Site Reliability Engineer (Gamedev)

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Australia

Описание вакансии

Текст:

TL;DR

Site Reliability Engineer: Influencing design and operational decisions for the overall stability of a cloud gaming service with an accent on production ownership, code quality, and deployments. Focus on ensuring operational readiness, incident response, and implementing automation to reduce toil in a large-scale web services infrastructure.

Location: Onsite in Adelaide, Australia

Company

hirify.global is a global leader in entertainment, producing the PlayStation family of products and services, including consoles, VR, and acclaimed software titles from PlayStation Studios.

What you will do

Lead technical discussions focused on reliability and scalability improvements.
Contribute to High-Level Designs for new products and platforms.
Mentor junior SRE staff.
Lead incident response and post-mortem activities within your assigned service team.
Collaborate with other Engineers in a cross-functional team to prioritize reliability improvements and address technical debt.
Contribute to code to improve reliability and implement automation to reduce ongoing toil.

Requirements

Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
Strong interpersonal, written and verbal communication skills.
Available to be scheduled in on-call rotation.
Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
Development experience in one or more of the following programming languages: Python (preferred), Bash, Go, Java, C++, or Rust.
Experience with at least 3 of the following topics: Distributed data storage (Hadoop, Ceph), NoSQL (MongoDB, Redis, Cassandra), Data Aggregation (ElasticSearch, Kafka), RDBMS (PostgreSQL, MySQL) with High Availability, Monitoring & Alerting (Prometheus, Grafana), Kubernetes and/or AWS, Software Distribution, Configuration Management (Ansible, SaltStack, Puppet, Chef).

Nice to have

QA or SDET experience.

Culture & Benefits

Work for a global leader in entertainment, contributing to the cloud gaming revolution.
Be part of an inclusive environment that empowers employees and embraces diversity.
Influence design and operational decisions towards overall service stability.
Engage throughout the software development lifecycle to ensure operational readiness.