Эта вакансия в архиве

Посмотреть похожие вакансии ↓
Company hidden
обновлено 2 месяца назад

Site Reliability Engineer (Gamedev)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Australia

Описание вакансии

Текст:
/

TL;DR

Site Reliability Engineer: Influencing design and operational decisions for the overall stability of a cloud gaming service with an accent on production ownership, code quality, and deployments. Focus on ensuring operational readiness, incident response, and implementing automation to reduce toil in a large-scale web services infrastructure.

Location: Onsite in Adelaide, Australia

Company

hirify.global is a global leader in entertainment, producing the PlayStation family of products and services, including consoles, VR, and acclaimed software titles from PlayStation Studios.

What you will do

  • Lead technical discussions focused on reliability and scalability improvements.
  • Contribute to High-Level Designs for new products and platforms.
  • Mentor junior SRE staff.
  • Lead incident response and post-mortem activities within your assigned service team.
  • Collaborate with other Engineers in a cross-functional team to prioritize reliability improvements and address technical debt.
  • Contribute to code to improve reliability and implement automation to reduce ongoing toil.

Requirements

  • Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
  • Strong interpersonal, written and verbal communication skills.
  • Available to be scheduled in on-call rotation.
  • Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
  • Development experience in one or more of the following programming languages: Python (preferred), Bash, Go, Java, C++, or Rust.
  • Experience with at least 3 of the following topics: Distributed data storage (Hadoop, Ceph), NoSQL (MongoDB, Redis, Cassandra), Data Aggregation (ElasticSearch, Kafka), RDBMS (PostgreSQL, MySQL) with High Availability, Monitoring & Alerting (Prometheus, Grafana), Kubernetes and/or AWS, Software Distribution, Configuration Management (Ansible, SaltStack, Puppet, Chef).

Nice to have

  • QA or SDET experience.

Culture & Benefits

  • Work for a global leader in entertainment, contributing to the cloud gaming revolution.
  • Be part of an inclusive environment that empowers employees and embraces diversity.
  • Influence design and operational decisions towards overall service stability.
  • Engage throughout the software development lifecycle to ensure operational readiness.