Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 2 месяца назад
Site Reliability Engineer (Gamedev)
Описание вакансии
Текст:
TL;DR
Site Reliability Engineer (Gamedev): Ensuring the stability and reliability of cloud gaming services by influencing design and operational decisions, focusing on production ownership, code quality, and deployment strategies. Focus on high-level design, incident response, and automation to reduce toil and improve overall service reliability and scalability.
Company
is a global leader in entertainment, producing PlayStation products and services and striving to create an inclusive environment.
What you will do
- Lead team technical discussions to improve reliability and scalability.
- Create High Level Designs (HLDs) for new products and platforms.
- Mentor junior SRE staff and enable them for success.
- Lead incident response and post-mortem activities within your assigned service team.
- Prioritize reliability improvements to address technical debt and toil in a cross-functional team.
- Contribute to code to improve reliability and implement automation to reduce ongoing toil.
Requirements
- Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration.
- Strong interpersonal, written, and verbal communication skills.
- Available to be scheduled in on-call rotation.
- Proficient as a Linux Production Systems Engineer, with experience managing large-scale Web Services infrastructure.
- Development experience in Python (preferred), Bash, Go, Java, C++, or Rust.
- Experience with distributed data storage, NoSQL, data aggregation, scaling RDBMS, monitoring & alerting, Kubernetes/AWS, software distribution, and configuration management.
Culture & Benefits
- Inclusive environment that empowers employees and embraces diversity.
- Encouragement to respond regardless of background.
- Fair Chance employer: qualified applicants with arrest and conviction records will be considered.