12 часов назад
Site Reliability Engineering Manager (SRE)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
Site Reliability Engineering Manager (SRE): Leading the reliability, scalability, and performance of critical production platforms with an accent on database expertise and operational excellence. Focus on building a culture of automation, managing SLIs/SLOs, and optimizing high-availability systems.
Location: Montevideo
Company
A global leader in simulation, training, and mission readiness, supporting critical operations worldwide for nearly 80 years.
What you will do
- Lead and develop a team of reliability engineers across database and platform domains.
- Own the availability, performance, and resilience of production databases and dependent platform services.
- Define and enforce SLIs, SLOs, and error budgets for critical services.
- Drive Infrastructure as Code (IaC), operational automation, and self-healing mechanisms to reduce manual toil.
- Define the observability strategy, including metrics, logs, traces, and alerting for proactive detection.
- Oversee disaster recovery planning, RTO/RPO alignment, and backup strategies.
Requirements
- Location: Must be based in Montevideo.
- 7+ years of experience in Site Reliability Engineering, Database Engineering, or production engineering.
- 2+ years of experience managing engineers or technical teams.
- Strong expertise in relational databases such as PostgreSQL, Oracle, or SQL Server.
- Experience operating reliable services in AWS or hybrid environments.
- Proficiency with Kubernetes, containerized workloads, and platform engineering practices.
Nice to have
- Experience with distributed systems and large-scale production environments.
- Expertise in automation frameworks, IaC, and CI/CD pipelines.
- Experience with FinOps, cloud cost optimization, or capacity modeling.
- Familiarity with modern observability platforms and reliability review processes.
Culture & Benefits
- Purpose-driven organization focused on making the world a safer place.
- Collaborative environment where bold ideas are encouraged.
- Opportunities for professional growth and development within a global corporate structure.
- Commitment to equal opportunity and reasonable accommodations for all employees.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
3 дня назад
Senior Site Reliability Engineer (SRE)
3 дня назад
Senior Site Reliability Engineer (AWS)
135 000 - 150 000$
Okta
2 дня назад
Manager, Site Reliability Engineering (AWS)
204 000 - 306 000$
22 часа назад
Senior Reliability Engineer (SRE)
100 300 - 150 000CAD
3 дня назад
Site Reliability Engineer
125 000 - 135 000$
SigNoz
4 дня назад
Senior Site Reliability Engineer (SRE) (Observability)
5 000 000 - 7 000 000INR