Manager of Site Reliability Engineering
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Manager of Site Reliability Engineering (SRE): Leading reliability, performance, operational excellence, and cost efficiency of production systems across hybrid cloud and on-prem environments with an accent on uptime, SLAs, FinOps, and observability. Focus on managing SRE operations, incident management, infrastructure automation with Terraform and GitOps, and cross-team collaboration for continuous delivery.
Location: US - Remote
Company
Provider of high-availability SaaS platforms supporting business growth.
What you will do
- Lead SRE operations for 24/7/365 availability, owning uptime, SLAs, SLIs, SLOs, error budgets, MTTR, and incident trends.
- Oversee incident management, on-call rotations, and post-incident reviews.
- Drive FinOps practices, cost optimization, right-sizing, and infrastructure waste elimination with visibility and reporting.
- Define observability standards using tools like Coralogix, Open Telemetry, and FireHydrant across AWS, Azure, and Vsphere.
- Champion GitOps, pull request governance, and Terraform-based infrastructure automation.
- Partner with Product, Engineering, Infrastructure, Finance, and Support teams; lead, mentor, and develop SRE team.
Requirements
- Leadership experience managing SRE, DevOps, or Infrastructure teams.
- Experience operating hybrid (cloud and on-prem) production environments.
- Proven experience with FinOps and cost optimization initiatives.
- Experience with GitOps workflows, Terraform, and observability tooling.
- Must be eligible to work remotely in the US.
Nice to have
- Bachelor’s degree in Computer Science, Engineering, or related field.
- 8+ years in SRE, DevOps, Infrastructure, including people leadership.
- Cloud certifications (AWS Solutions Architect, Google Cloud Architect, Azure).
- Experience in Agile/Scrum, Jira, high-availability SaaS, CI/CD frameworks, and application modernization.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →