Senior Manager, Site Reliability Engineering
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Manager, Site Reliability Engineering (SRE): Lead a team of ~10 SREs and own both day-to-day operations and the long-term reliability strategy with an accent on scaling multi-tenant infrastructure, improving SLO/SLI-driven alerting, and driving proactive reliability partnership with product engineering. Focus on cloud cost management and FinOps, developer self-service/platform engineering, and leveraging AI tooling in SRE workflows.
Location: Remote (USA)
Salary: $187,000 - $243,000 USD
Company
builds AI-enabled healthcare solutions and value-based care technology.
What you will do
- Lead and grow an SRE team (~10 engineers) across multiple time zones, including hiring, retention, career development, and performance management.
- Partner with product engineering pillars to shift SRE from reactive, ticket-based support to proactive co-ownership of reliability outcomes.
- Scale multi-tenant infrastructure to support new customer onboarding and growing patient populations.
- Own cloud cost management and FinOps practices, balancing cost control with reliability and performance.
- Build developer self-service and platform engineering capabilities, define SLOs/SLIs, and improve alert quality.
- Ensure SRE workflows fully leverage AI tooling (e.g., Claude Code) for IaC generation, log analysis, root cause investigation, and automation.
Requirements
- Location: Must be based in the USA
- 6+ years managing an SRE team and 10+ years hands-on SRE or infrastructure engineering experience.
- Strong familiarity with Kubernetes, GCP (GKE, Cloud SQL, Pub/Sub, GCS), Terraform, Helm, ArgoCD, PostgreSQL, and Prometheus/Grafana.
- Strong programming skills in Python and/or Go, including writing and reviewing infrastructure tooling code.
- Experience with CI/CD pipelines (GitHub Actions) and building/improving developer tooling and automation.
- Experience leading teams across multiple time zones and developing engineers into strong technical contributors.
Culture & Benefits
- Remote-first culture with flexibility and collaboration across global teams.
- Medical, dental, and vision coverage for employees and their families.
- No-Meeting Fridays, monthly company holidays, access to mental health resources, and generous flexible time-off.
- 401k matching, performance-based bonus program, and regular compensation reviews.
- Equity opportunities including ESPP, plus reimbursement for office setup expenses and a monthly cell phone & internet stipend.
- Paid parental leave for all new parents.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →