Назад
Company hidden
9 часов назад

Manager of Site Reliability Engineering

Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Manager of Site Reliability Engineering (SRE): Leading reliability, performance, operational excellence, and cost efficiency of production systems across hybrid cloud and on-prem environments with an accent on uptime, SLAs, FinOps, and observability. Focus on managing SRE operations, incident management, infrastructure automation with Terraform and GitOps, and cross-team collaboration for continuous delivery.

Location: US - Remote

Company

Provider of high-availability SaaS platforms supporting business growth.

What you will do

  • Lead SRE operations for 24/7/365 availability, owning uptime, SLAs, SLIs, SLOs, error budgets, MTTR, and incident trends.
  • Oversee incident management, on-call rotations, and post-incident reviews.
  • Drive FinOps practices, cost optimization, right-sizing, and infrastructure waste elimination with visibility and reporting.
  • Define observability standards using tools like Coralogix, Open Telemetry, and FireHydrant across AWS, Azure, and Vsphere.
  • Champion GitOps, pull request governance, and Terraform-based infrastructure automation.
  • Partner with Product, Engineering, Infrastructure, Finance, and Support teams; lead, mentor, and develop SRE team.

Requirements

  • Leadership experience managing SRE, DevOps, or Infrastructure teams.
  • Experience operating hybrid (cloud and on-prem) production environments.
  • Proven experience with FinOps and cost optimization initiatives.
  • Experience with GitOps workflows, Terraform, and observability tooling.
  • Must be eligible to work remotely in the US.

Nice to have

  • Bachelor’s degree in Computer Science, Engineering, or related field.
  • 8+ years in SRE, DevOps, Infrastructure, including people leadership.
  • Cloud certifications (AWS Solutions Architect, Google Cloud Architect, Azure).
  • Experience in Agile/Scrum, Jira, high-availability SaaS, CI/CD frameworks, and application modernization.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →