Назад
Company hidden
6 дней назад

Lead Site Reliability Engineer (SaaS)

136 000 - 177 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Lead Site Reliability Engineer (SaaS): Leading reliability strategy and cross-team execution for a modern split-plane, multi-region enterprise platform with an accent on system design, resilience, and MTTR reduction. Focus on operationalizing SLOs, advancing BCDR architecture, and mentoring senior engineering teams to ensure platform scalability and cost efficiency.

Location: Must be based in the United States

Salary: $136,000–$177,000

Company

hirify.global is a leading analytics platform provider empowering organizations to transform data into insights through automation, AI, and modern software solutions.

What you will do

  • Define and drive reliability strategy for control-plane and data-plane systems.
  • Establish and operationalize SLOs, SLAs, and error budgets to guide engineering decisions.
  • Lead architecture reviews focusing on scalability, multi-region resilience, and cost efficiency.
  • Drive incident management and systemic fixes to minimize MTTR and prevent recurrences.
  • Champion modern infrastructure automation, CI/CD practices, and AI-driven reliability initiatives.
  • Mentor senior engineers and influence cross-functional technical decisions.

Requirements

  • 6+ years of experience leading delivery of complex distributed systems or SaaS platforms.
  • Proven track record of improving SLOs, MTTR, and reliability at scale.
  • Deep expertise in multi-region, split-plane architectures and Kubernetes (multi-cluster).
  • Strong background in Infrastructure as Code, CI/CD, GitOps, and disaster recovery.
  • Proficiency in at least one language: Python, Java, C++, or JavaScript.
  • Must be legally authorized to work in the U.S. and capable of complying with U.S. export controls.

Nice to have

  • Experience with chaos engineering and large-scale reliability automation.
  • Expertise in modern observability platforms like Datadog or Grafana.
  • Background in enterprise SaaS platforms or split-plane architectures.

Culture & Benefits

  • Comprehensive benefits package including medical, retirement, financial, and wellness programs.
  • Focus on professional growth and a diverse, inclusive team environment.
  • Flexible time off and competitive compensation packages.
  • Culture of innovation, curiosity, and excellence in an unconventional tech environment.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →