Назад
Company hidden
1 день назад

Senior System Reliability Engineer (AI)

130 000 - 150 000$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior System Reliability Engineer (AI): Designing and maintaining reliable, scalable cloud-native infrastructure for a behavior change AI platform with an accent on automation, monitoring, and incident response. Focus on driving operational excellence through SLO definition, reducing toil, and mentoring teams in SRE best practices.

Location: Remote within the US, with a mandatory one-week onsite onboarding in Tennessee (travel expenses covered).

Salary: $130,000–$150,000

Company

hirify.global is a health technology company utilizing behavioral science and advanced AI to deliver personalized health interventions and improve consumer engagement.

What you will do

  • Architect and maintain automated solutions for deployment, monitoring, and incident response across AWS and Azure.
  • Define and enforce SLIs, SLOs, and error budgets for critical platform services.
  • Reduce operational toil through infrastructure-as-code (Terraform) and platform improvements.
  • Lead incident response, root cause analysis, and postmortem practices for production issues.
  • Collaborate with product and platform teams to embed security and reliability into the development lifecycle.
  • Mentor and coach engineering peers on reliability engineering principles and operational ownership.

Requirements

  • Must be authorized to work in the US without sponsorship.
  • Must be able to attend the first week of onboarding onsite in Tennessee.
  • 5-7 years of related engineering experience.
  • Strong expertise in cloud platforms (AWS/Azure), Kubernetes, and container orchestration.
  • Proficiency in infrastructure-as-code tools and scripting/programming languages like Java, TypeScript, or Python.
  • Deep understanding of Linux networking, distributed systems, and observability tools like Datadog.

Culture & Benefits

  • Comprehensive medical (with HSA), dental, and vision insurance.
  • Company-paid life, AD&D, and short/long-term disability coverage.
  • 401k plan with company match.
  • Flexible time-off policy and 10 paid holidays.
  • Quarterly company closure dates and additional holiday week closures.
  • 6 weeks of paid parental leave.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →