Назад
Company hidden
7 часов назад

Platform Site Reliability Engineer

174 000 - 272 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
middle/senior
Английский
b2
Страна
US, Switzerland
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Platform Site Reliability Engineer (SRE): Strengthening infrastructure and accelerating the ability to deploy, monitor, and scale systems effectively for a multi-tenant SaaS platform with an accent on reliability, security, and scalability. Focus on incident response, troubleshooting, and embedding reliability and observability into every service.

Location: Must be located in West or Mountain Time Zone.

Salary: 174000 - 272000

Company

hirify.global is the leader in digital employee experience management software, providing IT leaders with insights to diagnose and fix issues impacting employees.

What you will do

  • Design, build, and maintain infrastructure powering a multi-tenant SaaS platform.
  • Implement and manage cloud-native systems (AWS) using best-in-class tools and automation.
  • Operate and enhance Kubernetes clusters and deployment pipelines.
  • Establish and enforce SLOs, SLAs, and error budgets.
  • Improve incident response practices and reduce mean time to detect and recover.
  • Support automated testing, canary deployments, and rollback strategies.

Requirements

  • Minimum BS in Computer Science/Engineering.
  • 5+ years in an SRE/platform engineering role supporting SaaS platforms.
  • Strong hands-on experience with public cloud services (AWS, GCP, Azure).
  • Proficiency with Kubernetes and CI/CD pipelines.
  • Strong programming or scripting skills (Python, Go, Bash...).
  • Strong system-level troubleshooting skills and a proactive mindset toward incident prevention.
  • Comfort with being part of a rotating on-call schedule, including handling critical incidents and conducting post-incident reviews.

Nice to have

  • Familiarity with service mesh.
  • Knowledge of zero-downtime deployment strategies.
  • Exposure to compliance standards such as SOC 2, ISO 27001, or HIPAA.
  • Experience with chaos engineering or resilience testing practices.

Culture & Benefits

  • Flexible hours and unlimited vacation.
  • Hybrid work model that balances office and remote work.
  • Free access to professional training platforms.
  • 401(k) plan featuring up to 4% company matching contributions.
  • Bonuses for referring successful hires.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →