Platform Site Reliability Engineer

174 000 - 272 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

middle/senior

Английский

Страна

US, Switzerland

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Platform Site Reliability Engineer (SRE): Strengthening infrastructure and accelerating the ability to deploy, monitor, and scale systems effectively for a multi-tenant SaaS platform with an accent on reliability, security, and scalability. Focus on incident response, troubleshooting, and embedding reliability and observability into every service.

Location: Must be located in West or Mountain Time Zone.

Salary: 174000 - 272000

Company

hirify.global is the leader in digital employee experience management software, providing IT leaders with insights to diagnose and fix issues impacting employees.

What you will do

Design, build, and maintain infrastructure powering a multi-tenant SaaS platform.
Implement and manage cloud-native systems (AWS) using best-in-class tools and automation.
Operate and enhance Kubernetes clusters and deployment pipelines.
Establish and enforce SLOs, SLAs, and error budgets.
Improve incident response practices and reduce mean time to detect and recover.
Support automated testing, canary deployments, and rollback strategies.

Requirements

Minimum BS in Computer Science/Engineering.
5+ years in an SRE/platform engineering role supporting SaaS platforms.
Strong hands-on experience with public cloud services (AWS, GCP, Azure).
Proficiency with Kubernetes and CI/CD pipelines.
Strong programming or scripting skills (Python, Go, Bash...).
Strong system-level troubleshooting skills and a proactive mindset toward incident prevention.
Comfort with being part of a rotating on-call schedule, including handling critical incidents and conducting post-incident reviews.

Nice to have

Familiarity with service mesh.
Knowledge of zero-downtime deployment strategies.
Exposure to compliance standards such as SOC 2, ISO 27001, or HIPAA.
Experience with chaos engineering or resilience testing practices.

Culture & Benefits

Flexible hours and unlimited vacation.
Hybrid work model that balances office and remote work.
Free access to professional training platforms.
401(k) plan featuring up to 4% company matching contributions.
Bonuses for referring successful hires.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →