Назад
Company hidden
6 часов назад

Sr. Site Reliability Engineer

175 000 - 200 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Sr. Site Reliability Engineer (SRE): Creating and evolving systems that automatically run a suite of products and services reliably and consistently, with an accent on SLO/SLA success criteria, observability, and incident-driven reliability improvements. Focus on building recoverability, eliminating single points of failure, and delivering automation and developer experience tooling across Kubernetes-based, distributed, event-driven platforms.

Location: Seattle, Washington, United States

Salary: $175,000-$200,000 annually

Company

hirify.global is a Morningstar company building enterprise products and services.

What you will do

  • Build and maintain internal platform services, Kubernetes operators, and observability tooling for enterprise reliability at scale.
  • Define service level objectives (SLOs), error budgets, and SLIs; ensure systems consistently meet or exceed targets.
  • Implement recoverability across services (DR, backups/recovery, multi-AZ/multi-region cloud constructs) and improve failover readiness.
  • Design high-availability and scalability patterns (clustering, load balancing) for containerized cloud-native environments.
  • Develop reusable observability systems (monitoring, telemetry, tracing), including alerting and dashboards.
  • Operate and continuously improve reliability, scalability, performance, security, and uptime; participate in 24/7 on-call response.

Requirements

  • 5+ years building and maintaining Linux/UNIX-based systems in cloud environments (preferably GCP & AWS).
  • 5+ years in Reliability Engineering, DevOps, or infrastructure roles using infrastructure-as-code (Terraform, Puppet, Ansible, Chef).
  • 5+ years coding in an object-oriented language such as Java, Python, Go, or Kotlin.
  • 2+ years with containers and orchestration platforms including Kubernetes and Docker.
  • Deep knowledge of infrastructure systems, networking, and security in cloud environments; experience with scalability, recoverability, and capacity planning.
  • Must be authorized to work in the United States without visa sponsorship now or in the future.

Culture & Benefits

  • Comprehensive health benefits plus life and disability insurance.
  • Paid sabbatical after four years, paid family and paternity leave, and generous vacation/sick/volunteer days.
  • Annual educational stipend and tuition reimbursement; robust training programs.
  • 401k match and shared ownership employee stock program; monthly transportation stipend.
  • Role is expected to be in the office 5 days a week.

Hiring process

  • Interviews to evaluate reliability engineering experience, system design, and operational ownership.
  • Assessment of collaboration and communication in stressful incident scenarios.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →