Назад
Company hidden
14 часов назад

Senior Site Reliability Engineer (Kubernetes/Terraform)

150 000 - 200 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Site Reliability Engineer (Kubernetes/Terraform): Ensuring the stability, scalability, and reliability of cloud storage services with an accent on automation, observability, and incident response. Focus on designing scalable automation solutions, managing container orchestration, and embedding reliability practices into the development lifecycle.

Location: Remote (Must be based in the US)

Salary: $150,000 - $200,000

Company

hirify.global is a leader in open cloud object storage, managing data for over 500,000 customers worldwide.

What you will do

  • Drive the availability, durability, and performance of critical services across all production environments.
  • Design and architect scalable automation solutions using Terraform, Ansible, and Jenkins to eliminate operational toil.
  • Manage and evolve observability frameworks including Prometheus, Grafana, and ELK for comprehensive system monitoring.
  • Lead critical incident response and post-incident reviews, translating findings into long-term architectural improvements.
  • Develop production-grade reliability tools and system enhancements using Python, Go, or Bash.
  • Partner with engineering and product teams on resilient system design and lead the Production Readiness Review (PRR) process.

Requirements

  • 8+ years of progressive experience in site reliability, systems engineering, or operations.
  • Expert-level Linux systems administration and advanced troubleshooting skills.
  • Advanced proficiency in Python or Go.
  • Proven experience designing and operating Kubernetes, Docker, and Hashicorp products (Nomad, Vault, Terraform).
  • Experience scaling and operating large-scale production-grade distributed systems.
  • Must be based in the United States.

Nice to have

  • Significant experience in SaaS or hyper-scale distributed systems environments.
  • Deep familiarity with ITIL/OSS practices and experience defining SLO/SLA standards.
  • Advanced experience with cloud platforms such as AWS, GCP, or Azure in production.

Culture & Benefits

  • Comprehensive healthcare for family, including dental and vision.
  • Competitive compensation, 401K, RSU grants, and ESPP program.
  • Flexible vacation policy and maternity/paternity leave.
  • MacBook Pro and a generous stipend to personalize your workstation.
  • Learning and development program and a culture supporting healthy work-life balance.
  • Childcare bonus and fertility treatment support.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →