Senior Site Reliability Engineer (Kubernetes/Terraform)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (Kubernetes/Terraform): Ensuring the stability, scalability, and reliability of cloud storage services with an accent on automation, observability, and incident response. Focus on designing scalable automation solutions, managing container orchestration, and embedding reliability practices into the development lifecycle.
Location: Remote (Must be based in the US)
Salary: $150,000 - $200,000
Company
is a leader in open cloud object storage, managing data for over 500,000 customers worldwide.
What you will do
- Drive the availability, durability, and performance of critical services across all production environments.
- Design and architect scalable automation solutions using Terraform, Ansible, and Jenkins to eliminate operational toil.
- Manage and evolve observability frameworks including Prometheus, Grafana, and ELK for comprehensive system monitoring.
- Lead critical incident response and post-incident reviews, translating findings into long-term architectural improvements.
- Develop production-grade reliability tools and system enhancements using Python, Go, or Bash.
- Partner with engineering and product teams on resilient system design and lead the Production Readiness Review (PRR) process.
Requirements
- 8+ years of progressive experience in site reliability, systems engineering, or operations.
- Expert-level Linux systems administration and advanced troubleshooting skills.
- Advanced proficiency in Python or Go.
- Proven experience designing and operating Kubernetes, Docker, and Hashicorp products (Nomad, Vault, Terraform).
- Experience scaling and operating large-scale production-grade distributed systems.
- Must be based in the United States.
Nice to have
- Significant experience in SaaS or hyper-scale distributed systems environments.
- Deep familiarity with ITIL/OSS practices and experience defining SLO/SLA standards.
- Advanced experience with cloud platforms such as AWS, GCP, or Azure in production.
Culture & Benefits
- Comprehensive healthcare for family, including dental and vision.
- Competitive compensation, 401K, RSU grants, and ESPP program.
- Flexible vacation policy and maternity/paternity leave.
- MacBook Pro and a generous stipend to personalize your workstation.
- Learning and development program and a culture supporting healthy work-life balance.
- Childcare bonus and fertility treatment support.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →