Site Reliability Engineer (Hosted Infra) - Platform (Cloud)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Engineer (Cloud/Golang): Building and scaling multi-cloud infrastructure across 4 providers and 70+ regions with an accent on automation and Infrastructure as Code. Focus on engineering software to eliminate toil, improving observability, and maintaining high system reliability for the Search AI Platform.
Location: United States
Salary: $143,100—$175,000 USD
Company
is a Search AI company providing cloud-based solutions for search, security, and observability used by over 50% of the Fortune 500.
What you will do
- Engineer software in Golang to automate large-scale systems and internal tools.
- Optimize host reliability and lifecycles across multiple cloud providers.
- Develop alerting and monitoring systems to prioritize incident prevention over response.
- Scale global infrastructure and evolve management processes to meet growing demand.
- Participate in a balanced SRE on-call rotation, leading postmortems and runbook improvements.
- Contribute to code reviews, technical documentation, and team mentorship.
Requirements
- Experience building software with Golang.
- Production experience operating large-scale cloud compute (100+ hosts) via automated workflows.
- Deep expertise in Linux systems and OS-level debugging.
- Proficiency with containerized workloads in production environments.
- Strong communication skills for asynchronous and real-time collaboration across time zones.
- Ability to create maintainable software designs, runbooks, and architecture diagrams.
Nice to have
- Experience with Terraform, Puppet, Ansible, Argo CD, Argo Workflows, or Kubernetes.
- Proficiency with observability tools like Stack, Prometheus, or Influx.
- Hands-on experience engineering solutions using the Stack.
Culture & Benefits
- Competitive base salary and eligibility for the company stock program.
- Company-matched 401k up to 6% of eligible earnings.
- Comprehensive health coverage for employees and their families in many locations.
- Flexible locations and schedules supporting a distributed work culture.
- Generous vacation allowance and a minimum of 16 weeks of parental leave.
- Support for volunteer work (up to 40 hours/year) and matching financial donations up to $2,000.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →