Назад
Company hidden
9 часов назад

Staff Site Reliability Engineer (Observability)

194 000 - 267 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Site Reliability Engineer (Observability/GCP): Building and expanding a comprehensive, scalable Observability Platform within Google Cloud Platform with an accent on infrastructure as code and automated telemetry collection. Focus on optimizing data processing for Splunk and Grafana, eliminating toil through automation, and ensuring high reliability of distributed systems.

Location: Hybrid in San Francisco, California. Must be a U.S. Person (Citizen, National, Lawful Permanent Resident, Refugee, or Asylee)

Salary: $194,000 — $267,000 USD

Company

hirify.global is a leader in identity security, providing the trusted infrastructure that enables organizations to safely embrace AI and digital transformation.

What you will do

  • Design, build, and maintain scalable observability infrastructure using Terraform.
  • Optimize collection, processing, and storage of observability data for Splunk and Grafana services in GCP.
  • Automate the deployment and scaling of observability agents and collectors to eliminate toil.
  • Lead post-incident reviews and participate in on-call rotations to drive systemic improvements.

Requirements

  • Minimum 5+ years of experience scaling and managing observability in GCP/GKE.
  • Minimum 3+ years in SRE, DevOps, or Systems Engineering focusing on high-availability systems.
  • Strong coding proficiency in Python or Go for building internal tools.
  • Expertise in creating actionable dashboards in Splunk or Grafana.
  • Deep understanding of Linux internals, TCP/IP, DNS, and Kubernetes.
  • Must be able to provide documentation establishing U.S. Person status upon hire.

Nice to have

  • Hands-on experience with OpenTelemetry (OTel) or Vector.
  • Experience migrating Splunk to Grafana Loki.
  • Experience managing observability native tools within AWS.

Culture & Benefits

  • Comprehensive health, dental, and vision insurance.
  • 401(k) and flexible spending accounts.
  • Paid time off and parental leave.
  • Immersive in-person onboarding experience to accelerate impact.
  • Culture focused on well-being, social impact, and talent development.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →