Назад
1 день назад

Staff Site Reliability Engineer (GCP)

194 000 - 267 000$
Формат работы
hybrid
Тип работы
fulltime
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Site Reliability Engineer (GCP): Building and expanding a world-class, scalable observability platform within Google Cloud with an accent on infrastructure as code, automation, and distributed systems reliability. Focus on optimizing data collection for Splunk and Grafana, eliminating toil through automation, and driving observability-driven development across complex environments.

Location: Must be based in or able to work from Bellevue, WA; Chicago, IL; New York, NY; San Francisco, CA; or Washington, DC. This role requires U.S. Person status for access to federal data.

Salary: $194,000–$267,000 USD

Company

Okta is a leading identity and access management provider, securing digital identities for organizations worldwide.

What you will do

  • Design and maintain scalable observability infrastructure using Terraform.
  • Optimize the collection, processing, and storage of observability data for Splunk and Grafana services.
  • Automate the deployment and scaling of agents and collectors to eliminate toil.
  • Participate in on-call rotations and lead post-incident reviews.
  • Develop internal tools and automation workflows using Python or Go.
  • Collaborate with SRE teams to improve system reliability and performance.

Requirements

  • 5+ years of experience scaling and managing observability in GCP.
  • 3+ years of experience in an SRE, DevOps, or Systems Engineering role.
  • Expertise in creating actionable Splunk or Grafana dashboards.
  • Strong coding proficiency in Python or Go.
  • Deep understanding of Linux internals, networking, and Kubernetes/GKE.
  • Must be a U.S. Person (Citizen, Permanent Resident, etc.) to access federal data.

Nice to have

  • Hands-on experience with OpenTelemetry (OTel) or Vector.
  • Experience migrating Splunk to Grafana Loki.
  • Experience managing observability tools in AWS.

Culture & Benefits

  • Comprehensive health, dental, and vision insurance.
  • 401(k) retirement plan with company matching.
  • Flexible spending accounts (HSA/FSA).
  • Paid leave including PTO and parental leave.
  • Equity and bonus programs.
  • Global community with a focus on innovation and social impact.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →