Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (GCP): Building and expanding a world-class, scalable observability platform within Google Cloud with an accent on infrastructure as code, automation, and distributed systems reliability. Focus on optimizing data collection for Splunk and Grafana, eliminating toil through automation, and driving observability-driven development across complex environments.
Location: Must be based in or able to work from Bellevue, WA; Chicago, IL; New York, NY; San Francisco, CA; or Washington, DC. This role requires U.S. Person status for access to federal data.
Salary: $194,000–$267,000 USD
Company
Okta is a leading identity and access management provider, securing digital identities for organizations worldwide.
What you will do
- Design and maintain scalable observability infrastructure using Terraform.
- Optimize the collection, processing, and storage of observability data for Splunk and Grafana services.
- Automate the deployment and scaling of agents and collectors to eliminate toil.
- Participate in on-call rotations and lead post-incident reviews.
- Develop internal tools and automation workflows using Python or Go.
- Collaborate with SRE teams to improve system reliability and performance.
Requirements
- 5+ years of experience scaling and managing observability in GCP.
- 3+ years of experience in an SRE, DevOps, or Systems Engineering role.
- Expertise in creating actionable Splunk or Grafana dashboards.
- Strong coding proficiency in Python or Go.
- Deep understanding of Linux internals, networking, and Kubernetes/GKE.
- Must be a U.S. Person (Citizen, Permanent Resident, etc.) to access federal data.
Nice to have
- Hands-on experience with OpenTelemetry (OTel) or Vector.
- Experience migrating Splunk to Grafana Loki.
- Experience managing observability tools in AWS.
Culture & Benefits
- Comprehensive health, dental, and vision insurance.
- 401(k) retirement plan with company matching.
- Flexible spending accounts (HSA/FSA).
- Paid leave including PTO and parental leave.
- Equity and bonus programs.
- Global community with a focus on innovation and social impact.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →