2 месяца назад

Staff Site Reliability Engineer (Observability)

147 000 - 202 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Staff Site Reliability Engineer (Observability): Building and evolving a comprehensive, scalable Observability Platform with an accent on Splunk ecosystem optimization and infrastructure as code. Focus on automating the deployment of agents and collectors across complex distributed systems to eliminate toil and ensure high reliability.

Location: Bellevue, Washington. Must be a U.S. Person (Citizen, National, Lawful Permanent Resident, Refugee, or Asylee). Requires in-person onboarding in San Francisco during the first week.

Salary: $147,000 — $202,000 USD

Company

Okta secures AI and human identities by building trusted, neutral infrastructure that enables organizations to safely embrace the AI era.

What you will do

Design, build, and maintain scalable observability infrastructure using Terraform.
Optimize the collection, processing, and storage of log data within Splunk Cloud to ensure high reliability and low latency.
Participate in on-call rotations and lead post-incident reviews to drive systemic improvements.
Automate the deployment and scaling of observability agents and collectors to eliminate operational toil.

Requirements

U.S. Person status required (U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee).
Ability to attend in-person onboarding in San Francisco.
5+ years of experience scaling and managing Splunk Cloud (1000+ SVCs), including Workload Management (WLM) and HEC optimization.
5+ years of experience in SRE, DevOps, or Systems Engineering for high-availability systems.
Strong coding proficiency in SPL and Go for building internal tools and automating workflows.
Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/EKS).

Nice to have

Hands-on experience with OpenTelemetry (OTel), Vector, or similar instrumentation frameworks.
Experience implementing Splunk charge-back apps for usage reporting.
Experience managing native observability tools within AWS or GCP.

Culture & Benefits

Comprehensive health, dental, and vision insurance.
401(k), flexible spending accounts, and paid leave (including PTO and parental leave).
Immersive, in-person onboarding experience to accelerate impact.
Global community spanning over 20 offices worldwide.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Staff Site Reliability Engineer (Observability)

Okta

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Похожие вакансии

Senior Site Reliability Engineer (AWS)

Site Reliability Engineer

Staff Platform Engineer (DevOps)

Staff Site Reliability Engineer (SaaS)

Platform Operations Engineer (AWS)

Monitoring & Observability Lead (DevOps)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business

Staff Site Reliability Engineer (Observability)

Okta

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Categories

Похожие вакансии

Senior Site Reliability Engineer (AWS)

Site Reliability Engineer

Staff Platform Engineer (DevOps)

Staff Site Reliability Engineer (SaaS)

Platform Operations Engineer (AWS)

Monitoring & Observability Lead (DevOps)