Назад
Company hidden
23 часа назад

Staff Site Reliability Engineer (Observability)

147 000 - 202 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Site Reliability Engineer (Observability): Building and evolving a comprehensive, scalable Observability Platform with an accent on Splunk ecosystem optimization and infrastructure as code. Focus on automating the deployment of agents and collectors across complex distributed systems to eliminate toil and ensure high reliability.

Location: Bellevue, Washington. Must be a U.S. Person (Citizen, National, Lawful Permanent Resident, Refugee, or Asylee). Requires in-person onboarding in San Francisco during the first week.

Salary: $147,000 — $202,000 USD

Company

hirify.global secures AI and human identities by building trusted, neutral infrastructure that enables organizations to safely embrace the AI era.

What you will do

  • Design, build, and maintain scalable observability infrastructure using Terraform.
  • Optimize the collection, processing, and storage of log data within Splunk Cloud to ensure high reliability and low latency.
  • Participate in on-call rotations and lead post-incident reviews to drive systemic improvements.
  • Automate the deployment and scaling of observability agents and collectors to eliminate operational toil.

Requirements

  • U.S. Person status required (U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee).
  • Ability to attend in-person onboarding in San Francisco.
  • 5+ years of experience scaling and managing Splunk Cloud (1000+ SVCs), including Workload Management (WLM) and HEC optimization.
  • 5+ years of experience in SRE, DevOps, or Systems Engineering for high-availability systems.
  • Strong coding proficiency in SPL and Go for building internal tools and automating workflows.
  • Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/EKS).

Nice to have

  • Hands-on experience with OpenTelemetry (OTel), Vector, or similar instrumentation frameworks.
  • Experience implementing Splunk charge-back apps for usage reporting.
  • Experience managing native observability tools within AWS or GCP.

Culture & Benefits

  • Comprehensive health, dental, and vision insurance.
  • 401(k), flexible spending accounts, and paid leave (including PTO and parental leave).
  • Immersive, in-person onboarding experience to accelerate impact.
  • Global community spanning over 20 offices worldwide.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →