Senior Site Reliability Engineer (Observability)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer (Observability): Owning and evolving the Splunk ecosystem to deliver a scalable observability platform with an accent on automation and log management. Focus on eliminating toil through infrastructure as code and optimizing high-availability distributed systems.
Location: Must be based in the US (Bellevue, WA) and be a U.S. Person (Citizen, National, Lawful Permanent Resident, etc.) for federal data access. In-person onboarding in San Francisco is required for the first week.
Salary: $147,000 — $202,000 USD
Company
A trusted identity infrastructure company securing AI and human identities globally.
What you will do
- Design and maintain scalable observability infrastructure using Terraform.
- Optimize Splunk Cloud collection, processing, and storage for high reliability and low latency.
- Participate in on-call rotations and lead post-incident reviews to drive systemic improvements.
- Automate the deployment and scaling of observability agents and collectors to eliminate toil.
- Collaborate with SRE teams and business partners to deliver a world-class observability platform.
Requirements
- 5+ years of experience scaling Splunk Cloud (1000+ SVCs), including WLM and HEC optimization.
- 3+ years of experience in SRE, DevOps, or Systems Engineering focusing on high-availability systems.
- Proficiency in SPL and Go for building internal tools and automating workflows.
- Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and Kubernetes/EKS.
- Must be a U.S. Person (Citizen, National, Lawful Permanent Resident, Refugee, or Asylee) to access protected federal data.
Nice to have
- Experience with OpenTelemetry (OTel), Vector, or similar instrumentation frameworks.
- Experience implementing Splunk charge-back applications for usage reporting.
- Experience managing observability native tools within AWS or GCP.
Culture & Benefits
- Comprehensive health, dental, and vision insurance.
- 401(k) and flexible spending account (FSA).
- Paid time off (PTO) and parental leave.
- Immersive, in-person onboarding experience to accelerate impact.
- Commitment to diversity, equity, and inclusion as an Equal Opportunity Employer.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →