Lead Site Reliability Engineer (AWS/Kubernetes)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Lead Site Reliability Engineer (AWS/Kubernetes): Leading reliability consulting across multiple teams to improve resilience and sustainable engineering practices with an accent on SLO governance, error budgets, and systemic risk reduction. Focus on designing reusable reliability mechanisms, optimizing observability strategies, and mentoring senior engineers.
Location: Remote (Must be based in Florida, Arizona, Virginia, or Texas, USA)
Company
The world's largest live entertainment company and global leader in live event ticketing products and services.
What you will do
- Lead reliability consulting from discovery to delivery, aligning stakeholders on priorities and measurable outcomes.
- Define reliability targets and trade-offs using SLOs and error budgets.
- Design and implement reusable reliability mechanisms, templates, and tooling for adoption across teams.
- Drive observability strategy by improving signal quality, alerting philosophy, and operational dashboards.
- Lead complex incident investigations and ensure learnings translate into durable fixes for systemic risks.
- Mentor senior engineers and influence internal platform roadmaps to accelerate SRE adoption.
Requirements
- Deep practical understanding of SRE principles, including SLO governance and error budget policy.
- Strong experience with Kubernetes and AWS, including governance and cost trade-offs.
- Proven ability to design and troubleshoot distributed systems with cross-service failure modes.
- Strong software engineering fundamentals to deliver high-quality changes in enterprise codebases.
- Advanced incident analysis skills focused on systemic risk reduction and organizational learning.
- Must be based in Florida, Arizona, Virginia, or Texas, USA
Culture & Benefits
- Comprehensive medical, vision, dental, and mental health benefits (HSA/FSA available).
- 401(k) program with company match and stock reimbursement program.
- Generous paid time off, including holidays, sick time, and personal days.
- Unique perks including free concert tickets.
- Career development programs through the School of Live, tuition reimbursement, and student loan repayment.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →