Staff Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Site Reliability Engineer (AWS EKS/Terraform): Lead technical architecture and execution of migration from legacy GCP to scalable AWS EKS infrastructure with an accent on GitOps patterns, SLO-driven reliability, and developer tooling. Focus on designing critical automation in Node.js/Go, evolving K6-based load testing platform, and optimizing Istio service mesh for production demands.
Location: Remote, United States
Company
Sports media platform elevating overlooked communities, athletes, and sports through diverse, inclusive engineering teams using blind recruiting.
What you will do
- Lead migration from GCP to AWS EKS, architecting core infrastructure with Terraform and GitOps patterns for organization-wide adoption.
- Champion SLO-driven culture using four Golden Signals for critical user journeys.
- Design and develop tooling/automation in Node.js and Go to eliminate developer toil.
- Evolve in-house K6 load testing platform and act as Istio service mesh expert.
- Spearhead agentic workflows, proactive scaling, and automated remediation initiatives.
- Participate in on-call rotation, mentor engineers, and drive post-mortems to systemic improvements.
Requirements
- 8-10+ years in SRE/DevOps/Software Engineering at Staff level.
- Proven technical leadership mentoring seniors and leading large-scale projects.
- Expert polyglot coder with deep Node.js/Go experience building critical services.
- Architectural Kubernetes/EKS expertise including networking and control plane.
- Terraform IaC expert designing large-scale reusable frameworks.
- Observability architect with Datadog for SLOs; CI/CD with GitHub Actions.
- Systems thinker decomposing complex cross-functional problems.
Nice to have
- Agentic systems and intelligent automation for SRE.
- Cloud migration leadership (GCP to AWS).
- K6 performance testing and scaling frameworks.
- Istio in large multi-tenant environments.
- Serverless with SST; legacy config deprecation.
Culture & Benefits
- Code-first SRE culture with shared stability responsibility and pragmatism.
- Flexible work schedule control and all-hands in Austin, Texas.
- Comprehensive medical/dental/vision, 401(K) match, disability/life insurance.
- Progressive parental leave, flexible PTO, hack-a-thons, team events.
- Equity awards, stocked snacks, catered meals.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →