Назад
Company hidden
2 месяца назад

Staff Site Reliability Engineer

Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Site Reliability Engineer (AWS EKS/Terraform): Lead technical architecture and execution of migration from legacy GCP to scalable AWS EKS infrastructure with an accent on GitOps patterns, SLO-driven reliability, and developer tooling. Focus on designing critical automation in Node.js/Go, evolving K6-based load testing platform, and optimizing Istio service mesh for production demands.

Location: Remote, United States

Company

Sports media platform elevating overlooked communities, athletes, and sports through diverse, inclusive engineering teams using blind recruiting.

What you will do

  • Lead migration from GCP to AWS EKS, architecting core infrastructure with Terraform and GitOps patterns for organization-wide adoption.
  • Champion SLO-driven culture using four Golden Signals for critical user journeys.
  • Design and develop tooling/automation in Node.js and Go to eliminate developer toil.
  • Evolve in-house K6 load testing platform and act as Istio service mesh expert.
  • Spearhead agentic workflows, proactive scaling, and automated remediation initiatives.
  • Participate in on-call rotation, mentor engineers, and drive post-mortems to systemic improvements.

Requirements

  • 8-10+ years in SRE/DevOps/Software Engineering at Staff level.
  • Proven technical leadership mentoring seniors and leading large-scale projects.
  • Expert polyglot coder with deep Node.js/Go experience building critical services.
  • Architectural Kubernetes/EKS expertise including networking and control plane.
  • Terraform IaC expert designing large-scale reusable frameworks.
  • Observability architect with Datadog for SLOs; CI/CD with GitHub Actions.
  • Systems thinker decomposing complex cross-functional problems.

Nice to have

  • Agentic systems and intelligent automation for SRE.
  • Cloud migration leadership (GCP to AWS).
  • K6 performance testing and scaling frameworks.
  • Istio in large multi-tenant environments.
  • Serverless with SST; legacy config deprecation.

Culture & Benefits

  • Code-first SRE culture with shared stability responsibility and pragmatism.
  • Flexible work schedule control and all-hands in Austin, Texas.
  • Comprehensive medical/dental/vision, 401(K) match, disability/life insurance.
  • Progressive parental leave, flexible PTO, hack-a-thons, team events.
  • Equity awards, stocked snacks, catered meals.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →