Назад
Company hidden
13 часов назад

Staff Site Reliability Engineer

Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Site Reliability Engineer (AWS EKS/Terraform): Lead technical architecture and execution of migration from legacy GCP to scalable AWS EKS infrastructure with an accent on GitOps patterns, SLO-driven reliability, and developer tooling. Focus on designing critical automation in Node.js/Go, evolving K6-based load testing platform, and optimizing Istio service mesh for production demands.

Location: Remote, United States

Company

Sports media platform elevating overlooked communities, athletes, and sports through diverse, inclusive engineering teams using blind recruiting.

What you will do

  • Lead migration from GCP to AWS EKS, architecting core infrastructure with Terraform and GitOps patterns for organization-wide adoption.
  • Champion SLO-driven culture using four Golden Signals for critical user journeys.
  • Design and develop tooling/automation in Node.js and Go to eliminate developer toil.
  • Evolve in-house K6 load testing platform and act as Istio service mesh expert.
  • Spearhead agentic workflows, proactive scaling, and automated remediation initiatives.
  • Participate in on-call rotation, mentor engineers, and drive post-mortems to systemic improvements.

Requirements

  • 8-10+ years in SRE/DevOps/Software Engineering at Staff level.
  • Proven technical leadership mentoring seniors and leading large-scale projects.
  • Expert polyglot coder with deep Node.js/Go experience building critical services.
  • Architectural Kubernetes/EKS expertise including networking and control plane.
  • Terraform IaC expert designing large-scale reusable frameworks.
  • Observability architect with Datadog for SLOs; CI/CD with GitHub Actions.
  • Systems thinker decomposing complex cross-functional problems.

Nice to have

  • Agentic systems and intelligent automation for SRE.
  • Cloud migration leadership (GCP to AWS).
  • K6 performance testing and scaling frameworks.
  • Istio in large multi-tenant environments.
  • Serverless with SST; legacy config deprecation.

Culture & Benefits

  • Code-first SRE culture with shared stability responsibility and pragmatism.
  • Flexible work schedule control and all-hands in Austin, Texas.
  • Comprehensive medical/dental/vision, 401(K) match, disability/life insurance.
  • Progressive parental leave, flexible PTO, hack-a-thons, team events.
  • Equity awards, stocked snacks, catered meals.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →