Staff SRE (AWS)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff SRE (AWS): Build tools, frameworks, and automation for monitoring, reliability, and scalability of on-prem and cloud infrastructure with an accent on observability, incident management, and operational efficiency. Focus on designing SLOs/SLIs, leading post-mortems, reducing toil via IaC and GitOps, and evaluating cloud technologies for production stability.
Location: Remote - Virginia (US-based)
Salary: $183,500 - $232,500
Company
Technology company building solutions to enhance road safety through video telematics and fleet management.
What you will do
- Build monitoring tools and frameworks to ensure high uptime in production environments.
- Mentor SRE team on best practices and foster innovation culture.
- Enhance 24/7 on-call processes, incident management, run-books, and SOPs for cloud services.
- Collaborate with architects, DBAs, developers, and DevOps to integrate reliability and scalability early.
- Lead blameless post-mortems, RCA publications, SLO/SLI definitions, and service owner initiatives.
- Research cloud technologies, reduce operational toil with IaC and GitOps, and resolve production incidents.
Requirements
- 8+ years as SRE in AWS at medium/large scale
- 6+ years with observability tools (Prometheus, New Relic, Grafana)
- Proficiency in Python, Groovy, Bash; database management (SQL/NoSQL)
- 5+ years building pipelines with Git, Terraform, Helm, Jenkins/ArgoCD
- Expertise in AWS services (VPCs, EKS, IAM, EC2, RDS, etc.) and Linux systems
- Experience with Kubernetes, 24/7 on-call rotations, run-books across geo-locations
- Ability to work under pressure in challenging environments
Nice to have
- Managing AWS networks (Direct Connects, Transit Gateways, VPNs, BGP)
- Cloud databases (RDS, Mongo, Elasticsearch, Snowflake)
- AWS, Kubernetes, Linux, Programming, CI/CD certifications
Culture & Benefits
- Medical, dental, vision insurance; HSA, FSA, telehealth
- 401(k) with match, life/AD&D insurance, short/long-term disability
- FTO/PTO, 11 paid holidays + 1 inclusive holiday, volunteer time off
- Employee well-being program, referral program, education reimbursement
- Recognition programs and additional perks
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →