17 часов назад
Lead Site Reliability Engineer (AWS)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
Lead Site Reliability Engineer (AWS): Architecting and improving cloud-native systems for a health and wellness platform with an accent on reliability, performance at scale, and automation. Focus on shaping SLIs/SLOs, leading high-severity incident response, and evolving the resilience of distributed AWS environments.
Location: London, UK
Company
is a health and wellness retailer transforming into a product- and platform-led technology organization.
What you will do
- Architect cloud-native systems focusing on reliability, SLIs/SLOs, and capacity planning.
- Lead high-severity incident response and conduct post-incident reviews to drive measurable improvements.
- Mentor SREs and platform engineers while championing DevSecOps and observability practices.
- Develop CI/CD pipelines and Infrastructure-as-Code environments to remove toil and accelerate teams.
- Collaborate with Security and Platform teams to align reliability with overall resilience goals.
- Implement resilience validation via load testing, stress testing, and chaos engineering.
Requirements
- 5–8+ years of experience in SRE, Platform, or Cloud Infrastructure roles.
- Deep expertise in AWS (EC2, EKS, Lambda, VPC, DynamoDB, S3, CloudFront, RDS, IAM, KMS).
- Strong coding proficiency in Python, Go, or Bash for automation.
- Expertise with observability stacks including Datadog, Prometheus, Grafana, and OpenTelemetry.
- Proficiency with Terraform, CloudFormation, or AWS CDK.
- Proven experience in incident response leadership and root-cause analysis.
Nice to have
- Experience mentoring or leading engineers within SRE or platform teams.
- Hands-on experience with load testing, stress testing, and chaos engineering.
- Passion for uplifting engineering culture through tooling and reliability-first thinking.
Culture & Benefits
- Wellbeing benefits: Health Cash Plan, Life Assurance, Virtual GP, and Private Medical care.
- Financial perks: Bonus scheme based on company and personal performance and Pension Contribution scheme.
- Discounts: 25% colleague discount and annual product allowance.
- Learning & Development: Access to Level 2-5 Apprenticeships, workshops, and a Digital Learning Library.
- Lifestyle: Access to 'Wellhub' with gyms, studios, and wellbeing apps.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
Checkr
3 дня назад
Software Engineer, Reliability (SRE)
3 дня назад
Senior Site Reliability Engineer (SecOps)
6 дней назад
Senior Platform Engineer (AWS)
3 дня назад
Site Reliability Engineer (SRE) (AI)
Wheely
2 дня назад
Infrastructure Engineer (DevOps)
4 дня назад
Site Reliability Engineer (AI)
142 696 - 158 303$