Senior Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Site Reliability Engineer: Driving architectural change and evolving ’s Kubernetes platform, designing, upgrading and optimising clusters at scale. Focus on troubleshooting complex production issues, designing automation tools, and enhancing AWS estate for cost efficiency and security.
Location: Hybrid, based in London, England, United Kingdom, with a minimum of 2-3 days per week in-office depending on level
Company
is the UK's #1 credit score and report app, helping over 20 million users globally make better financial decisions through cutting-edge technology and insightful analytics.
What you will do
- Drive architectural change through RFCs, architecture forums, and platform-wide initiatives to improve reliability, scalability, and efficiency.
- Lead and evolve ’s Kubernetes platform, designing, upgrading, and optimising clusters at scale.
- Troubleshoot and resolve complex production issues independently using deep understanding of distributed systems and containerisation.
- Design and contribute to Kubernetes controllers and automation tools to enhance infrastructure and developer experience.
- Enhance the AWS estate, ensuring cost efficiency, security, and scalability while promoting best practices.
- Build and maintain CI/CD pipelines from scratch for new use cases, manage migrations, and introduce new tooling.
Requirements
- Expert-level Kubernetes knowledge, including cluster upgrades, networking (CNI), and container runtimes.
- Strong AWS expertise, covering architecture, networking, and cost management.
- Deep understanding of Linux internals, containerisation, and operating system-level performance tuning.
- Proficiency in at least one compiled language (e.g., Go, Rust, C++) and one interpreted language (e.g., Python, Bash).
- Proven ability to automate infrastructure, deployments, and monitoring with strong scripting skills.
- Experience designing, deploying, and operating distributed systems with complex failure modes.
Culture & Benefits
- 25 paid holidays plus a “duvet day” for your birthday.
- Private health and dental cover, including mental health support through Bupa.
- Up to 6% matched pension and life assurance scheme.
- Flexible hybrid work environment focused on output, not time spent at a screen.
- Regular Lunch and Learns, dog-friendly office, daily breakfast, and free snacks.
- Continued investment into learning and development, including leadership-led training and an in-house psychotherapist.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →