Site Reliability Technical Lead (EdTech)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Site Reliability Technical Lead (EdTech): Leading the site reliability team to design and maintain robust, scalable, and secure infrastructure for school management tools with an accent on system architecture, observability, and automation. Focus on mentoring engineers, driving reliability standards, and optimizing incident response frameworks to ensure high-performance service delivery.
Location: Remote (Must be based in the UK, no visa sponsorship provided)
Salary: £80,000 - £90,000
Company
provides MIS and school management tools to over 7,000 schools, focusing on transforming school operations and improving staff wellbeing through actionable data insights.
What you will do
- Define and guide system architecture to balance scalability, maintainability, and security.
- Champion system observability and ensure adherence to Service Level Objectives (SLOs).
- Lead Root Cause Analysis (RCA) and optimize incident response frameworks.
- Drive automation initiatives to reduce operational toil and improve system efficiency.
- Mentor and coach engineers to foster a culture of quality, reliability, and technical excellence.
- Collaborate with Product and Engineering Managers to align technical direction with product strategy.
Requirements
- Must have the right to work in the UK (no visa sponsorship).
- Extensive professional experience in SRE, DevOps, or Platform Engineering on complex, scalable systems.
- Deep expertise with AWS and distributed cloud architectures.
- Proven experience operating platforms serving high request volumes (~1000 req/sec).
- Advanced proficiency with Terraform and configuration management tools.
- Strong programming skills in Python, Go, or similar languages.
- Expert understanding of distributed systems, microservices, containerization (Docker/Kubernetes), and CI/CD pipelines.
Nice to have
- Experience with chaos engineering and reliability testing.
- Knowledge of security best practices and compliance frameworks.
- Background in agile and lean methodologies.
- Contributions to open-source projects or the SRE community.
Culture & Benefits
- 32 days holiday (25 days annual leave + 7 company-wide days).
- Private Dental Insurance with Bupa and AIG Smart Health wellness benefits.
- Enhanced maternity, adoption, and paternity pay.
- Salary sacrifice pension scheme.
- Dedicated professional development training budget.
- Flexible working environment with a focus on wellbeing and mental health support.
Hiring process
- Phone screen.
- 1st stage interview.
- 2nd stage interview.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →