TL;DR
Site Reliability Engineer (AWS): Developing and maintaining a reliable, scalable, and observable cloud-native platform on AWS with an accent on incident response, observability tooling, and infrastructure-as-code. Focus on improving system reliability, reducing MTTR, and partnering with product engineering teams to ensure safe delivery.
Location: Hybrid (London, UK), minimum 60% office attendance required. Work from Abroad policy for 28 days.
Salary: £55,000–£63,000
Company
hirify.global is Europe’s number 1 downloaded rail app, enabling millions of travelers to find and book tickets across 40+ countries.
What you will do
- Develop an understanding of system architecture, dependencies, and failure modes.
- Participate in production incident response, supporting investigation, mitigation, and communication.
- Contribute to post-incident reviews and follow-up actions to improve reliability, scalability, and resilience.
- Design, build, and maintain observability using metrics, logs, events, and traces.
- Support AWS-hosted infrastructure and shared platform services using infrastructure-as-code and CI/CD tooling.
- Collaborate with product engineering teams to ensure services are operationally ready and deployed safely.
Requirements
- Experience with SRE concepts such as SLI, SLO, and error budgets.
- Hands-on experience with observability tooling (e.g., New Relic, Elastic, Grafana).
- Experience working with cloud providers (preferably AWS).
- Experience troubleshooting Linux operating systems.
- Scripting experience in at least one language (preferably Python).
- Understanding of load balancing, reverse proxy, and application architecture concepts.
- Experience with build, deployment & configuration management tooling (e.g., GitHub Actions, Terraform).
Culture & Benefits
- Private healthcare & dental insurance and EV Scheme.
- Generous work from abroad policy (28 days).
- 2-for-1 share purchase plans, extra festive time off, and family-friendly benefits.
- Clear career paths, transparent pay bands, personal learning budgets, and regular learning days.
- Hybrid work model with a minimum of 60% office attendance over a 12-week period.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →