Lead Site Reliability Engineer (Azure)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Lead Site Reliability Engineer (Azure): Ensuring cloud platforms are observable, reliable, and scalable for Public Safety & Justice SaaS solutions with an accent on observability, automation, and infrastructure as code. Focus on leading root cause investigations, establishing SLOs/SLAs, and optimizing system performance and cost across microservices architectures.
Location: Hybrid in Southampton, United Kingdom
Company
provides state-of-the-art SaaS solutions for the Public Safety & Justice market to a worldwide customer base.
What you will do
- Lead investigations into root cause outages, performance bottlenecks, and cost issues.
- Manage the SRE work backlog and develop reliability improvements acting as a production gatekeeper.
- Collaborate with DevOps and engineering teams to establish and enforce SLOs, SLAs, and error budgets.
- Install and configure observability platforms including Grafana, Prometheus, and Azure Monitor.
- Develop Bicep modules for monitoring infrastructure and optimize system security and performance.
- Provide technical leadership and oversight to Cloud Operations and Support teams.
Requirements
- 6+ years of experience in Site Reliability Engineering.
- Deep expertise with Azure cloud, Kubernetes (AKS), and containerization.
- Strong proficiency in programming or advanced scripting using Python, PowerShell, or C#.
- Experience with Infrastructure as Code using Bicep, ARM, and Git.
- Proven track record managing monitoring tools like Prometheus, Grafana, and Elasticsearch.
- Knowledge of compliance frameworks such as ISO 27001 or FEDRAMP.
hirify.global-to-have"> to have
- Exposure to Azure DevOps pipelines (CI/CD).
- Experience using AI tools to automate and accelerate workflows.
- Relevant certifications: AZ-104, AZ-305, AZ-500, AZ-700, or CKA.
Culture & Benefits
- Opportunity to work for a market leader in AI, cloud, and digital innovation.
- High-standard environment focused on challenging limits and achieving excellence.
- Collaborative culture within a global company spanning 30+ countries.
- Professional growth through a highly hands-on technical leadership role.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →