Site Reliability Engineer (AWS)

Формат работы

remote (только United_kingdom)

Тип работы

fulltime

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Site Reliability Engineer (SRE/DevOps): Maintaining 24/7 service reliability and incident response for global software products with an accent on operational automation and observability. Focus on engineering away repetitive toil, optimizing MTTD/MTTR, and implementing self-healing infrastructure.

Location: Remote (United Kingdom)

Company

hirify.global is a global innovation powerhouse (NASDAQ: hirify.global) providing AI, cloud, and digital software for customer experience and financial crime prevention.

What you will do

Act as a primary or escalation responder in a 24x7 on-call rotation for major incident response and mitigation.
Design and maintain alerting strategies and service health monitoring using Grafana, Prometheus, Datadog, Splunk, or CloudWatch.
Automate repetitive operational tasks and develop scripts in Python, Bash, or Go to reduce manual toil.
Support and troubleshoot Linux-based systems, Kubernetes, and cloud platforms (AWS, Azure, GCP).
Drive blameless post-incident reviews (PIRs) and track corrective actions to improve system reliability.
Partner with engineering teams to optimize system design for better operational readiness.

Requirements

Must be based in the United Kingdom.
Strong experience in Linux systems administration and production support.
Proficiency with cloud infrastructure (AWS preferred) and container orchestration (Kubernetes, Docker).
Scripting or programming experience in Python, Bash, Go, or similar.
Solid understanding of networking fundamentals, including DNS, TCP/IP, and load balancing.
Experience working in 24x7 NOC or production operations environments.

hirify.global-to-have">hirify.global to have

Experience defining and operating according to SLIs and SLOs.
Proficiency with Infrastructure as Code (IaC) tools such as Terraform and Ansible.
Exposure to security, compliance, or regulated environments.
Prior experience migrating from a traditional NOC to an SRE model.

Culture & Benefits

Remote work arrangement.
Opportunity to work at a market leader used by 85 of the Fortune 100 corporations.
Ambitious, high-standard environment focused on challenging limits.
Equal opportunity employer with a diverse global team across 30+ countries.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Site Reliability Engineer (AWS)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

hirify.global-to-have">hirify.global to have

Culture & Benefits

Похожие вакансии

Site Reliability Engineer

Site Reliability Engineer (AI)

Site Reliability Technical Lead (AWS)

Site Reliability Engineer (AI)

Site Reliability Engineer (Web3)

Senior DevOps Engineer (Voice)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business