Site Reliability Engineer (K8s)

Тип работы

fulltime

Грейд

middle

Английский

Страна

Вакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Site Reliability Engineer (GCP/K8s): Ensuring the availability and performance of GCP-hosted APIs and data infrastructure for the BI team with an accent on service health, observability, and cloud cost optimization. Focus on automating incident response, building monitoring dashboards, and improving the reliability of data pipelines and BigQuery workflows.

Location: United Kingdom

Company

hirify.global is a technology company providing data-driven advertising solutions.

What you will do

Monitor and maintain uptime for GCP-hosted APIs and services to meet performance targets.
Lead incident response for BI platform services, including triage, resolution, and post-mortem analysis.
Build and manage observability infrastructure, including dashboards, alerts, and logging across GCP services.
Track GCP cloud spend and implement cost alerting to optimize cloud budgets.
Identify and resolve security gaps in IAP configurations, service account permissions, and API access controls.
Collaborate with backend and data engineers to improve the reliability of data pipelines and BigQuery workflows.

Requirements

2+ years of experience in Site Reliability, DevOps, or Cloud Infrastructure roles in a production environment.
Bachelor's degree in Computer Science, Engineering, or equivalent hands-on experience.
Practical experience with GCP, specifically Cloud Run, API Gateway, and BigQuery.
Experience with monitoring and observability tools such as Cloud Monitoring or Datadog.
Strong understanding of cloud security fundamentals, including IAM and network controls.
Proficiency with Git and version control in a team setting.

Nice to have

Experience with CI/CD pipelines and deployment automation (GitHub Actions, Cloud Build).
Knowledge of Terraform or other infrastructure-as-code tools.
Python proficiency for scripting and automation.
Deep experience with MySQL, Spanner, or BigQuery.
Experience using dbt or Looker.
Ability to work across CET/EST hours in a distributed team.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →