20 часов назад
Site Reliability Engineer (K8s)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
Site Reliability Engineer (GCP/K8s): Ensuring the availability and performance of GCP-hosted APIs and data infrastructure for the BI team with an accent on service health, observability, and cloud cost optimization. Focus on automating incident response, building monitoring dashboards, and improving the reliability of data pipelines and BigQuery workflows.
Location: United Kingdom
Company
is a technology company providing data-driven advertising solutions.
What you will do
- Monitor and maintain uptime for GCP-hosted APIs and services to meet performance targets.
- Lead incident response for BI platform services, including triage, resolution, and post-mortem analysis.
- Build and manage observability infrastructure, including dashboards, alerts, and logging across GCP services.
- Track GCP cloud spend and implement cost alerting to optimize cloud budgets.
- Identify and resolve security gaps in IAP configurations, service account permissions, and API access controls.
- Collaborate with backend and data engineers to improve the reliability of data pipelines and BigQuery workflows.
Requirements
- 2+ years of experience in Site Reliability, DevOps, or Cloud Infrastructure roles in a production environment.
- Bachelor's degree in Computer Science, Engineering, or equivalent hands-on experience.
- Practical experience with GCP, specifically Cloud Run, API Gateway, and BigQuery.
- Experience with monitoring and observability tools such as Cloud Monitoring or Datadog.
- Strong understanding of cloud security fundamentals, including IAM and network controls.
- Proficiency with Git and version control in a team setting.
Nice to have
- Experience with CI/CD pipelines and deployment automation (GitHub Actions, Cloud Build).
- Knowledge of Terraform or other infrastructure-as-code tools.
- Python proficiency for scripting and automation.
- Deep experience with MySQL, Spanner, or BigQuery.
- Experience using dbt or Looker.
- Ability to work across CET/EST hours in a distributed team.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
6 дней назад
Site Reliability Engineer (SRE)
55 000 - 68 000€
16 часов назад
Site Reliability Engineer (Cloud)
2 дня назад
Principal Site Reliability Engineer
163 620 - 212 710$
3 дня назад
Site Reliability Engineer (DevSecOps)
150 000 - 175 000$
1 день назад
Site Reliability Engineer (AWS)
4 часа назад