Middle/High-Middle DevOps/SRE Engineer (Gamedev)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Middle/High-Middle DevOps/SRE Engineer (Gamedev): Operate and improve production platform in GCP/GKE fronted by Cloudflare with an accent on reliability improvements, monitoring coverage, and CI/CD workflows. Focus on implementing IaC via Terraform/Helm, expanding Datadog observability, incident response, security basics, and cost optimizations in highload systems with traffic spikes.
Location: Onsite in Lisbon, Portugal
Company
helps game developers achieve financial and creative independence by providing solutions to launch, run, and grow their businesses.
What you will do
- Operate and support production systems on GCP, primarily GKE and managed services, executing improvements delegated by seniors.
- Implement infrastructure changes via Terraform (and Terragrunt), maintain Helm charts and Kubernetes manifests.
- Improve GitHub Actions/CI/CD workflows, deployment automation, and reliability.
- Build and maintain Datadog dashboards/monitors, close monitoring gaps, reduce noisy alerts.
- Participate in incident response: triage, mitigation, escalation, postmortems.
- Run security tooling, triage findings, support secure practices, identify cost optimizations.
Requirements
- Hands-on production experience with Kubernetes (ideally GKE) and basic cluster operations.
- Working experience with Terraform and Helm in PR-based workflows.
- Familiarity with GCP services (Cloud SQL, BigQuery, BigTable, Pub/Sub, Cloud Run, Memorystore).
- Monitoring/alerting and troubleshooting skills (preferably Datadog).
- Strong scripting/automation mindset, reliability awareness under SLA constraints.
- Cloudflare basics, experience with runbooks/postmortems, exposure to SOC 2/PCI-DSS.
Nice to have
- Experience in high-load consumer products or game dev.
Culture & Benefits
- Cloud-only, highload environment with real engineering challenges.
- Small team with ownership, autonomy, quick iteration.
- Strong opportunity to grow into platform ownership and SRE leadership.
- Direct impact on reliability, scalability, developer velocity.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →