Staff SRE (Platform Engineering)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff SRE (Platform Engineering): Building and optimizing Project Volcano, an internal developer platform for 's engineering ecosystem with an accent on reliability, multi-region Kubernetes infrastructure, and managed data services. Focus on establishing SRE practices from the ground up, designing GitOps pipelines, and scaling multi-tenant PostgreSQL clusters.
Location: Remote (United States)
Company
is a leading developer of API and AI connectivity technologies, providing infrastructure that powers the agentic era through its unified platform, Konnect.
What you will do
- Own end-to-end reliability for Volcano services, defining SLOs, error budgets, and incident response practices.
- Architect multi-region Kubernetes infrastructure, networking, and data planes for edge deployment pipelines.
- Establish GitOps and CI/CD backbones using ArgoCD, Helm, and Terraform/Terragrunt.
- Scale and harden multi-tenant PostgreSQL clusters, Redis caching layers, and object storage.
- Implement comprehensive observability using Datadog, Prometheus, and Grafana.
- Mentor engineers on reliability principles and foster a blameless engineering culture through postmortems.
Requirements
- BS in Computer Science or equivalent degree.
- Substantial experience at Staff or Principal IC level in SRE or Platform Engineering.
- Proven track record of building SRE/platform engineering practices for developer-facing PaaS/SaaS products.
- Deep expertise in Kubernetes, including multi-tenant cluster design, networking (CNI, service mesh), and security hardening.
- Must be based in the United States.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →