Staff Software Engineer (Kubernetes)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Software Engineer (Kubernetes): Building and operating the foundational Kubernetes infrastructure that powers the AI platform across multi-tenant SaaS and on-prem environments with an accent on unified deployment experiences and zero-downtime upgrades. Focus on designing scalable control plane services, implementing GitOps patterns, and mentoring engineers to improve developer velocity.
Location: Boston, Seattle, San Francisco (US), Kyiv, Lviv (Ukraine), Remote Canada (ON), Remote Poland. Must be available to attend in-person company trainings and meetings.
Company
delivers an AI platform that enables organizations to develop, deliver, and govern predictive and generative AI at scale.
What you will do
- Architect and implement scalable, secure Kubernetes-based infrastructure for multi-cloud and hybrid environments.
- Lead technical direction for core Fleet initiatives, including control plane services, tenancy models, and deployment pipelines.
- Mentor engineers across the team to foster a strong engineering culture of ownership and excellence.
- Drive modernization efforts by introducing GitOps, Policy-as-Code (Kyverno), Cilium networking, and autoscaling.
- Collaborate with SRE, Platform, and Application teams to align infrastructure capabilities with product demands.
- Champion best practices in CI/CD, reliability, and container lifecycle management.
Requirements
- 7-10+ years of engineering experience, with 5+ years in infrastructure, platform, or backend systems.
- Deep expertise in Kubernetes internals, including networking, scheduling, scaling, and controller patterns.
- Strong proficiency in Go or Python for building production-quality, reliable, and observable systems.
- Experience operating across multiple cloud providers (AWS, GCP, Azure) and/or hybrid environments.
- Strong experience with Helm, container orchestration patterns, and CI/CD automation.
- Comfortable working with IaC (Terraform, Pulumi) and GitOps workflows.
Nice to have
- Familiarity with Cilium, Kyverno, KEDA, Gateway API, or OPA.
- Experience building and running multi-tenant SaaS platforms or on-prem delivery models.
- Experience with performance tuning for large-scale data or compute workloads.
- Experience working with GPU infrastructure for training and inference.
Culture & Benefits
- Comprehensive benefits package including Medical, Dental, and Vision Insurance.
- Flexible Time Off Program and Paid Holidays.
- Paid Parental Leave and Global Employee Assistance Program (EAP).
- High-performance culture based on rigor, overcommunication, and continuous improvement.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →