2 days ago
Principal Software Engineer (ML Platform)
Вакансия напрямую с сайта из расширенного списка "глобальных компаний для русскоязычных специалистов" - туда входят компании с восточно-европейскими корнями.
Обычно нужен английский ~B2 и локация вне РФ/РБ (и/или ИП). Может требовать VPN для доступа
Описание вакансии
Текст:
TL;DR
Principal Software Engineer (ML Platform): Architecting and leading core infrastructure for ML model deployment, observability, and lifecycle management with an accent on scalable model serving, GPU/CPU orchestration, and CI/CD workflows. Focus on building production-grade ML platforms, enabling rapid iteration, and mentoring engineers in a game development context.
Location: Los Angeles, United States
Company
is a leading game development company focused on creating player-centric gaming experiences and innovative technology platforms.
What you will do
- Architect and implement scalable ML inference infrastructure for live and batch model serving with GPU and CPU orchestration.
- Collaborate with researchers, game teams, and platform engineers to deliver reusable ML solutions.
- Build and maintain CI/CD pipelines for ML artifacts supporting rapid development and production promotion.
- Manage environment and dependency tooling for ML runtimes ensuring security and reliability.
- Instrument platform metrics for observability, model monitoring, and performance SLAs.
- Lead technical strategy, mentor engineers, and contribute to long-term platform architecture and hiring.
Requirements
- 10+ years of software engineering experience with leadership in platform or infrastructure teams
- Proven experience building large-scale distributed and production ML systems
- Expertise with cloud-native systems including Kubernetes, containerization, and autoscaling
- Experience with ML inference serving frameworks and GPU orchestration
- Strong background in CI/CD automation, infrastructure as code, and Python ML ecosystems
- Ability to mentor engineers and influence cross-functional teams
Nice to have
- Experience in real-time or latency-sensitive ML infrastructure
- Familiarity with ML workflow tools and drift monitoring
- Exposure to A/B testing and online model evaluation frameworks
- Experience founding or building greenfield ML platforms
- Passion for game systems and player experience
- Experience with technical deployments in China, especially Tencent
Culture & Benefits
- Open paid time off policy and flexible work schedules
- Medical, dental, and life insurance coverage
- Parental leave for employees and their families
- 401k retirement plan with company match
- Collaborative teams empowered to bring unique perspectives