Engineering Manager, Serverless Compute Platform
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Engineering Manager, Serverless Compute Platform: Own end-to-end delivery of a greenfield Execution Sandbox service and scale the engineers building it with an accent on platform-wide blast radius, multi-cloud reliability, and unifying CPU/GPU provisioning. Focus on architecting and launching production-grade distributed systems with strong operational rigor, observability, and SLO-driven on-call practices.
Company
Databricks builds a data and AI infrastructure platform used by thousands of organizations worldwide.
What you will do
- Own 0→1 delivery of the Execution Sandbox service from inception to production scale, covering non-Spark compute workloads.
- Unify fragmented CPU/GPU cluster management into a single provisioning service to eliminate parity bugs and ensure consistent product experiences.
- Ensure strong execution health and production-grade reliability across multiple use cases (e.g., GPU onboarding, UDF generalization, managed REPL).
- Collaborate across 5+ partner organizations to align on API contracts and shared milestones across Serverless Platform and related teams.
- Partner with Product Management to shape product strategy for future sandbox-based offerings (e.g., serverless command execution APIs, FaaS-style workloads).
Requirements
- 5+ years managing engineers building and operating distributed systems in production, ideally control-plane or orchestration services.
- BS or higher in Computer Science (or related field) or equivalent practical experience.
- Deep technical fluency in infrastructure systems, including reviewing architecture, challenging tradeoffs (state machine design, API boundaries), and coaching senior ICs.
- Experience deploying and operating services across multiple clouds and/or regions (AWS, Azure, GCP).
- Strong operational rigor: observability, SLOs, pre-mortems, and healthy on-call culture.
- Ability to build and scale a high-caliber team (manage and elevate L3–L5 engineers; hire 2–3 additional engineers).
Culture & Benefits
- Pay range transparency: $180,500–$225,600 USD for local pay range.
- Compensation may include annual performance bonus and equity, plus region-specific benefits.
- Emphasis on operational excellence with observability, SLOs, and structured incident prevention (pre-mortems).
- Support for building and scaling teams with clear ownership boundaries and architectural doctrine.
Hiring process
- Interviews focused on engineering leadership, distributed systems/infrastructure depth, and operational rigor.
- Evaluation of ability to architect, align cross-org on APIs/milestones, and scale execution from preview to production.
Location: Bellevue, Washington
Salary: $180,500 — $225,600 USD
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →