Senior Engineering Manager (Compute)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Engineering Manager (Compute): Leading the development and operational strategy of the compute substrate for planet-scale AI workloads with an accent on reliability, fleet efficiency, and multi-tenant isolation. Focus on building a high-ownership team, driving architectural decisions for large-scale infrastructure, and ensuring the platform remains invisible to customers while handling complex compute requirements.
Location: Must be based in the United States
Compensation: $320,000 - $335,000
Company
provides an open-source durable execution layer that simplifies code and enables reliable application development for the agentic AI era.
What you will do
- Define the strategic direction and standards for the compute layer supporting global AI agents.
- Lead, hire, and mentor a high-ownership engineering team while staying deeply involved in technical design and code reviews.
- Drive the roadmap for next-generation compute platforms based on customer feedback and design-partner requirements.
- Own operational excellence, including on-call rotations, incident response, and blameless postmortems.
- Manage capacity, supply planning, and the cost-per-unit-of-compute profile for the fleet.
- Partner with cross-functional leadership to align priorities and ensure reliable delivery.
Requirements
- Must be based in the United States
- 12+ years in software or infrastructure engineering with 7+ years of people management experience.
- Proven experience leading teams that build and operate large-scale, multi-tenant compute platforms.
- Deep expertise in distributed systems and compute infrastructure.
- Strong operational rigor with a track record of managing live-site reliability and incident response.
- Excellent communication skills for partnering across technical and non-technical teams.
Nice to have
- Experience with MicroVMs (Firecracker, gVisor) or managed-compute primitives like AWS Fargate or GCP Cloud Run.
- Background in building serverless or hosted-compute products from 0 to 1.
- Knowledge of GPU scheduling, fractional GPUs, and accelerated compute infrastructure.
- Experience with multi-cloud delivery across AWS and GCP.
Culture & Benefits
- Comprehensive medical, dental, and vision coverage with 100% premiums paid.
- Unlimited PTO plus 14 holidays per year.
- 401(k) plan with company participation.
- Generous stipends for WFH meals, internet, lifestyle spending, and professional development.
- Support for in-home office setup and company-issued hardware.
- Collaborative, globally distributed team culture with occasional in-person events and offsites.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →