Vice President Of Infrastructure (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Vice President of Infrastructure (AI): Building and scaling the foundational infrastructure for an open-access AI cloud platform with an accent on GPU orchestration, distributed systems, and platform reliability. Focus on leading engineering teams while remaining hands-on with architecture design, debugging critical production issues, and driving infrastructure strategy.
Location: Must be based in or able to work from San Francisco, CA (Hybrid)
Company
is an AI cloud platform startup democratizing access to computing power through an innovative GPU marketplace and inference service.
What you will do
- Lead the design and evolution of the AI cloud platform, including GPU orchestration and compute scheduling.
- Build and scale large GPU clusters to support customer AI training and inference workloads.
- Remain deeply involved in technical direction by contributing directly to architecture reviews and infrastructure design.
- Establish SRE and Platform Engineering functions, defining reliability standards and operational excellence.
- Recruit and develop world-class infrastructure teams while fostering a high-performance engineering culture.
- Manage infrastructure budgets, vendor relationships, and long-term capacity planning.
Requirements
- 12+ years of experience building and operating large-scale infrastructure systems.
- Proven track record of leading infrastructure organizations while maintaining hands-on technical involvement.
- Deep expertise in Kubernetes, Linux, networking, and distributed systems.
- Experience building or operating GPU-native cloud infrastructure or AI/ML compute platforms.
- Experience scaling infrastructure within high-growth startup environments.
- Must be able to work in a hybrid capacity in San Francisco, CA.
Nice to have
- Experience with GPU scheduling tools like Slurm, Ray, or Kubernetes GPU operators.
- Background in managing thousands of GPUs in production environments.
- Expertise in Infrastructure-as-Code and advanced observability frameworks.
Culture & Benefits
- Opportunity to lead infrastructure at a Series A startup backed by PhD-level founders.
- Direct impact on democratizing AI and redefining cloud computing.
- Collaborative environment focused on ownership, execution, and technical excellence.
- Commitment to diversity and an inclusive workplace culture.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →