Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Head of Infrastructure Operations (AI): Leading end-to-end operational management of GPU data center portfolios with an accent on operational excellence, safety, and reliability. Focus on scaling physical infrastructure, managing cross-functional regional teams, and ensuring compliance for a high-performance AI cloud platform.
Location: Must be based in the US. Regular travel to data centers is essential.
Salary: $180,000 - $277,000 USD
Company
Nscale is a GPU cloud provider engineered for AI, delivering cost-effective, high-performance infrastructure for AI startups and large enterprises.
What you will do
- Own the strategic vision and execution of data center infrastructure operations across the region to align with business growth.
- Build, mentor, and lead high-performing operations teams across multiple data center sites.
- Oversee physical facility performance, including power distribution, cooling systems, security, and asset inventory.
- Establish SLOs/SLIs for availability and lead root-cause analysis and remediation for operational failures.
- Manage critical vendor relationships, oversee SLAs, and handle procurement for equipment and maintenance.
- Partner with Engineering, Security, and Finance teams on capacity planning and site commissioning.
Requirements
- 10+ years of experience in data center operations, infrastructure, or facilities management at scale.
- Proven track record of leading regional or multi-site operations in high-growth environments.
- Deep technical understanding of power systems, cooling, networking, and physical security.
- Familiarity with compliance frameworks such as ISO 22237, ISO 27001, SOC 2, and ISO 22301.
- Understanding of GPU/HPC infrastructure and the specific operational needs of AI cloud platforms.
- Must be based in the US.
Nice to have
- Experience with Palantir Foundry or similar data platforms for operational analytics.
- Background in sustainability and energy efficiency optimization.
- Knowledge of Kubernetes, container orchestration, or hybrid cloud architectures.
- Security certifications or experience with GRC tooling.
Culture & Benefits
- Highly competitive compensation package including base salary and equity.
- Human-first flexibility and autonomy to shape your own workday.
- Comprehensive benefits including medical, dental, vision, paid time off, and retirement plans.
- Dynamic progression plans and the opportunity to work at a fast-growing AI tech startup.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →