Назад
15 дней назад

Infrastructure Operations Manager (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
Iceland
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Infrastructure Operations Manager (AI/HPC): Managing and optimizing the physical and technical infrastructure of an AI-focused GPU data center with an accent on service reliability, HPC hardware deployment, and team leadership. Focus on ensuring 24/7 operational uptime, managing power and cooling systems, and coordinating with vendors to maintain high-performance AI workloads.

Location: On-site in Iceland (Datacenter)

Company

Nscale is a GPU cloud provider engineered for AI, providing high-performance infrastructure for startups and enterprises.

What you will do

  • Oversee overall accountability for all devices and infrastructure in the AI Data Center to ensure 24/7 reliability.
  • Manage power, cooling, and environmental conditions to proactively mitigate operational risks.
  • Lead, mentor, and schedule a team of engineers and technicians.
  • Serve as the primary point of contact for clients regarding SLAs and KPIs.
  • Coordinate procurement and maintenance of spare parts and GPU/HPC hardware with external vendors.
  • Monitor resource utilization and implement improvements for scalability and cost-effectiveness.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience managing data centers, specifically in HPC and GPU environments.
  • Proven experience in team leadership and personnel development.
  • Strong technical knowledge of power, cooling, and environmental systems in data centers.
  • Ability to work on-site in Iceland and handle on-call duties.

Nice to have

  • Certifications in data center management (e.g., CDCP, CDCS).
  • Hands-on experience with NVIDIA GPUs, CUDA, and AI frameworks.
  • Familiarity with hybrid cloud/HPC environments.

Culture & Benefits

  • Competitive compensation package including base salary and equity.
  • Annual performance and salary reviews every 12 months.
  • Opportunity to work in a fast-growing AI tech startup.
  • Dynamic progression plan tailored to individual ambitions.
  • Human-first flexibility and autonomy in shaping your workday.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →