Эта вакансия в архиве

Посмотреть похожие вакансии ↓
Company hidden
обновлено 1 месяц назад

HPC Systems Engineer (AI)

Формат работы
remote (только Europe)
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
UK/Norway/Netherlands

Описание вакансии

Текст:
/

TL;DR

HPC Systems Engineer (AI): Designing, deploying, and operating large-scale HPC clusters and GPU-based compute environments for AI infrastructure with an accent on InfiniBand/Ethernet networking and performance tuning. Focus on automating infrastructure provisioning, optimizing cluster hardware architectures, and ensuring seamless integration of complex compute and storage layers.

Location: Must be based in the EMEA region (Netherlands, Norway, or UK). This role will require 20-30% travel to European sites.

Company

hirify.global is a GPU cloud provider engineered for AI, offering high-performance infrastructure for startups and enterprises to accelerate AI development.

What you will do

  • Design, deploy, and operate large-scale HPC clusters and GPU-compute environments.
  • Create and maintain hardware architectures, including BOMs and rack elevations.
  • Implement and manage HPC scheduling systems like Slurm.
  • Design and optimize high-speed network topologies, including InfiniBand and Ethernet.
  • Automate provisioning, configuration, and operations using Python or Bash.
  • Collaborate with cross-functional teams to troubleshoot cluster performance across compute, storage, and interconnect layers.

Requirements

  • Must be based in the Netherlands, Norway, or the UK with ability to travel to European sites (20-30%).
  • Proven experience in designing and operating large-scale compute clusters.
  • Strong knowledge of Slurm or equivalent workload management systems.
  • Experience with InfiniBand networking, performance tuning, and high-speed Ethernet protocols.
  • Proficiency in scripting with Python or Bash for automation.
  • Strong understanding of hardware BOMs, physical layer design, and rack architecture.

Culture & Benefits

  • Highly competitive compensation package including base salary and equity.
  • Dynamic progression plan tailored to individual ambitions and career growth.
  • Human-first flexible work culture with a remote-first team setup.
  • Opportunity to contribute to cutting-edge AI infrastructure at a high-growth startup.
  • Collaborative and transparent environment with a strong sense of ownership.