Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 1 месяц назад
HPC Systems Engineer (AI)
Описание вакансии
Текст:
TL;DR
HPC Systems Engineer (AI): Designing, deploying, and operating large-scale HPC clusters and GPU-based compute environments for AI infrastructure with an accent on InfiniBand/Ethernet networking and performance tuning. Focus on automating infrastructure provisioning, optimizing cluster hardware architectures, and ensuring seamless integration of complex compute and storage layers.
Location: Must be based in the EMEA region (Netherlands, Norway, or UK). This role will require 20-30% travel to European sites.
Company
is a GPU cloud provider engineered for AI, offering high-performance infrastructure for startups and enterprises to accelerate AI development.
What you will do
- Design, deploy, and operate large-scale HPC clusters and GPU-compute environments.
- Create and maintain hardware architectures, including BOMs and rack elevations.
- Implement and manage HPC scheduling systems like Slurm.
- Design and optimize high-speed network topologies, including InfiniBand and Ethernet.
- Automate provisioning, configuration, and operations using Python or Bash.
- Collaborate with cross-functional teams to troubleshoot cluster performance across compute, storage, and interconnect layers.
Requirements
- Must be based in the Netherlands, Norway, or the UK with ability to travel to European sites (20-30%).
- Proven experience in designing and operating large-scale compute clusters.
- Strong knowledge of Slurm or equivalent workload management systems.
- Experience with InfiniBand networking, performance tuning, and high-speed Ethernet protocols.
- Proficiency in scripting with Python or Bash for automation.
- Strong understanding of hardware BOMs, physical layer design, and rack architecture.
Culture & Benefits
- Highly competitive compensation package including base salary and equity.
- Dynamic progression plan tailored to individual ambitions and career growth.
- Human-first flexible work culture with a remote-first team setup.
- Opportunity to contribute to cutting-edge AI infrastructure at a high-growth startup.
- Collaborative and transparent environment with a strong sense of ownership.
Похожие вакансии
1 день назад
HPC Engineer (AI)
350 000 - 450 000$
7 дней назад
Senior Systems HPC Engineer
18 часов назад
Lead Quantum/HPC Integration Engineer (Quantum Computing)
3 дня назад
Staff Engineer, Datacenter Server Lifecycle (AI)
320 000 - 405 000$
3 дня назад
IT Systems and Support Engineer (AI)
4 дня назад
Professional Services Engineer (HPC)
88 000 - 117 000$