Systems Engineer (HPC)

Формат работы

remote (только Europe)/hybrid

Тип работы

fulltime

Английский

Страна

France/UK/Spain +3 еще

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Systems Engineer (HPC): Designing and operating large-scale infrastructure for AI platforms with an accent on Linux administration, automation, and system reliability. Focus on scaling clusters to thousands of nodes, managing petabyte-scale storage, and optimizing HPC environments for research workloads.

Location: Hybrid in Paris, London, Amsterdam, Barcelona, Madrid, Berlin, Munich, Frankfurt, or Lausanne; Remote options available.

Company

hirify.global builds high-performance, open, and efficient AI systems designed to power the next generation of applications.

What you will do

Operate and maintain large-scale Linux environments across bare metal, clusters, and cloud.
Scale infrastructure toward thousands of nodes and manage petabyte-scale storage systems.
Automate operational tasks and system lifecycle management using Python, Bash, Ansible, or Terraform.
Collaborate with HPC, DevOps, and research teams to ensure high availability and performance.
Monitor system health, troubleshoot complex incidents, and support production workloads.

Requirements

Strong Linux systems administration experience.
Experience working with HPC clusters or large-scale cloud infrastructure.
Proficiency with Job schedulers, specifically Slurm.
Solid troubleshooting skills across systems, hardware, and networks.
Must be based in or able to work from one of the specified European locations due to the hybrid nature of the role.

Nice to have

Experience with containers and orchestration tools like Kubernetes.
Knowledge of storage systems such as Ceph, Lustre, or NFS.
Understanding of networking fundamentals (Ethernet; InfiniBand is a plus).
Practical experience with GPU infrastructure or AI/ML workloads.

Culture & Benefits

Opportunity to play a pivotal role in scaling cutting-edge AI infrastructure.
Chance to shape data center operations from the ground up in a high-growth startup.
Low-ego, collaborative, and highly technical work environment.
Competitive compensation and comprehensive benefits.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →