Назад
Company hidden
2 часа назад

Systems Engineer (HPC)

Формат работы
remote (только Europe)/hybrid
Тип работы
fulltime
Английский
b2
Страна
France/UK/Spain +3 еще
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Systems Engineer (HPC): Designing and operating large-scale infrastructure for AI platforms with an accent on Linux administration, automation, and system reliability. Focus on scaling clusters to thousands of nodes, managing petabyte-scale storage, and optimizing HPC environments for research workloads.

Location: Hybrid in Paris, London, Amsterdam, Barcelona, Madrid, Berlin, Munich, Frankfurt, or Lausanne; Remote options available.

Company

hirify.global builds high-performance, open, and efficient AI systems designed to power the next generation of applications.

What you will do

  • Operate and maintain large-scale Linux environments across bare metal, clusters, and cloud.
  • Scale infrastructure toward thousands of nodes and manage petabyte-scale storage systems.
  • Automate operational tasks and system lifecycle management using Python, Bash, Ansible, or Terraform.
  • Collaborate with HPC, DevOps, and research teams to ensure high availability and performance.
  • Monitor system health, troubleshoot complex incidents, and support production workloads.

Requirements

  • Strong Linux systems administration experience.
  • Experience working with HPC clusters or large-scale cloud infrastructure.
  • Proficiency with Job schedulers, specifically Slurm.
  • Solid troubleshooting skills across systems, hardware, and networks.
  • Must be based in or able to work from one of the specified European locations due to the hybrid nature of the role.

Nice to have

  • Experience with containers and orchestration tools like Kubernetes.
  • Knowledge of storage systems such as Ceph, Lustre, or NFS.
  • Understanding of networking fundamentals (Ethernet; InfiniBand is a plus).
  • Practical experience with GPU infrastructure or AI/ML workloads.

Culture & Benefits

  • Opportunity to play a pivotal role in scaling cutting-edge AI infrastructure.
  • Chance to shape data center operations from the ground up in a high-growth startup.
  • Low-ego, collaborative, and highly technical work environment.
  • Competitive compensation and comprehensive benefits.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →