Systems Engineer (HPC)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Systems Engineer (HPC): Designing and operating large-scale infrastructure for AI platforms with an accent on Linux administration, automation, and system reliability. Focus on scaling clusters to thousands of nodes, managing petabyte-scale storage, and optimizing HPC environments for research workloads.
Location: Hybrid in Paris, London, Amsterdam, Barcelona, Madrid, Berlin, Munich, Frankfurt, or Lausanne; Remote options available.
Company
builds high-performance, open, and efficient AI systems designed to power the next generation of applications.
What you will do
- Operate and maintain large-scale Linux environments across bare metal, clusters, and cloud.
- Scale infrastructure toward thousands of nodes and manage petabyte-scale storage systems.
- Automate operational tasks and system lifecycle management using Python, Bash, Ansible, or Terraform.
- Collaborate with HPC, DevOps, and research teams to ensure high availability and performance.
- Monitor system health, troubleshoot complex incidents, and support production workloads.
Requirements
- Strong Linux systems administration experience.
- Experience working with HPC clusters or large-scale cloud infrastructure.
- Proficiency with Job schedulers, specifically Slurm.
- Solid troubleshooting skills across systems, hardware, and networks.
- Must be based in or able to work from one of the specified European locations due to the hybrid nature of the role.
Nice to have
- Experience with containers and orchestration tools like Kubernetes.
- Knowledge of storage systems such as Ceph, Lustre, or NFS.
- Understanding of networking fundamentals (Ethernet; InfiniBand is a plus).
- Practical experience with GPU infrastructure or AI/ML workloads.
Culture & Benefits
- Opportunity to play a pivotal role in scaling cutting-edge AI infrastructure.
- Chance to shape data center operations from the ground up in a high-growth startup.
- Low-ego, collaborative, and highly technical work environment.
- Competitive compensation and comprehensive benefits.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →