Назад
Company hidden
3 дня назад

Senior HPC Developer (RDMA Networking)

150 000 - 230 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior HPC Developer (C++/RDMA): Building and optimizing high-performance GPU and networking subsystems for AI fabrics with an accent on cross-stack observability and workload fault tolerance. Focus on debugging performance issues across kernel, driver, and network layers to maximize GPU cluster utilization.

Location: On Site, Palo Alto, California

Salary: $150,000 - $230,000

Company

hirify.global Systems is pioneering a software-driven approach to AI fabrics to increase GPU cluster utilization through cross-stack observability and performance acceleration.

What you will do

  • Build and optimize high-performance GPU and networking subsystems.
  • Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads.
  • Debug performance issues across kernel, driver, GPU, and network layers.
  • Develop and improve GPU-aware networking solutions.
  • Profile, analyze, and tune system performance using low-level tooling.
  • Collaborate with a small engineering team and take ownership of core systems.

Requirements

  • 5+ years of experience in systems, HPC, or performance-critical software development.
  • Strong proficiency in low-level C/C++.
  • Solid understanding of RDMA networking, including InfiniBand, RoCE, and IBVerbs.
  • Experience working with multi-node, multi-GPU workloads.
  • Familiarity with collective communication libraries and communication algorithms.
  • Ability and willingness to debug complex issues across hardware and software boundaries.

Nice to have

  • Experience with congestion control mechanisms such as DCQCN.
  • Exposure to GPU-aware networking or advanced communication optimizations.
  • Experience with performance profiling, tracing, or observability tooling.
  • Background in AI infrastructure, HPC clusters, or distributed systems.

Culture & Benefits

  • Challenging projects in a fast-moving startup environment.
  • Friendly and inclusive workplace culture.
  • Competitive compensation and a comprehensive benefits package.
  • Catered lunch.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →