Senior HPC Developer (RDMA Networking)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior HPC Developer (C++/RDMA): Building and optimizing high-performance GPU and networking subsystems for AI fabrics with an accent on cross-stack observability and workload fault tolerance. Focus on debugging performance issues across kernel, driver, and network layers to maximize GPU cluster utilization.
Location: On Site, Palo Alto, California
Salary: $150,000 - $230,000
Company
Systems is pioneering a software-driven approach to AI fabrics to increase GPU cluster utilization through cross-stack observability and performance acceleration.
What you will do
- Build and optimize high-performance GPU and networking subsystems.
- Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads.
- Debug performance issues across kernel, driver, GPU, and network layers.
- Develop and improve GPU-aware networking solutions.
- Profile, analyze, and tune system performance using low-level tooling.
- Collaborate with a small engineering team and take ownership of core systems.
Requirements
- 5+ years of experience in systems, HPC, or performance-critical software development.
- Strong proficiency in low-level C/C++.
- Solid understanding of RDMA networking, including InfiniBand, RoCE, and IBVerbs.
- Experience working with multi-node, multi-GPU workloads.
- Familiarity with collective communication libraries and communication algorithms.
- Ability and willingness to debug complex issues across hardware and software boundaries.
Nice to have
- Experience with congestion control mechanisms such as DCQCN.
- Exposure to GPU-aware networking or advanced communication optimizations.
- Experience with performance profiling, tracing, or observability tooling.
- Background in AI infrastructure, HPC clusters, or distributed systems.
Culture & Benefits
- Challenging projects in a fast-moving startup environment.
- Friendly and inclusive workplace culture.
- Competitive compensation and a comprehensive benefits package.
- Catered lunch.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →