Director - Host Software (AI/HPC)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Director - Host Software (AI/HPC): Leading the engineering team responsible for the host-side software ecosystem, including Linux kernel drivers and high-performance transport libraries, with an accent on system-level performance and ecosystem integration. Focus on designing low-latency communication paths, optimizing AI/HPC middleware, and scaling networking solutions for GPU/CPU clusters.
Location: Remote (Must be based in the United States)
Company
delivers high-performance scale-out networking solutions for AI and HPC datacenters.
What you will do
- Lead and grow a high-performance host software organization focused on systems programming and ecosystem integration.
- Define and deliver the host software stack for future product generations, aligning capabilities with hardware features.
- Oversee the development of Linux kernel-mode drivers (netdev, RDMA, PCIe) with a focus on low-latency communication.
- Direct the implementation of user-mode libraries and protocol state machines such as libfabric/OFI providers.
- Optimize collective communication libraries (NCCL/RCCL) and MPI/SHMEM for AI/HPC frameworks.
- Partner with hardware, firmware, and switch software teams to ensure end-to-end system stability and performance.
Requirements
- 8+ years of experience in high-performance systems programming in C/C++ on Linux.
- Proven track record in technical leadership or management roles (Team Lead, Manager, or similar).
- Strong understanding of Linux kernel internals and networking transport protocols.
- Hands-on experience with RDMA or high-performance networking concepts.
- Must reside within the United States.
Nice to have
- 12+ years of experience in software engineering with significant time in director-level roles.
- Deep expertise in libfabric/OFI, UCX, or specific interconnects like Omni-Path, InfiniBand, or RoCE.
- Proven track record of meaningful contributions to major open-source projects like the Linux kernel.
- Master’s or PhD in Computer Science, Engineering, or a related discipline.
Culture & Benefits
- Competitive compensation package including equity, cash, and incentives.
- Comprehensive medical, dental, and vision coverage, plus disability and life insurance.
- 401(k) with company match.
- Open Time Off (OTO), sick time, bonding leave, and pregnancy disability leave.
- Flexible work environment collaborating with influential leaders in the semiconductor industry.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →