Storage Software Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Storage Software Engineer (AI Infrastructure): Designing and deploying large-scale distributed storage protocol solutions (object, block, file) to power AI training and inference with an accent on storage performance and reliability. Focus on optimizing storage protocol APIs, integrating NVMe/GPU-direct storage, and orchestrating resources across redundant arrays.
Location: Hybrid. Must be based in or be able to work from San Francisco, Bellevue, or San Jose (Note: San Francisco office requires presence 4 days per week).
Salary: $266,000 – $395,000 per year
Company
is a leader in AI cloud infrastructure, providing GPU compute and superintelligence cloud services to researchers and enterprises.
What you will do
- Design, develop, and maintain software for storage systems with a focus on performance, scalability, and reliability.
- Implement and optimize storage protocol APIs for file (NFS, SMB), block (Fibre Channel), and object (S3) access.
- Develop distributed systems for managing and orchestrating storage resources across multiple solutions and redundant arrays.
- Collaborate with hardware and system architects to integrate software with NVMe and GPU-direct storage.
- Lead a high-performance team through deliberate hiring, upskilling, and performance management.
- Partner with fleet and observability teams to ensure seamless deployment and track SLOs/SLIs.
Requirements
- 10+ years of experience in storage engineering, with at least 5+ years in a management or lead role.
- Proficiency in serving one or more storage protocols: object (S3), block (iSCSI), or file (NFS, SMB, Lustre).
- Strong expertise in systems-level programming and storage performance optimization.
- Familiarity with modern storage technologies such as NVMe, RDMA, and DPUs.
- Professional experience as a storage engineer or storage SRE.
- Must be based in the USA (San Francisco, Bellevue, or San Jose).
Nice to have
- Deep experience with Vast, Weka, and/or NetApp in HPC or AI Infrastructure environments.
- Experience implementing CEPH at a scale greater than 100PB.
- Experience with NVidia SuperNIC DPUs for edge-caching (e.g., GPUDirect Storage).
- Proven track record of driving cross-functional engineering management initiatives.
Culture & Benefits
- Generous cash and equity compensation.
- Comprehensive health, dental, and vision coverage for employees and dependents.
- 401k Plan with 2% company match for USA employees.
- Flexible paid time off plan.
- Wellness and commuter stipends for select roles.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →