Senior Storage Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Storage Engineer (AI): Designing and implementing distributed storage solutions to support scaling data-intensive AI workloads with an accent on exabyte-scale S3-compatible object storage and high-performance data transfer. Focus on optimizing throughput and latency using RDMA and GPU Direct Storage to ensure reliability for demanding AI environments.
Location: Hybrid in Livingston, NJ, New York, NY, Sunnyvale, CA, or Bellevue, WA. Remote work may be considered for candidates located more than 30 miles from an office.
Salary: $143,000 – $210,000
Company
is a specialized cloud provider designed specifically for AI, delivering high-performance infrastructure and tools for AI labs and global enterprises.
What you will do
- Design and implement distributed storage solutions to scale data-intensive AI workloads.
- Develop exabyte-scale, S3-compatible object storage and integrate dedicated clusters into customer environments.
- Optimize storage performance using RDMA, GPU Direct Storage, and protocols like NFS or FUSE.
- Lead efforts to improve reliability, durability, security, and observability of the storage stack.
- Analyze telemetry and system data using ClickHouse, Prometheus, and Grafana to enhance throughput and resilience.
- Mentor other engineers on best practices for building high-performance distributed systems.
Requirements
- 8–10+ years of experience in storage systems engineering or infrastructure.
- Strong hands-on experience with object storage or distributed filesystems (e.g., Ceph, DAOS) in production.
- Proficiency in systems programming languages such as Go, C, or Rust.
- Experience with cloud-native infrastructure, Kubernetes, and scalable architectures.
- Must be a U.S. person (citizen, national, lawful permanent resident, refugee, or asylee) to comply with U.S. Government export regulations.
- Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.
Nice to have
- Expertise in data persistence on physical media.
- Deep knowledge of high-performance data transfer using RDMA.
- Experience with resilient distributed systems.
Culture & Benefits
- Comprehensive health coverage with medical, dental, and vision insurance 100% paid by the company.
- Financial security through 401(k) with generous employer match and an Employee Stock Purchase Program (ESPP).
- Flexible PTO, paid parental leave, and family-forming support via Carrot.
- Daily catered lunch at office and data center locations.
- Mental wellness benefits through Spring Health and tuition reimbursement.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →