Helix AI Engineer, Backend Infrastructure (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Helix AI Engineer, Backend Infrastructure (Backend): Architecting and scaling cloud backend systems for real-time streaming of media and sensor data with an accent on low-latency pipelines and high-throughput infrastructure. Focus on integrating ML model serving into real-time pipelines and ensuring system reliability for robot fleet operations.
Location: Onsite in San Jose, CA (5 days/week in-office collaboration required)
Salary: $150,000 – $400,000
Company
is an AI Robotics company developing a general-purpose humanoid designed for commercial and home tasks.
What you will do
- Architect and scale cloud backend infrastructure for high-concurrency, real-time streaming of media and sensor data.
- Design low-latency data pipelines to ingest, route, and process high-bandwidth streams into the AI stack.
- Own reliability, latency, and throughput SLAs for streaming and data infrastructure.
- Collaborate with AI and robotics teams to integrate ML model serving into real-time pipelines.
- Build observability, alerting, and tooling for live robot traffic.
- Drive architectural decisions and mentor engineers across the team.
Requirements
- Deep experience scaling cloud backend systems handling high-concurrency, real-time data streams (media, sensor, telemetry).
- Strong fundamentals in distributed systems, stream processing, and low-latency architecture.
- Proficiency in one or more backend languages: Go, C++, Python, or Rust.
- Experience with cloud platforms (AWS, GCP, or Azure) and containerized infrastructure.
- Strong communication and cross-functional collaboration skills.
- Must be based in or be able to work onsite in San Jose, CA 5 days per week.
Nice to have
- Experience integrating AI inference serving (Triton, TensorRT, SageMaker) into real-time pipelines.
- Background in robotics, autonomous vehicles, or latency-critical streaming domains.
- Familiarity with WebRTC, RTSP, gRPC, or Kafka for real-time data transport.
- Experience with on-device or edge inference and cloud vs. edge processing tradeoffs.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →