Назад
Company hidden
1 день назад

Senior ML Inference Engineer (AI)

128 700 - 261 300$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Релокация
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior ML Inference Engineer (AI/Platform): Building and operating the ML deployment platform that automates the path from trained models to on-vehicle inference for autonomous vehicles with an accent on automation, reliability, and developer experience. Focus on designing agentic tools to diagnose deployment issues and implementing shift-left validation to surface risks early in the cycle.

Location: Remote within the USA, with a hybrid requirement (report to office 3 times a week) for candidates living within a specific radius of GM hubs (Austin, Mountain View, or Sunnyvale).

Salary: $128,700 – $261,300

Company

hirify.global (GM) is an automotive leader developing autonomous vehicle technology to create a world with zero crashes, emissions, and congestion.

What you will do

  • Design and operate the ML deployment platform that automates the transition from trained models (e.g., PyTorch) to on-vehicle inference.
  • Partner with model development teams to drive the deployment of high-value models into the autonomous vehicle stack.
  • Build agentic and LLM-powered tools to diagnose and automate manual deployment-blocking workflows.
  • Develop the developer experience layer, including tooling, dashboards, and observability for ML teams.
  • Implement shift-left validation to identify compile, runtime, and latency risks early in the development cycle.
  • Integrate optimizations from kernels and compiler teams directly into the deployment workflow.

Requirements

  • BS, MS, or PhD in Computer Science or a related technical field.
  • 3+ years of industry experience in building or operating production platform or infrastructure systems.
  • Strong coding proficiency in Python and experience with ML model deployment and inference integration.
  • Experience using coding agents (e.g., Cursor, Claude Code, GitHub Copilot) as part of the engineering workflow.
  • Must be based in the US or eligible for relocation to the US.
  • Ability to travel up to 25%.

Nice to have

  • Experience with orchestration frameworks such as Airflow, Temporal, Flyte, Ray, or Kubeflow.
  • Familiarity with the NVIDIA GPU stack (CUDA, TensorRT, Triton, torch.compile, ONNX).
  • Experience with inference-serving frameworks like vLLM, TorchServe, or Ray Serve.
  • Background in autonomous vehicles, robotics, or safety-critical ML domains.
  • Open-source contributions to PyTorch, Ray, Airflow, or related projects.

Culture & Benefits

  • Comprehensive health, dental, and vision programs, including HSA and FSA options.
  • Retirement savings plan, life insurance, and tuition assistance.
  • Paid vacation, holidays, and sickness/accident benefits.
  • GM vehicle discounts.
  • Hybrid work flexibility for those near company hubs.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →