Назад
Company hidden
7 месяцев назад

Machine Learning Infrastructure Engineer (AI)

150 000 - 350 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Machine Learning Infrastructure Engineer (AI): Designing, building, and maintaining training and serving infrastructure for ML research with an accent on maximizing GPU allocation and utilization. Focus on diagnosing cluster issues, managing experiments, and supporting ML research.

Location: Redwood City, CA

Compensation Range: $150K - $350K

Company

hirify.global empowers people to connect, learn and tell stories through interactive entertainment.

What you will do

  • Provide infrastructure support to ML research and product.
  • Build tooling to diagnose cluster issues and hardware failures.
  • Monitor deployments, manage experiments, and support research.
  • Maximize GPU allocation and utilization for serving and training.

Requirements

  • 4+ years of experience supporting infrastructure within an ML environment.
  • Experience in developing tools for diagnosing ML infrastructure problems.
  • Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage).
  • Experience working with GPUs.

Nice to have

  • Experience with large GPU clusters and high-performance computing.
  • Experience with supporting large language model training.
  • Experience with ML frameworks like Pytorch/TensorFlow/JAX.
  • Experience with GPU kernel development.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →