Назад
Company hidden
1 месяц назад

Machine Learning Infra Engineer (AI)

150 000 - 300 000$
Формат работы
onsite
Тип работы
fulltime
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Machine Learning Infra Engineer (AI): Building training and inference frameworks for vision models processing enterprise documents with an accent on scaling across multi-node GPU clusters and high-performance serving. Focus on designing distributed systems, developing benchmarks, applying SOTA advances, and creating observability tooling.

Location: On-site in San Francisco office. In-person role required.

Salary: $150K – $300K

Company

hirify.global enables AI teams to ingest unstructured enterprise data like PDFs and spreadsheets using vision models for accurate extraction at scale.

What you will do

  • Build and maintain training and inference stacks emphasizing fast iteration, flexibility, and high performance.
  • Develop benchmarks to identify bottlenecks in training and inference.
  • Explore and apply state-of-the-art advances in training and inference.
  • Design systems for reliable multi-node, multi-GPU training with observability.
  • Scale distributed workloads across GPU clusters for better utilization, reliability, and cost efficiency.
  • Create tooling and abstractions to accelerate ML engineers from experiment to production.

Requirements

  • Strong Python skills and systems engineering background.
  • Comfortable with Kubernetes and distributed training frameworks.
  • High standards for quality, precision, and first-principles problem-solving.
  • Experience with real-world implementation challenges.
  • Ability to thrive in fast-changing, high-growth environments.
  • Effective collaboration across technical and non-technical teams with full ownership.

Nice to have

  • Experience at early-stage or high-growth startups.
  • Contributions to open source training/inference stacks.
  • Excitement for distributed inference on 100s-1000s of GPUs.
  • Focus on technical excellence with business impact.

Culture & Benefits

  • Unlimited PTO for recharging.
  • Daily free lunch with teammates at the office.
  • Reimbursed transportation costs.
  • Generous health, dental, and vision insurance.
  • $150/mo health and wellness budget for gym, classes, etc.
  • Flexible parental leave scheduling.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →