Назад
Company hidden
2 дня назад

ML Engineer, Post-Training and Evaluation (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
US
Релокация
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

ML Engineer, Post-Training and Evaluation (AI): Adapting open-weight models for enterprise customers through fine-tuning and evaluation with an accent on SFT, preference optimization, and RLHF. Focus on building evaluation harnesses, creating reproducible data pipelines, and deploying adapted models to production.

Location: On-site in San Francisco or New York. Relocation support is provided.

Company

Developing open-weight superintelligence models for individuals, agents, and enterprises.

What you will do

  • Fine-tune open-weight models for specific customer use cases using SFT, DPO, and RLHF.
  • Design and maintain evaluation infrastructure, including eval suites and test set curation.
  • Develop reproducible data pipelines to clean and format raw customer inputs.
  • Debug training and inference issues by analyzing loss curves and training dynamics.
  • Deploy fine-tuned models across public cloud, VPC, and on-premises environments.
  • Establish best practices and benchmarks for the company's fine-tuning and evaluation playbooks.

Requirements

  • 3+ years of engineering experience with significant exposure to applied ML or MLE.
  • Hands-on experience with LLM fine-tuning, including dataset preparation and training loops.
  • Strong software engineering fundamentals in Python.
  • Proficiency with GPU compute management and training infrastructure.
  • Experience working in customer-facing environments to translate requirements into training strategies.
  • Must be based in or relocate to San Francisco or New York

Culture & Benefits

  • Top-tier salary and equity package.
  • Comprehensive medical, dental, vision, life, and disability insurance.
  • Fully paid parental leave and financial support for family planning.
  • Daily provided lunch and dinner.
  • Relocation support and regular team off-sites.
  • High-agency environment within a small, talent-dense team of researchers from DeepMind, OpenAI, and Meta.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →