ML Engineer, Post-Training and Evaluation (AI)

Формат работы

onsite

Тип работы

fulltime

Грейд

middle

Английский

Страна

Релокация

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

ML Engineer, Post-Training and Evaluation (AI): Adapting open-weight models for enterprise customers through fine-tuning and evaluation with an accent on SFT, preference optimization, and RLHF. Focus on building evaluation harnesses, creating reproducible data pipelines, and deploying adapted models to production.

Location: On-site in San Francisco or New York. Relocation support is provided.

Company

Developing open-weight superintelligence models for individuals, agents, and enterprises.

What you will do

Fine-tune open-weight models for specific customer use cases using SFT, DPO, and RLHF.
Design and maintain evaluation infrastructure, including eval suites and test set curation.
Develop reproducible data pipelines to clean and format raw customer inputs.
Debug training and inference issues by analyzing loss curves and training dynamics.
Deploy fine-tuned models across public cloud, VPC, and on-premises environments.
Establish best practices and benchmarks for the company's fine-tuning and evaluation playbooks.

Requirements

3+ years of engineering experience with significant exposure to applied ML or MLE.
Hands-on experience with LLM fine-tuning, including dataset preparation and training loops.
Strong software engineering fundamentals in Python.
Proficiency with GPU compute management and training infrastructure.
Experience working in customer-facing environments to translate requirements into training strategies.
Must be based in or relocate to San Francisco or New York

Culture & Benefits

Top-tier salary and equity package.
Comprehensive medical, dental, vision, life, and disability insurance.
Fully paid parental leave and financial support for family planning.
Daily provided lunch and dinner.
Relocation support and regular team off-sites.
High-agency environment within a small, talent-dense team of researchers from DeepMind, OpenAI, and Meta.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →