Senior Applied Scientist (AI)

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Senior Applied Scientist (AI): Owning the RL-based post-training pipeline end-to-end to improve clinical reasoning, safety, and alignment of LLMs. Focus on designing and iterating on RL-based post-training methods, building and evaluating reward models, and developing conversational AI environments for healthcare RL training with synthetic data.

Location: Must be in our Palo Alto office five days a week.

Company

hirify.global is the leading generative AI company in healthcare with the only system that can have safe, autonomous, clinical conversations with patients.

What you will do

Design and iterate on RL-based post-training methods (RLHF, RLVR, DPO, and beyond).
Build and evaluate reward models, verifiers, and LLM-as-judge pipelines.
Develop conversational AI environments and simulations for healthcare RL training with synthetic data.
Run rigorous experiments to understand what drives post-training gains.
Collaborate with research, engineering, and clinical teams.

Requirements

MS or PhD in CS or relevant field.
4+ years or experience in NLP, LLM training, RL, or general ML.
1+ years experience in RL for LLM post-training.
Experience with large-scale (50B+ parameter and multi-node) LLM training.
Strong Python and PyTorch coding skills.
Experience with RLHF, RLVR, LLM-as-judge or similar methods for LLM post-training.

Nice to have

Publications at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
Healthcare domain experience

Culture & Benefits

Reinvent healthcare with AI that puts safety first.
Work with the people shaping the future.
Backed by the world’s leading healthcare and AI investors.
Build alongside the best in healthcare and AI.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →