Senior Applied Scientist (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Applied Scientist (AI): Owning the RL-based post-training pipeline end-to-end to improve clinical reasoning, safety, and alignment of LLMs. Focus on designing and iterating on RL-based post-training methods, building and evaluating reward models, and developing conversational AI environments for healthcare RL training with synthetic data.
Location: Must be in our Palo Alto office five days a week.
Company
is the leading generative AI company in healthcare with the only system that can have safe, autonomous, clinical conversations with patients.
What you will do
- Design and iterate on RL-based post-training methods (RLHF, RLVR, DPO, and beyond).
- Build and evaluate reward models, verifiers, and LLM-as-judge pipelines.
- Develop conversational AI environments and simulations for healthcare RL training with synthetic data.
- Run rigorous experiments to understand what drives post-training gains.
- Collaborate with research, engineering, and clinical teams.
Requirements
- MS or PhD in CS or relevant field.
- 4+ years or experience in NLP, LLM training, RL, or general ML.
- 1+ years experience in RL for LLM post-training.
- Experience with large-scale (50B+ parameter and multi-node) LLM training.
- Strong Python and PyTorch coding skills.
- Experience with RLHF, RLVR, LLM-as-judge or similar methods for LLM post-training.
Nice to have
- Publications at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
- Healthcare domain experience
Culture & Benefits
- Reinvent healthcare with AI that puts safety first.
- Work with the people shaping the future.
- Backed by the world’s leading healthcare and AI investors.
- Build alongside the best in healthcare and AI.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →