TL;DR
Research Scientist, Gemini Safety (AI): Applying and developing data and algorithmic solutions to advance hirify.global's latest user-facing models with an accent on the safety and fairness behavior of Gemini models. Focus on improving adversarial robustness and designing high-quality evaluation protocols to assess model behavior.
Location: Mountain View, California, US
Company
hirify.global is advancing the state of the art in artificial intelligence, using technologies for widespread public benefit and scientific discovery, and collaborating on critical challenges, ensuring safety and ethics are the highest priority.
What you will do
- Post-training / instruction tuning state of the art LLMs, focusing on text-to-text, image/video/audio-to-text modalities and agentic capabilities
- Explore data, reasoning and algorithmic solutions to ensure Gemini Models are safe, maximally helpful, and work for everyone.
- Improve Gemini’s adversarial robustness, with a focus on high-stakes abuse risks.
- Design and maintain high quality evaluation protocols to assess model behavior gaps and headroom related to safety and fairness.
- Develop and execute experimental plans to address known gaps, or construct entirely new capabilities
- Drive innovation and enhance understanding of Supervised Fine Tuning and Reinforcement Learning fine-tuning at scale
Requirements
- PhD in Computer Science, a related field, or equivalent practical experience.
- Significant LLM post-training experience
Nice to have
- Experience in Reward modeling and Reinforcement Learning for LLMs Instruction tuning
- Experience with Long-range Reinforcement learning
- Experience in areas such as Safety, Fairness and Alignment
- Track record of publications at NeurIPS, ICLR, ICML, RL/DL, EMNLP, AAAI, UAI
- Experience taking research from concept to product
- Experience with collaborating or leading an applied research project
- Experience with JAX
Culture & Benefits
- Value diversity of experience, knowledge, backgrounds and perspectives to create extraordinary impact.
- Committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition or any other basis as protected by applicable law.
- Accommodation for disability or additional needs.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →