Назад
Company hidden
3 дня назад

Research Scientist (AI)

Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Scientist (AI): Developing and applying post-training methods and interpretability techniques to enhance the safety and robustness of frontier AI systems with an accent on RLHF, DPO, and GRPO methodologies. Focus on designing post-training pipelines, conducting interpretability-informed evaluations, and translating research findings into actionable safety standards for industry and policymakers.

Location: Must be based in the United States

Company

hirify.global is a leading data and evaluation partner for frontier AI companies, providing high-quality data and full-stack technologies to help enterprises and governments build, deploy, and oversee impactful AI applications.

What you will do

  • Design and execute post-training pipelines to analyze the impact of training choices on model safety, robustness, and alignment.
  • Develop interpretability-informed evaluations to identify and mitigate unsafe, deceptive, or undesirable model behaviors.
  • Collaborate with cross-functional teams, including policymakers and engineers, to translate research into actionable safety standards and benchmarks.
  • Conduct rigorous research on agent robustness, AI control protocols, and risk evaluation.
  • Publish research findings to contribute to the broader scientific understanding of AI capabilities and risks.

Requirements

  • At least three years of experience addressing sophisticated machine learning problems in research or product development.
  • Proven experience with post-training and RL techniques such as RLHF, DPO, or GRPO.
  • Strong track record of published research in machine learning, specifically within generative AI.
  • Excellent written and verbal communication skills for cross-functional collaboration.
  • Commitment to the mission of promoting safe, secure, and trustworthy AI deployments.

Nice to have

  • Experience with mechanistic interpretability, probing, or understanding model internals.
  • Familiarity with red-teaming or adversarial evaluation of post-trained models.
  • Experience studying failure modes like reward hacking, sycophancy, or alignment faking.

Culture & Benefits

  • Comprehensive health, dental, and vision coverage.
  • Retirement benefits and equity-based compensation.
  • Generous paid time off (PTO) and learning/development stipends.
  • Inclusive and equal opportunity workplace culture.
  • Opportunities to collaborate with industry leaders and government agencies.

Hiring process

  • Interviews focus on practical ML prototyping, debugging, and research concept grasp.
  • No LeetCode-style questions are used in the evaluation process.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →