Назад
Company hidden
2 часа назад

Research Scientist (Reinforcement Learning)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
UK
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Scientist (Reinforcement Learning): Developing and scaling fundamental reinforcement learning algorithms for high-impact AI research with an accent on experimental rigor and large-scale model performance. Focus on implementing novel research hypotheses, conducting end-to-end experiments, and contributing to state-of-the-art developments in the field of AI.

Location: London, UK (Onsite)

Company

hirify.global is a world-leading research organization dedicated to pushing the boundaries of AI, developing transformative technologies like AlphaGo, AlphaZero, and Gemini.

What you will do

  • Initiate and pursue novel research directions through testing and proposal of hypotheses.
  • Implement and manage end-to-end experimental research projects.
  • Build and improve research infrastructure at scale to support complex models.
  • Analyze results, debug failure modes, and iterate on research implementations.
  • Communicate research findings clearly through technical writeups and publications.
  • Collaborate with interdisciplinary teams to empower researchers and scale experiments.

Requirements

  • Research track record in reinforcement learning, including peer-reviewed publications.
  • Strong implementation ability and experience with research codebases.
  • Evidence of owning research experiments end-to-end.
  • PhD in machine learning or equivalent practical experience.
  • High agency, strong prioritization skills, and ability to take initiative.
  • Excellent communication skills with a bias toward transparency and clarity.

Nice to have

  • Experience with sequence models, post-training, or preference-based learning.
  • Proficiency in modern research stacks such as JAX/Flax or PyTorch.
  • Strong experimental judgment regarding baselines and ablations.

Culture & Benefits

  • Collaborative environment with a tight-knit, world-class research team.
  • Commitment to diversity, equity, and inclusion in the workplace.
  • Strong emphasis on continuous learning and professional development.
  • Opportunity to influence cutting-edge AI breakthroughs and product impact.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...