Назад
Company hidden
3 часа назад

Senior ML Engineer (AI Research)

Формат работы
remote (только Europe, mena)
Тип работы
fulltime
Грейд
senior/lead
Английский
c1
Страна
UK, Netherlands, Israel
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior ML Engineer (AI Research): Conducting applied research in AI, focusing on areas like guided search, reinforcement learning for agentic systems, and web-scale problem collection for training agents. Focus on designing experiments, developing efficient training methods for large language models, and exploring novel methods for agent training.

Location: Remote from Europe or Israel, or onsite in Amsterdam (Netherlands) or United Kingdom

Company

hirify.global is a Nasdaq-listed cloud computing company headquartered in Amsterdam, focused on providing tools and resources for the global AI economy.

What you will do

  • Conduct applied research in AI, specifically in areas like guided search, reinforcement learning for agentic systems, web-scale problem collection, and efficient model distillation.
  • Design and execute experiments to train large language models on interaction traces with various environments.
  • Explore methods for guided generation and search in trajectory space.
  • Develop strategies to mine relevant data at web scale and integrate it into model post-training.
  • Conduct experiments with reinforcement learning configurations in verifiable and non-verifiable domains.
  • Collaborate with adjacent teams to apply research findings in practice.

Requirements

  • Profound understanding of machine learning and reinforcement learning foundations.
  • Deep expertise in modern deep learning for language processing and generation.
  • Substantial experience with training large models on multiple computational nodes.
  • Strong software engineering skills (Python, Jax framework).
  • Ability to design, execute, and analyze machine learning experiments with statistical rigor.
  • Strong communication and leadership abilities.
  • Excellent command of the English language.

Nice to have

  • Experience with deep reinforcement learning for LLMs (reward modeling, DPO, PPO).
  • Familiarity with LLM concepts (RoPE, ZeRO/FSDP, Flash Attention, quantization).
  • Experience building and delivering products in a dynamic startup-like environment.
  • Open-source contributions or experience in engineering complex distributed systems.
  • Proficiency in contemporary software engineering approaches (CI/CD, version control, unit testing).

Culture & Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth.
  • Flexible working arrangements.
  • Dynamic and collaborative work environment that values initiative and innovation.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...