Назад
Company hidden
2 дня назад

AI Research Engineer (Reinforcement Learning)

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
senior
Английский
b2
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Research Engineer (Reinforcement Learning): Driving innovation in reinforcement learning approaches for advanced models to optimize decision-making and adaptive behavior. Focus on developing, testing, and implementing novel RL algorithms, curating simulation environments, and resolving bottlenecks for superior domain-adapted AI performance.

Location: Fully remote, worldwide

Company

hirify.global is pioneering a global financial revolution by providing cutting-edge solutions for integrating reserve-backed tokens across blockchains and driving innovation in energy, AI, and education.

What you will do

  • Develop and implement state-of-the-art reinforcement learning algorithms to optimize decision-making processes in simulated and real-world settings.
  • Build, run, and monitor controlled reinforcement learning experiments, tracking key performance indicators and comparing outcomes against established benchmarks.
  • Identify and curate high-quality simulation environments and training datasets tailored to specific domain challenges.
  • Systematically debug and optimize the reinforcement learning pipeline by analyzing computational efficiency and learning performance metrics.
  • Collaborate with cross-functional teams to integrate reinforcement learning agents into production systems, defining clear success metrics.

Requirements

  • A degree in Computer Science or related field, ideally PhD in NLP, Machine Learning, or a related field, with a solid track record in AI R&D and good publications in A* conferences.
  • Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative Policy Optimization (GRPO).
  • Deep understanding of reinforcement learning algorithms, including state-of-the-art online RL methods, policy gradients, and actor-critic.
  • Strong expertise in PyTorch and relevant reinforcement learning frameworks, with practical experience in developing RL pipelines.
  • Demonstrated ability to apply empirical research to overcome reinforcement learning challenges and design robust evaluation frameworks.

Culture & Benefits

  • Work remotely from every corner of the world as part of a global talent powerhouse.
  • Opportunity to collaborate with some of the brightest minds in the fintech space.
  • Contribute to an innovative platform, pushing boundaries and setting new standards.
  • Join a fast-growing, lean, and industry-leading team.
  • Excellent English communication skills are required.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →