Назад
Company hidden
18 часов назад

AI Research Engineer (Reinforcement Learning)

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
senior
Английский
c1
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Research Engineer (Reinforcement Learning): Optimizing decision-making and adaptive behavior for advanced models across various systems, from resource-efficient to complex multi-modal architectures with an accent on developing, testing, and implementing novel RL algorithms and training frameworks. Focus on curating specialized simulation environments, strengthening baseline policy performance, and resolving bottlenecks to unlock superior, domain-adapted AI performance.

Location: Remote (Global)

Company

hirify.global is pioneering a global financial revolution, empowering businesses with cutting-edge blockchain solutions, innovative products like USDT, energy solutions, AI-driven data platforms (KEET), and digital education initiatives.

What you will do

  • Develop and implement state-of-the-art reinforcement learning algorithms designed to optimize decision-making processes in simulated and real-world settings.
  • Build, run, and monitor controlled reinforcement learning experiments, tracking key performance indicators and documenting iterative results.
  • Identify and curate high-quality simulation environments and training datasets tailored to specific domain challenges.
  • Systematically debug and optimize the reinforcement learning pipeline by analyzing computational efficiency and learning performance metrics.
  • Collaborate with cross-functional teams to integrate reinforcement learning agents into production systems, ensuring continuous monitoring and iterative refinements.

Requirements

  • A degree in Computer Science or related field; ideally PhD in NLP, Machine Learning, or a related field, complemented by a solid track record in AI R&D (with good publications in A* conferences).
  • Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative Policy Optimization (GRPO).
  • Deep understanding of reinforcement learning algorithms, including state-of-the-art online RL methods, policy gradients, actor-critic, and GRPO.
  • Strong expertise in PyTorch and relevant reinforcement learning frameworks; practical experience in developing and deploying RL pipelines in production environments.
  • Demonstrated ability to apply empirical research to overcome reinforcement learning challenges such as sample inefficiency, exploration-exploitation tradeoffs, and training instability.
  • Excellent English communication skills.

Culture & Benefits

  • Join a global, fully remote team working from every corner of the world.
  • Opportunity to make a significant mark in the fintech space.
  • Collaborate with bright minds, pushing boundaries and setting new standards in the industry.
  • Work for a lean, fast-growing industry leader.
  • Benefit from transparency which is the bedrock of all operations.

Hiring process

  • Apply only through official channels on hirify.global.recruitee.com.
  • Verify the recruiter’s identity via verified LinkedIn profiles.
  • All communication will be through official company emails (@hirify.global.to or @hirify.global.io) and platforms, not WhatsApp, Telegram, or SMS.
  • The company will never request payment or financial details during the hiring process.
  • Do not use AI tools when completing the application, as AI-generated answers may lead to disqualification.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →