AI Research Engineer (Reinforcement Learning)

Формат работы

remote (Global)

Тип работы

fulltime

Грейд

senior

Английский

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

AI Research Engineer (Reinforcement Learning): Driving innovation in reinforcement learning approaches for advanced models to optimize decision-making and adaptive behavior. Focus on developing, testing, and implementing novel RL algorithms, curating simulation environments, and resolving bottlenecks for superior domain-adapted AI performance.

Location: Fully remote, worldwide

Company

hirify.global is pioneering a global financial revolution by providing cutting-edge solutions for integrating reserve-backed tokens across blockchains and driving innovation in energy, AI, and education.

What you will do

Develop and implement state-of-the-art reinforcement learning algorithms to optimize decision-making processes in simulated and real-world settings.
Build, run, and monitor controlled reinforcement learning experiments, tracking key performance indicators and comparing outcomes against established benchmarks.
Identify and curate high-quality simulation environments and training datasets tailored to specific domain challenges.
Systematically debug and optimize the reinforcement learning pipeline by analyzing computational efficiency and learning performance metrics.
Collaborate with cross-functional teams to integrate reinforcement learning agents into production systems, defining clear success metrics.

Requirements

A degree in Computer Science or related field, ideally PhD in NLP, Machine Learning, or a related field, with a solid track record in AI R&D and good publications in A* conferences.
Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative Policy Optimization (GRPO).
Deep understanding of reinforcement learning algorithms, including state-of-the-art online RL methods, policy gradients, and actor-critic.
Strong expertise in PyTorch and relevant reinforcement learning frameworks, with practical experience in developing RL pipelines.
Demonstrated ability to apply empirical research to overcome reinforcement learning challenges and design robust evaluation frameworks.

Culture & Benefits

Work remotely from every corner of the world as part of a global talent powerhouse.
Opportunity to collaborate with some of the brightest minds in the fintech space.
Contribute to an innovative platform, pushing boundaries and setting new standards.
Join a fast-growing, lean, and industry-leading team.
Excellent English communication skills are required.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →