TL;DR
AI Research Engineer (Reinforcement Learning): Driving innovation in reinforcement learning approaches for advanced models to optimize decision-making and adaptive behavior. Focus on developing, testing, and implementing novel RL algorithms, curating simulation environments, and resolving bottlenecks for superior domain-adapted AI performance.
Location: Fully remote, worldwide
Company
hirify.global is pioneering a global financial revolution by providing cutting-edge solutions for integrating reserve-backed tokens across blockchains and driving innovation in energy, AI, and education.
What you will do
- Develop and implement state-of-the-art reinforcement learning algorithms to optimize decision-making processes in simulated and real-world settings.
- Build, run, and monitor controlled reinforcement learning experiments, tracking key performance indicators and comparing outcomes against established benchmarks.
- Identify and curate high-quality simulation environments and training datasets tailored to specific domain challenges.
- Systematically debug and optimize the reinforcement learning pipeline by analyzing computational efficiency and learning performance metrics.
- Collaborate with cross-functional teams to integrate reinforcement learning agents into production systems, defining clear success metrics.
Requirements
- A degree in Computer Science or related field, ideally PhD in NLP, Machine Learning, or a related field, with a solid track record in AI R&D and good publications in A* conferences.
- Proven experience with large-scale reinforcement learning experiments, including online RL techniques such as Group Relative Policy Optimization (GRPO).
- Deep understanding of reinforcement learning algorithms, including state-of-the-art online RL methods, policy gradients, and actor-critic.
- Strong expertise in PyTorch and relevant reinforcement learning frameworks, with practical experience in developing RL pipelines.
- Demonstrated ability to apply empirical research to overcome reinforcement learning challenges and design robust evaluation frameworks.
Culture & Benefits
- Work remotely from every corner of the world as part of a global talent powerhouse.
- Opportunity to collaborate with some of the brightest minds in the fintech space.
- Contribute to an innovative platform, pushing boundaries and setting new standards.
- Join a fast-growing, lean, and industry-leading team.
- Excellent English communication skills are required.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →