Назад
Company hidden
2 часа назад

Reinforcement Learning Engineer (AI)

188 000 - 275 000$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
middle/senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Reinforcement Learning Engineer (AI): Developing and optimizing advanced RL workflows for self-improving agents with an accent on data efficiency, training methodologies, and large-scale model optimization. Focus on solving complex research challenges in sequence-level importance ratios and continuous learning, leveraging massive GPU infrastructure to push the boundaries of AI agents.

Location: Remote within the US or hybrid (San Francisco/Bellevue). Must be a U.S. person or eligible for export-controlled information access.

Salary: $188,000–$275,000

Company

hirify.global is an AI Hyperscaler providing high-performance GPU infrastructure and end-to-end platforms for developers to build, train, and scale AI models.

What you will do

  • Generate and investigate research ideas to overcome obstacles in continuous learning.
  • Collaborate with the OpenPipe team to validate research directions on real-world customer tasks.
  • Engineer scalable and data-efficient training methods for self-improving agents.
  • Utilize massive compute clusters to run large-scale training experiments and ablations.
  • Build and maintain production services supporting the RL platform lifecycle.

Requirements

  • Proven experience training LLMs to SOTA on specific tasks.
  • Strong engineering background with a focus on high-impact project results.
  • Deep understanding of RL methods, specifically sequence-level or token-level importance ratios.
  • Eligibility for access to export-controlled information (U.S. person or authorized status required).
  • Proficiency in Python and familiarity with the stack (Kubernetes, Postgres, FastAPI).

Culture & Benefits

  • 100% employer-paid medical, dental, and vision insurance.
  • 401(k) retirement plan with a generous employer match.
  • Comprehensive disability, life insurance, and mental wellness support (Spring Health).
  • Flexible Paid Time Off policy to support work-life balance.
  • Paid parental leave and family-forming support through Carrot.
  • Hybrid work environment with quarterly team gatherings for collaboration.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →