Company hidden

обновлено 11 часов назад

Researcher (Reinforcement Learning)

310 000 - 460 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

senior

Английский

Страна

Релокация

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Researcher (Reinforcement Learning): Developing novel reinforcement learning techniques leveraging synthetic data, environments, and feedback to train and evaluate frontier AI models with an accent on self-play, simulators, and other synthetic evaluations. Focus on designing experiments, analyzing learning dynamics, and translating research insights into production training approaches.

Location: Hybrid (San Francisco, CA) with 3 days in office per week. Relocation assistance to San Francisco, CA is offered.

Salary: $310,000–$460,000 + Offers Equity

Company

hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

Research and develop reinforcement learning algorithms.
Design and run experiments to study training dynamics and model behavior at scale.
Collaborate with engineers and researchers to integrate successful approaches into model training pipelines.

Requirements

Strong background in reinforcement learning, machine learning research, or related fields.
Strong engineering and statistical analysis skills.
Enjoys exploring new problem spaces where data, objectives, and evaluation are imperfect or evolving.
Motivated by seeing research ideas influence real-world AI systems.

Culture & Benefits

Work on open-ended problems with a focus on fast iteration.
Directly shape how frontier models are trained.
Committed to providing reasonable accommodations to applicants with disabilities.
An equal opportunity employer, promoting diversity and inclusion.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Researcher (Reinforcement Learning)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Culture & Benefits

Похожие вакансии

Machine Learning Engineer (AI)

AI Research Scientist (New Grad) (AI)

Principal Research Scientist (AI Scaling)

Machine Learning Engineer (AI)

Principal Research Scientist (AI Scaling & Optimization)

People Research Scientist (AI)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business

Researcher (Reinforcement Learning)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Culture & Benefits

Categories

Похожие вакансии

Machine Learning Engineer (AI)

AI Research Scientist (New Grad) (AI)

Principal Research Scientist (AI Scaling)

Machine Learning Engineer (AI)

Principal Research Scientist (AI Scaling & Optimization)

People Research Scientist (AI)