Research Engineer (AI)

500 000 - 850 000$

Формат работы

hybrid

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Research Engineer (AI): Advancing search and knowledge post-training to build trustworthy AI systems with an accent on reinforcement learning, retrieval, and evaluation. Focus on designing controlled search environments, building frontier-discriminating evaluations, and optimizing training rigor.

Location: Must be based in the US (San Francisco, Seattle, or New York City) and attend the office at least 25% of the time.

Salary: $500,000 - $850,000 USD per year

Company

AI safety and research company focused on creating reliable, interpretable, and steerable AI systems.

What you will do

Own end-to-end research directions for search post-training problems, from hypothesis to training.
Build instrumentation for controlled experiments to study how environment factors contribute to capabilities.
Design frontier-discriminating evaluations to distinguish genuine reasoning from pattern matching.
Drive optimization rigor across the stack, including efficient experiment design and training run economics.
Collaborate with post-training, RL infrastructure, and product teams to translate model behavior into signals.
Establish the team's experimental standards for measurement and result validation.

Requirements

Outstanding software engineering skills in Python, spanning data pipelines to RL training.
Proven track record of repeatedly shipping real ML research.
Rigorous quantitative mindset, utilizing ablations, controls, and confidence intervals.
Ability to operate with high autonomy in ambiguous environments.
Strong written and verbal communication skills for defending design choices.
Bachelor's degree or equivalent combination of education and experience.

Nice to have

Hands-on experience with RL on large language models (reward design, scaling behavior).
Background in search, retrieval, RAG, or agents that reason over external sources.
Experience building evaluations for open-ended or knowledge-intensive LLM behavior.
Prior work in frontier AI labs or quant research firms.
Published research on LLMs, RL, retrieval, or calibration.

Culture & Benefits

Collaborative "big science" approach focusing on high-impact research.
Competitive compensation and optional equity donation matching.
Generous vacation and parental leave policies.
Flexible working hours.
Visa sponsorship provided.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →