Назад
Company hidden
2 часа назад

Research Engineer (AI)

500 000 - 850 000$
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Engineer (AI): Advancing search and knowledge post-training to build trustworthy AI systems with an accent on reinforcement learning, retrieval, and evaluation. Focus on designing controlled search environments, building frontier-discriminating evaluations, and optimizing training rigor.

Location: Must be based in the US (San Francisco, Seattle, or New York City) and attend the office at least 25% of the time.

Salary: $500,000 - $850,000 USD per year

Company

AI safety and research company focused on creating reliable, interpretable, and steerable AI systems.

What you will do

  • Own end-to-end research directions for search post-training problems, from hypothesis to training.
  • Build instrumentation for controlled experiments to study how environment factors contribute to capabilities.
  • Design frontier-discriminating evaluations to distinguish genuine reasoning from pattern matching.
  • Drive optimization rigor across the stack, including efficient experiment design and training run economics.
  • Collaborate with post-training, RL infrastructure, and product teams to translate model behavior into signals.
  • Establish the team's experimental standards for measurement and result validation.

Requirements

  • Outstanding software engineering skills in Python, spanning data pipelines to RL training.
  • Proven track record of repeatedly shipping real ML research.
  • Rigorous quantitative mindset, utilizing ablations, controls, and confidence intervals.
  • Ability to operate with high autonomy in ambiguous environments.
  • Strong written and verbal communication skills for defending design choices.
  • Bachelor's degree or equivalent combination of education and experience.

Nice to have

  • Hands-on experience with RL on large language models (reward design, scaling behavior).
  • Background in search, retrieval, RAG, or agents that reason over external sources.
  • Experience building evaluations for open-ended or knowledge-intensive LLM behavior.
  • Prior work in frontier AI labs or quant research firms.
  • Published research on LLMs, RL, retrieval, or calibration.

Culture & Benefits

  • Collaborative "big science" approach focusing on high-impact research.
  • Competitive compensation and optional equity donation matching.
  • Generous vacation and parental leave policies.
  • Flexible working hours.
  • Visa sponsorship provided.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →