Назад
Company hidden
2 дня назад

Researcher, Agent Post-Training (AI)

295 000 - 445 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Researcher, Agent Post-Training (AI): Training the next generation of frontier agents for Codex and ChatGPT with an accent on tool use, computer operation, and multi-agent coordination. Focus on developing training signals, RLHF pipelines, and evals to improve model reliability and product fit.

Location: San Francisco, USA

Salary: $295K – $445K

Company

hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

  • Design and run experiments to improve agentic model behavior across coding, tool use, function calling, and multi-agent collaboration.
  • Own end-to-end improvements to the post-training stack, including RL, data pipelines, graders, and reward signals.
  • Build evals and environments to expose model failures and convert them into training data or research directions.
  • Partner with product teams (Codex, API, ChatGPT) to translate user needs into concrete model improvements.
  • Develop early-training and alignment interventions using synthetic data, objectives, and eval loops.
  • Optimize large-scale training machinery for improved velocity, reliability, and production readiness.

Requirements

  • Must be based in or able to work from San Francisco, USA
  • Strong technical fundamentals in machine learning, software engineering, systems, or statistics.
  • Hands-on experience with LLMs, RL, RLHF/RLAIF, post-training, evals, or production ML systems.
  • Ability to translate ambiguous behavioral problems into concrete hypotheses and experiments.
  • Experience with coding agents, tool-using agents, or synthetic data generation.
  • Ability to communicate effectively across research, product, and infrastructure boundaries.

Culture & Benefits

  • High-agency role where research and engineering work lands directly in frontier models.
  • Opportunity to shape the capabilities of AI systems used by millions of users globally.
  • Collaborative environment working with world-class researchers, engineers, and safety partners.
  • Inclusive culture committed to safety and ensuring AI benefits all of humanity.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →