Company hidden

3 дня назад

Agent Post-Training, API & Power Users (AI)

295 000 - 445 000$

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Agent Post-Training, API & Power Users (AI): Training and refining frontier agentic models for API developers and power users with an accent on tool use, coding, and long-horizon execution. Focus on designing evals from real-world workflows, implementing RLHF/post-training interventions, and improving model reliability in complex agentic systems.

Location: San Francisco; Must be US-based

Salary: $295K – $445K

Company

hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

Design and run experiments to improve model behavior in API and power-user workflows, including function calling, tool use, and planning.
Build evals, graders, and environments from real developer workflows to convert observed failures into training data.
Partner with API and product teams to identify behavior gaps and implement post-training interventions.
Develop feedback loops using power-user traces and API usage patterns to discover new model failures.
Own end-to-end model behavior projects from qualitative failure analysis to launch readiness.
Improve large-scale training machinery, focusing on experiment velocity, reliability, and production readiness.

Requirements

Strong technical fundamentals in ML, software engineering, systems, or statistics.
Hands-on experience with LLMs, post-training, RL/RLHF/RLAIF, evals, or synthetic data.
Proven ability to analyze model traces and form concrete hypotheses for behavioral improvement.
Experience building coding agents or tool-using agents in production ML systems.
Must be based in the United States.

Nice to have

Experience with multi-agent systems.
Experience training models directly against production-like environments.

Culture & Benefits

High-agency role with direct impact on frontier models used by millions.
Collaborative environment working across research, engineering, and safety boundaries.
Opportunity to push the boundaries of general-purpose AI capabilities.
Commitment to equal opportunity and diversity in the workplace.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Похожие вакансии

Agent Post-Training, API & Power Users (AI)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Похожие вакансии

Research Engineer, Post-Training (AI)

Staff Enterprise AI Engineer (AI)

Manager, Applied AI Engineering (AI)

Principal Engineer, Agentic AI Systems (AI)

Research Engineer (AI)

Software Engineer (AI/ML GenAI)

Разработка

Game Dev

Design и Creative

Аналитика

Менеджмент

People & Business

Agent Post-Training, API & Power Users (AI)

Мэтч & Сопровод

Описание вакансии

TL;DR

Company

What you will do

Requirements

Nice to have

Culture & Benefits

Categories

Похожие вакансии

Research Engineer, Post-Training (AI)

Staff Enterprise AI Engineer (AI)

Manager, Applied AI Engineering (AI)

Principal Engineer, Agentic AI Systems (AI)

Research Engineer (AI)

Software Engineer (AI/ML GenAI)