Назад
Company hidden
2 дня назад

Research Engineer - Post-Training for Agentic Coding (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
UK
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Engineer - Post-Training for Agentic Coding (AI): Developing and implementing post-training solutions for enterprise-grade coding agents using static analysis and LLM techniques with an accent on reinforcement learning from verifiable rewards, fine-tuning, and safety alignment. Focus on designing experiments, iterating prototypes, and converting research into high-impact products for agentic software development.

On-site in London

Company

Leader in AI code review and verification, enabling reliable AI-generated code for Fortune 100 companies via hirify.globalQube and agentic tools.

What you will do

  • Develop advanced post-training products for coding agents that generate enterprise-standard code.
  • Collaborate with researchers and engineers to design experiments, build prototypes, and productionize successful ones.
  • Contribute ideas in cross-disciplinary team to advance coding model post-training.
  • Stay current with LLM and agentic developments, explain complex concepts to diverse audiences.

Requirements

  • Master’s or PhD in Computer Science, Machine Learning, or related field
  • Strong industry ML experience with modern software engineering practices
  • Fluency in Python and core ML frameworks; Rust or C#/C++/JS/TS/Java a plus
  • Expertise in post-training techniques: RL from verifiable rewards, GRPO, offline RL, PEFT, SFT, safety alignment
  • Experience with large-scale data processing and cloud infrastructure (AWS, Databricks)
  • Track record driving research to prototypes and products; excellent English communication

Culture & Benefits

  • Global team across hubs in Austin, Bochum, Dubai, Geneva, London, Singapore, Tokyo, Washington D.C.
  • CODE mindset: committed, quality-obsessed, deliberate, effective teamwork
  • Fast-paced growth in profitable company building AI software revolution
  • Diversity, equity, inclusion focus; equal opportunity employer

Hiring process

  • Comprehensive background check and reference verification
  • AI tools may assist in application review; final decisions by humans
  • No agency submissions; contact for accommodations

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →