TL;DR
Research Scientist (AI/ML): Building a real science of post-training for agents and LLM-based systems with an accent on rigorous experimentation, scaling, and evaluation. Focus on implementing algorithm ideas, designing evaluations that answer real questions, and analyzing complex failure modes.
Location: Onsite in London, UK
Company
hirify.global is a team of scientists, engineers, and machine learning experts working to advance the state of the art in artificial intelligence.
What you will do
- Propose and test research hypotheses in post-training and RL for agents/LLMs.
- Implement algorithm ideas and run end-to-end experiments, including setup, execution, analysis, and iteration.
- Design evaluations and ablations that answer real questions and change minds.
- Analyze results carefully, including debugging and failure analysis.
- Communicate clearly through plots, writeups, and paper-ready narratives and figures.
- Collaborate closely with engineering and research partners to keep the team aligned on findings and strategy.
Requirements
- Work format: Onsite in London, UK
- A research track record in ML/RL, demonstrated through publications or high-quality projects.
- Strong implementation ability and comfort working in research codebases.
- Evidence of owning experiments end-to-end, including analysis and interpretation.
- Strong communication skills and a bias toward clarity and honesty regarding results.
- High agency and drive to push projects forward, prioritize effectively, and take initiative.
- PhD in ML preferred, or equivalent practical experience.
Nice to have
- Experience with RL for sequence models, post-training, preference-based learning, or agentic systems.
- Experience with modern research stacks (e.g., JAX/Flax or PyTorch) and scaling experiments.
- Strong experimental taste and good judgment regarding baselines, ablations, and what is worth testing.
- Comfort with scaling, evaluation methodologies, and diagnosing complex failure modes.
- A focus on craft, caring about doing excellent work while maintaining high velocity.
Culture & Benefits
- Contribute to a culture of first-principles thinking, high standards, and direct, constructive feedback.
- Opportunity to advance state-of-the-art AI for widespread public benefit and scientific discovery.
- Committed to equal employment opportunities regardless of sex, race, religion, or belief.
- Accommodation provided for disabilities or additional needs.
- Employment offers are conditional on the results of a background check.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →