Machine Learning Scientist (AI)

Формат работы

onsite

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Machine Learning Scientist (AI): Developing and analyzing novel evaluation methodologies for AI models with an accent on human preference signals, model reliability, and alignment. Focus on designing large-scale experiments, building statistical frameworks to improve model performance, and translating research findings into production-ready evaluation systems.

Location: Must be based in the Bay Area

Company

hirify.global is an open platform created by researchers from UC Berkeley’s SkyLab, dedicated to evaluating AI model performance and building transparent, human-centered benchmarks for the global AI community.

What you will do

Design and conduct experiments to evaluate AI model behavior across reasoning, robustness, and user preference dimensions.
Develop new metrics, methodologies, and protocols that exceed traditional benchmark standards.
Analyze large-scale human interaction and voting data to derive insights into model performance.
Collaborate with engineering and product teams to scale research findings into robust production systems.
Prototype and test research ideas rapidly while maintaining scientific rigor.
Contribute to the scientific integrity of the LMArena leaderboard through internal reports and external publications.

Requirements

PhD or equivalent research experience in Machine Learning, Natural Language Processing, or Statistics.
Deep understanding of LLMs and modern deep learning architectures like Transformers and reinforcement learning.
Proficiency in Python and research libraries such as PyTorch, JAX, or TensorFlow.
Demonstrated ability to design experiments with high statistical rigor.
Track record of publishing research or contributing to open-source ML/AI projects.
Ability to translate complex research questions into practical, scalable systems.

Culture & Benefits

Competitive compensation packages with equity.
Comprehensive health and wellness benefits including medical, dental, and vision coverage.
Opportunity to contribute to a mission-driven team working on the cutting edge of AI evaluation.
Collaborative environment valuing transparency, craftsmanship, and curiosity.
Work with experts from leading institutions like Google, DeepMind, and Stanford.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →