Назад
Company hidden
14 часов Π½Π°Π·Π°Π΄

Machine Learning Scientist (AI)

Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
onsite
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
US
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Machine Learning Scientist (AI): Developing and analyzing novel evaluation methodologies for AI models with an accent on human preference signals, model reliability, and alignment. Focus on designing large-scale experiments, building statistical frameworks to improve model performance, and translating research findings into production-ready evaluation systems.

Location: Must be based in the Bay Area

Company

hirify.global is an open platform created by researchers from UC Berkeley’s SkyLab, dedicated to evaluating AI model performance and building transparent, human-centered benchmarks for the global AI community.

What you will do

  • Design and conduct experiments to evaluate AI model behavior across reasoning, robustness, and user preference dimensions.
  • Develop new metrics, methodologies, and protocols that exceed traditional benchmark standards.
  • Analyze large-scale human interaction and voting data to derive insights into model performance.
  • Collaborate with engineering and product teams to scale research findings into robust production systems.
  • Prototype and test research ideas rapidly while maintaining scientific rigor.
  • Contribute to the scientific integrity of the LMArena leaderboard through internal reports and external publications.

Requirements

  • PhD or equivalent research experience in Machine Learning, Natural Language Processing, or Statistics.
  • Deep understanding of LLMs and modern deep learning architectures like Transformers and reinforcement learning.
  • Proficiency in Python and research libraries such as PyTorch, JAX, or TensorFlow.
  • Demonstrated ability to design experiments with high statistical rigor.
  • Track record of publishing research or contributing to open-source ML/AI projects.
  • Ability to translate complex research questions into practical, scalable systems.

Culture & Benefits

  • Competitive compensation packages with equity.
  • Comprehensive health and wellness benefits including medical, dental, and vision coverage.
  • Opportunity to contribute to a mission-driven team working on the cutting edge of AI evaluation.
  • Collaborative environment valuing transparency, craftsmanship, and curiosity.
  • Work with experts from leading institutions like Google, DeepMind, and Stanford.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’