Назад
Company hidden
2 месяца назад

Research Scientist (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Scientist (AI): Driving end-to-end ambiguous research problems in RL or mid-training, forming hypotheses, and building training/eval/data to test them. Focus on improving understanding of RL for longer horizon tasks, training graders for coding, and enhancing data quality for model training.

Location: In-person in North Beach, San Francisco or Manhattan, New York

Company

hirify.global is building the best tool for professional programmers by automating coding through inventive research, design, and engineering.

What you will do

  • Own ambiguous, hard research problems end-to-end, forming hypotheses and designing experiments.
  • Build training, evaluation, and data infrastructure to test hypotheses and push results into models.
  • Improve understanding of Reinforcement Learning (RL) for longer horizon tasks with less compute.
  • Train graders to improve performance on coding tasks with non-verifiable reward.
  • Improve the quality and difficulty of datapoints used for model training.

Requirements

  • Deep background in RL and strong machine learning fundamentals.
  • Excellent programmer and software engineer.
  • Ability to handle ambiguous research tasks with little guidance.
  • Strong focus on data quality.
  • Must work in-person from offices in San Francisco or New York.

Culture & Benefits

  • Small, talent-dense, flat organization.
  • Culture of truth-seeking, passion, creativity, spirited debate, and shipping code.
  • Cozy offices in North Beach, San Francisco and Manhattan, New York.
  • Well-stocked libraries.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...