Назад
Company hidden
обновлено 1 день назад

Pre-Training Researcher (AI)

350 000 - 475 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Pre-Training Researcher (AI): Design and implement methods for sourcing, curating, and analyzing large-scale pre-training datasets to support next-generation AI models with an accent on data quality, ethical considerations, and scalable data systems. Focus on developing data quality metrics, mitigating data risks, and publishing research that advances the AI community.

Location: San Francisco, California, USA

Salary: $350,000 - $475,000 USD per year

Company

hirify.global empowers humanity by advancing collaborative general intelligence, building widely used AI products and open-source projects.

What you will do

  • Design and implement techniques for curating, sourcing, and filtering large-scale text, code, and multimodal data.
  • Develop data quality metrics and analyze coverage, diversity, and representativeness across sources.
  • Collaborate with research and infrastructure teams to scale data processing systems efficiently and reproducibly.
  • Investigate and mitigate data risks including privacy, safety, and licensing concerns.
  • Continuously evaluate dataset improvements by analyzing downstream effects on model learning and behavior.
  • Publish and present research to advance the AI community and share code, datasets, and insights.

Requirements

  • Location: Must be based in San Francisco, California, USA
  • Proficiency in Python and familiarity with deep learning frameworks such as PyTorch, TensorFlow, or JAX.
  • Bachelor’s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or related discipline.
  • Strong theoretical and empirical grounding with clarity in communication.
  • Experience with large-scale data curation, preprocessing, and analysis preferred.
  • Knowledge of data ethics, safety, and licensing frameworks relevant to AI dataset creation preferred.

Nice to have

  • PhD or equivalent industry research experience in relevant fields.
  • Contributions to open datasets, research publications, or data tooling.

Culture & Benefits

  • Generous health, dental, and vision benefits.
  • Unlimited PTO and paid parental leave.
  • Visa sponsorship and relocation support as needed.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →