Pre-Training Researcher (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Pre-Training Researcher (AI): Design and implement methods for sourcing, curating, and analyzing large-scale pre-training datasets to support next-generation AI models with an accent on data quality, ethical considerations, and scalable data systems. Focus on developing data quality metrics, mitigating data risks, and publishing research that advances the AI community.
Location: San Francisco, California, USA
Salary: $350,000 - $475,000 USD per year
Company
empowers humanity by advancing collaborative general intelligence, building widely used AI products and open-source projects.
What you will do
- Design and implement techniques for curating, sourcing, and filtering large-scale text, code, and multimodal data.
- Develop data quality metrics and analyze coverage, diversity, and representativeness across sources.
- Collaborate with research and infrastructure teams to scale data processing systems efficiently and reproducibly.
- Investigate and mitigate data risks including privacy, safety, and licensing concerns.
- Continuously evaluate dataset improvements by analyzing downstream effects on model learning and behavior.
- Publish and present research to advance the AI community and share code, datasets, and insights.
Requirements
- Location: Must be based in San Francisco, California, USA
- Proficiency in Python and familiarity with deep learning frameworks such as PyTorch, TensorFlow, or JAX.
- Bachelor’s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or related discipline.
- Strong theoretical and empirical grounding with clarity in communication.
- Experience with large-scale data curation, preprocessing, and analysis preferred.
- Knowledge of data ethics, safety, and licensing frameworks relevant to AI dataset creation preferred.
Nice to have
- PhD or equivalent industry research experience in relevant fields.
- Contributions to open datasets, research publications, or data tooling.
Culture & Benefits
- Generous health, dental, and vision benefits.
- Unlimited PTO and paid parental leave.
- Visa sponsorship and relocation support as needed.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →