Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
RE/RS, Data Understanding (AI): Developing high-quality multimodal datasets and quantized representations for large-scale model training with an accent on synthesis, filtering, and tokenization. Focus on building automated data preparation pipelines and measuring the impact of dataset changes on model performance.
Location: Onsite in San Francisco, USA
Salary: $445,000 – $555,000 + Equity
Company
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
What you will do
- Develop and curate high-quality multimodal datasets (images, audio, video) and their quantized representations.
- Synthesize multimodal content and supervisions to enhance model training.
- Build and optimize noisy data pipelines, quality filters, and deduplication processes.
- Use ML models to automate data preparation and tokenization.
- Measure and analyze how dataset modifications influence model performance.
- Drive a research agenda from problem identification to implementation and impact.
Requirements
- Strong track record of new or improved ML ideas via publications, projects, or applied research.
- Ability to own and drive a research agenda independently.
- Must be based in or able to work from San Francisco, USA.
Nice to have
- Experience with multimodal learning (audio, vision, video) or synthetic data.
- Background in data-centric ML.
- Experience building high-performance deep learning or large-scale data processing systems.
- Consideration of AI impact, including privacy and data provenance.
Culture & Benefits
- Competitive salary and equity offers.
- Opportunity to work at the forefront of AGI research.
- Collaborative, empirical research environment.
- Equal opportunity employer with a commitment to diversity and inclusion.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →