Staff/Lead Research Engineer (Data)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff/Lead Research Engineer (Data): Architecting and scaling data engineering systems for multimodal foundation model training with an accent on large-scale data pipelines and ML data curation. Focus on building robust infrastructure for ingestion, cleaning, and management of diverse datasets to power real-time generative AI.
Location: Must be based in or able to work on-site in Palo Alto, CA
Salary: $185,000 – $400,000
Company
is a high-growth startup building state-of-the-art agentic and multimedia platforms to democratize creative technology.
What you will do
- Take ownership of large-scale data pipeline architecture for text, image, audio, and video datasets.
- Partner with research teams to curate, clean, and manage diverse datasets for model pre-training.
- Develop scalable tools for data ingestion, labeling, filtering, and augmentation.
- Ensure data quality, reliability, and compliance throughout the data lifecycle.
- Optimize data processing and delivery for distributed training pipelines.
- Prototype and productionize new methods for dataset creation and management.
Requirements
- 5+ years of experience building and scaling data pipelines for machine learning at a staff or lead level.
- Strong background in ML data curation for LLMs, VLMs, or multimodal models.
- Expertise in distributed data systems like Spark, Ray, or Hadoop.
- Strong programming skills in Python and SQL.
- Familiarity with cloud data platforms such as AWS, GCP, or Azure.
- Knowledge of data privacy, ethics, and compliance standards.
Culture & Benefits
- Competitive salary and substantial equity package.
- Full health benefits and 401k matching.
- Collaborative, mission-driven environment with significant growth potential.
- Flexible hybrid work policy based out of Palo Alto HQ.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →