Эта вакансия в архиве
Посмотреть похожие вакансии ↓обновлено 28 дней назад
Data Engineer (AI)
Описание вакансии
Текст:
TL;DR
Data Engineer (AI): Build and optimize distributed systems to process petabytes of multimedia data powering AI training pipelines and analytics. Focus on scaling large-scale ETL pipelines, GPU cluster orchestration, and ML pipeline deployment.
Location: On-site in San Francisco, United States
Company
builds next-generation AI creative tools empowering human creativity across text, images, video, sound, and 3D formats.
What you will do
- Build distributed systems to process massive multimedia datasets (images, video, 3D).
- Collaborate with research team to build and deploy ML pipelines.
- Manage large-scale GPU clusters on Kubernetes for compute-intensive workloads.
- Design multi-stage pipelines to transform raw data into clean datasets with metadata and annotations.
- Solve orchestration and scaling challenges in distributed GPU job processing.
Requirements
- Location: Must work onsite in San Francisco, United States
- Experience with Python, PyArrow, DuckDB, SQL, PyTorch, Pandas, NumPy.
- Knowledge of Kubernetes and containerization.
- Experience designing and implementing large-scale ETL and distributed systems.
- Fundamental understanding of operating systems, file systems, and networking.
- English proficiency at least B2 level.
Nice to have
- Machine learning engineering experience or willingness to learn on the job.
Culture & Benefits
- Small, tight-knit team of 12 with millions of active users.
- Backed by $83M in funding from top Silicon Valley investors.
- Focus on building innovative AI creative tooling.
Похожие вакансии
2 дня назад
Data Engineer (AWS)
2 дня назад
Software Engineer (Distributed Systems/AI)
150 000 - 300 000$
1 день назад
Senior Data Platform Engineer (Python/AWS)
16 часов назад
Senior Data Engineer (AI)
150 000 - 180 000$
2 дня назад
Data Engineer (Python/AWS)
2 дня назад
Software Engineer, Data (AI)
180 000 - 220 000$