Эта вакансия в архиве

Посмотреть похожие вакансии ↓
Company hidden
обновлено 28 дней назад

Data Engineer (AI)

Формат работы
onsite
Тип работы
fulltime
Английский
b2
Страна
US

Описание вакансии

Текст:
/

TL;DR

Data Engineer (AI): Build and optimize distributed systems to process petabytes of multimedia data powering AI training pipelines and analytics. Focus on scaling large-scale ETL pipelines, GPU cluster orchestration, and ML pipeline deployment.

Location: On-site in San Francisco, United States

Company

hirify.global builds next-generation AI creative tools empowering human creativity across text, images, video, sound, and 3D formats.

What you will do

  • Build distributed systems to process massive multimedia datasets (images, video, 3D).
  • Collaborate with research team to build and deploy ML pipelines.
  • Manage large-scale GPU clusters on Kubernetes for compute-intensive workloads.
  • Design multi-stage pipelines to transform raw data into clean datasets with metadata and annotations.
  • Solve orchestration and scaling challenges in distributed GPU job processing.

Requirements

  • Location: Must work onsite in San Francisco, United States
  • Experience with Python, PyArrow, DuckDB, SQL, PyTorch, Pandas, NumPy.
  • Knowledge of Kubernetes and containerization.
  • Experience designing and implementing large-scale ETL and distributed systems.
  • Fundamental understanding of operating systems, file systems, and networking.
  • English proficiency at least B2 level.

Nice to have

  • Machine learning engineering experience or willingness to learn on the job.

Culture & Benefits

  • Small, tight-knit team of 12 with millions of active users.
  • Backed by $83M in funding from top Silicon Valley investors.
  • Focus on building innovative AI creative tooling.