Helix AI Engineer, Video Pretraining (AI)

Формат работы

onsite

Тип работы

fulltime

Грейд

middle/senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

Helix AI Engineer, Video Pretraining (AI): Leading the development of large-scale video foundation models trained on diverse real-world and robot-collected data with an accent on pretraining models that learn from raw video. Focus on capturing motion, interaction, and temporal structure to enable downstream capabilities in perception, prediction, and embodied reasoning.

Location: Requires 5 days/week in-office collaboration in San Jose, CA

Company

hirify.global is an AI robotics company developing autonomous general-purpose humanoid robots.

What you will do

Design and train large-scale video foundation models on diverse datasets.
Develop pretraining strategies that capture temporal dynamics, motion, and object interaction from raw video sequences.
Build models that learn transferable representations for downstream tasks.
Explore architectures for video understanding and generation.
Implement efficient data pipelines and training strategies for high-throughput video ingestion and large-scale distributed training.
Design evaluation frameworks and benchmarks to measure temporal understanding, prediction quality, and generalization.

Requirements

Experience training large-scale models on video data or other high-dimensional sequential modalities.
Strong understanding of modern deep learning architectures for video, vision, or multimodal systems.
Experience with large-scale pretraining, including dataset curation, training dynamics, and scaling laws.
Proficiency in Python and deep learning frameworks such as PyTorch.
Experience working with distributed training systems and large GPU clusters.
Solid software engineering skills and ability to build scalable, reliable systems.

Nice to have

Experience working on frontier video models or multimodal foundation models.
Background in video diffusion, autoregressive video modeling, or world models.
Experience at leading AI labs such as OpenAI, Google DeepMind, Google, ByteDance, Midjourney, or Adobe.
Familiarity with robotics, embodied AI, or learning from egocentric / first-person video.

Culture & Benefits

Developing core AI systems that power humanoid autonomy.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →