Назад
Company hidden
21 дСнь назад

Helix AI Engineer, Video Pretraining (AI)

Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
onsite
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
middle/senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
US
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Helix AI Engineer, Video Pretraining (AI): Leading the development of large-scale video foundation models trained on diverse real-world and robot-collected data with an accent on pretraining models that learn from raw video. Focus on capturing motion, interaction, and temporal structure to enable downstream capabilities in perception, prediction, and embodied reasoning.

Location: Requires 5 days/week in-office collaboration in San Jose, CA

Company

hirify.global is an AI robotics company developing autonomous general-purpose humanoid robots.

What you will do

  • Design and train large-scale video foundation models on diverse datasets.
  • Develop pretraining strategies that capture temporal dynamics, motion, and object interaction from raw video sequences.
  • Build models that learn transferable representations for downstream tasks.
  • Explore architectures for video understanding and generation.
  • Implement efficient data pipelines and training strategies for high-throughput video ingestion and large-scale distributed training.
  • Design evaluation frameworks and benchmarks to measure temporal understanding, prediction quality, and generalization.

Requirements

  • Experience training large-scale models on video data or other high-dimensional sequential modalities.
  • Strong understanding of modern deep learning architectures for video, vision, or multimodal systems.
  • Experience with large-scale pretraining, including dataset curation, training dynamics, and scaling laws.
  • Proficiency in Python and deep learning frameworks such as PyTorch.
  • Experience working with distributed training systems and large GPU clusters.
  • Solid software engineering skills and ability to build scalable, reliable systems.

Nice to have

  • Experience working on frontier video models or multimodal foundation models.
  • Background in video diffusion, autoregressive video modeling, or world models.
  • Experience at leading AI labs such as OpenAI, Google DeepMind, Google, ByteDance, Midjourney, or Adobe.
  • Familiarity with robotics, embodied AI, or learning from egocentric / first-person video.

Culture & Benefits

  • Developing core AI systems that power humanoid autonomy.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’