Senior AI Researcher (Multimodal Perception Models)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior AI Researcher (Multimodal Perception Models): Leading foundational research on multimodal conversational avatars with an accent on video, audio, and language perception and generation. Focus on designing and training autoregressive and diffusion-based architectures to build real-time human simulation models.
Location: Preferred San Francisco (hybrid) or London. Remote available within the U.S. or Europe.
Company
is a research lab building real-time human simulation models that enable AI avatars to see, hear, and interact with empathy.
What you will do
- Lead research on foundational multimodal models for conversational avatars.
- Build and train models using autoregressive, predictive, and diffusion architectures.
- Design experiments to control visual, auditory, and linguistic responses of avatars.
- Partner with the Applied ML team to translate research into production systems.
- Mentor research team members and define technical roadmaps.
Requirements
- PhD plus 2-3+ years of experience with LLMs, VLMs, or multimodal systems.
- Expertise in sequence modeling for video, audio, and text.
- Strong understanding of autoregressive and diffusion frameworks.
- Proven ability to bridge research and production-grade engineering.
- Strong PyTorch skills.
- Location: Must be based in the U.S. or Europe.
Nice to have
- Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM).
- Experience with large-scale model training and optimization.
- Broad familiarity with generative AI paradigms.
Culture & Benefits
- Flexible work schedule.
- Unlimited PTO.
- Competitive healthcare coverage.
- Gear stipends and team-oriented work environment.
- Supportive, fast-growing startup culture focused on innovation.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →