AI Researcher (Multimodal Audio/Video Generation)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
AI Researcher (Multimodal Audio/Video Generation): Developing advanced audio-visual generation models for conversational agents with an accent on diffusion models, neural avatars, and real-time human simulation. Focus on bridging the gap between verbal and non-verbal communication signals and deploying research into production-scale conversational interfaces.
Location: Office presence preferred in San Francisco (hybrid) or London. Remote options available for exceptional candidates within the U.S. or Europe.
Company
is a research lab building AI Humans, a conversational interface designed to create trusted, empathetic digital agents capable of face-to-face interactions.
What you will do
- Research and develop audio-visual generation models for conversational agents.
- Ensure seamless integration between conversation flow and non-verbal signals.
- Experiment with diffusion models, long-video generation, and audio synthesis.
- Collaborate with Applied ML teams to bring research prototypes into production.
- Stay current with advancements in multimodal AI to influence the product roadmap.
Requirements
- PhD in a relevant field or equivalent hands-on research experience.
- Strong foundations in generative modeling and rapid prototyping.
- Deep familiarity with diffusion models and recent efficiency advances.
- Proficiency in PyTorch and GPU-based inference.
- Understanding of video-language models and multimodal generation.
Nice to have
- Experience with long-video or audio generation.
- Skills in 3D graphics or Gaussian splatting.
- Publications in top-tier venues like CVPR, NeurIPS, BMVC, or ICASSP.
- Exposure to software engineering best practices.
Culture & Benefits
- Exposure to cutting-edge research in human-machine interfaces.
- Work in a high-speed, research-driven startup environment.
- Collaborate with a team backed by top-tier investors like Sequoia and Y Combinator.
- Opportunity to see complex research models deployed in real-world production.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →