Назад
Company hidden
5 дней назад

AI Researcher (Multimodal Audio/Video Generation)

Формат работы
remote (Global)/onsite/hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
UK/US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Researcher (Multimodal Audio/Video Generation): Developing advanced audio-visual generation models for conversational agents with an accent on diffusion models, neural avatars, and real-time human simulation. Focus on bridging the gap between verbal and non-verbal communication signals and deploying research into production-scale conversational interfaces.

Location: Office presence preferred in San Francisco (hybrid) or London. Remote options available for exceptional candidates within the U.S. or Europe.

Company

hirify.global is a research lab building AI Humans, a conversational interface designed to create trusted, empathetic digital agents capable of face-to-face interactions.

What you will do

  • Research and develop audio-visual generation models for conversational agents.
  • Ensure seamless integration between conversation flow and non-verbal signals.
  • Experiment with diffusion models, long-video generation, and audio synthesis.
  • Collaborate with Applied ML teams to bring research prototypes into production.
  • Stay current with advancements in multimodal AI to influence the product roadmap.

Requirements

  • PhD in a relevant field or equivalent hands-on research experience.
  • Strong foundations in generative modeling and rapid prototyping.
  • Deep familiarity with diffusion models and recent efficiency advances.
  • Proficiency in PyTorch and GPU-based inference.
  • Understanding of video-language models and multimodal generation.

Nice to have

  • Experience with long-video or audio generation.
  • Skills in 3D graphics or Gaussian splatting.
  • Publications in top-tier venues like CVPR, NeurIPS, BMVC, or ICASSP.
  • Exposure to software engineering best practices.

Culture & Benefits

  • Exposure to cutting-edge research in human-machine interfaces.
  • Work in a high-speed, research-driven startup environment.
  • Collaborate with a team backed by top-tier investors like Sequoia and Y Combinator.
  • Opportunity to see complex research models deployed in real-world production.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →