Назад
Company hidden
3 дня назад

PhD Research Intern (Multimodal AI, Audio)

53$
Тип работы
fulltime
Грейд
trainee
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

PhD Research Intern (Multimodal AI, Audio): Develop innovative AI algorithms to process audio and/or video data for audio-visual quality evaluation, audio-visual content analysis, and multimodal representations with an accent on multimodal machine learning, multimodal generative AI, and audio-visual quality evaluation pipelines. Focus on building and prototyping deep learning architectures for audio/video and contributing to new formats and evaluation pipelines for generative AI media.

Company

hirify.global’ Advanced Technology Group (ATG) researches and builds technologies for next-generation entertainment experiences across audio, imaging, and cloud.

What you will do

  • Develop AI algorithms to process audio and/or video data for audio-visual quality evaluation and audio-visual content analysis.
  • Build multimodal representations to support new formats and evaluation pipelines for generative AI media.
  • Collaborate with research scientists/engineers/AI researchers across multiple locations within the Multimodal Processing Team.
  • Prototype quickly and iterate on deep learning approaches for audio and/or video applications.

Requirements

  • Working towards a Ph.D. degree in Artificial Intelligence, Electrical Engineering, Computer Science, or a related field (recent grads within six months of graduation are eligible).
  • Experience developing and training deep learning architectures.
  • Experience with deep learning architectures for audio and/or video applications.
  • Programming experience in Python and experience with PyTorch or TensorFlow.
  • First-author publications at top-tier peer-reviewed AI conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, InterSpeech, ICASSP).
  • Must be available to work full-time, Monday to Friday, for 12 weeks between September 2026 and December 2026 (start date: September 21, 2026).

Culture & Benefits

  • Project-based internship experience with exposure to Dolby technology.
  • Collaborative, creative environment with a diverse and welcoming culture.
  • Work on real-world projects with impact used by millions of people daily.
  • Potential to publish and/or patent innovations.

Hiring process

  • Applications reviewed on a rolling basis; submit by June 26, 2026 for best consideration.
  • Recruiter shares the specific hourly range and location-based perks during the hiring process.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →