PhD Research Intern (Multimodal AI, Audio)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
PhD Research Intern (Multimodal AI, Audio): Develop innovative AI algorithms to process audio and/or video data for audio-visual quality evaluation, audio-visual content analysis, and multimodal representations with an accent on multimodal machine learning, multimodal generative AI, and audio-visual quality evaluation pipelines. Focus on building and prototyping deep learning architectures for audio/video and contributing to new formats and evaluation pipelines for generative AI media.
Company
’ Advanced Technology Group (ATG) researches and builds technologies for next-generation entertainment experiences across audio, imaging, and cloud.
What you will do
- Develop AI algorithms to process audio and/or video data for audio-visual quality evaluation and audio-visual content analysis.
- Build multimodal representations to support new formats and evaluation pipelines for generative AI media.
- Collaborate with research scientists/engineers/AI researchers across multiple locations within the Multimodal Processing Team.
- Prototype quickly and iterate on deep learning approaches for audio and/or video applications.
Requirements
- Working towards a Ph.D. degree in Artificial Intelligence, Electrical Engineering, Computer Science, or a related field (recent grads within six months of graduation are eligible).
- Experience developing and training deep learning architectures.
- Experience with deep learning architectures for audio and/or video applications.
- Programming experience in Python and experience with PyTorch or TensorFlow.
- First-author publications at top-tier peer-reviewed AI conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, InterSpeech, ICASSP).
- Must be available to work full-time, Monday to Friday, for 12 weeks between September 2026 and December 2026 (start date: September 21, 2026).
Culture & Benefits
- Project-based internship experience with exposure to Dolby technology.
- Collaborative, creative environment with a diverse and welcoming culture.
- Work on real-world projects with impact used by millions of people daily.
- Potential to publish and/or patent innovations.
Hiring process
- Applications reviewed on a rolling basis; submit by June 26, 2026 for best consideration.
- Recruiter shares the specific hourly range and location-based perks during the hiring process.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →