Назад
Company hidden
1 день назад

AI Research Scientist (Multimodal Post-Training) (AI)

71 000 - 110 000
Формат работы
hybrid
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
UK/Portugal
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Research Scientist (Multimodal post-training): Design and execute research on multimodal model training with an accent on vision-language models and speech-language models, including fine-tuning, alignment, and post-training methods (SFT, RLHF) tailored for clinical domains. Focus on developing models for unified patient understanding through video, language, and speech, contributing to the full model development cycle, and advancing long-term goals like real-time patient state estimation and safety validation.

Location: Hybrid across Europe and the UK, preference for candidates based in London, UK, Lisbon or Porto, Portugal where research team and offices are located. Candidates must possess a valid EU visa and be based in Portugal. No relocation assistance.

Salary: €71,000 - €110,000 a year

Company

hirify.global is a clinical-centric AI lab and applied AI platform reimagining healthcare delivery through AI-native, 24/7 care programs in physical therapy, women’s health, mental health, and more.

What you will do

  • Design and execute research on multimodal model training, focusing on vision-language and speech-language models using fine-tuning, SFT, RLHF for clinical applications.
  • Develop models enabling AI agents to perceive patients via video, language, speech for unified understanding.
  • Handle full model cycle: dataset curation, architecture design, cross-modal training, evaluation, iteration.
  • Collaborate with AI Engineering, Product, Clinical teams to deploy research into production patient care systems.
  • Pursue ambitious goals like real-time multimodal estimation, clinical memory, safety validation with immediate milestones.
  • Publish in top-tier AI conferences and clinical journals.

Requirements

  • PhD in Computer Science, Machine Learning, NLP, Computer Vision, or related AI field.
  • Hands-on experience fine-tuning LLMs or multimodal models (pre-training, SFT, RLHF, post-training).
  • Experience with multi-modal models (video+language, image+text, speech+text).
  • Strong publication record in peer-reviewed AI conferences/journals.
  • Proficiency in Python and ML frameworks (PyTorch, JAX).
  • Ability to design rigorous experiments and interpret results.
  • AI proficiency at least Level 1 (daily use for productivity).

Nice to have

  • First-author papers in NeurIPS, ICML, ICLR, CVPR, ACL, etc.
  • Expertise in vision-language, video understanding, speech-language models, multimodal learning.
  • Experience with video/image models in applied settings (pose estimation, action recognition, medical imaging).
  • Work on LLM-based agents, prompt engineering, memory, workflows.
  • Industry experience post-PhD, research-to-production track record.
  • Comfort in high-uncertainty environments, strong cross-functional collaboration.

Culture & Benefits

  • Health, dental, vision insurance, meal allowance, equity shares.
  • Remote work allowance, flexible hours, work from home, discretionary vacation.
  • Snacks, beverages; high-impact research with compute, data, team support.
  • Operate at frontier of research and product; publish while shipping models.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →