Назад
Company hidden
6 дней назад

AI Engineer (Generative AI, Multimodal)

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
senior
Английский
b2
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior AI Engineer (Generative AI, Multimodal): Driving innovation in architecture development for cutting-edge AI models of various scales with an accent on video generation, multimodal language models, and scalable training pipelines. Focus on exploring and implementing novel techniques and algorithms, resolving pre-training bottlenecks, and prototyping generative AI applications.

Location: Remote (Global)

Company

hirify.global pioneers a global financial revolution by empowering businesses with cutting-edge solutions for integrating reserve-backed tokens across blockchains, enhancing digital finance with transparency and innovation.

What you will do

  • Pioneer multimodal and video-centric AI research, contributing to usable prototypes and scalable systems.
  • Design and implement novel AI architectures for multimodal language models, integrating text, visual, and audio.
  • Engineer scalable training and inference pipelines optimized for large-scale multimodal datasets and distributed GPU systems.
  • Optimize systems and algorithms for efficient data processing, model execution, and pipeline throughput.
  • Collaborate cross-functionally with research and engineering teams to translate innovations into production.
  • Prototype generative AI applications showcasing new capabilities of multimodal foundation models.

Requirements

  • Bachelor’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience.
  • Expertise in Python & Pytorch, including the full development pipeline from data processing to training, inference, and optimization.
  • Experience working with large-scale text data, or interleaved data spanning audio, video, image, and/or text.
  • Direct hands-on experience in developing or benchmarking LLMs, Vision Language Models, Audio Language Models, or generative video models.

Nice to have

  • PhD in Computer Vision, Machine Learning, NLP, Computer Science, or Applied Statistics.
  • Demonstrated expertise in computer vision, video generation foundation models, and/or multimodal research.
  • First-author publications at leading AI conferences (e.g., CVPR, ICCV, ICML, ICLR, NeurIPS).

Culture & Benefits

  • Join a global talent powerhouse, working remotely from every corner of the world.
  • Opportunity to make a mark in the fintech space, collaborating with bright minds.
  • Work with a fast-growing, lean, and industry-leading team.
  • Contribute to the most innovative platform on the planet.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →