Назад
Company hidden
обновлено 1 месяц назад

Generative AI Inference Engineer

Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Generative AI Inference Engineer (AI): Develop and optimize multi-modal generative AI inference systems with an accent on diffusion model architectures and high-performance computing. Focus on designing scalable inference pipelines, model tuning, deployment, and collaboration with cloud providers to deliver hosted AI solutions.

Location: Remote (United States)

Company

hirify.global is a leading company specializing in generative AI technologies and high-performance computing for creative AI applications.

What you will do

  • Lead design and development of customer-facing multi-modal ML inference systems.
  • Collaborate with Platform and Inference teams on optimization, tuning, and deployment of next-generation models.
  • Partner with cloud providers to deliver hosted inference solutions.
  • Drive business impact through strategic machine learning initiatives.
  • Prototype and productionize inference platform improvements and new features.

Requirements

  • Location: Must be based in the United States or able to work remotely within the US
  • 7+ years experience in productionizing machine learning systems and inference pipelines.
  • Expertise in python services, pytorch, and high-performance inference frameworks.
  • Deep understanding of diffusion architectures and GPU profiling tools.
  • Experience with cloud orchestration (Kubernetes) and cloud providers (AWS, GCP, Azure).
  • Strong communication and collaboration skills.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →