Назад
обновлено 10 дней назад

Software Engineer, Inference (AI)

325 000 - 490 000$
Формат работы
onsite
Тип работы
fulltime
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Software Engineer, Inference (AI): Scaling and optimizing OpenAI’s inference infrastructure across emerging GPU platforms with an accent on advancing inference performance on AMD accelerators. Focus on debugging and optimizing distributed inference workloads across memory, network, and compute layers.

Location: Onsite in San Francisco

Company

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

  • Own bring-up, correctness and performance of the OpenAI inference stack on AMD hardware.
  • Integrate internal model-serving infrastructure into a variety of GPU-backed systems.
  • Debug and optimize distributed inference workloads.
  • Validate correctness, performance, and scalability of model execution on large GPU clusters.
  • Collaborate with partner teams to design and optimize high-performance GPU kernels.
  • Build, integrate and tune collective communication libraries for model execution.

Requirements

  • Experience writing or porting GPU kernels using HIP, CUDA, or Triton.
  • Familiarity with communication libraries like NCCL/RCCL.
  • Experience with distributed inference systems.
  • Ability to solve end-to-end performance challenges.
  • Excitement to build new infrastructure from first principles.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →