Назад
Company hidden
7 месяцев назад

Research Engineer, Infrastructure, Inference (AI)

350 000 - 475 000$
Формат работы
onsite
Тип работы
fulltime
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Research Engineer, Infrastructure, Inference (AI): Designing, optimizing, and scaling systems that power large AI models with an accent on performant and efficient model inference. Focus on collaborating with researchers to improve performance, latency, and reliability of AI infrastructure.

Location: San Francisco, California

Compensation: $350,000 - $475,000 USD

Company

hirify.global empowers humanity through advancing collaborative general intelligence.

What you will do

  • Work alongside researchers and engineers to bring cutting-edge AI models into production.
  • Collaborate with research teams to enable high-performance inference for novel architectures.
  • Design and implement new techniques, tools, and architectures that improve performance, latency, throughput, and efficiency.
  • Optimize our codebase and compute fleet (e.g., GPUs) to fully utilize hardware FLOPs, bandwidth, and memory.
  • Extend orchestration frameworks (e.g., Kubernetes, Ray, SLURM) for distributed inference, evaluation, and large-batch serving.
  • Publish and share learnings through internal documentation, open-source libraries, or technical reports.

Requirements

  • Bachelor’s degree or equivalent experience in computer science, engineering, or similar.
  • Understanding of deep learning frameworks (e.g., PyTorch, JAX) and their underlying system architectures.
  • Experience with inference serving systems optimized for throughput and latency.
  • Strong engineering skills, ability to contribute performant, maintainable code and debug in complex codebases.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →