Назад
Company hidden
обновлено 2 дня назад

Software Engineer, Inference - Performance Optimization (AI)

295 000 - 555 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Software Engineer, Inference - Performance Optimization (AI): Building and optimizing the inference stack across application, model, and fleet layers with an accent on reducing latency and cost-to-serve. Focus on developing high-fidelity performance models, identifying system bottlenecks, and optimizing hardware efficiency.

Location: San Francisco, USA

Salary: $295K – $555K + Equity

Company

hirify.global is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

What you will do

  • Build and refine performance models that translate microbenchmark results into cost-to-serve estimates.
  • Analyze end-to-end inference workloads across applications, models, and fleet infrastructure.
  • Enhance tooling to identify bottlenecks across layers for latency and throughput.
  • Partner with cross-functional teams to turn performance insights into concrete improvements.
  • Project how future architectural changes affect inference performance and capacity.

Requirements

  • Deep expertise in performance profiling, benchmarking, analysis, and optimization.
  • Strong ability to reason from first principles about distributed systems and model inference.
  • Experience working across abstraction layers, from application behavior to kernels, accelerators, and networking.
  • Knowledge of fleet scheduling and hardware efficiency.
  • Must be based in or authorized to work in the US (San Francisco).

Culture & Benefits

  • Opportunity to work at the forefront of AI research and deployment.
  • Competitive compensation package including base salary and equity.
  • Collaborative environment working with world-class engineering and research teams.
  • Commitment to safety and human-centric AI development.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →