Назад
Company hidden
4 дня назад

Staff Backend Software Engineer (AI Platform)

166 000 - 225 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Staff Backend Software Engineer (AI Platform): Designing and implementing core systems and APIs for hirify.global Model Serving, ensuring scalability, reliability, and operational excellence with an accent on optimizing performance, throughput, and autoscaling for CPU and GPU serving workloads. Focus on improving latency, availability, and cost-effectiveness across customer-facing and foundational serving layers.

Location: San Francisco, California

Salary: $166,000 — $225,000 USD

Company

hirify.global is the data and AI company that provides a unified platform for data, analytics, and AI, trusted by over 10,000 organizations worldwide.

What you will do

  • Design and implement core systems and APIs that power hirify.global Model Serving, ensuring scalability, reliability, and operational excellence.
  • Drive architectural decisions and trade-offs to optimize performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads.
  • Contribute to key components across the serving infrastructure, from model container builds and deployment workflows to runtime systems like routing, caching, and intelligent autoscaling.
  • Collaborate cross-functionally with product, platform, and research teams to translate customer needs into reliable and performant systems.
  • Lead technical initiatives that improve latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers.
  • Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance.

Requirements

  • 5+ years of experience building and operating large-scale distributed systems.
  • Experience in model serving, inference systems, or related infrastructure (e.g., routing, scheduling, autoscaling, and observability).
  • Strong foundation in algorithms, data structures, and system design as applied to large-scale, low-latency serving systems.
  • Proven ability to deliver technically complex, high-impact initiatives that create measurable customer or business value.
  • Experience building architecture for large-scale, performance-sensitive CPU/GPU inference systems.
  • Strong communication skills and ability to collaborate across teams in fast-moving environments.

Culture & Benefits

  • Committed to fostering a diverse and inclusive culture where everyone can excel.
  • Hiring practices are inclusive and meet equal employment opportunity standards.
  • Consideration without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →