Назад
5 дней назад

Solution Specialist (AI Runtime Services)

207 000 - 275 000$
Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Solution Specialist (AI Runtime Services): Driving the commercial and technical adoption of AI runtime infrastructure with an accent on high-throughput, low-latency model serving and secure execution environments. Focus on translating early adopter needs into product roadmaps and optimizing cost-per-token economics for production-scale AI systems.

Location: Livingston, NJ / New York, NY / Sunnyvale, CA / San Francisco, CA / Bellevue, WA. Must be a U.S. person (citizen, green card holder, etc.) due to export control regulations.

Salary: $207,000 – $275,000

Company

CoreWeave is The Essential Cloud for AI™, providing specialized GPU infrastructure for AI labs and enterprises.

What you will do

  • Own the commercial and technical strategy for net new customer wins in AI runtime infrastructure.
  • Drive business opportunities focusing on inference latency, throughput, and workload isolation.
  • Translate customer requirements for serving frameworks (vLLM, TensorRT-LLM, TGI) into the product roadmap.
  • Develop technical playbooks and benchmark narratives to help sales and SA teams accelerate opportunities.
  • Design commercial frameworks for large-scale deployments, including throughput modeling and SLAs.
  • Partner with product and infrastructure teams to optimize serving efficiency and operational reliability.

Requirements

  • 10+ years of experience in distributed systems, ML infrastructure, or production AI engineering.
  • 5+ years working with AI runtime systems (model serving, inference optimization) in customer-facing roles.
  • Deep knowledge of GPU memory management, serving frameworks (vLLM, TensorRT-LLM, Triton), and batching strategies.
  • Experience with sandboxed/isolated execution environments and microVM architectures.
  • Proficiency with Kubernetes-native runtime orchestration (autoscaling, GPU operators).
  • Must meet U.S. Government export control regulations (must be a U.S. person).

Nice to have

  • Experience in generative AI, autonomous systems, or financial modeling.
  • Background in technical sales, solution consulting, or product management for inference infrastructure.
  • Advanced degree in Computer Science, Machine Learning, or Engineering.

Culture & Benefits

  • 100% company-paid medical, dental, and vision insurance.
  • 401(k) with generous employer match and Employee Stock Purchase Program (ESPP).
  • Flexible PTO and comprehensive family-forming support (Carrot, Kinside).
  • Mental wellness benefits through Spring Health and tuition reimbursement.
  • Daily catered lunch at office and data center locations.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →