Solution Specialist (AI Runtime Services)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Solution Specialist (AI Runtime Services): Driving the commercial and technical adoption of AI runtime infrastructure with an accent on high-throughput, low-latency model serving and secure execution environments. Focus on translating early adopter needs into product roadmaps and optimizing cost-per-token economics for production-scale AI systems.
Location: Livingston, NJ / New York, NY / Sunnyvale, CA / San Francisco, CA / Bellevue, WA. Must be a U.S. person (citizen, green card holder, etc.) due to export control regulations.
Salary: $207,000 – $275,000
Company
CoreWeave is The Essential Cloud for AI™, providing specialized GPU infrastructure for AI labs and enterprises.
What you will do
- Own the commercial and technical strategy for net new customer wins in AI runtime infrastructure.
- Drive business opportunities focusing on inference latency, throughput, and workload isolation.
- Translate customer requirements for serving frameworks (vLLM, TensorRT-LLM, TGI) into the product roadmap.
- Develop technical playbooks and benchmark narratives to help sales and SA teams accelerate opportunities.
- Design commercial frameworks for large-scale deployments, including throughput modeling and SLAs.
- Partner with product and infrastructure teams to optimize serving efficiency and operational reliability.
Requirements
- 10+ years of experience in distributed systems, ML infrastructure, or production AI engineering.
- 5+ years working with AI runtime systems (model serving, inference optimization) in customer-facing roles.
- Deep knowledge of GPU memory management, serving frameworks (vLLM, TensorRT-LLM, Triton), and batching strategies.
- Experience with sandboxed/isolated execution environments and microVM architectures.
- Proficiency with Kubernetes-native runtime orchestration (autoscaling, GPU operators).
- Must meet U.S. Government export control regulations (must be a U.S. person).
Nice to have
- Experience in generative AI, autonomous systems, or financial modeling.
- Background in technical sales, solution consulting, or product management for inference infrastructure.
- Advanced degree in Computer Science, Machine Learning, or Engineering.
Culture & Benefits
- 100% company-paid medical, dental, and vision insurance.
- 401(k) with generous employer match and Employee Stock Purchase Program (ESPP).
- Flexible PTO and comprehensive family-forming support (Carrot, Kinside).
- Mental wellness benefits through Spring Health and tuition reimbursement.
- Daily catered lunch at office and data center locations.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →