Назад
Company hidden
15 часов назад

Vp Of Product, Research And Training Infrastructure (AI)

233 000 - 341 000$
Формат работы
remote (только USA)/hybrid
Тип работы
fulltime
Грейд
director
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Vp Of Product, Research And Training Infrastructure (AI): Owning the product strategy and engineering execution for services that power AI research labs with an accent on specialized orchestration, evaluation, and iteration tools. Focus on building infrastructure for Reinforcement Learning (RL) and RLHF pipelines, enabling labs to refine foundation models with maximum efficiency.

Location: Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA. While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets.

Salary: $233,000 to $341,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location.

Company

hirify.global is The Essential Cloud for AI™, delivering a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence.

What you will do

  • Oversee the evolution of SUNK (Slurm on Kubernetes) to provide researchers with deterministic, bare-metal performance through a cloud-native interface.
  • Drive the development of next-generation orchestrators and automated training-based evaluation frameworks that ensure model quality throughout the lifecycle.
  • Build the infrastructure required for sophisticated Reinforcement Learning (RL) and RLHF pipelines, enabling labs to refine foundation models with maximum efficiency.
  • Act as the primary technical partner for lead researchers at global AI labs, translating their "future-state" requirements into actionable product roadmaps.

Requirements

  • U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.
  • 15+ years of experience in engineering leadership, with at least 5+ years managing large-scale infrastructure at a top-tier research lab or an AI-native cloud provider.
  • Deep, hands-on knowledge of Slurm, Kubernetes, and the specific networking requirements (InfiniBand/RDMA) for distributed training clusters.
  • Background supporting frontier model research (pre-training and post-training) and understand the "pain points" of a research scientist.
  • Track record of delivering mission-critical services on multi-thousand GPU clusters (H100/Blackwell/Rubin architectures).
  • Ability to define "what’s next" in the AI stack, from automated RL loops to specialized sandbox environments.

Culture & Benefits

  • Medical, dental, and vision insurance - 100% paid for by hirify.global.
  • Flexible Spending Account and Health Savings Account.
  • Tuition Reimbursement and Employee Stock Purchase Program (ESPP).
  • Mental Wellness Benefits through Spring Health and Family-Forming support provided by Carrot.
  • Flexible PTO, catered lunch each day in our office and data center locations, and a casual work environment.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →