Назад
Company hidden
4 часа назад

AI Field Engineer (AI)

200 000 - 260 000$
Формат работы
remote (только USA)/hybrid/onsite
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Field Engineer (AI): Building and deploying production-grade generative AI infrastructure for high-velocity customers with an accent on inference optimization, fine-tuning pipelines, and scalable model serving. Focus on architecting end-to-end solutions, debugging production issues, and translating complex customer requirements into platform-level product improvements.

Location: Must be based in the USA (New York, San Mateo, or Remote USA).

Compensation: $200,000–$260,000 USD + Equity.

Company

A Series C AI infrastructure startup valued at $4B, founded by veterans of Meta PyTorch and Google Vertex AI, focused on high-performance LLM inference and training.

What you will do

  • Build end-to-end POCs and MVPs directly within customer codebases and infrastructure.
  • Architect inference foundations and size deployments to ensure scalability for GenAI-native products.
  • Run load tests and tune deployments using frameworks like vLLM and SGLang to meet latency and throughput targets.
  • Guide customers through model selection, fine-tuning strategies (SFT, DPO, RFT), and evaluation methodologies.
  • Lead discovery conversations and own the technical relationship from initial engagement to production deployment.
  • Translate recurring customer pain points into concrete product proposals and platform improvements.

Requirements

  • 5+ years in a hands-on, customer-facing technical role such as Forward Deployed Engineer, Applied AI Engineer, or Solutions Architect.
  • Strong Python skills with experience reading, writing, and debugging production code.
  • Working knowledge of the LLM stack, including inference trade-offs, model serving, and fine-tuning workflows.
  • Experience with cloud infrastructure (AWS, Azure, GCP) and deploying models on GPU infrastructure.
  • Exceptional communication skills, capable of presenting to VPs and debugging technical issues with ML engineers.
  • Must be based in the USA and comfortable with on-site customer engagements.

Nice to have

  • 10+ years in technical field or engineering roles.
  • Experience with inference serving frameworks like TensorRT-LLM.
  • Prior experience at a company with a forward-deployed engineering model (e.g., Palantir, Scale AI, OpenAI).
  • Track record of taking GenAI POCs from prototype to production-scale.

Culture & Benefits

  • Meaningful equity in a fast-growing, well-funded startup.
  • Opportunity to work with bleeding-edge technology at the forefront of AI infrastructure.
  • Collaborative environment with world-class researchers and engineers.
  • Comprehensive benefits package.
  • High-impact role with minimal bureaucracy.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →