Назад
Company hidden
3 дня назад

AI Field Engineer (Microsoft Foundry)

280 000 - 320 000$
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Field Engineer (Azure/LLM): Developing and deploying generative AI infrastructure for the Microsoft ecosystem with an accent on reference architectures, inference optimization, and partner integrations. Focus on building production-ready POCs, tuning LLM serving frameworks, and aligning technical roadmaps between hirify.global and Azure Foundry.

Location: San Mateo, CA

Salary: $280,000 - $320,000 USD

Company

hirify.global is a Series C generative AI infrastructure company building the industry's fastest and most scalable LLM inference platform.

What you will do

  • Lead technical co-sell motions with Microsoft, creating reference architectures and integration patterns for Azure Foundry.
  • Build and deploy end-to-end POCs and MVPs within partner codebases and infrastructure.
  • Conduct load testing and tune deployments using vLLM and SGLang to optimize latency and throughput.
  • Guide customers on model selection and execute fine-tuning pipelines using SFT, DPO, and RFT.
  • Translate partner feedback and product gaps into technical requirements for the Fireworks engineering team.

Requirements

  • 3+ years of experience in pre-sales, partner engineering, or forward-deployed technical roles.
  • Proficiency in Python and experience with Kubernetes and infrastructure engineering.
  • Hands-on expertise with LLM inference, including quantization and function calling.
  • Practical experience with fine-tuning techniques, specifically LoRA.
  • Deep familiarity with the Azure AI stack, including Azure Foundry, Azure OpenAI, and AKS.
  • Ability to work from San Mateo, CA.

Nice to have

  • 5+ years of experience managing technical relationships with hyperscalers or major SIs.
  • Experience with TensorRT-LLM or other inference serving frameworks.
  • Prior experience at a hyperscaler, AI-native cloud, or inference provider.
  • Knowledge of agentic frameworks such as LangChain or LlamaIndex.
  • Proven track record of taking GenAI POCs from prototype to production-scale.

Culture & Benefits

  • Opportunity to solve complex problems in AI infrastructure and low-latency inference.
  • Direct impact on the future of AI in a fast-growing, non-bureaucratic environment.
  • Collaboration with world-class engineers and AI researchers.
  • Competitive salary and meaningful equity in a Series C startup.
  • Comprehensive benefits package.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →