AI Field Engineer (Microsoft Foundry)

280 000 - 320 000$

Тип работы

fulltime

Грейд

senior

Английский

Страна

Вакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:

TL;DR

AI Field Engineer (Azure/LLM): Developing and deploying generative AI infrastructure for the Microsoft ecosystem with an accent on reference architectures, inference optimization, and partner integrations. Focus on building production-ready POCs, tuning LLM serving frameworks, and aligning technical roadmaps between hirify.global and Azure Foundry.

Location: San Mateo, CA

Salary: $280,000 - $320,000 USD

Company

hirify.global is a Series C generative AI infrastructure company building the industry's fastest and most scalable LLM inference platform.

What you will do

Lead technical co-sell motions with Microsoft, creating reference architectures and integration patterns for Azure Foundry.
Build and deploy end-to-end POCs and MVPs within partner codebases and infrastructure.
Conduct load testing and tune deployments using vLLM and SGLang to optimize latency and throughput.
Guide customers on model selection and execute fine-tuning pipelines using SFT, DPO, and RFT.
Translate partner feedback and product gaps into technical requirements for the Fireworks engineering team.

Requirements

3+ years of experience in pre-sales, partner engineering, or forward-deployed technical roles.
Proficiency in Python and experience with Kubernetes and infrastructure engineering.
Hands-on expertise with LLM inference, including quantization and function calling.
Practical experience with fine-tuning techniques, specifically LoRA.
Deep familiarity with the Azure AI stack, including Azure Foundry, Azure OpenAI, and AKS.
Ability to work from San Mateo, CA.

Nice to have

5+ years of experience managing technical relationships with hyperscalers or major SIs.
Experience with TensorRT-LLM or other inference serving frameworks.
Prior experience at a hyperscaler, AI-native cloud, or inference provider.
Knowledge of agentic frameworks such as LangChain or LlamaIndex.
Proven track record of taking GenAI POCs from prototype to production-scale.

Culture & Benefits

Opportunity to solve complex problems in AI infrastructure and low-latency inference.
Direct impact on the future of AI in a fast-growing, non-bureaucratic environment.
Collaboration with world-class engineers and AI researchers.
Competitive salary and meaningful equity in a Series C startup.
Comprehensive benefits package.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →