AI Field Engineer (Microsoft Foundry)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
AI Field Engineer (Azure/LLM): Developing and deploying generative AI infrastructure for the Microsoft ecosystem with an accent on reference architectures, inference optimization, and partner integrations. Focus on building production-ready POCs, tuning LLM serving frameworks, and aligning technical roadmaps between and Azure Foundry.
Location: San Mateo, CA
Salary: $280,000 - $320,000 USD
Company
is a Series C generative AI infrastructure company building the industry's fastest and most scalable LLM inference platform.
What you will do
- Lead technical co-sell motions with Microsoft, creating reference architectures and integration patterns for Azure Foundry.
- Build and deploy end-to-end POCs and MVPs within partner codebases and infrastructure.
- Conduct load testing and tune deployments using vLLM and SGLang to optimize latency and throughput.
- Guide customers on model selection and execute fine-tuning pipelines using SFT, DPO, and RFT.
- Translate partner feedback and product gaps into technical requirements for the Fireworks engineering team.
Requirements
- 3+ years of experience in pre-sales, partner engineering, or forward-deployed technical roles.
- Proficiency in Python and experience with Kubernetes and infrastructure engineering.
- Hands-on expertise with LLM inference, including quantization and function calling.
- Practical experience with fine-tuning techniques, specifically LoRA.
- Deep familiarity with the Azure AI stack, including Azure Foundry, Azure OpenAI, and AKS.
- Ability to work from San Mateo, CA.
Nice to have
- 5+ years of experience managing technical relationships with hyperscalers or major SIs.
- Experience with TensorRT-LLM or other inference serving frameworks.
- Prior experience at a hyperscaler, AI-native cloud, or inference provider.
- Knowledge of agentic frameworks such as LangChain or LlamaIndex.
- Proven track record of taking GenAI POCs from prototype to production-scale.
Culture & Benefits
- Opportunity to solve complex problems in AI infrastructure and low-latency inference.
- Direct impact on the future of AI in a fast-growing, non-bureaucratic environment.
- Collaboration with world-class engineers and AI researchers.
- Competitive salary and meaningful equity in a Series C startup.
- Comprehensive benefits package.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →