Applied AI Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Applied AI Engineer (Python/FastAPI): Transform research breakthroughs into production AI systems, building scalable applications spanning agentic automation, custom inference services, and next-generation AI products for enterprise customers. Focus on optimizing latency, throughput, and cost across the model serving stack, implementing evaluation harnesses, observability, guardrails, and productionizing novel techniques.
Location: Remote (US)
Company
AI-native engineering firm specializing in enterprise services, domain-specific solutions, and high-performance infrastructure from strategy through deployment for commercial enterprises and government agencies. Certified Service-Disabled Veteran-Owned Small Business (SDVOSB).
What you will do
- Build and operate production AI services including agent runtimes, RAG pipelines, and custom inference endpoints
- Optimize latency, throughput, and cost across the model serving stack
- Implement evaluation harnesses, observability, and guardrails for AI systems
- Collaborate with research scientists to productionize novel techniques
- Contribute to internal frameworks and reusable agent components
Requirements
- 3+ years building production AI/ML applications
- Strong Python and modern web/API stack experience
- Hands-on experience with at least one model serving framework (vLLM, TGI, TensorRT-LLM, Triton Inference Server)
- Familiarity with agent frameworks (LangGraph, LlamaIndex, AutoGen) or custom orchestration
- Cloud-native deployment (Kubernetes, AWS/GCP/Azure)
Nice to have
- MS or PhD in CS or related; or proven track record of shipping AI systems at scale
- Open-source contributions to inference or agent frameworks
- Experience with quantization, distillation, or other inference optimization techniques
Culture & Benefits
- Professional development and skill-building opportunities
- Supportive management with Veteran-owned company values
- Work on impactful client engagements with long-term contract potential
- AI productivity tools and training provided
- Comprehensive benefits: 401(k) with match, medical/dental/vision insurance, short/long-term disability, PTO, profit sharing
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →