TL;DR
Senior AI Engineer (LLMs): Building and owning the LLM + retrieval + context layer that powers various copilots across critical workflows with an accent on making these copilots accurate, auditable, fast, and cost-efficient. Focus on production RAG, context graph development, LLM orchestration, and GPU/inference cost optimization.
Location: Remote (US)
Salary: $146,000–$236,000
Company
hirify.global is a technology-driven company streamlining the life insurance process, making it more accessible and convenient by leveraging predictive analytics and data science.
What you will do
- Own the LLM + retrieval + context layer for LLM-powered copilots, ensuring accuracy, auditability, speed, and cost-efficiency.
- Design and ship the end-to-end pipeline: retrieve → assemble context → generate → cite → log/monitor.
- Improve quality and trust via evaluation, feedback loops, and clear evidence-backed outputs.
- Implement production RAG, including indexing, retrieval, hybrid search, reranking, query rewriting, grounding, and citations.
- Develop Context Graph for entity resolution, linking, provenance, and multi-hop context.
- Optimize GPU inference costs through batching, caching/KV reuse, quantization, and autoscaling.
Requirements
- 7+ years building production systems; 2+ years hands-on LLMs/RAG.
- Proven RAG experience (embeddings, vector DBs, hybrid search, reranking, eval).
- Strong backend/distributed systems and observability experience.
- Track record shipping in high-stakes environments with auditability/correctness.
- Knowledge graph / entity resolution / provenance systems experience.
- GPU inference optimization experience (vLLM/TGI/TensorRT-LLM, quantization AWQ/GPTQ, batching).
- Regulated domain experience (insurance/fintech/healthcare).
Culture & Benefits
- Dedicated to building a diverse, inclusive, and authentic workplace.
- Committed to being an equal opportunity employer.
- US-specific benefits available (not detailed, but mentioned).
- Focus on protecting families and rapid scaling within the insurtech industry.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →