Назад
Company hidden
15 часов назад

Senior AI Engineer (LLMs)

146 000 - 236 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior AI Engineer (LLMs): Building and owning the LLM + retrieval + context layer that powers various copilots across critical workflows with an accent on making these copilots accurate, auditable, fast, and cost-efficient. Focus on production RAG, context graph development, LLM orchestration, and GPU/inference cost optimization.

Location: Remote (US)

Salary: $146,000–$236,000

Company

hirify.global is a technology-driven company streamlining the life insurance process, making it more accessible and convenient by leveraging predictive analytics and data science.

What you will do

  • Own the LLM + retrieval + context layer for LLM-powered copilots, ensuring accuracy, auditability, speed, and cost-efficiency.
  • Design and ship the end-to-end pipeline: retrieve → assemble context → generate → cite → log/monitor.
  • Improve quality and trust via evaluation, feedback loops, and clear evidence-backed outputs.
  • Implement production RAG, including indexing, retrieval, hybrid search, reranking, query rewriting, grounding, and citations.
  • Develop Context Graph for entity resolution, linking, provenance, and multi-hop context.
  • Optimize GPU inference costs through batching, caching/KV reuse, quantization, and autoscaling.

Requirements

  • 7+ years building production systems; 2+ years hands-on LLMs/RAG.
  • Proven RAG experience (embeddings, vector DBs, hybrid search, reranking, eval).
  • Strong backend/distributed systems and observability experience.
  • Track record shipping in high-stakes environments with auditability/correctness.
  • Knowledge graph / entity resolution / provenance systems experience.
  • GPU inference optimization experience (vLLM/TGI/TensorRT-LLM, quantization AWQ/GPTQ, batching).
  • Regulated domain experience (insurance/fintech/healthcare).

Culture & Benefits

  • Dedicated to building a diverse, inclusive, and authentic workplace.
  • Committed to being an equal opportunity employer.
  • US-specific benefits available (not detailed, but mentioned).
  • Focus on protecting families and rapid scaling within the insurtech industry.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...