Назад
Company hidden
4 дня назад

AI Engineer

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
senior
Английский
b2
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Engineer (AI Engineering): Own and evolve the core multi-agent LLM system powering a healthcare AI assistant with an accent on low-latency real-time text and voice communication, multi-agent orchestration, and prompt optimization. Focus on designing and operating complex real-time systems, reasoning and optimization techniques, and continuous evaluation to ensure high quality and reliability.

Location: Remote with options for hybrid work in Barcelona or London

Company

hirify.global develops Qu, a clinician-built AI assistant providing real-time support across healthcare workflows for providers, payers, and pharmaceutical companies.

What you will do

  • Own and manage the architecture, SLAs, latency, and rollouts of Qu’s brain service end to end.
  • Develop low-latency streaming text and voice communication features including VAD, barge-in, and turn-taking.
  • Implement multi-agent orchestration with planner–executor–critic patterns and coordination protocols.
  • Apply reasoning and optimization techniques such as ReAct, Chain-of-Thought, and Tree-/Graph-of-Thoughts.
  • Optimize programmatic prompts and integrate iterative prompt evolution tools.
  • Build and evaluate high-signal retrieval augmented generation (RAG) pipelines with observability and continuous evaluation.

Requirements

  • Must have 5+ years in ML or backend engineering with recent focus on LLM systems.
  • Expertise in Python, FastAPI, asyncio, and production observability.
  • Experience building or integrating low-latency real-time text/voice systems (LiveKit, Pipecat, WebRTC, SIP).
  • Knowledge of agent patterns and evaluation-driven development.
  • Hands-on with advanced reasoning techniques like ReAct and Chain-of-Thought.
  • Prior startup experience required.

Nice to have

  • Experience with DSPy, MiPRO/GEPA, and LLM evaluation tooling.
  • Familiarity with WebRTC/SRTP, jitter buffers, SIP basics, and TURN/SFU tuning.
  • Experience with GCP services including Cloud Run, GKE, Pub/Sub, Vertex AI, and Cloud Logging.
  • Healthcare data domain knowledge.

Culture & Benefits

  • Work on cutting-edge real-time agent technology in healthtech.
  • Fun off-sites in Barcelona.
  • High-tech laptop and ergonomic development setup.
  • Flexible work options: remote or hybrid in Barcelona/London.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →