TL;DR
AI Engineer (AI Engineering): Own and evolve the core multi-agent LLM system powering a healthcare AI assistant with an accent on low-latency real-time text and voice communication, multi-agent orchestration, and prompt optimization. Focus on designing and operating complex real-time systems, reasoning and optimization techniques, and continuous evaluation to ensure high quality and reliability.
Location: Remote with options for hybrid work in Barcelona or London
Company
hirify.global develops Qu, a clinician-built AI assistant providing real-time support across healthcare workflows for providers, payers, and pharmaceutical companies.
What you will do
- Own and manage the architecture, SLAs, latency, and rollouts of Qu’s brain service end to end.
- Develop low-latency streaming text and voice communication features including VAD, barge-in, and turn-taking.
- Implement multi-agent orchestration with planner–executor–critic patterns and coordination protocols.
- Apply reasoning and optimization techniques such as ReAct, Chain-of-Thought, and Tree-/Graph-of-Thoughts.
- Optimize programmatic prompts and integrate iterative prompt evolution tools.
- Build and evaluate high-signal retrieval augmented generation (RAG) pipelines with observability and continuous evaluation.
Requirements
- Must have 5+ years in ML or backend engineering with recent focus on LLM systems.
- Expertise in Python, FastAPI, asyncio, and production observability.
- Experience building or integrating low-latency real-time text/voice systems (LiveKit, Pipecat, WebRTC, SIP).
- Knowledge of agent patterns and evaluation-driven development.
- Hands-on with advanced reasoning techniques like ReAct and Chain-of-Thought.
- Prior startup experience required.
Nice to have
- Experience with DSPy, MiPRO/GEPA, and LLM evaluation tooling.
- Familiarity with WebRTC/SRTP, jitter buffers, SIP basics, and TURN/SFU tuning.
- Experience with GCP services including Cloud Run, GKE, Pub/Sub, Vertex AI, and Cloud Logging.
- Healthcare data domain knowledge.
Culture & Benefits
- Work on cutting-edge real-time agent technology in healthtech.
- Fun off-sites in Barcelona.
- High-tech laptop and ergonomic development setup.
- Flexible work options: remote or hybrid in Barcelona/London.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →