Principal AI/ML Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Principal AI/ML Engineer (AI/ML): Building and optimizing the AI layer for a large European digital-services ecosystem with an accent on agentic AI orchestration and production-grade LLM deployment. Focus on designing low-latency voice AI pipelines, fine-tuning domain-specific models, and establishing high-standard MLOps observability.
Location: Fully Remote, but must have right to work in Belgium, Germany, Italy, Spain, or Greece
Company
is an ecosystem of 60+ brands providing hosting, domains, and SaaS services to 3.5 million customers across Europe.
What you will do
- Architect and evolve multi-agent orchestration platforms and tool-use pipelines.
- Design low-latency voice AI pipelines (STT/TTS) with sub-300ms end-to-end targets.
- Build RAG pipelines with hybrid search, re-ranking, and retrieval quality measurement.
- Fine-tune and evaluate LLMs using LoRA, QLoRA, and DPO for domain-specific tasks.
- Own the AI observability stack and enforce safety guardrails (hallucination detection, PII redaction).
- Set architectural standards, conduct design reviews, and mentor ML engineers and scientists.
Requirements
- 8+ years in ML Engineering or Applied AI, including 2+ years in a lead or staff-level role.
- Deep hands-on production experience with LLMs, fine-tuning, RAG, and tool use.
- Proficiency in Python, PyTorch, and HuggingFace Transformers.
- Experience with LLM inference serving (vLLM, TensorRT-LLM, or TGI) in latency-sensitive environments.
- Solid understanding of MLOps, containerization (Docker/Kubernetes), and CI/CD for ML.
- Must provide proof of eligibility to work in the country applied for; no sponsorship or relocation available.
Nice to have
- End-to-end voice pipeline experience (VAD → ASR → LLM → TTS → SIP/RTP).
- Advanced RAG techniques such as multi-hop retrieval or RAPTOR-style hierarchical retrieval.
- Contributions to open-source ML projects or published research (arXiv, NeurIPS, etc.).
- Knowledge of quantization techniques (GPTQ, AWQ, GGUF) and multimodal models.
Culture & Benefits
- Fully remote work arrangement.
- Inclusive culture emphasizing respect, openness, and trusted collaboration.
- Strong commitment to ESG and sustainability goals.
- Opportunity to shape scalable AI impacting millions of users daily.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →