обновлено 1 час назад
Ai Expert (Telco)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
Текст:
TL;DR
AI Expert (Telco): Architecting and deploying end-to-end RAG pipelines, combining retrieval mechanisms with generative models for enterprise use cases with an accent on fine-tuning and optimizing retrieval models. Focus on designing GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.
Location: Warsaw
Company
is a telecommunications company.
What you will do
- Architect and deploy end-to-end RAG pipelines, combining retrieval mechanisms with generative models for enterprise use cases.
- Implement and customize inference servers using vLLM for efficient LLM serving and LiteLLM for lightweight model orchestration.
- Design GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.
- Apply techniques like quantization, pruning, and dynamic batching to maximize resource efficiency in resource-constrained on-prem setups.
- Partner with data engineers to curate and preprocess domain-specific datasets for retrieval and generation tasks.
Requirements
- Bachelor’s/Master’s/PhD in Computer Science, AI, or related field.
- 3+ years in ML/NLP roles, with 2+ years focused on RAG systems.
- Proven experience deploying LLMs in on-prem or hybrid environments.
- Proficiency with vLLM, LiteLLM, and open-source LLMs (e.g., LLAMA 3.2, Deepseek, Mistral).
- Strong Python expertise with frameworks like PyTorch, Hugging Face Transformers, and LangChain.
- Familiarity with Linux-based systems and RedHat OpenShift
Culture & Benefits
- Hybrid work model
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →
Похожие вакансии
6 дней назад
Senior Artificial Intelligence/Machine Learning Engineer (AI/ML)
21 минуту назад
Senior AI Engineer (Azure)
5 000 - 5 500$
7 дней назад
Staff Machine Learning Engineer (AI Applications)
13 часов назад
Forward Deployed Engineer (AI/DXP)
5 дней назад
AI Engineer
3 дня назад