Назад
Company hidden
обновлено 1 час назад

Ai Expert (Telco)

Формат работы
hybrid
Тип работы
fulltime
Грейд
middle/senior
Английский
b2
Страна
Poland
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

AI Expert (Telco): Architecting and deploying end-to-end RAG pipelines, combining retrieval mechanisms with generative models for enterprise use cases with an accent on fine-tuning and optimizing retrieval models. Focus on designing GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.

Location: Warsaw

Company

hirify.global is a telecommunications company.

What you will do

  • Architect and deploy end-to-end RAG pipelines, combining retrieval mechanisms with generative models for enterprise use cases.
  • Implement and customize inference servers using vLLM for efficient LLM serving and LiteLLM for lightweight model orchestration.
  • Design GPU-optimized, scalable infrastructure for LLM training and inference, ensuring compliance with security and data governance policies.
  • Apply techniques like quantization, pruning, and dynamic batching to maximize resource efficiency in resource-constrained on-prem setups.
  • Partner with data engineers to curate and preprocess domain-specific datasets for retrieval and generation tasks.

Requirements

  • Bachelor’s/Master’s/PhD in Computer Science, AI, or related field.
  • 3+ years in ML/NLP roles, with 2+ years focused on RAG systems.
  • Proven experience deploying LLMs in on-prem or hybrid environments.
  • Proficiency with vLLM, LiteLLM, and open-source LLMs (e.g., LLAMA 3.2, Deepseek, Mistral).
  • Strong Python expertise with frameworks like PyTorch, Hugging Face Transformers, and LangChain.
  • Familiarity with Linux-based systems and RedHat OpenShift

Culture & Benefits

  • Hybrid work model

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →