Назад
Company hidden
2 дня назад

DevOps/MLOps Engineer (ML / LLM Infrastructure)

Формат работы
remote/onsite
Тип работы
fulltime
Английский
b2
Страна
Ukraine
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

DevOps/MLOps Engineer (ML / LLM Infrastructure): Design, build, and operate scalable ML infrastructure on GCP supporting LLMs and NLP systems with an accent on Kubernetes environments, CI/CD pipelines, and observability. Focus on managing training and inference workloads, optimizing resource utilization, and ensuring reliability across distributed GPU/TPU/CPU setups.

Location: Office or remote — it’s up to you. Remote onboarding.

Company

Ukrainian hybrid IT company, subsidiary of Kyivstar telecom operator, creating technological solutions for businesses and users.

What you will do

  • Design, build, and operate scalable ML infrastructure on GCP (GKE) for experimentation and production LLM/NLP workloads.
  • Manage Kubernetes environments: deployment, scaling, upgrades, and reliability for training/inference across GPU/TPU/CPU.
  • Build and maintain CI/CD pipelines (GitHub Actions, Jenkins) for ML services and infrastructure automation.
  • Implement infrastructure as code (Terraform, Ansible) for secure, reproducible cloud resource management.
  • Ensure observability with monitoring, logging, alerting for ML systems and pipelines.
  • Collaborate with ML and Data Engineers on training/inference pipelines; optimize costs and troubleshoot issues.
  • Contribute to best practices: code reviews, automation, platform reliability, and developer experience.

Requirements

  • 4+ years in DevOps, Platform Engineering, or ML Infrastructure with production distributed systems.
  • Hands-on GCP experience; cloud-native architectures for compute/data workloads.
  • Solid Docker, Kubernetes (GKE), Helm, networking; CI/CD (GitHub Actions, Jenkins).
  • Airflow or similar for orchestration; Terraform or IaC tools.
  • Scripting (Bash/Python); observability (Prometheus, Grafana); ML lifecycle familiarity.
  • Ability to collaborate translating ML needs into scalable infrastructure.

Culture & Benefits

  • Office or remote work choice with remote onboarding.
  • Performance bonuses, health/life insurance, wellbeing program, corporate psychologist.
  • Training via company library, internal resources, partner programs.
  • Reimbursement for Kyivstar mobile communication.
  • Entrepreneurial culture focused on innovation and continuous evolution.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →