Назад
Company hidden
15 часов назад

LLMOps Engineer (AI)

Формат работы
remote
Тип работы
fulltime
Грейд
middle
Английский
b2
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

LLMOps Engineer (AI): Deploying, managing, and optimizing large language models in production environments with an accent on scalable GPU infrastructure and security hardening. Focus on architecting auto-scaling inference clusters, implementing LLM observability, and optimizing resource management across multi-cloud environments.

Location: Work From Home (WFH)

Company

hirify.global specializes in building scalable, secure, and high-performance GenAI systems across cloud platforms.

What you will do

  • Deploy and optimize open-source large language models on cloud-based GPU instances.
  • Implement rigorous security protocols for model weight storage, endpoint encryption, and prompt injection mitigation.
  • Architect auto-scaling inference clusters on Azure and GCP to handle high-concurrency requests with low latency.
  • Build custom observability stacks to track hallucination rates and latency for self-hosted deployments.
  • Manage complex GPU scheduling and cost-optimization strategies across Azure, GCP, and AWS.

Requirements

  • 3+ years of experience in LLMOps or related AI engineering roles.
  • Professional experience deploying GenAI workloads on Azure and GCP.
  • Deep understanding of GPU partitioning and memory-efficient inference.
  • Knowledge of GenAI security, including OWASP for LLMs and secure API design.
  • Experience with LLM frameworks and a capability to document system architecture clearly.

Culture & Benefits

  • Remote work arrangement (Work From Home).
  • Agile environment with an emphasis on rapid iteration of prompts and model parameters.
  • Opportunity to work on cutting-edge GenAI production systems.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →