Назад
Company hidden
19 часов Π½Π°Π·Π°Π΄

Senior AI Engineer (AI)

180Β 000 - 200Β 000$
Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
onsite
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
US
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Senior AI Engineer (vLLM/RAG): Designing and operating enterprise AI systems across a client portfolio with an accent on inference optimization and end-to-end AI stack implementation. Focus on tuning LLM inference serving, architecting RAG pipelines, and managing high-performance GPU infrastructure.

Location: Must be based in Atlanta, United States

Compensation: $180,000 - $200,000 per year

Company

hirify.global is a leading provider of IT services and solutions specializing in cloud, cybersecurity, infrastructure, and application modernization for enterprise clients.

What you will do

  • Lead end-to-end design and operation of AI systems on AI Factory platforms such as HPE PCAI, Dell AI Factory, and Nutanix Enterprise AI.
  • Engineer and tune LLM inference serving stacks, specifically utilizing vLLM for optimal latency, throughput, and cost.
  • Architect RAG applications with vector databases, focusing on chunking strategies, retrieval tuning, and context-window management.
  • Develop MLOps pipelines covering the model lifecycle, registries, deployment, and observability.
  • Engineer high-performance storage and networking for AI workloads using RDMA fabrics and parallel filesystems.
  • Collaborate directly with client architects and executives to deliver production AI outcomes and provide technical mentorship.

Requirements

  • 7+ years of software, data, or infrastructure engineering, with 3+ years specialized in modern AI/LLM systems.
  • Production-level proficiency in Python, deep Linux system internals, and Docker.
  • Hands-on experience deploying and operating vLLM and AI Factory platforms (HPE, Dell, or Nutanix).
  • Practical experience with vector databases, RAG pipelines, and production-scale prompt engineering.
  • Demonstrated ability to design LLM evaluation harnesses and quality metrics.
  • Location: Based in Atlanta, United States

Nice to have

  • Experience with GPU drivers, CUDA toolchains, and NVIDIA AI Enterprise software stack.
  • Familiarity with Ray for distributed training and inference scaling.
  • Certified Kubernetes Administrator (CKA) or CKAD certifications.
  • Knowledge of LoRA/QLoRA/PEFT and supervised fine-tuning workflows.
  • Experience with Infrastructure as Code (Terraform, Ansible, Helm).

Culture & Benefits

  • Opportunity to work at the center of enterprise AI investment and cutting-edge AI Factory platforms.
  • High-impact role with direct engagement across a diverse client portfolio.
  • Commitment to technical excellence through mentorship and continuous practice improvement.
  • Collaborative environment focused on conquering IT complexity through innovation.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’