Назад
Company hidden
3 часа назад

Senior ML Solutions Architect (AI)

215 000 - 275 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior ML Solutions Architect (AI): Designing and implementing customized LLM-based solutions and architecting scalable AI applications using hirify.global Token Factory's serverless inference platform with an accent on prompt engineering, RAG architectures, and model optimization. Focus on building production-ready multimodal LLM applications and guiding customers from POC to production with a focus on performance, reliability, and cost efficiency.

Location: United States

Salary: $215,000–$275,000 OTE

Company

hirify.global is a cloud computing company focused on serving the global AI economy by providing tools and resources for AI/ML challenges.

What you will do

  • Design and implement LLM-based solutions to drive business value and support customer goals.
  • Build production-ready applications using hirify.global Token Factory’s serverless LLM APIs, including multimodal and domain-specific models.
  • Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to integrate customer feedback and shape the platform roadmap.
  • Guide customers in scaling AI applications from POC to production, focusing on performance, reliability, and cost efficiency.

Requirements

  • 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with prompt engineering, LLM pipeline development, and evaluation.
  • Experience with agentic frameworks like Langchain, Langsmith, or smolagents.
  • Proficiency with vector databases and RAG implementation patterns.
  • Strong Python programming skills and excellent communication abilities.

Nice to have

  • Experience with inference frameworks and libraries (e.g., vLLM, SGLang, TensorRT-LLM, Transformers).
  • Familiarity with inference optimization techniques (quantization, batching, caching).
  • Work with multimodal AI models (vision-language, speech).
  • Proficiency with DevOps tools (Docker, Kubernetes).
  • Contributions to open-source ML/AI projects.

Culture & Benefits

  • 100% company-paid medical, dental, and vision coverage for employees and families.
  • Up to 4% 401(k) company match with immediate vesting.
  • 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers.
  • Up to $85/month reimbursement for mobile and internet.
  • Company-paid short-term, long-term, and life insurance coverage.
  • Flexible working arrangements and opportunities for professional growth.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...