Назад
Company hidden
19 часов назад

Senior ML Solutions Architect (Token Factory)

Формат работы
remote (только Singapore)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Singapore
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior ML Solutions Architect (Token Factory): Design and implement customized LLM-based solutions using serverless inference platform for open-source LLMs across multiple modalities with an accent on prompt engineering, RAG architectures, model selection, and inference optimization. Focus on building production-ready applications, scaling from POC to production, and collaborating with product and engineering teams to shape the platform roadmap.

Location: Remote from Singapore

Company

hirify.global leads a new era in cloud computing for the global AI economy, headquartered in Amsterdam with R&D hubs across Europe, North America, and Israel, and over 1400 employees including 400+ skilled engineers.

What you will do

  • Design and implement LLM-based solutions using hirify.global Token Factory’s inference services to drive business value.
  • Build production-ready applications leveraging serverless LLM APIs, including multimodal models (text, vision, audio) and domain-specific models.
  • Provide expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to incorporate customer feedback and influence platform roadmap.
  • Guide customers in scaling from POC to production, emphasizing performance, reliability, and cost efficiency.

Requirements

  • 5+ years in ML/AI systems, with 2+ years on LLMs and generative AI
  • Deep knowledge of LLM ecosystem, model architectures, and fine-tuning.
  • Hands-on with prompt engineering, LLM pipeline development and evaluation, agentic frameworks (Langchain, Langsmith, smolagents), vector databases, RAG patterns.
  • Experience deploying LLM applications via APIs from OpenAI, Anthropic, or open-source models.
  • Strong Python programming skills.
  • Excellent communication skills to explain technical concepts

Nice to have

  • Experience with inference frameworks (vLLM, SGLang, TensorRT-LLM, Transformers).
  • Inference optimization techniques (quantization, batching, caching, routing).
  • Work with multimodal AI models (vision-language, speech).
  • Proficiency with DevOps tools (Docker, Kubernetes).
  • Contributions to open-source ML/AI projects.

Culture & Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth.
  • Flexible working arrangements.
  • Dynamic, collaborative environment valuing initiative and innovation.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →