Senior ML Solutions Architect (Token Factory)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior ML Solutions Architect (Token Factory): Design and implement customized LLM-based solutions using serverless inference platform for open-source LLMs across multiple modalities with an accent on prompt engineering, RAG architectures, model selection, and inference optimization. Focus on building production-ready applications, scaling from POC to production, and collaborating with product and engineering teams to shape the platform roadmap.
Location: Remote from Singapore
Company
leads a new era in cloud computing for the global AI economy, headquartered in Amsterdam with R&D hubs across Europe, North America, and Israel, and over 1400 employees including 400+ skilled engineers.
What you will do
- Design and implement LLM-based solutions using Token Factory’s inference services to drive business value.
- Build production-ready applications leveraging serverless LLM APIs, including multimodal models (text, vision, audio) and domain-specific models.
- Provide expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
- Collaborate with product and engineering teams to incorporate customer feedback and influence platform roadmap.
- Guide customers in scaling from POC to production, emphasizing performance, reliability, and cost efficiency.
Requirements
- 5+ years in ML/AI systems, with 2+ years on LLMs and generative AI
- Deep knowledge of LLM ecosystem, model architectures, and fine-tuning.
- Hands-on with prompt engineering, LLM pipeline development and evaluation, agentic frameworks (Langchain, Langsmith, smolagents), vector databases, RAG patterns.
- Experience deploying LLM applications via APIs from OpenAI, Anthropic, or open-source models.
- Strong Python programming skills.
- Excellent communication skills to explain technical concepts
Nice to have
- Experience with inference frameworks (vLLM, SGLang, TensorRT-LLM, Transformers).
- Inference optimization techniques (quantization, batching, caching, routing).
- Work with multimodal AI models (vision-language, speech).
- Proficiency with DevOps tools (Docker, Kubernetes).
- Contributions to open-source ML/AI projects.
Culture & Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth.
- Flexible working arrangements.
- Dynamic, collaborative environment valuing initiative and innovation.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →