TL;DR
Senior ML Solutions Architect (AI): Designing and implementing custom LLM-based solutions for customers using the Token Factory inference platform with an accent on RAG architectures, prompt engineering, and model deployment. Focus on scaling applications from POC to production while optimizing inference performance, reliability, and cost efficiency.
Location: Remote (must be based in Europe)
Company
hirify.global is a cloud computing provider building infrastructure to serve the global AI economy and accelerate the transformation of industries through advanced AI/ML tooling.
What you will do
- Design and implement customized LLM solutions to drive business value for clients.
- Build production-ready applications utilizing serverless LLM APIs for text, vision, and audio models.
- Provide technical leadership in prompt engineering, RAG architectures, and inference strategy.
- Collaborate with engineering teams to influence platform roadmaps based on customer feedback.
- Guide customers through scaling initiatives from initial POC to production-grade deployments.
Requirements
- 5+ years of experience in ML/AI systems.
- At least 2 years of focused experience with LLMs and generative AI.
- Strong proficiency in Python.
- Deep understanding of the LLM ecosystem including fine-tuning and model architectures.
- Hands-on experience with vector databases, agentic frameworks like Langchain, and API development.
- Excellent communication skills for conveying technical concepts to diverse stakeholders.
Nice to have
- Experience with inference frameworks like vLLM, SGLang, or TensorRT-LLM.
- Knowledge of optimization techniques such as quantization, caching, and routing.
- Familiarity with DevOps/MLOps tools like Docker and Kubernetes.
- Contributions to open-source ML/AI projects.
Culture & Benefits
- Competitive compensation and comprehensive benefits package.
- Support for professional development and growth within a scaling tech organization.
- Flexible work arrangements.
- Collaborative environment focused on cutting-edge AI cloud infrastructure.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →