TL;DR
Staff Engineer (Machine Learning): Design and build scalable backend services and end-to-end ML workflows including RAG systems, vector search, and model integration with an accent on performance, reliability, and cost optimization. Focus on productionizing ML systems with observability, strong SLO ownership, and collaboration with applied ML teams.
Company
hirify.global is a global digital product engineering company with 17,700+ experts across 39 countries, focused on building inspiring products, services, and experiences at scale.
What you will do
- Design and build core backend services powering AI/ML runtime including orchestration and session management.
- Implement end-to-end retrieval and memory systems covering ingestion, embeddings, indexing, vector search, ranking, caching, and lifecycle management.
- Productionize ML workflows with feature/metadata services, model integration contracts, and evaluation hooks.
- Drive performance, reliability, and cost optimization with strong SLO ownership and observability practices.
- Collaborate with applied ML teams on model routing, prompts/tools, evaluation datasets, and safe releases.
- Lead troubleshooting, root-cause analysis, and POCs to validate technology and design decisions.
Requirements
- Total experience of 5.5 years+
- Strong expertise in Python and backend engineering with scalable microservices experience
- Experience with RAG workflows, vector databases, LLM/NLP engineering, and ML productionization
- Knowledge of REST/gRPC APIs, Docker, Kubernetes, CI/CD, and cloud platforms (AWS/GCP/Azure)
- Bachelor’s or master’s degree in computer science or related field
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →