Staff / Principal Machine Learning Engineer (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff / Principal Machine Learning Engineer (AI Serving): Developing and optimizing high-performance real-time multimodal model serving systems with an accent on inference optimization and distributed scaling. Focus on reducing latency, increasing throughput, and ensuring production reliability for thousands of concurrent queries.
Location: Must have the legal right to work in the United Kingdom (no visa sponsorship available)
Salary: £140,000 – £200,000
Company
is a product-oriented research lab developing best-in-class real-time multimodal models and a high-performance orchestration platform.
What you will do
- Optimize model serving using frameworks such as vLLM or TRT-LLM.
- Implement model acceleration via quantization, distillation, caching strategies, and speculative decoding.
- Develop high-performance systems using C++, CUDA, Rust, or highly optimized Python.
- Scale inference across multi-GPU and multi-node environments using Kubernetes and Ray.
- Take models from research to production, handling containerization and ensuring stability.
Requirements
- Legal right to work in the United Kingdom is required.
- Deep understanding of modern serving frameworks and inference optimization techniques.
- Proficiency in high-performance languages (C++, CUDA, Rust, or optimized Python).
- Experience with distributed systems and handling thousands of concurrent connections.
- PhD in CS, Physics, Math, or equivalent practical experience building backend/ML systems.
- Demonstrable track record through public projects or open-source contributions.
Culture & Benefits
- Flat organizational structure with fast iterations and minimal process overhead.
- Environment that values ownership, autonomy, and a bias for impact.
- Support for open-source contributions that advance the field of AI.
- Compensation includes base salary, equity, and benefits.
- Future relocation support and visa sponsorship for the San Francisco Bay Area may be available.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →