Эта вакансия в архиве
Посмотреть похожие вакансии ↓2 месяца назад
Senior Ai Inference Engineer (Ai)
Описание вакансии
Текст:
TL;DR
Senior AI Inference Engineer (AI): Owning the inference backbone behind QVAC's local AI stack, focusing on the C++ systems layer for fast and reliable model execution on user hardware with an accent on runtime quality, startup behavior, memory pressure, and throughput/latency balance. Focus on low-level problem solving and building infrastructure that other teams trust in production, enabling private, on-device AI experiences.
Location: London, England, United Kingdom. 100% Remote
Company
is building cutting-edge solutions that empower businesses to seamlessly integrate reserve-backed tokens across blockchains.
What you will do
- Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx.
- Collaborate with researchers to code, train, and transition models from research to production.
- Integrate AI features into existing products.
Requirements
- Excellent programming skills in C++.
- Strong experience with Llama.cpp and ggml inference engines.
- Good understanding of deep learning concepts and model architectures.
- Experience with transformers, LLMs, and Diffusion models.
- Ability to rapidly assimilate new technologies and techniques.
- A degree in Computer Science, AI, Machine Learning, or a related field.
Nice to have
- Experience with Javascript/Typescript.
- Understanding of p2p technology.
- Experience with Vulkan, Metal, and OpenCL.
- Experience with productionized models.
Culture & Benefits
- Global talent powerhouse working remotely.
- Opportunity to make a mark in the fintech space and collaborate with bright minds.
- Fast-growing, lean company and a leader in the industry.
- Excellent English communication skills are essential.