Эта вакансия в архиве
Посмотреть похожие вакансии ↓2 месяца назад
Senior AI Inference Engineer
Описание вакансии
Текст:
TL;DR
Senior AI Inference Engineer (AI): Developing and optimizing the inference backbone for QVAC's local AI stack with an accent on runtime quality, performance, and stability. Focus on low-level problem solving, technical ownership, and building reliable infrastructure for private, on-device AI experiences and peer-to-peer AI products.
Location: Remote, based in Barcelona, Spain, with global remote work allowed.
Company
is a leading fintech product company pioneering blockchain-based digital finance solutions, including the world’s most trusted stablecoin USDT and AI-driven data technologies.
What you will do
- Own and engineer the C++ inference systems layer for AI model deployment on edge devices.
- Collaborate with researchers to transition AI models from research to production.
- Integrate advanced AI features into existing products.
- Ensure runtime quality including startup behavior, memory management, throughput, latency, and stability.
- Define and evolve core abstractions for inference features to maintain performance and maintainability.
Requirements
- Must have excellent C++ programming skills; JavaScript experience is a bonus.
- Strong experience with Llama.cpp and ggml inference engines for GPU model deployment.
- Good understanding of deep learning, transformers, LLMs, and diffusion models.
- Degree in Computer Science, AI, Machine Learning or related field with proven AI R&D experience.
- Excellent English communication skills required.
Nice to have
- Experience with JavaScript/TypeScript.
- Understanding of peer-to-peer technology challenges.
- Experience with Vulkan, Metal, or OpenCL.
- Experience productionizing AI models.
Culture & Benefits
- Fully remote global team with flexible work environment.
- Opportunity to work on cutting-edge fintech and AI technologies.
- Collaborate with a global talent pool in a fast-growing product company.