TL;DR
Lead AI Inference Engineer (AI): Leading a cross-functional pod to build and optimize local AI capabilities, ensuring reliable performance and seamless integration across devices with an accent on C++ inference engines and JavaScript applications. Focus on deploying machine learning models to edge devices using frameworks like llama.cpp and ggml, integrating advanced AI features into existing products, and guiding middleware/foundation engineers.
Location: Remote (Global)
Company
hirify.global is a pioneering company building cutting-edge solutions for digital finance, driving sustainable growth in energy, and fueling breakthroughs in AI and peer-to-peer technology.
What you will do
- Lead a cross-functional pod to ensure reliable and performant local AI capabilities across devices.
- Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx.
- Collaborate with researchers to transition models from research to production environments.
- Integrate advanced AI features into existing products.
- Manage a cross-functional team of engineers (middleware, foundation, QA, documentation).
- Ensure robust architectural choices, code quality, and stable releases by following internal processes.
Requirements
- Excellent programming skills in C++.
- Strong experience with Llama.cpp and ggml inference engines, especially for GPU architectures.
- Good understanding of deep learning concepts and model architectures, including transformers and LLMs.
- Demonstrated ability to rapidly assimilate new technologies and techniques.
- Experience managing a small, specialized, cross-functional team (3-5 people).
- A degree in Computer Science, AI, Machine Learning, or a related field with a solid track record in AI R&D.
Nice to have
- Extensive experience with Javascript/Typescript.
- Experience with AWS, containerization, orchestration, and automated testing suites (Maestro, Appium).
- Understanding of p2p technology, MLC, TVM, Vulkan, or CUDA.
- Experience with productionizing models.
Culture & Benefits
- Join a global remote team working from every corner of the world.
- Opportunity to make a mark in the fintech space and collaborate with bright minds.
- Work for an industry leader with a lean and fast-growing team.
- Excellent English communication skills required.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →