Назад
Company hidden
1 день назад

Lead AI Inference Engineer (AI)

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
lead
Английский
c1
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Lead AI Inference Engineer (AI): Leading a cross-functional pod to build and optimize local AI capabilities, ensuring reliable performance and seamless integration across devices with an accent on C++ inference engines and JavaScript applications. Focus on deploying machine learning models to edge devices using frameworks like llama.cpp and ggml, integrating advanced AI features into existing products, and guiding middleware/foundation engineers.

Location: Remote (Global)

Company

hirify.global is a pioneering company building cutting-edge solutions for digital finance, driving sustainable growth in energy, and fueling breakthroughs in AI and peer-to-peer technology.

What you will do

  • Lead a cross-functional pod to ensure reliable and performant local AI capabilities across devices.
  • Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and onnx.
  • Collaborate with researchers to transition models from research to production environments.
  • Integrate advanced AI features into existing products.
  • Manage a cross-functional team of engineers (middleware, foundation, QA, documentation).
  • Ensure robust architectural choices, code quality, and stable releases by following internal processes.

Requirements

  • Excellent programming skills in C++.
  • Strong experience with Llama.cpp and ggml inference engines, especially for GPU architectures.
  • Good understanding of deep learning concepts and model architectures, including transformers and LLMs.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • Experience managing a small, specialized, cross-functional team (3-5 people).
  • A degree in Computer Science, AI, Machine Learning, or a related field with a solid track record in AI R&D.

Nice to have

  • Extensive experience with Javascript/Typescript.
  • Experience with AWS, containerization, orchestration, and automated testing suites (Maestro, Appium).
  • Understanding of p2p technology, MLC, TVM, Vulkan, or CUDA.
  • Experience with productionizing models.

Culture & Benefits

  • Join a global remote team working from every corner of the world.
  • Opportunity to make a mark in the fintech space and collaborate with bright minds.
  • Work for an industry leader with a lean and fast-growing team.
  • Excellent English communication skills required.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →