Назад
Company hidden
6 часов назад

Lead AI Inference Engineer (AI)

Формат работы
remote (Global)
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
Spain
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Lead AI Inference Engineer (AI): Leading a cross-functional pod, responsible for ensuring reliable shipping and performance of local AI capabilities across various devices. Focus on balancing hands-on technical work with team coordination to deliver cohesive, production-ready local AI systems.

Location: Remote

Company

hirify.global is a company building solutions that empower businesses to integrate reserve-backed tokens across blockchains, enabling secure and transparent digital transactions.

What you will do

  • Deploy machine learning models to edge devices using frameworks like llama.cpp, ggml, and ONNX.
  • Collaborate with researchers to assist in coding, training, and transitioning models from research to production.
  • Integrate AI features into existing products, enhancing them with machine learning advancements.
  • Manage a cross-functional pod of middleware, foundation, QA, and documentation engineers to produce high-quality deliverables.
  • Assess the company's market position regarding similar products and platforms.
  • Ensure stable releases by following precise internal release processes.

Requirements

  • Excellent programming skills in C++.
  • Strong experience with Llama.cpp and ggml inference engines.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers and LLMs.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • Experience managing a small, specialized, cross-functional team (pod) of 3-5 people.

Nice to have

  • Extensive experience with Javascript/Typescript.
  • Experience with AWS, containerization platforms, orchestration, and automated testing suites.
  • Understanding of the difficulties, nuances, and importance of p2p technology.
  • Experience with MLC, TVM, or similar frameworks.
  • Experience with Vulkan, CUDA.
  • Models that have been productionized.

Culture & Benefits

  • Work remotely from anywhere in the world.
  • Opportunity to collaborate with bright minds in the fintech space.
  • Contribute to an innovative platform.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →