Назад
Company hidden
3 часа назад

Senior Software Engineer (AI/ML)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Israel
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Software Engineer (AI/ML): Designing and implementing highly optimized GPU-accelerated ML inference systems with an accent on low-level parallelism and performance tuning. Focus on enhancing runtime efficiency across heterogeneous computing environments and building production-grade ML pipelines for next-generation AI products.

Location: Onsite in Haifa, Israel

Company

hirify.global is an AI-first company creating next-generation content creation technology, known for pioneering consumer creativity with products like Facetune and an open-source generative video model, LTX-2.

What you will do

  • Design and implement highly optimized GPU-accelerated ML inference systems using CUDA and low-level parallelism.
  • Optimize memory, compute, and data flow to meet real-time or high-throughput constraints.
  • Improve the performance, reliability, and observability of the inference backend across diverse compute targets (CPU/GPU).
  • Collaborate with cross-functional teams to deliver efficient and scalable inference solutions.
  • Contribute to ComfyUI and internal infrastructure to improve model execution flows.
  • Investigate performance bottlenecks and drive innovation in low-level system design for future ML workloads.

Requirements

  • 5+ years of experience in high-performance software engineering.
  • Advanced proficiency in CUDA, C/C++, and Python in production environments.
  • Deep understanding of GPU architecture, memory hierarchies, and optimization techniques.
  • Proven track record of optimizing compute-intensive systems.
  • Strong system architecture fundamentals, especially around performance, concurrency, and parallelism.
  • Ability to independently lead deep technical investigations and deliver clean, maintainable solutions.

Nice to have

  • Experience with low-level profiling and debugging tools (e.g., Nsight, perf, gdb, VTune).
  • Familiarity with machine learning frameworks (e.g., PyTorch, TensorRT, ONNX Runtime).
  • Contributions to performance-critical open-source or ML infrastructure projects.
  • Experience with cloud infrastructure and GPU scheduling at scale.

Culture & Benefits

  • Environment that encourages people to think, create, and explore.
  • Empowerment to experiment, evolve, and elevate together for real impact.
  • Collaborative mindset with a focus on deep tech and creative energy.
  • Commitment to a zero-buzzword culture.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →