TL;DR
Senior Software Engineer (AI/ML): Designing and implementing highly optimized GPU-accelerated ML inference systems with an accent on low-level parallelism and performance tuning. Focus on enhancing runtime efficiency across heterogeneous computing environments and building production-grade ML pipelines for next-generation AI products.
Location: Onsite in Haifa, Israel
Company
hirify.global is an AI-first company creating next-generation content creation technology, known for pioneering consumer creativity with products like Facetune and an open-source generative video model, LTX-2.
What you will do
- Design and implement highly optimized GPU-accelerated ML inference systems using CUDA and low-level parallelism.
- Optimize memory, compute, and data flow to meet real-time or high-throughput constraints.
- Improve the performance, reliability, and observability of the inference backend across diverse compute targets (CPU/GPU).
- Collaborate with cross-functional teams to deliver efficient and scalable inference solutions.
- Contribute to ComfyUI and internal infrastructure to improve model execution flows.
- Investigate performance bottlenecks and drive innovation in low-level system design for future ML workloads.
Requirements
- 5+ years of experience in high-performance software engineering.
- Advanced proficiency in CUDA, C/C++, and Python in production environments.
- Deep understanding of GPU architecture, memory hierarchies, and optimization techniques.
- Proven track record of optimizing compute-intensive systems.
- Strong system architecture fundamentals, especially around performance, concurrency, and parallelism.
- Ability to independently lead deep technical investigations and deliver clean, maintainable solutions.
Nice to have
- Experience with low-level profiling and debugging tools (e.g., Nsight, perf, gdb, VTune).
- Familiarity with machine learning frameworks (e.g., PyTorch, TensorRT, ONNX Runtime).
- Contributions to performance-critical open-source or ML infrastructure projects.
- Experience with cloud infrastructure and GPU scheduling at scale.
Culture & Benefits
- Environment that encourages people to think, create, and explore.
- Empowerment to experiment, evolve, and elevate together for real impact.
- Collaborative mindset with a focus on deep tech and creative energy.
- Commitment to a zero-buzzword culture.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →