TL;DR
Senior Software Engineer (AI): Designing and optimizing GPU-accelerated ML inference systems for generative video models with an accent on low-level parallelism, CUDA, and systems architecture. Focus on enhancing runtime efficiency, solving performance bottlenecks, and contributing to production-grade ML pipelines.
Location: Onsite in central Israel
Company
hirify.global is an AI-first company that builds generative video models and products like Facetune and LTX Studio, used by hundreds of millions globally.
What you will do
- Design and implement highly optimized GPU-accelerated ML inference systems using CUDA and low-level parallelism techniques.
- Optimize memory, compute, and data flow to meet real-time or high-throughput constraints.
- Improve the performance, reliability, and observability of the inference backend across diverse compute targets (CPU/GPU).
- Collaborate with cross-functional teams (including researchers, developers, and designers) to deliver efficient and scalable inference solutions.
- Contribute to ComfyUI and internal infrastructure to improve usability and performance of model execution flows.
- Investigate performance bottlenecks at all levels of the stack—from Python to kernel-level execution.
Requirements
- 5+ years of experience in high-performance software engineering.
- Advanced proficiency in CUDA, C/C++, and Python, especially in production environments.
- Deep understanding of GPU architecture, memory hierarchies, and optimization techniques.
- Proven track record of optimizing compute-intensive systems.
- Strong system architecture fundamentals, especially around performance, concurrency, and parallelism.
- Ability to independently lead deep technical investigations and deliver clean, maintainable solutions.
Nice to have
- Experience with low-level profiling and debugging tools (e.g., Nsight, perf, gdb, VTune).
- Familiarity with machine learning frameworks (e.g., PyTorch, TensorRT, ONNX Runtime).
- Contributions to performance-critical open-source or ML infrastructure projects.
- Experience with cloud infrastructure and GPU scheduling at scale.
Culture & Benefits
- Daily door-to-door shuttles, Car-to-go subscriptions for several locations in central Israel, plus free parking and train-station pickups.
- Two chef-led restaurants on site by the Machneyuda Group, plus a bakery in the office.
- Empowerment with cutting-edge tools and learning opportunities through workshops, access to platforms, subscriptions, and clear guidelines for responsible AI use.
- Environment encouraging people to think, create, and explore for real impact through experimentation and collaboration.
- Focus on pushing the boundaries of what’s possible with AI and video for craft, challenge, and genuine innovation.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →