TL;DR
Senior Software Engineer (AI): Designing and implementing high-performance, massively scalable infrastructure required to deploy frontier LLM models through innovative GPU kernel, compression, scheduling and parallelization optimizations. Focus on low-level performance optimization and solving hard cross-discipline engineering problems.
Location: Beijing, China
Company
hirify.global is an equal opportunity employer.
What you will do
- Keep up to date with and utilize the latest developments in LLM system optimization.
- Discover/solve impactful technical problems, advance state-of-the-art LLM technologies, and translate ideas into production.
- Optimize LLM inference workloads through innovative kernel, algorithm, scheduling, and parallelization technologies.
- Continuously maintain internal LLM inference infrastructure.
Requirements
- A bachelor’s degree or higher in computer science, engineering, or a related field, PhD is preferred
- Strong programming skills in Python and C/C++
- 2+ years of experience in machine learning system development and optimization
Nice to have
- 2+ years of experience in CUDA kernel development and optimization
- Experience in optimizing communication layer / kernels for deep learning systems
- Experience in machine learning model compression
- Experience on different hardware such as both NVIDIA and AMD GPUs is a plus
- A growth mindset and a passion for learning new things
Culture & Benefits
- hirify.global is an equal opportunity employer.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →