Эта вакансия в архиве

Посмотреть похожие вакансии ↓
Company hidden
обновлено 29 дней назад

Senior Machine Learning Systems Engineer (Training Optimization)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
c1
Страна
China

Описание вакансии

Текст:
/

TL;DR

Senior Machine Learning Systems Engineer (AI): Leading efforts to scale and optimize the training system for our large-scale multimodal and foundation models with an accent on performance across compute, memory, and communication layers. Focus on debugging, profiling, and fine-tuning training workflows to unlock new levels of scalability.

Location: Beijing, China

Company

hirify.global's CORE team within the Generative AI supergroup invents foundational technologies that will power the future of AI-assisted design, building the technical core of hirify.global’s creative intelligence engine.

What you will do

  • Design, implement, and optimize large-scale machine learning systems for training and inference.
  • Improve all aspects of performance, including GPU utilization, communication overhead, and memory efficiency.
  • Partner with research and modeling teams to align systems with algorithmic needs.
  • Evaluate and apply best practices for distributed training using industry-leading frameworks.
  • Dive deep into low-level optimization, including custom CUDA or Triton kernels.
  • Debug, profile, and fine-tune training workflows to unlock new levels of scalability.

Requirements

  • Strong background in LLMs, multimodal AI, or diffusion models.
  • Proficiency in Python. Familiarity with a system programming language (e.g. C++ or Rust) is a plus.
  • Deep knowledge of PyTorch or JAX as well as libraries such as Megatron-LM, NeMo, or DeepSpeed.
  • Familiarity with common optimization techniques such as FSDP/ZeRO, gradient checkpointing, or low-precision data types.
  • Hands-on experience writing custom GPU kernels in CUDA or Triton.
  • Excellent communication and problem-solving skills, incl. full proficiency in English.

Culture & Benefits

  • Fast-paced, high-impact environment.
  • Work on challenges that stretch current boundaries.
  • Collaborate clearly across domains.