Назад
Company hidden
2 дня назад

Principal Machine Learning Engineer (AI)

Формат работы
remote (только China)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
China
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Principal Machine Learning Engineer (AI): Building proactive AI systems and production-grade ML pipelines with an accent on model fine-tuning and inference system architecture. Focus on designing end-to-end training pipelines, scaling deployments under production constraints, and optimizing models for latency and reliability.

Location: Remote (based in China/Beijing)

Company

A technology company building proactive AI systems that understand context and facilitate long-term task execution.

What you will do

  • Build and own end-to-end ML pipelines including data, training, evaluation, inference, and deployment.
  • Fine-tune and adapt models using methods such as LoRA, QLoRA, SFT, DPO, and distillation.
  • Architect and operate scalable inference systems while balancing latency, cost, and reliability.
  • Design and maintain data systems for high-quality training data.
  • Partner with research leadership to implement evaluation pipelines for performance, safety, and bias.
  • Own production deployments with a focus on GPU memory efficiency and scaling policies.

Requirements

  • Strong background in deep learning and transformer-based architectures.
  • Hands-on experience training, fine-tuning, or deploying large-scale ML models in production.
  • Proficiency with modern ML frameworks like PyTorch or JAX.
  • Experience with distributed training and inference frameworks such as DeepSpeed, FSDP, or Ray.
  • Experience with GPU optimization including quantization and mixed precision.
  • Strong software engineering fundamentals to build maintainable, production-grade systems.

Nice to have

  • Experience with LLM inference frameworks like vLLM or TensorRT-LLM.
  • Contributions to open-source ML or systems libraries.
  • Background in scientific computing, compilers, or GPU kernels.
  • Experience with RLHF pipelines (PPO, DPO, ORPO).
  • Experience with large-scale data processing tools like Apache Arrow or Spark.

Culture & Benefits

  • High talent density team focused on rapid execution and collective decision-making.
  • Opportunity to work on complex AI challenges with global reach.
  • Collaborative environment striking a balance between research and high-quality production shipping.
  • Emphasis on individual judgment, independence, and technical ownership.

Hiring process

  • Evaluation by the technical team.
  • 3 to 4 interviews conducted via virtual meetings or onsite.
  • Prompt decision-making process.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →