Назад
Company hidden
6 дней назад

Senior Applied Scientist (AI/ML)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
China
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Applied Scientist (AI/ML): Building and optimizing next-generation intelligent assistant infrastructure for Microsoft Copilot with an accent on scalable training, inference optimization, and model compression for SLMs/LLMs. Focus on pioneering innovations in model efficiency, post-training techniques, and robust production deployment.

Location: Beijing, Cn. Employees are expected to work from the office at least four days per week.

Company

hirify.global is driving the next generation of intelligent assistant infrastructure, powering Microsoft Copilot experiences across the enterprise.

What you will do

  • Design and implement efficient workflows for training, distillation, and fine-tuning SLMs/LLMs using techniques like LoRA, QLoRA, and instruction tuning.
  • Apply model compression strategies, including quantization (GPTQ, AWQ) and pruning, to reduce inference costs and improve latency.
  • Optimize LLM inference performance using frameworks like vLLM and TensorRT-LLM for scalable, low-latency deployment.
  • Build robust and scalable inference systems tailored to heterogeneous production environments, focusing on performance, cost-efficiency, and stability.
  • Develop evaluation datasets and metrics to assess model performance in real-world product scenarios.
  • Collaborate closely with product managers, engineers, and research scientists to translate business needs into impactful AI solutions.

Requirements

  • Bachelor’s Degree and 4+ years, Master’s Degree and 3+ years, or Doctorate and 1+ year related experience in Statistics, Econometrics, Computer Science, or Electrical/Computer Engineering.
  • Solid programming skills with hands-on experience managing large-scale data and machine learning pipelines.
  • Deep understanding of open-source ML frameworks such as PyTorch, vLLM, and TensorRT-LLM.
  • Solid knowledge of model optimization techniques, including quantization, pruning, and efficient inference.

Nice to have

  • Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience.
  • 1+ years of experience optimizing LLM inference using frameworks like vLLM or TRT-LLM.
  • Practical experience in model compression and deployment within production systems.
  • Experience designing agentic AI systems, such as multi-agent orchestration, tool usage, planning, and reasoning.

Culture & Benefits

  • Microsoft’s mission is to empower every person and every organization on the planet to achieve more.
  • The team builds on values of growth mindset, innovation, and collaboration, fostering a culture of inclusion, respect, integrity, and accountability.
  • Microsoft is an equal opportunity employer, committed to diversity and inclusion.
  • Assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process is available.

Hiring process

  • Applications are accepted on an ongoing basis until the position is filled.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →