Назад
Company hidden
6 дней назад

Senior Applied Scientist (AI)

Формат работы
onsite
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
China
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Applied Scientist (AI): Developing and optimizing scalable training and inference workflows for Small and Large Language Models with an accent on model compression, efficient deployment, and inference performance. Focus on designing advanced AI systems, optimizing LLM inference, and collaborating cross-functionally to integrate AI solutions into enterprise products.

Location: Suzhou, China, onsite

Company

hirify.global drives innovation in intelligent assistant infrastructure powering Microsoft Copilot experiences worldwide.

What you will do

  • Design and implement efficient training, distillation, and fine-tuning workflows for language models using techniques like LoRA and instruction tuning.
  • Apply model compression strategies such as quantization and pruning to improve inference cost and latency.
  • Optimize LLM inference performance with frameworks like vLLM and TensorRT-LLM for scalable deployment.
  • Build robust, scalable inference systems focused on performance, cost-efficiency, and stability.
  • Develop evaluation datasets and metrics to assess model performance in real-world scenarios.
  • Collaborate with product managers, engineers, and researchers to translate business needs into AI solutions.

Requirements

  • Location: Must be based in Suzhou, China for onsite work.
  • Bachelor’s degree with 4+ years or Master’s with 3+ years or Doctorate with 1+ year experience in relevant fields or equivalent.
  • Strong programming skills managing large-scale data and ML pipelines.
  • Deep knowledge of ML frameworks such as PyTorch, vLLM, and TensorRT-LLM.
  • Experience with model optimization techniques including quantization and pruning.
  • English: Proficient (B2) required.

Nice to have

  • Master’s or Doctorate with more extensive experience (6+ years or 3+ years respectively).
  • Experience optimizing LLM inference using vLLM or TRT-LLM.
  • Practical experience in model compression and production deployment.
  • Experience designing agentic AI systems with multi-agent orchestration and planning.

Culture & Benefits

  • Growth mindset and collaborative culture focused on respect, integrity, and accountability.
  • Work environment aligned with Microsoft’s mission to empower every person and organization.
  • Onsite work expectation with local law compliance.

Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →