TL;DR
Senior Applied Scientist (AI): Developing and optimizing scalable training and inference workflows for Small and Large Language Models with an accent on model compression, efficient deployment, and inference performance. Focus on designing advanced AI systems, optimizing LLM inference, and collaborating cross-functionally to integrate AI solutions into enterprise products.
Location: Suzhou, China, onsite
Company
hirify.global drives innovation in intelligent assistant infrastructure powering Microsoft Copilot experiences worldwide.
What you will do
- Design and implement efficient training, distillation, and fine-tuning workflows for language models using techniques like LoRA and instruction tuning.
- Apply model compression strategies such as quantization and pruning to improve inference cost and latency.
- Optimize LLM inference performance with frameworks like vLLM and TensorRT-LLM for scalable deployment.
- Build robust, scalable inference systems focused on performance, cost-efficiency, and stability.
- Develop evaluation datasets and metrics to assess model performance in real-world scenarios.
- Collaborate with product managers, engineers, and researchers to translate business needs into AI solutions.
Requirements
- Location: Must be based in Suzhou, China for onsite work.
- Bachelor’s degree with 4+ years or Master’s with 3+ years or Doctorate with 1+ year experience in relevant fields or equivalent.
- Strong programming skills managing large-scale data and ML pipelines.
- Deep knowledge of ML frameworks such as PyTorch, vLLM, and TensorRT-LLM.
- Experience with model optimization techniques including quantization and pruning.
- English: Proficient (B2) required.
Nice to have
- Master’s or Doctorate with more extensive experience (6+ years or 3+ years respectively).
- Experience optimizing LLM inference using vLLM or TRT-LLM.
- Practical experience in model compression and production deployment.
- Experience designing agentic AI systems with multi-agent orchestration and planning.
Culture & Benefits
- Growth mindset and collaborative culture focused on respect, integrity, and accountability.
- Work environment aligned with Microsoft’s mission to empower every person and organization.
- Onsite work expectation with local law compliance.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →