TL;DR
Member of Technical Staff – Post-Training (AI/LLM): Developing and optimizing cutting-edge algorithms for post-training large language models (LLMs) and deploying them to users, with an accent on data collection, evaluation, and advanced reward modeling/RL techniques. Focus on pushing model capabilities in reasoning, instruction following, math, code, and agentic tasks.
Location: Redmond, United States. Expected to work from a designated Microsoft office at least four days a week if living within 50 miles (U.S.) of that location.
Salary: USD $119,800 – $234,700 per year (U.S. typical base pay range for IC4); USD $158,400 – $258,000 per year (San Francisco Bay area and New York City metropolitan area for IC4); USD $139,900 – $274,800 per year (U.S. typical base pay range for IC5); USD $188,000 – $304,200 per year (San Francisco Bay area and New York City metropolitan area for IC5).
Company
hirify.global is on a mission to develop cutting-edge algorithms for post-training large language models (LLMs) and ship those models to millions of users using Copilot every day.
What you will do
- Develop data collection, evaluation, and post-training methods for models.
- Design hypotheses and experiment plans for rapidly iterating on model performance.
- Contribute to all stages of the post-training process, including data acquisition and building model evaluations.
- Apply advanced reward modeling and RL techniques to improve post-training recipes.
- Take end-to-end ownership of projects.
Requirements
- Bachelor’s Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline.
- 4+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python.
- Experience with reward modeling, RL, or other post-training techniques.
- English: B2 required
Nice to have
- Master’s Degree in Computer Science or related technical field and 4+ years technical engineering experience.
- Demonstrated experience in large-scale AI.
- Passion for conversational AI and its deployment.
- Strong written and verbal communication skills for cross-functional collaboration.
- Passion for learning new technologies and staying up to date with industry trends in AI.
Culture & Benefits
- Work within the startup-like Microsoft Superintelligence Team inside hirify.global.
- Opportunity to push the boundaries of AI toward Humanist Superintelligence.
- Collaborate in a highly collaborative, fast-paced, and inclusive environment.
- Contribute to models reaching billions of users and creating positive impact.
- Focus on values of respect, integrity, and accountability.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →