TL;DR
Senior Machine Learning Research Engineer (AI): Designing and maintaining large-scale distributed training systems for LLMs and multi-modal models with an accent on performance optimization and efficient data pipelines. Focus on automating never-ending innovative discovery processes by building frontier scientific AI infrastructure.
Location: San Francisco, CA
Salary: $148,000 - $240,000
Company
hirify.global is a scientific superintelligence platform and autonomous lab applying AI to chemistry, materials, and life sciences to accelerate discovery.
What you will do
- Design and maintain large-scale distributed training infrastructure for LLMs and multi-modal models.
- Optimize training and optimization workflows including SFT, RL, and long-context processing.
- Orchestrate frontier LLMs alongside complex compute-intensive tools.
- Build scalable pipelines for data preprocessing and experiment orchestration.
- Develop system-level performance benchmarks and debugging utilities.
Requirements
- Proven experience with distributed ML training frameworks such as Megatron-LM, TorchTitan, DeepSpeed, or Ray.
- Strong software engineering skills with proficiency in Python.
- Deep understanding of large-scale model training techniques.
- Experience working in cloud or HPC environments.
Nice to have
- Contributions to C++ kernels.
- Prior work with scientific datasets or domain-specific modeling.
- Contributions to open-source ML frameworks.
Culture & Benefits
- Competitive base salary and bonus potential.
- Generous early-stage equity grants.
- Work at the intersection of AI research and scientific discovery.
- Equal opportunity employer committed to a diverse scientific team.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →