Research Engineer (AI Infrastructure)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Research Engineer (AI Infrastructure): Designing and building core systems for scalable, efficient training of large models with an accent on optimizing distributed training systems across thousands of GPUs. Focus on developing high-performance optimizations and reusable frameworks to enhance training reproducibility and reliability.
Location: This role is based in San Francisco, California.
Compensation: $350,000 - $475,000 USD.
Company
empowers humanity through advancing collaborative general intelligence.
What you will do
- Design, implement, and optimize distributed training systems for large-scale workloads.
- Develop high-performance optimizations to maximize throughput and efficiency.
- Create reusable frameworks to improve training reproducibility and scalability.
- Establish standards for reliability and security in systems.
- Collaborate with researchers and engineers to build scalable infrastructure.
- Publish and share learnings through documentation and open-source libraries.
Requirements
- Bachelor’s degree or equivalent experience in relevant fields.
- Strong engineering skills and ability to debug complex codebases.
- Understanding of deep learning frameworks like PyTorch and JAX.
- Experience in a collaborative environment with cross-functional teams.
- Initiative to work across different stacks and teams.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →