Staff Machine Learning Engineer (Simulation)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Staff Machine Learning Engineer (Simulation): Building scalable machine learning systems and simulation workflows for autonomous driving evaluation with an accent on RL algorithms, generative models, and human preference data. Focus on implementing novel RLHF paradigms, fine-tuning large-scale models, and optimizing for high-fidelity driving behaviors.
Location: Onsite in London, England
Salary: £150,000—£162,000 GBP
Company
Autonomous driving technology company powering fully autonomous ride-hail service with the Driver.
What you will do
- Build scalable systems for training and fine-tuning large-scale generative models to produce realistic driving behaviors.
- Lead implementation of novel RL algorithms, reward functions, and training paradigms for high-fidelity simulation.
- Develop Deep Learning and Generative AI (LLM/VLM) solutions to automate triaging, analysis, and anomaly detection in self-driving behavior.
- Oversee production and optimization of ML models assessing fleet vehicles traveling millions of miles.
- Monitor industry best practices to develop RLHF-based data collection and evaluation systems.
- Collaborate with Prediction, Planning, Research teams and leadership on strategic efforts.
Requirements
- M.S. or Ph.D. in Computer Science, Machine Learning, AI, or related field, or equivalent experience.
- 7+ years hands-on experience developing and applying ML models, focused on Reinforcement Learning.
- Expertise in deep learning, sequence modeling, and generative models.
- Strong publication record or impactful RL project delivery.
- Proficiency in Python and ML frameworks (JAX, TensorFlow).
- Experience with large-scale distributed training and data processing.
- Ability to lead complex technical projects from conception to completion.
Nice to have
- 10+ years in ML/RL research and application.
- Experience in autonomous vehicles, robotics, or simulation environments.
- Deep knowledge of state-of-the-art RL techniques including RLHF.
- Familiarity with large-scale simulation platforms and ML integration.
- Experience designing metrics for complex AI systems.
- Track record of technical leadership and cross-team innovation.
- Excellent communication skills.
Culture & Benefits
- Discretionary annual bonus program.
- Equity incentive plan.
- Generous company benefits program.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →