Lead Engineer, RL Scaling & Procedural Scenario Generation (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Lead Engineer, RL Scaling & Procedural Scenario Generation (AI): Building scalable RL training pipelines and high-fidelity synthetic scenarios for sidewalk delivery robots with an accent on terrain intelligence and social navigation behaviors. Focus on designing procedural simulation environments, optimizing distributed RL systems, and mapping real-world failures into repeatable synthetic cases.
Location: Remote (Must be based in the USA or Canada)
Salary: $225,000 – $300,000 USD (Bay Area) / $190,000 – $230,000 USD (USA other) / $160,000 – $190,000 CAD (Canada)
Company
is reimagining city logistics using personable sidewalk robots to handle commercial deliveries while reducing street congestion.
What you will do
- Develop RL algorithms for terrain intelligence and social navigation behaviors.
- Design and optimize large-scale distributed RL training pipelines using GPU clusters and containerized workflows.
- Implement curriculum learning, domain randomization, and multi-agent RL strategies.
- Build procedural generation pipelines for synthetic environments and diverse long-tail edge cases.
- Collaborate with autonomy and safety teams to translate real-world failures into repeatable simulation cases.
- Optimize simulation performance for determinism, reproducibility, and real-time speed.
Requirements
- Master’s degree in Robotics, AI, Computer Science, Mathematics, or a related field.
- 7+ years of experience shipping transformer-based AI models for AV or robotics solutions at scale.
- 3+ years of technical leadership or architecture experience.
- Strong expertise in Reinforcement Learning (PPO, SAC, A3C, DQN) and distributed frameworks (Ray RLlib, PyTorch Distributed).
- Proficiency in Python and C++ for performance-critical simulation or graphics pipelines.
- Experience with simulation environments such as Isaac Sim, Unity, Unreal, CARLA, or Gazebo.
Nice to have
- Background in Generative AI (diffusion, LLMs) for scenario synthesis or environment creation.
- Experience with traffic simulation (SUMO) or sensor simulation (LiDAR, camera pipelines).
- Knowledge of CUDA, graphics engines, or physics modeling.
Culture & Benefits
- Opportunity to work with tech industry veterans in software, hardware, and design.
- Agile, diverse, and collaborative team environment.
- Direct impact on the future of urban robotic deliveries.
- Competitive compensation package including equity offers.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →