Назад
Company hidden
3 дня назад

Lead Engineer, RL Scaling & Procedural Scenario Generation (AI)

225 000 - 300 000$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
US/Canada
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Lead Engineer, RL Scaling & Procedural Scenario Generation (AI): Building scalable RL training pipelines and high-fidelity synthetic scenarios for sidewalk delivery robots with an accent on terrain intelligence and social navigation behaviors. Focus on designing procedural simulation environments, optimizing distributed RL systems, and mapping real-world failures into repeatable synthetic cases.

Location: Remote (Must be based in the USA or Canada)

Salary: $225,000 – $300,000 USD (Bay Area) / $190,000 – $230,000 USD (USA other) / $160,000 – $190,000 CAD (Canada)

Company

hirify.global is reimagining city logistics using personable sidewalk robots to handle commercial deliveries while reducing street congestion.

What you will do

  • Develop RL algorithms for terrain intelligence and social navigation behaviors.
  • Design and optimize large-scale distributed RL training pipelines using GPU clusters and containerized workflows.
  • Implement curriculum learning, domain randomization, and multi-agent RL strategies.
  • Build procedural generation pipelines for synthetic environments and diverse long-tail edge cases.
  • Collaborate with autonomy and safety teams to translate real-world failures into repeatable simulation cases.
  • Optimize simulation performance for determinism, reproducibility, and real-time speed.

Requirements

  • Master’s degree in Robotics, AI, Computer Science, Mathematics, or a related field.
  • 7+ years of experience shipping transformer-based AI models for AV or robotics solutions at scale.
  • 3+ years of technical leadership or architecture experience.
  • Strong expertise in Reinforcement Learning (PPO, SAC, A3C, DQN) and distributed frameworks (Ray RLlib, PyTorch Distributed).
  • Proficiency in Python and C++ for performance-critical simulation or graphics pipelines.
  • Experience with simulation environments such as Isaac Sim, Unity, Unreal, CARLA, or Gazebo.

Nice to have

  • Background in Generative AI (diffusion, LLMs) for scenario synthesis or environment creation.
  • Experience with traffic simulation (SUMO) or sensor simulation (LiDAR, camera pipelines).
  • Knowledge of CUDA, graphics engines, or physics modeling.

Culture & Benefits

  • Opportunity to work with tech industry veterans in software, hardware, and design.
  • Agile, diverse, and collaborative team environment.
  • Direct impact on the future of urban robotic deliveries.
  • Competitive compensation package including equity offers.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →