On learning racing policies with reinforcement learning

Grzegorz Czechmanowski, Jan Wegrzynowski, Piotr Kicki, Krzysztof Walas

IDEAS NCBR
Institute of Robotics and Machine Intelligence, Poznan University of Technology
IROS 2025

A Reinforcement Learning Policy, Trained in Simulation, deployed Zero-Shot on a Real Track

Abstract

Fully autonomous vehicles promise enhanced safety and efficiency. However, ensuring reliable operation in challenging corner cases requires control algorithms capable of performing at the vehicle limits. We address this requirement by considering the task of autonomous racing and propose solving it by learning a racing policy using Reinforcement Learning (RL). Our approach leverages domain randomization, actuator dynamics modeling, and policy architecture design to enable reliable and safe zero-shot deployment on a real platform. Evaluated on the F1TENTH race car, our RL policy not only surpasses a state-of-the-art Model Predictive Control (MPC), but, to the best of our knowledge, also represents the first instance of an RL policy outperforming expert human drivers in RC racing. This work identifies the key factors driving this performance improvement, providing critical insights for the design of robust RL-based control strategies for autonomous vehicles.

Video Presentation