Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Minh Tri Huynh

SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving

Oct 30, 2024

Minh Tri Huynh, Duc Dung Nguyen

Figure 1 for SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving

Figure 2 for SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving

Figure 3 for SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving

Figure 4 for SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving

Abstract:In recent years, motion planning for urban self-driving cars (SDV) has become a popular problem due to its complex interaction of road components. To tackle this, many methods have relied on large-scale, human-sampled data processed through Imitation learning (IL). Although effective, IL alone cannot adequately handle safety and reliability concerns. Combining IL with Reinforcement learning (RL) by adding KL divergence between RL and IL policy to the RL loss can alleviate IL's weakness but suffer from over-conservation caused by covariate shift of IL. To address this limitation, we introduce a method that combines IL with RL using an implicit entropy-KL control that offers a simple way to reduce the over-conservation characteristic. In particular, we validate different challenging simulated urban scenarios from the unseen dataset, indicating that although IL can perform well in imitation tasks, our proposed method significantly improves robustness (over 17\% reduction in failures) and generates human-like driving behavior.

* submitted to IEEE Open Journal of Intelligent Transportation Systems

Via

Access Paper or Ask Questions