Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Nov 03, 2021

Xinyi Yu, Siyu Xu, Yuehai Fan, Linlin Ou

Figure 1 for A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Figure 2 for A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Figure 3 for A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Figure 4 for A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Share this with someone who'll enjoy it:

Abstract:To solve the coupling problem of control loops and the adaptive parameter tuning problem in the multi-input multi-output (MIMO) PID control system, a self-adaptive LSAC-PID algorithm is proposed based on deep reinforcement learning (RL) and Lyapunov-based reward shaping in this paper. For complex and unknown mobile robot control environment, an RL-based MIMO PID hybrid control strategy is firstly presented. According to the dynamic information and environmental feedback of the mobile robot, the RL agent can output the optimal MIMO PID parameters in real time, without knowing mathematical model and decoupling multiple control loops. Then, to improve the convergence speed of RL and the stability of mobile robots, a Lyapunov-based reward shaping soft actor-critic (LSAC) algorithm is proposed based on Lyapunov theory and potential-based reward shaping method. The convergence and optimality of the algorithm are proved in terms of the policy evaluation and improvement step of soft policy iteration. In addition, for line-following robots, the region growing method is improved to adapt to the influence of forks and environmental interference. Through comparison, test and cross-validation, the simulation and real-environment experimental results all show good performance of the proposed LSAC-PID tuning algorithm.

* 11 pages, 13 figures

View paper on

Share this with someone who'll enjoy it:

Title:A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

Paper and Code