Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ningyuan Zhang

Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Apr 06, 2022

Ningyuan Zhang, Wenliang Liu, Calin Belta

Figure 1 for Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Figure 2 for Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Figure 3 for Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Figure 4 for Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping

Abstract:We present a computational framework for synthesis of distributed control strategies for a heterogeneous team of robots in a partially observable environment. The goal is to cooperatively satisfy specifications given as Truncated Linear Temporal Logic (TLTL) formulas. Our approach formulates the synthesis problem as a stochastic game and employs a policy graph method to find a control strategy with memory for each agent. We construct the stochastic game on the product between the team transition system and a finite state automaton (FSA) that tracks the satisfaction of the TLTL formula. We use the quantitative semantics of TLTL as the reward of the game, and further reshape it using the FSA to guide and accelerate the learning process. Simulation results demonstrate the efficacy of the proposed solution under demanding task specifications and the effectiveness of reward shaping in significantly accelerating the speed of learning.

* 12 pages, 4 figures, accepted by L4DC 2022

Via

Access Paper or Ask Questions