Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huanhui Cao

Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

Mar 23, 2021

Zhiyuan Cai, Huanhui Cao, Wenjie Lu, Lin Zhang, Hao Xiong

Figure 1 for Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

Figure 2 for Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

Figure 3 for Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

Figure 4 for Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

Abstract:Multi-Agent Reinforcement Learning (MARL) algorithms show amazing performance in simulation in recent years, but placing MARL in real-world applications may suffer safety problems. MARL with centralized shields was proposed and verified in safety games recently. However, centralized shielding approaches can be infeasible in several real-world multi-agent applications that involve non-cooperative agents or communication delay. Thus, we propose to combine MARL with decentralized Control Barrier Function (CBF) shields based on available local information. We establish a safe MARL framework with decentralized multiple CBFs and develop Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to Multi-Agent Deep Deterministic Policy Gradient with decentralized multiple Control Barrier Functions (MADDPG-CBF). Based on a collision-avoidance problem that includes not only cooperative agents but obstacles, we demonstrate the construction of multiple CBFs with safety guarantees in theory. Experiments are conducted and experiment results verify that the proposed safe MARL framework can guarantee the safety of agents included in MARL.

Via

Access Paper or Ask Questions