Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Feb 22, 2024

Tianying Ji, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu

Figure 1 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Figure 2 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Figure 3 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Figure 4 for ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Share this with someone who'll enjoy it:

Abstract:The varying significance of distinct primitive behaviors during the policy learning process has been overlooked by prior model-free RL algorithms. Leveraging this insight, we explore the causal relationship between different action dimensions and rewards to evaluate the significance of various primitive behaviors during training. We introduce a causality-aware entropy term that effectively identifies and prioritizes actions with high potential impacts for efficient exploration. Furthermore, to prevent excessive focus on specific primitive behaviors, we analyze the gradient dormancy phenomenon and introduce a dormancy-guided reset mechanism to further enhance the efficacy of our method. Our proposed algorithm, ACE: Off-policy Actor-critic with Causality-aware Entropy regularization, demonstrates a substantial performance advantage across 29 diverse continuous control tasks spanning 7 domains compared to model-free RL baselines, which underscores the effectiveness, versatility, and efficient sample efficiency of our approach. Benchmark results and videos are available at https://ace-rl.github.io/.

View paper on

Share this with someone who'll enjoy it:

Title:ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Paper and Code