Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wensong Bai

Towards Optimal Randomized Strategies in Adversarial Example Game

Jun 29, 2023

Jiahao Xie, Chao Zhang, Weijie Liu, Wensong Bai, Hui Qian

Figure 1 for Towards Optimal Randomized Strategies in Adversarial Example Game

Figure 2 for Towards Optimal Randomized Strategies in Adversarial Example Game

Figure 3 for Towards Optimal Randomized Strategies in Adversarial Example Game

Abstract:The vulnerability of deep neural network models to adversarial example attacks is a practical challenge in many artificial intelligence applications. A recent line of work shows that the use of randomization in adversarial training is the key to find optimal strategies against adversarial example attacks. However, in a fully randomized setting where both the defender and the attacker can use randomized strategies, there are no efficient algorithm for finding such an optimal strategy. To fill the gap, we propose the first algorithm of its kind, called FRAT, which models the problem with a new infinite-dimensional continuous-time flow on probability distribution spaces. FRAT maintains a lightweight mixture of models for the defender, with flexibility to efficiently update mixing weights and model parameters at each iteration. Furthermore, FRAT utilizes lightweight sampling subroutines to construct a random strategy for the attacker. We prove that the continuous-time limit of FRAT converges to a mixed Nash equilibria in a zero-sum game formed by a defender and an attacker. Experimental results also demonstrate the efficiency of FRAT on CIFAR-10 and CIFAR-100 datasets.

* Extended version of paper https://doi.org/10.1609/aaai.v37i9.26247 which appeared in AAAI 2023

Via

Access Paper or Ask Questions

PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

Jun 11, 2023

Wensong Bai, Chao Zhang, Yichao Fu, Lingwei Peng, Hui Qian, Bin Dai

Figure 1 for PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

Figure 2 for PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

Figure 3 for PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

Figure 4 for PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

Abstract:In this paper, we propose the first fully push-forward-based Distributional Reinforcement Learning algorithm, called Push-forward-based Actor-Critic EncourageR (PACER). Specifically, PACER establishes a stochastic utility value policy gradient theorem and simultaneously leverages the push-forward operator in the construction of both the actor and the critic. Moreover, based on maximum mean discrepancies (MMD), a novel sample-based encourager is designed to incentivize exploration. Experimental evaluations on various continuous control benchmarks demonstrate the superiority of our algorithm over the state-of-the-art.

Via

Access Paper or Ask Questions