Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pedro Pachuca

Neighboring state-based RL Exploration

Dec 21, 2022

Jeffery Cheng, Kevin Li, Justin Lin, Pedro Pachuca

Figure 1 for Neighboring state-based RL Exploration

Figure 2 for Neighboring state-based RL Exploration

Figure 3 for Neighboring state-based RL Exploration

Figure 4 for Neighboring state-based RL Exploration

Abstract:Reinforcement Learning is a powerful tool to model decision-making processes. However, it relies on an exploration-exploitation trade-off that remains an open challenge for many tasks. In this work, we study neighboring state-based, model-free exploration led by the intuition that, for an early-stage agent, considering actions derived from a bounded region of nearby states may lead to better actions when exploring. We propose two algorithms that choose exploratory actions based on a survey of nearby states, and find that one of our methods, ${\rho}$-explore, consistently outperforms the Double DQN baseline in an discrete environment by 49\% in terms of Eval Reward Return.

Via

Access Paper or Ask Questions

Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Jun 28, 2021

Oliver Bryniarski, Nabeel Hingun, Pedro Pachuca, Vincent Wang, Nicholas Carlini

Figure 1 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Figure 2 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Figure 3 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Figure 4 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Abstract:Evading adversarial example detection defenses requires finding adversarial examples that must simultaneously (a) be misclassified by the model and (b) be detected as non-adversarial. We find that existing attacks that attempt to satisfy multiple simultaneous constraints often over-optimize against one constraint at the cost of satisfying another. We introduce Orthogonal Projected Gradient Descent, an improved attack technique to generate adversarial examples that avoids this problem by orthogonalizing the gradients when running standard gradient-based attacks. We use our technique to evade four state-of-the-art detection defenses, reducing their accuracy to 0% while maintaining a 0% detection rate.

Via

Access Paper or Ask Questions