Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arya Baburaj

Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Nov 30, 2020

Gaurang Sriramanan, Sravanti Addepalli, Arya Baburaj, R. Venkatesh Babu

Figure 1 for Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Figure 2 for Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Figure 3 for Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Figure 4 for Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Abstract:Advances in the development of adversarial attacks have been fundamental to the progress of adversarial defense research. Efficient and effective attacks are crucial for reliable evaluation of defenses, and also for developing robust models. Adversarial attacks are often generated by maximizing standard losses such as the cross-entropy loss or maximum-margin loss within a constraint set using Projected Gradient Descent (PGD). In this work, we introduce a relaxation term to the standard loss, that finds more suitable gradient-directions, increases attack efficacy and leads to more efficient adversarial training. We propose Guided Adversarial Margin Attack (GAMA), which utilizes function mapping of the clean image to guide the generation of adversaries, thereby resulting in stronger attacks. We evaluate our attack against multiple defenses and show improved performance when compared to existing attacks. Further, we propose Guided Adversarial Training (GAT), which achieves state-of-the-art performance amongst single-step defenses by utilizing the proposed relaxation term for both attack generation and training.

* NeurIPS 2020 (Spotlight)

Via

Access Paper or Ask Questions

Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Apr 01, 2020

Sravanti Addepalli, Vivek B. S., Arya Baburaj, Gaurang Sriramanan, R. Venkatesh Babu

Figure 1 for Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Figure 2 for Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Figure 3 for Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Figure 4 for Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Abstract:As humans, we inherently perceive images based on their predominant features, and ignore noise embedded within lower bit planes. On the contrary, Deep Neural Networks are known to confidently misclassify images corrupted with meticulously crafted perturbations that are nearly imperceptible to the human eye. In this work, we attempt to address this problem by training networks to form coarse impressions based on the information in higher bit planes, and use the lower bit planes only to refine their prediction. We demonstrate that, by imposing consistency on the representations learned across differently quantized images, the adversarial robustness of networks improves significantly when compared to a normally trained model. Present state-of-the-art defenses against adversarial attacks require the networks to be explicitly trained using adversarial samples that are computationally expensive to generate. While such methods that use adversarial training continue to achieve the best results, this work paves the way towards achieving robustness without having to explicitly train on adversarial samples. The proposed approach is therefore faster, and also closer to the natural learning process in humans.

* CVPR 2020

Via

Access Paper or Ask Questions