Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Janani Suresh

First line of defense: A robust first layer mitigates adversarial attacks

Aug 21, 2024

Janani Suresh, Nancy Nayak, Sheetal Kalyani

Figure 1 for First line of defense: A robust first layer mitigates adversarial attacks

Figure 2 for First line of defense: A robust first layer mitigates adversarial attacks

Figure 3 for First line of defense: A robust first layer mitigates adversarial attacks

Figure 4 for First line of defense: A robust first layer mitigates adversarial attacks

Abstract:Adversarial training (AT) incurs significant computational overhead, leading to growing interest in designing inherently robust architectures. We demonstrate that a carefully designed first layer of the neural network can serve as an implicit adversarial noise filter (ANF). This filter is created using a combination of large kernel size, increased convolution filters, and a maxpool operation. We show that integrating this filter as the first layer in architectures such as ResNet, VGG, and EfficientNet results in adversarially robust networks. Our approach achieves higher adversarial accuracies than existing natively robust architectures without AT and is competitive with adversarial-trained architectures across a wide range of datasets. Supporting our findings, we show that (a) the decision regions for our method have better margins, (b) the visualized loss surfaces are smoother, (c) the modified peak signal-to-noise ratio (mPSNR) values at the output of the ANF are higher, (d) high-frequency components are more attenuated, and (e) architectures incorporating ANF exhibit better denoising in Gaussian noise compared to baseline architectures. Code for all our experiments are available at \url{https://github.com/janani-suresh-97/first-line-defence.git}.

Via

Access Paper or Ask Questions

Probabilistic learning rate scheduler with provable convergence

Jul 10, 2024

Dahlia Devapriya, Thulasi Tholeti, Janani Suresh, Sheetal Kalyani

Figure 1 for Probabilistic learning rate scheduler with provable convergence

Figure 2 for Probabilistic learning rate scheduler with provable convergence

Figure 3 for Probabilistic learning rate scheduler with provable convergence

Figure 4 for Probabilistic learning rate scheduler with provable convergence

Abstract:Learning rate schedulers have shown great success in speeding up the convergence of learning algorithms in practice. However, their convergence to a minimum has not been proven theoretically. This difficulty mainly arises from the fact that, while traditional convergence analysis prescribes to monotonically decreasing (or constant) learning rates, schedulers opt for rates that often increase and decrease through the training epochs. In this work, we aim to bridge the gap by proposing a probabilistic learning rate scheduler (PLRS), that does not conform to the monotonically decreasing condition, with provable convergence guarantees. In addition to providing detailed convergence proofs, we also show experimental results where the proposed PLRS performs competitively as other state-of-the-art learning rate schedulers across a variety of datasets and architectures.

Via

Access Paper or Ask Questions