Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Apr 14, 2023

Abhisek Kundu, Naveen K. Mellempudi, Dharma Teja Vooturi, Bharat Kaul, Pradeep Dubey

Figure 1 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Figure 2 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Figure 3 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Figure 4 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Share this with someone who'll enjoy it:

Abstract:Sparse training is emerging as a promising avenue for reducing the computational cost of training neural networks. Several recent studies have proposed pruning methods using learnable thresholds to efficiently explore the non-uniform distribution of sparsity inherent within the models. In this paper, we propose Gradient Annealing (GA), where gradients of masked weights are scaled down in a non-linear manner. GA provides an elegant trade-off between sparsity and accuracy without the need for additional sparsity-inducing regularization. We integrated GA with the latest learnable pruning methods to create an automated sparse training algorithm called AutoSparse, which achieves better accuracy and/or training/inference FLOPS reduction than existing learnable pruning methods for sparse ResNet50 and MobileNetV1 on ImageNet-1K: AutoSparse achieves (2x, 7x) reduction in (training,inference) FLOPS for ResNet50 on ImageNet at 80% sparsity. Finally, AutoSparse outperforms sparse-to-sparse SotA method MEST (uniform sparsity) for 80% sparse ResNet50 with similar accuracy, where MEST uses 12% more training FLOPS and 50% more inference FLOPS.

View paper on

Share this with someone who'll enjoy it:

Title:AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Paper and Code