Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LayerCollapse: Adaptive compression of neural networks

Nov 29, 2023

Soheil Zibakhsh Shabgahi, Mohammad Soheil Shariff, Farinaz Koushanfar

Figure 1 for LayerCollapse: Adaptive compression of neural networks

Figure 2 for LayerCollapse: Adaptive compression of neural networks

Figure 3 for LayerCollapse: Adaptive compression of neural networks

Figure 4 for LayerCollapse: Adaptive compression of neural networks

Share this with someone who'll enjoy it:

Abstract:Handling the ever-increasing scale of contemporary deep learning and transformer-based models poses a significant challenge. Although great strides have been made in optimizing model compression techniques such as model architecture search and knowledge distillation, the availability of data and computational resources remains a considerable hurdle for these optimizations. This paper introduces LayerCollapse, a novel alternative adaptive model compression methodology. LayerCollapse works by eliminating non-linearities within the network and collapsing two consecutive fully connected layers into a single linear transformation. This approach simultaneously reduces both the number of layers and the parameter count, thereby enhancing model efficiency. We also introduce a compression aware regularizer, which compresses the model in alignment with the dataset quality and model expressiveness, consequently reducing overfitting across tasks. Our results demonstrate LayerCollapse's effective compression and regularization capabilities in multiple fine-grained classification benchmarks, achieving up to 74% post training compression with minimal accuracy loss. We compare this method with knowledge distillation on the same target network, showcasing a five-fold increase in computational efficiency and 8% improvement in overall accuracy on the ImageNet dataset.

View paper on

Share this with someone who'll enjoy it:

Title:LayerCollapse: Adaptive compression of neural networks

Paper and Code