Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rohit Raj Rai

Application Specific Compression of Deep Learning Models

Sep 09, 2024

Rohit Raj Rai, Angana Borah, Amit Awekar

Figure 1 for Application Specific Compression of Deep Learning Models

Figure 2 for Application Specific Compression of Deep Learning Models

Figure 3 for Application Specific Compression of Deep Learning Models

Figure 4 for Application Specific Compression of Deep Learning Models

Abstract:Large Deep Learning models are compressed and deployed for specific applications. However, current Deep Learning model compression methods do not utilize the information about the target application. As a result, the compressed models are application agnostic. Our goal is to customize the model compression process to create a compressed model that will perform better for the target application. Our method, Application Specific Compression (ASC), identifies and prunes components of the large Deep Learning model that are redundant specifically for the given target application. The intuition of our work is to prune the parts of the network that do not contribute significantly to updating the data representation for the given application. We have experimented with the BERT family of models for three applications: Extractive QA, Natural Language Inference, and Paraphrase Identification. We observe that customized compressed models created using ASC method perform better than existing model compression methods and off-the-shelf compressed models.

* Accepted in the Proceedings of the 8th Joint International Conference on Data Science & Management of Data (12th ACM IKDD CODS and 30th COMAD) for the Short Research Paper track, 5 pages

Via

Access Paper or Ask Questions

Compressed models are NOT miniature versions of large models

Jul 18, 2024

Rohit Raj Rai, Rishant Pal, Amit Awekar

Abstract:Large neural models are often compressed before deployment. Model compression is necessary for many practical reasons, such as inference latency, memory footprint, and energy consumption. Compressed models are assumed to be miniature versions of corresponding large neural models. However, we question this belief in our work. We compare compressed models with corresponding large neural models using four model characteristics: prediction errors, data representation, data distribution, and vulnerability to adversarial attack. We perform experiments using the BERT-large model and its five compressed versions. For all four model characteristics, compressed models significantly differ from the BERT-large model. Even among compressed models, they differ from each other on all four model characteristics. Apart from the expected loss in model performance, there are major side effects of using compressed models to replace large neural models.

* Accepted at the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024) for the Short Research Paper track, 5 pages

Via

Access Paper or Ask Questions

Effect of dimensionality change on the bias of word embeddings

Dec 28, 2023

Rohit Raj Rai, Amit Awekar

Abstract:Word embedding methods (WEMs) are extensively used for representing text data. The dimensionality of these embeddings varies across various tasks and implementations. The effect of dimensionality change on the accuracy of the downstream task is a well-explored question. However, how the dimensionality change affects the bias of word embeddings needs to be investigated. Using the English Wikipedia corpus, we study this effect for two static (Word2Vec and fastText) and two context-sensitive (ElMo and BERT) WEMs. We have two observations. First, there is a significant variation in the bias of word embeddings with the dimensionality change. Second, there is no uniformity in how the dimensionality change affects the bias of word embeddings. These factors should be considered while selecting the dimensionality of word embeddings.

* Accepted for publication in the Young Research Symposium Track of ACM CODS-COMADS 2024. 2 pages

Via

Access Paper or Ask Questions