Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ramit Pahwa

Model Blending for Text Classification

Aug 05, 2022

Ramit Pahwa

Figure 1 for Model Blending for Text Classification

Figure 2 for Model Blending for Text Classification

Figure 3 for Model Blending for Text Classification

Figure 4 for Model Blending for Text Classification

Abstract:Deep neural networks (DNNs) have proven successful in a wide variety of applications such as speech recognition and synthesis, computer vision, machine translation, and game playing, to name but a few. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance, which is what we call reducing the complexity. In the following work, we try reducing the complexity of state of the art LSTM models for natural language tasks such as text classification, by distilling their knowledge to CNN based models, thus reducing the inference time(or latency) during testing.

* Masters thesis. arXiv admin note: text overlap with arXiv:1803.01271, arXiv:1710.09282 by other authors

Via

Access Paper or Ask Questions

Data-Driven Compression of Convolutional Neural Networks

Nov 28, 2019

Ramit Pahwa, Manoj Ghuhan Arivazhagan, Ankur Garg, Siddarth Krishnamoorthy, Rohit Saxena, Sunav Choudhary

Figure 1 for Data-Driven Compression of Convolutional Neural Networks

Figure 2 for Data-Driven Compression of Convolutional Neural Networks

Figure 3 for Data-Driven Compression of Convolutional Neural Networks

Figure 4 for Data-Driven Compression of Convolutional Neural Networks

Abstract:Deploying trained convolutional neural networks (CNNs) to mobile devices is a challenging task because of the simultaneous requirements of the deployed model to be fast, lightweight and accurate. Designing and training a CNN architecture that does well on all three metrics is highly non-trivial and can be very time-consuming if done by hand. One way to solve this problem is to compress the trained CNN models before deploying to mobile devices. This work asks and answers three questions on compressing CNN models automatically: a) How to control the trade-off between speed, memory and accuracy during model compression? b) In practice, a deployed model may not see all classes and/or may not need to produce all class labels. Can this fact be used to improve the trade-off? c) How to scale the compression algorithm to execute within a reasonable amount of time for many deployments? The paper demonstrates that a model compression algorithm utilizing reinforcement learning with architecture search and knowledge distillation can answer these questions in the affirmative. Experimental results are provided for current state-of-the-art CNN model families for image feature extraction like VGG and ResNet with CIFAR datasets.

* 17 pages, 10 tables, 1 figure

Via

Access Paper or Ask Questions

LSTMs with Attention for Aggression Detection

Jul 16, 2018

Nishant Nikhil, Ramit Pahwa, Mehul Kumar Nirala, Rohan Khilnani

Figure 1 for LSTMs with Attention for Aggression Detection

Figure 2 for LSTMs with Attention for Aggression Detection

Figure 3 for LSTMs with Attention for Aggression Detection

Figure 4 for LSTMs with Attention for Aggression Detection

Abstract:In this paper, we describe the system submitted for the shared task on Aggression Identification in Facebook posts and comments by the team Nishnik. Previous works demonstrate that LSTMs have achieved remarkable performance in natural language processing tasks. We deploy an LSTM model with an attention unit over it. Our system ranks 6th and 4th in the Hindi subtask for Facebook comments and subtask for generalized social media data respectively. And it ranks 17th and 10th in the corresponding English subtasks.

* Accepted in First Workshop on Trolling, Aggression and Cyberbullying at 27th International Conference of Computational Linguistics (COLING 2018)

Via

Access Paper or Ask Questions