Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jayesh Rajkumar Vachhani

Self Similarity Matrix based CNN Filter Pruning

Nov 03, 2022

S Rakshith, Jayesh Rajkumar Vachhani, Sourabh Vasant Gothe, Rishabh Khurana

Figure 1 for Self Similarity Matrix based CNN Filter Pruning

Figure 2 for Self Similarity Matrix based CNN Filter Pruning

Figure 3 for Self Similarity Matrix based CNN Filter Pruning

Figure 4 for Self Similarity Matrix based CNN Filter Pruning

Abstract:In recent years, most of the deep learning solutions are targeted to be deployed in mobile devices. This makes the need for development of lightweight models all the more imminent. Another solution is to optimize and prune regular deep learning models. In this paper, we tackle the problem of CNN model pruning with the help of Self-Similarity Matrix (SSM) computed from the 2D CNN filters. We propose two novel algorithms to rank and prune redundant filters which contribute similar activation maps to the output. One of the key features of our method is that there is no need of finetuning after training the model. Both the training and pruning process is completed simultaneously. We benchmark our method on two of the most popular CNN models - ResNet and VGG and record their performance on the CIFAR-10 dataset.

* Paper accepted in the 7th International Conference on Computer Vision & Image Processing (2022)

Via

Access Paper or Ask Questions

FONTNET: On-Device Font Understanding and Prediction Pipeline

Mar 30, 2021

Rakshith S, Rishabh Khurana, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Guggilla Bhanodai

Figure 1 for FONTNET: On-Device Font Understanding and Prediction Pipeline

Figure 2 for FONTNET: On-Device Font Understanding and Prediction Pipeline

Figure 3 for FONTNET: On-Device Font Understanding and Prediction Pipeline

Figure 4 for FONTNET: On-Device Font Understanding and Prediction Pipeline

Abstract:Fonts are one of the most basic and core design concepts. Numerous use cases can benefit from an in depth understanding of Fonts such as Text Customization which can change text in an image while maintaining the Font attributes like style, color, size. Currently, Text recognition solutions can group recognized text based on line breaks or paragraph breaks, if the Font attributes are known multiple text blocks can be combined based on context in a meaningful manner. In this paper, we propose two engines: Font Detection Engine, which identifies the font style, color and size attributes of text in an image and a Font Prediction Engine, which predicts similar fonts for a query font. Major contributions of this paper are three-fold: First, we developed a novel CNN architecture for identifying font style of text in images. Second, we designed a novel algorithm for predicting similar fonts for a given query font. Third, we have optimized and deployed the entire engine On-Device which ensures privacy and improves latency in real time applications such as instant messaging. We achieve a worst case On-Device inference time of 30ms and a model size of 4.5MB for both the engines.

* Accepted for publication in IEEE ICASSP 2021: 46th IEEE International Conference on Acoustics, Speech, & Signal Processing

Via

Access Paper or Ask Questions