Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On-Device Document Classification using multimodal features

Jan 06, 2021

Sugam Garg, Harichandana, Sumit Kumar

Figure 1 for On-Device Document Classification using multimodal features

Figure 2 for On-Device Document Classification using multimodal features

Figure 3 for On-Device Document Classification using multimodal features

Figure 4 for On-Device Document Classification using multimodal features

Share this with someone who'll enjoy it:

Abstract:From small screenshots to large videos, documents take up a bulk of space in a modern smartphone. Documents in a phone can accumulate from various sources, and with the high storage capacity of mobiles, hundreds of documents are accumulated in a short period. However, searching or managing documents remains an onerous task, since most search methods depend on meta-information or only text in a document. In this paper, we showcase that a single modality is insufficient for classification and present a novel pipeline to classify documents on-device, thus preventing any private user data transfer to server. For this task, we integrate an open-source library for Optical Character Recognition (OCR) and our novel model architecture in the pipeline. We optimise the model for size, a necessary metric for on-device inference. We benchmark our classification model with a standard multimodal dataset FOOD-101 and showcase competitive results with the previous State of the Art with 30% model compression.

* 8th ACM IKDD CODS and 26th COMAD 2-4 January 2021

View paper on

Share this with someone who'll enjoy it:

Title:On-Device Document Classification using multimodal features

Paper and Code