Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vishal Agarwal

Broken News: Making Newspapers Accessible to Print-Impaired

Jun 23, 2022

Vishal Agarwal, Tanuja Ganu, Saikat Guha

Figure 1 for Broken News: Making Newspapers Accessible to Print-Impaired

Figure 2 for Broken News: Making Newspapers Accessible to Print-Impaired

Figure 3 for Broken News: Making Newspapers Accessible to Print-Impaired

Figure 4 for Broken News: Making Newspapers Accessible to Print-Impaired

Abstract:Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources. In this paper, we present our approach for digitization of print newspaper into an accessible file format such as HTML. We use an ensemble of instance segmentation and detection framework for newspaper layout analysis and then OCR to recognize text elements such as headline and article text. Additionally, we propose EdgeMask loss function for Mask-RCNN framework to improve segmentation mask boundary and hence accuracy of downstream OCR task. Empirically, we show that our proposed loss function reduces the Word Error Rate (WER) of news article text by 32.5 %.

* Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

Via

Access Paper or Ask Questions

Unsupervised Representation Learning of DNA Sequences

Jun 07, 2019

Vishal Agarwal, N Jayanth Kumar Reddy, Ashish Anand

Figure 1 for Unsupervised Representation Learning of DNA Sequences

Figure 2 for Unsupervised Representation Learning of DNA Sequences

Figure 3 for Unsupervised Representation Learning of DNA Sequences

Figure 4 for Unsupervised Representation Learning of DNA Sequences

Abstract:Recently several deep learning models have been used for DNA sequence based classification tasks. Often such tasks require long and variable length DNA sequences in the input. In this work, we use a sequence-to-sequence autoencoder model to learn a latent representation of a fixed dimension for long and variable length DNA sequences in an unsupervised manner. We evaluate both quantitatively and qualitatively the learned latent representation for a supervised task of splice site classification. The quantitative evaluation is done under two different settings. Our experiments show that these representations can be used as features or priors in closely related tasks such as splice site classification. Further, in our qualitative analysis, we use a model attribution technique Integrated Gradients to infer significant sequence signatures influencing the classification accuracy. We show the identified splice signatures resemble well with the existing knowledge.

* Accepted at 2019 ICML Workshop on Computational Biology

Via

Access Paper or Ask Questions

Deep Face Quality Assessment

Nov 11, 2018

Vishal Agarwal

Figure 1 for Deep Face Quality Assessment

Figure 2 for Deep Face Quality Assessment

Figure 3 for Deep Face Quality Assessment

Figure 4 for Deep Face Quality Assessment

Abstract:Face image quality is an important factor in facial recognition systems as its verification and recognition accuracy is highly dependent on the quality of image presented. Rejecting low quality images can significantly increase the accuracy of any facial recognition system. In this project, a simple approach is presented to train a deep convolutional neural network to perform end-to-end face image quality assessment. The work is done in 2 stages : First, generation of quality score label and secondly, training a deep convolutional neural network in a supervised manner to predict quality score between 0 and 1. The generation of quality labels is done by comparing the face image with a template of best quality images and then evaluating the normalized score based on the similarity.

* Course project report

Via

Access Paper or Ask Questions

An Interval Type-2 Fuzzy Approach to Automatic Generation for Histogram Specification

May 06, 2018

Vishal Agarwal, Diwanshu Jain, A. Vamshi Krishna Reddy, Frank Chung-Hoon Rhee

Figure 1 for An Interval Type-2 Fuzzy Approach to Automatic Generation for Histogram Specification

Figure 2 for An Interval Type-2 Fuzzy Approach to Automatic Generation for Histogram Specification

Figure 3 for An Interval Type-2 Fuzzy Approach to Automatic Generation for Histogram Specification

Figure 4 for An Interval Type-2 Fuzzy Approach to Automatic Generation for Histogram Specification

Abstract:Image enhancement plays an important role in several application in the field of computer vision and image processing. Histogram specification (HS) is one of the most widely used techniques for contrast enhancement of an image, which requires an appropriate probability density function for the transformation. In this paper, we propose a fuzzy method to find a suitable PDF automatically for histogram specification using interval type - 2 (IT2) fuzzy approach, based on the fuzzy membership values obtained from the histogram of input image. The proposed algorithm works in 5 stages which includes - symmetric Gaussian fitting on the histogram, extraction of IT2 fuzzy membership functions (MFs) and therefore, footprint of uncertainty (FOU), obtaining membership value (MV), generating PDF and application of HS. We have proposed 4 different methods to find membership values - point-wise method, center of weight method, area method, and karnik-mendel (KM) method. The framework is sensitive to local variations in the histogram and chooses the best PDF so as to improve contrast enhancement. Experimental validity of the methods used is illustrated by qualitative and quantitative analysis on several images using the image quality index - Average Information Content (AIC) or Entropy, and by comparison with the commonly used algorithms such as Histogram Equalization (HE), Recursive Mean-Separate Histogram Equalization (RMSHE) and Brightness Preserving Fuzzy Histogram Equalization (BPFHE). It has been found out that on an average, our algorithm improves the AIC index by 11.5% as compared to the index obtained by histogram equalisation.

Via

Access Paper or Ask Questions