Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vikas Reddy

Building a Word Segmenter for Sanskrit Overnight

Feb 17, 2018

Vikas Reddy, Amrith Krishna, Vishnu Dutt Sharma, Prateek Gupta, Vineeth M R, Pawan Goyal

Figure 1 for Building a Word Segmenter for Sanskrit Overnight

Figure 2 for Building a Word Segmenter for Sanskrit Overnight

Figure 3 for Building a Word Segmenter for Sanskrit Overnight

Abstract:There is an abundance of digitised texts available in Sanskrit. However, the word segmentation task in such texts are challenging due to the issue of 'Sandhi'. In Sandhi, words in a sentence often fuse together to form a single chunk of text, where the word delimiter vanishes and sounds at the word boundaries undergo transformations, which is also reflected in the written text. Here, we propose an approach that uses a deep sequence to sequence (seq2seq) model that takes only the sandhied string as the input and predicts the unsandhied string. The state of the art models are linguistically involved and have external dependencies for the lexical and morphological analysis of the input. Our model can be trained "overnight" and be used for production. In spite of the knowledge lean approach, our system preforms better than the current state of the art by gaining a percentage increase of 16.79 % than the current state of the art.

* The work is accepted at LREC 2018, Miyazaki, Japan

Via

Access Paper or Ask Questions

Visualization Regularizers for Neural Network based Image Recognition

Jan 03, 2017

Biswajit Paria, Vikas Reddy, Anirban Santara, Pabitra Mitra

Figure 1 for Visualization Regularizers for Neural Network based Image Recognition

Figure 2 for Visualization Regularizers for Neural Network based Image Recognition

Figure 3 for Visualization Regularizers for Neural Network based Image Recognition

Figure 4 for Visualization Regularizers for Neural Network based Image Recognition

Abstract:The success of deep neural networks is mostly due their ability to learn meaningful features from the data. Features learned in the hidden layers of deep neural networks trained in computer vision tasks have been shown to be similar to mid-level vision features. We leverage this fact in this work and propose the visualization regularizer for image tasks. The proposed regularization technique enforces smoothness of the features learned by hidden nodes and turns out to be a special case of Tikhonov regularization. We achieve higher classification accuracy as compared to existing regularizers such as the L2 norm regularizer and dropout, on benchmark datasets without changing the training computational complexity.

Via

Access Paper or Ask Questions

MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos

Jun 19, 2014

Vikas Reddy, Conrad Sanderson, Andres Sanin, Brian C. Lovell

Figure 1 for MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos

Figure 2 for MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos

Figure 3 for MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos

Figure 4 for MRF-based Background Initialisation for Improved Foreground Detection in Cluttered Surveillance Videos

Abstract:Robust foreground object segmentation via background modelling is a difficult problem in cluttered environments, where obtaining a clear view of the background to model is almost impossible. In this paper, we propose a method capable of robustly estimating the background and detecting regions of interest in such environments. In particular, we propose to extend the background initialisation component of a recent patch-based foreground detection algorithm with an elaborate technique based on Markov Random Fields, where the optimal labelling solution is computed using iterated conditional modes. Rather than relying purely on local temporal statistics, the proposed technique takes into account the spatial continuity of the entire background. Experiments with several tracking algorithms on the CAVIAR dataset indicate that the proposed method leads to considerable improvements in object tracking accuracy, when compared to methods based on Gaussian mixture models and feature histograms.

* arXiv admin note: substantial text overlap with arXiv:1303.2465

Via

Access Paper or Ask Questions

Improved Anomaly Detection in Crowded Scenes via Cell-based Analysis of Foreground Speed, Size and Texture

Apr 03, 2013

Vikas Reddy, Conrad Sanderson, Brian C. Lovell

Figure 1 for Improved Anomaly Detection in Crowded Scenes via Cell-based Analysis of Foreground Speed, Size and Texture

Figure 2 for Improved Anomaly Detection in Crowded Scenes via Cell-based Analysis of Foreground Speed, Size and Texture

Figure 3 for Improved Anomaly Detection in Crowded Scenes via Cell-based Analysis of Foreground Speed, Size and Texture

Figure 4 for Improved Anomaly Detection in Crowded Scenes via Cell-based Analysis of Foreground Speed, Size and Texture

Abstract:A robust and efficient anomaly detection technique is proposed, capable of dealing with crowded scenes where traditional tracking based approaches tend to fail. Initial foreground segmentation of the input frames confines the analysis to foreground objects and effectively ignores irrelevant background dynamics. Input frames are split into non-overlapping cells, followed by extracting features based on motion, size and texture from each cell. Each feature type is independently analysed for the presence of an anomaly. Unlike most methods, a refined estimate of object motion is achieved by computing the optical flow of only the foreground pixels. The motion and size features are modelled by an approximated version of kernel density estimation, which is computationally efficient even for large training datasets. Texture features are modelled by an adaptively grown codebook, with the number of entries in the codebook selected in an online fashion. Experiments on the recently published UCSD Anomaly Detection dataset show that the proposed method obtains considerably better results than three recent approaches: MPPCA, social force, and mixture of dynamic textures (MDT). The proposed method is also several orders of magnitude faster than MDT, the next best performing method.

* IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 55-61, 2011

Via

Access Paper or Ask Questions

Improved Foreground Detection via Block-based Classifier Cascade with Probabilistic Decision Integration

Mar 18, 2013

Vikas Reddy, Conrad Sanderson, Brian C. Lovell

Figure 1 for Improved Foreground Detection via Block-based Classifier Cascade with Probabilistic Decision Integration

Figure 2 for Improved Foreground Detection via Block-based Classifier Cascade with Probabilistic Decision Integration

Figure 3 for Improved Foreground Detection via Block-based Classifier Cascade with Probabilistic Decision Integration

Figure 4 for Improved Foreground Detection via Block-based Classifier Cascade with Probabilistic Decision Integration

Abstract:Background subtraction is a fundamental low-level processing task in numerous computer vision applications. The vast majority of algorithms process images on a pixel-by-pixel basis, where an independent decision is made for each pixel. A general limitation of such processing is that rich contextual information is not taken into account. We propose a block-based method capable of dealing with noise, illumination variations and dynamic backgrounds, while still obtaining smooth contours of foreground objects. Specifically, image sequences are analysed on an overlapping block-by-block basis. A low-dimensional texture descriptor obtained from each block is passed through an adaptive classifier cascade, where each stage handles a distinct problem. A probabilistic foreground mask generation approach then exploits block overlaps to integrate interim block-level decisions into final pixel-level foreground segmentation. Unlike many pixel-based methods, ad-hoc post-processing of foreground masks is not required. Experiments on the difficult Wallflower and I2R datasets show that the proposed approach obtains on average better results (both qualitatively and quantitatively) than several prominent methods. We furthermore propose the use of tracking performance as an unbiased approach for assessing the practical usefulness of foreground segmentation methods, and show that the proposed approach leads to considerable improvements in tracking accuracy on the CAVIAR dataset.

* IEEE Transactions on Circuits and Systems for Video Technology, Vol. 23, No. 1, pp. 83-93, 2013

Via

Access Paper or Ask Questions

A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts

Mar 11, 2013

Vikas Reddy, Conrad Sanderson, Brian C. Lovell

Figure 1 for A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts

Figure 2 for A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts

Figure 3 for A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts

Figure 4 for A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts

Abstract:For the purposes of foreground estimation, the true background model is unavailable in many practical circumstances and needs to be estimated from cluttered image sequences. We propose a sequential technique for static background estimation in such conditions, with low computational and memory requirements. Image sequences are analysed on a block-by-block basis. For each block location a representative set is maintained which contains distinct blocks obtained along its temporal line. The background estimation is carried out in a Markov Random Field framework, where the optimal labelling solution is computed using iterated conditional modes. The clique potentials are computed based on the combined frequency response of the candidate block and its neighbourhood. It is assumed that the most appropriate block results in the smoothest response, indirectly enforcing the spatial continuity of structures within a scene. Experiments on real-life surveillance videos demonstrate that the proposed method obtains considerably better background estimates (both qualitatively and quantitatively) than median filtering and the recently proposed "intervals of stable intensity" method. Further experiments on the Wallflower dataset suggest that the combination of the proposed method with a foreground segmentation algorithm results in improved foreground segmentation.

* EURASIP Journal on Image and Video Processing, 2011

Via

Access Paper or Ask Questions