Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Aha

U.S. Naval Research Laboratory

SPARCNN: SPAtially Related Convolutional Neural Networks

Aug 24, 2017

JT Turner, Kalyan Moy Gupta, David Aha

Figure 1 for SPARCNN: SPAtially Related Convolutional Neural Networks

Figure 2 for SPARCNN: SPAtially Related Convolutional Neural Networks

Figure 3 for SPARCNN: SPAtially Related Convolutional Neural Networks

Figure 4 for SPARCNN: SPAtially Related Convolutional Neural Networks

Abstract:The ability to accurately detect and classify objects at varying pixel sizes in cluttered scenes is crucial to many Navy applications. However, detection performance of existing state-of the-art approaches such as convolutional neural networks (CNNs) degrade and suffer when applied to such cluttered and multi-object detection tasks. We conjecture that spatial relationships between objects in an image could be exploited to significantly improve detection accuracy, an approach that had not yet been considered by any existing techniques (to the best of our knowledge) at the time the research was conducted. We introduce a detection and classification technique called Spatially Related Detection with Convolutional Neural Networks (SPARCNN) that learns and exploits a probabilistic representation of inter-object spatial configurations within images from training sets for more effective region proposals to use with state-of-the-art CNNs. Our empirical evaluation of SPARCNN on the VOC 2007 dataset shows that it increases classification accuracy by 8% when compared to a region proposal technique that does not exploit spatial relations. More importantly, we obtained a higher performance boost of 18.8% when task difficulty in the test set is increased by including highly obscured objects and increased image clutter.

* 6 pages, AIPR 2016 submission

Via

Access Paper or Ask Questions

Convolutional Architecture Exploration for Action Recognition and Image Classification

Dec 23, 2015

J. T. Turner, David Aha, Leslie Smith, Kalyan Moy Gupta

Figure 1 for Convolutional Architecture Exploration for Action Recognition and Image Classification

Figure 2 for Convolutional Architecture Exploration for Action Recognition and Image Classification

Figure 3 for Convolutional Architecture Exploration for Action Recognition and Image Classification

Figure 4 for Convolutional Architecture Exploration for Action Recognition and Image Classification

Abstract:Convolutional Architecture for Fast Feature Encoding (CAFFE) [11] is a software package for the training, classifying, and feature extraction of images. The UCF Sports Action dataset is a widely used machine learning dataset that has 200 videos taken in 720x480 resolution of 9 different sporting activities: diving, golf, swinging, kicking, lifting, horseback riding, running, skateboarding, swinging (various gymnastics), and walking. In this report we report on a caffe feature extraction pipeline of images taken from the videos of the UCF Sports Action dataset. A similar test was performed on overfeat, and results were inferior to caffe. This study is intended to explore the architecture and hyper parameters needed for effective static analysis of action in videos and classification over a variety of image datasets.

* 12 pages. 11 tables. 0 Images. Written Summer 2014

Via

Access Paper or Ask Questions

Semi-Supervised Collective Classification via Hybrid Label Regularization

Jun 27, 2012

Luke McDowell, David Aha

Figure 1 for Semi-Supervised Collective Classification via Hybrid Label Regularization

Figure 2 for Semi-Supervised Collective Classification via Hybrid Label Regularization

Figure 3 for Semi-Supervised Collective Classification via Hybrid Label Regularization

Figure 4 for Semi-Supervised Collective Classification via Hybrid Label Regularization

Abstract:Many classification problems involve data instances that are interlinked with each other, such as webpages connected by hyperlinks. Techniques for "collective classification" (CC) often increase accuracy for such data graphs, but usually require a fully-labeled training graph. In contrast, we examine how to improve the semi-supervised learning of CC models when given only a sparsely-labeled graph, a common situation. We first describe how to use novel combinations of classifiers to exploit the different characteristics of the relational features vs. the non-relational features. We also extend the ideas of "label regularization" to such hybrid classifiers, enabling them to leverage the unlabeled data to bias the learning process. We find that these techniques, which are efficient and easy to implement, significantly increase accuracy on three real datasets. In addition, our results explain conflicting findings from prior related studies.

* Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

Via

Access Paper or Ask Questions