Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mehrsan Javan Roshtkhari

Deep Learning of Appearance Models for Online Object Tracking

Jul 09, 2016

Mengyao Zhai, Mehrsan Javan Roshtkhari, Greg Mori

Figure 1 for Deep Learning of Appearance Models for Online Object Tracking

Figure 2 for Deep Learning of Appearance Models for Online Object Tracking

Figure 3 for Deep Learning of Appearance Models for Online Object Tracking

Figure 4 for Deep Learning of Appearance Models for Online Object Tracking

Abstract:This paper introduces a novel deep learning based approach for vision based single target tracking. We address this problem by proposing a network architecture which takes the input video frames and directly computes the tracking score for any candidate target location by estimating the probability distributions of the positive and negative examples. This is achieved by combining a deep convolutional neural network with a Bayesian loss layer in a unified framework. In order to deal with the limited number of positive training examples, the network is pre-trained offline for a generic image feature representation and then is fine-tuned in multiple steps. An online fine-tuning step is carried out at every frame to learn the appearance of the target. We adopt a two-stage iterative algorithm to adaptively update the network parameters and maintain a probability density for target/non-target regions. The tracker has been tested on the standard tracking benchmark and the results indicate that the proposed solution achieves state-of-the-art tracking results.

Via

Access Paper or Ask Questions

Deep Structured Models For Group Activity Recognition

Jun 12, 2015

Zhiwei Deng, Mengyao Zhai, Lei Chen, Yuhao Liu, Srikanth Muralidharan, Mehrsan Javan Roshtkhari, Greg Mori

Figure 1 for Deep Structured Models For Group Activity Recognition

Figure 2 for Deep Structured Models For Group Activity Recognition

Figure 3 for Deep Structured Models For Group Activity Recognition

Figure 4 for Deep Structured Models For Group Activity Recognition

Abstract:This paper presents a deep neural-network-based hierarchical graphical model for individual and group activity recognition in surveillance scenes. Deep networks are used to recognize the actions of individual people in a scene. Next, a neural-network-based hierarchical graphical model refines the predicted labels for each class by considering dependencies between the classes. This refinement step mimics a message-passing step similar to inference in a probabilistic graphical model. We show that this approach can be effective in group activity recognition, with the deep graphical model improving recognition rates over baseline methods.

Via

Access Paper or Ask Questions

Discovering Human Interactions in Videos with Limited Data Labeling

Feb 12, 2015

Mehran Khodabandeh, Arash Vahdat, Guang-Tong Zhou, Hossein Hajimirsadeghi, Mehrsan Javan Roshtkhari, Greg Mori, Stephen Se

Figure 1 for Discovering Human Interactions in Videos with Limited Data Labeling

Figure 2 for Discovering Human Interactions in Videos with Limited Data Labeling

Figure 3 for Discovering Human Interactions in Videos with Limited Data Labeling

Figure 4 for Discovering Human Interactions in Videos with Limited Data Labeling

Abstract:We present a novel approach for discovering human interactions in videos. Activity understanding techniques usually require a large number of labeled examples, which are not available in many practical cases. Here, we focus on recovering semantically meaningful clusters of human-human and human-object interaction in an unsupervised fashion. A new iterative solution is introduced based on Maximum Margin Clustering (MMC), which also accepts user feedback to refine clusters. This is achieved by formulating the whole process as a unified constrained latent max-margin clustering problem. Extensive experiments have been carried out over three challenging datasets, Collective Activity, VIRAT, and UT-interaction. Empirical results demonstrate that the proposed algorithm can efficiently discover perfect semantic clusters of human interactions with only a small amount of labeling effort.

Via

Access Paper or Ask Questions