Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Synh Viet-Uyen Ha

CDN-MEDAL: Two-stage Density and Difference Approximation Framework for Motion Analysis

Jun 07, 2021

Synh Viet-Uyen Ha, Cuong Tien Nguyen, Hung Ngoc Phan, Nhat Minh Chung, Phuong Hoai Ha

Figure 1 for CDN-MEDAL: Two-stage Density and Difference Approximation Framework for Motion Analysis

Figure 2 for CDN-MEDAL: Two-stage Density and Difference Approximation Framework for Motion Analysis

Figure 3 for CDN-MEDAL: Two-stage Density and Difference Approximation Framework for Motion Analysis

Figure 4 for CDN-MEDAL: Two-stage Density and Difference Approximation Framework for Motion Analysis

Abstract:Background modeling is a promising research area in video analysis with a variety of video surveillance applications. Recent years have witnessed the proliferation of deep neural networks via effective learning-based approaches in motion analysis. However, these techniques only provide a limited description of the observed scenes' insufficient properties where a single-valued mapping is learned to approximate the temporal conditional averages of the target background. On the other hand, statistical learning in imagery domains has become one of the most prevalent approaches with high adaptation to dynamic context transformation, notably Gaussian Mixture Models, combined with a foreground extraction step. In this work, we propose a novel, two-stage method of change detection with two convolutional neural networks. The first architecture is grounded on the unsupervised Gaussian mixtures statistical learning to describe the scenes' salient features. The second one implements a light-weight pipeline of foreground detection. Our two-stage framework contains approximately 3.5K parameters in total but still maintains rapid convergence to intricate motion patterns. Our experiments on publicly available datasets show that our proposed networks are not only capable of generalizing regions of moving objects in unseen cases with promising results but also are competitive in performance efficiency and effectiveness regarding foreground segmentation.

* 14 pages, 5 figures, to be submitted to IEEE TCSVT

Via

Access Paper or Ask Questions