Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jingqing Lu

Time and Frequency Network for Human Action Detection in Videos

Mar 08, 2021

Changhai Li, Huawei Chen, Jingqing Lu, Yang Huang, Yingying Liu

Figure 1 for Time and Frequency Network for Human Action Detection in Videos

Figure 2 for Time and Frequency Network for Human Action Detection in Videos

Figure 3 for Time and Frequency Network for Human Action Detection in Videos

Figure 4 for Time and Frequency Network for Human Action Detection in Videos

Abstract:Currently, spatiotemporal features are embraced by most deep learning approaches for human action detection in videos, however, they neglect the important features in frequency domain. In this work, we propose an end-to-end network that considers the time and frequency features simultaneously, named TFNet. TFNet holds two branches, one is time branch formed of three-dimensional convolutional neural network(3D-CNN), which takes the image sequence as input to extract time features; and the other is frequency branch, extracting frequency features through two-dimensional convolutional neural network(2D-CNN) from DCT coefficients. Finally, to obtain the action patterns, these two features are deeply fused under the attention mechanism. Experimental results on the JHMDB51-21 and UCF101-24 datasets demonstrate that our approach achieves remarkable performance for frame-mAP.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions