Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Survey on Backbones for Deep Video Action Recognition

May 09, 2024

Zixuan Tang, Youjun Zhao, Yuhang Wen, Mengyuan Liu

Figure 1 for A Survey on Backbones for Deep Video Action Recognition

Figure 2 for A Survey on Backbones for Deep Video Action Recognition

Figure 3 for A Survey on Backbones for Deep Video Action Recognition

Share this with someone who'll enjoy it:

Abstract:Action recognition is a key technology in building interactive metaverses. With the rapid development of deep learning, methods in action recognition have also achieved great advancement. Researchers design and implement the backbones referring to multiple standpoints, which leads to the diversity of methods and encountering new challenges. This paper reviews several action recognition methods based on deep neural networks. We introduce these methods in three parts: 1) Two-Streams networks and their variants, which, specifically in this paper, use RGB video frame and optical flow modality as input; 2) 3D convolutional networks, which make efforts in taking advantage of RGB modality directly while extracting different motion information is no longer necessary; 3) Transformer-based methods, which introduce the model from natural language processing into computer vision and video understanding. We offer objective sights in this review and hopefully provide a reference for future research.

* This paper has been accepted by ICME workshop

View paper on

Share this with someone who'll enjoy it:

Title:A Survey on Backbones for Deep Video Action Recognition

Paper and Code