Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DEAR: Depth-Enhanced Action Recognition

Aug 28, 2024

Sadegh Rahmaniboldaji, Filip Rybansky, Quoc Vuong, Frank Guerin, Andrew Gilbert

Figure 1 for DEAR: Depth-Enhanced Action Recognition

Figure 2 for DEAR: Depth-Enhanced Action Recognition

Share this with someone who'll enjoy it:

Abstract:Detecting actions in videos, particularly within cluttered scenes, poses significant challenges due to the limitations of 2D frame analysis from a camera perspective. Unlike human vision, which benefits from 3D understanding, recognizing actions in such environments can be difficult. This research introduces a novel approach integrating 3D features and depth maps alongside RGB features to enhance action recognition accuracy. Our method involves processing estimated depth maps through a separate branch from the RGB feature encoder and fusing the features to understand the scene and actions comprehensively. Using the Side4Video framework and VideoMamba, which employ CLIP and VisionMamba for spatial feature extraction, our approach outperformed our implementation of the Side4Video network on the Something-Something V2 dataset. Our code is available at: https://github.com/SadeghRahmaniB/DEAR

* 5 pages, 1 figure, 1 table, accepted at Human-inspired Computer Vision, ECCV

View paper on

Share this with someone who'll enjoy it:

Title:DEAR: Depth-Enhanced Action Recognition

Paper and Code