Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ahmad Sufril Azlan Mohmamed

ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Nov 04, 2024

Chuanchuan Wang, Ahmad Sufril Azlan Mohmamed, Xiao Yang, Xiang Li

Figure 1 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Figure 2 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Figure 3 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Figure 4 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Abstract:This paper presents ARN-LSTM, a novel multi-stream action recognition model designed to address the challenge of simultaneously capturing spatial motion and temporal dynamics in action sequences. Traditional methods often focus solely on spatial or temporal features, limiting their ability to comprehend complex human activities fully. Our proposed model integrates joint, motion, and temporal information through a multi-stream fusion architecture. Specifically, it comprises a joint stream for extracting skeleton features, a temporal stream for capturing dynamic temporal features, and an ARN-LSTM block that utilizes Time-Distributed Long Short-Term Memory (TD-LSTM) layers followed by an Attention Relation Network (ARN) to model temporal relations. The outputs from these streams are fused in a fully connected layer to provide the final action prediction. Evaluations on the NTU RGB+D 60 and NTU RGB+D 120 datasets demonstrate the effectiveness of our model, achieving effective performance, particularly in group activity recognition.

Via

Access Paper or Ask Questions