Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

May 22, 2018

Wei Wang, Jinjin Zhang, Chenyang Si, Liang Wang

Figure 1 for Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Figure 2 for Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Figure 3 for Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Figure 4 for Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Share this with someone who'll enjoy it:

Abstract:Recently, pose-based action recognition has gained more and more attention due to the better performance compared with traditional appearance-based methods. However, there still exist two problems to be further solved. First, existing pose-based methods generally recognize human actions with captured 3D human poses which are very difficult to obtain in real scenarios. Second, few pose-based methods model the action-related objects in recognizing human-object interaction actions in which objects play an important role. To solve the problems above, we propose a pose-based two-stream relational network (PSRN) for action recognition. In PSRN, one stream models the temporal dynamics of the targeted 2D human pose sequences which are directly extracted from raw videos, and the other stream models the action-related objects from a randomly sampled video frame. Most importantly, instead of fusing two-streams in the class score layer as before, we propose a pose-object relational network to model the relationship between human poses and action-related objects. We evaluate the proposed PSRN on two challenging benchmarks, i.e., Sub-JHMDB and PennAction. Experimental results show that our PSRN obtains the state-the-of-art performance on Sub-JHMDB (80.2%) and PennAction (98.1%). Our work opens a new door to action recognition by combining 2D human pose extracted from raw video and image appearance.

View paper on

Share this with someone who'll enjoy it:

Title:Pose-Based Two-Stream Relational Networks for Action Recognition in Videos

Paper and Code