Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hierarchical Compositional Representations for Few-shot Action Recognition

Aug 19, 2022

Changzhen Li, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan

Figure 1 for Hierarchical Compositional Representations for Few-shot Action Recognition

Figure 2 for Hierarchical Compositional Representations for Few-shot Action Recognition

Figure 3 for Hierarchical Compositional Representations for Few-shot Action Recognition

Figure 4 for Hierarchical Compositional Representations for Few-shot Action Recognition

Share this with someone who'll enjoy it:

Abstract:Recently action recognition has received more and more attention for its comprehensive and practical applications in intelligent surveillance and human-computer interaction. However, few-shot action recognition has not been well explored and remains challenging because of data scarcity. In this paper, we propose a novel hierarchical compositional representations (HCR) learning approach for few-shot action recognition. Specifically, we divide a complicated action into several sub-actions by carefully designed hierarchical clustering and further decompose the sub-actions into more fine-grained spatially attentional sub-actions (SAS-actions). Although there exist large differences between base classes and novel classes, they can share similar patterns in sub-actions or SAS-actions. Furthermore, we adopt the Earth Mover's Distance in the transportation problem to measure the similarity between video samples in terms of sub-action representations. It computes the optimal matching flows between sub-actions as distance metric, which is favorable for comparing fine-grained patterns. Extensive experiments show our method achieves the state-of-the-art results on HMDB51, UCF101 and Kinetics datasets.

View paper on

Share this with someone who'll enjoy it:

Title:Hierarchical Compositional Representations for Few-shot Action Recognition

Paper and Code