Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Grammatical Compositional Model for Video Action Detection

Oct 04, 2023

Zhijun Zhang, Xu Zou, Jiahuan Zhou, Sheng Zhong, Ying Wu

Figure 1 for A Grammatical Compositional Model for Video Action Detection

Figure 2 for A Grammatical Compositional Model for Video Action Detection

Figure 3 for A Grammatical Compositional Model for Video Action Detection

Figure 4 for A Grammatical Compositional Model for Video Action Detection

Share this with someone who'll enjoy it:

Abstract:Analysis of human actions in videos demands understanding complex human dynamics, as well as the interaction between actors and context. However, these interaction relationships usually exhibit large intra-class variations from diverse human poses or object manipulations, and fine-grained inter-class differences between similar actions. Thus the performance of existing methods is severely limited. Motivated by the observation that interactive actions can be decomposed into actor dynamics and participating objects or humans, we propose to investigate the composite property of them. In this paper, we present a novel Grammatical Compositional Model (GCM) for action detection based on typical And-Or graphs. Our model exploits the intrinsic structures and latent relationships of actions in a hierarchical manner to harness both the compositionality of grammar models and the capability of expressing rich features of DNNs. The proposed model can be readily embodied into a neural network module for efficient optimization in an end-to-end manner. Extensive experiments are conducted on the AVA dataset and the Something-Else task to demonstrate the superiority of our model, meanwhile the interpretability is enhanced through an inference parsing procedure.

View paper on

Share this with someone who'll enjoy it:

Title:A Grammatical Compositional Model for Video Action Detection

Paper and Code