Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Modeling long-term interactions to enhance action recognition

Apr 23, 2021

Alejandro Cartas, Petia Radeva, Mariella Dimiccoli

Figure 1 for Modeling long-term interactions to enhance action recognition

Figure 2 for Modeling long-term interactions to enhance action recognition

Figure 3 for Modeling long-term interactions to enhance action recognition

Figure 4 for Modeling long-term interactions to enhance action recognition

Share this with someone who'll enjoy it:

Abstract:In this paper, we propose a new approach to under-stand actions in egocentric videos that exploits the semantics of object interactions at both frame and temporal levels. At the frame level, we use a region-based approach that takes as input a primary region roughly corresponding to the user hands and a set of secondary regions potentially corresponding to the interacting objects and calculates the action score through a CNN formulation. This information is then fed to a Hierarchical LongShort-Term Memory Network (HLSTM) that captures temporal dependencies between actions within and across shots. Ablation studies thoroughly validate the proposed approach, showing in particular that both levels of the HLSTM architecture contribute to performance improvement. Furthermore, quantitative comparisons show that the proposed approach outperforms the state-of-the-art in terms of action recognition on standard benchmarks,without relying on motion information

* Accepted to the 25th International Conference on Pattern Recognition (ICPR), 2021

View paper on

Share this with someone who'll enjoy it:

Title:Modeling long-term interactions to enhance action recognition

Paper and Code