Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Oct 29, 2021

Masahiro Mitsuhara, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi

Figure 1 for ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Figure 2 for ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Figure 3 for ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Figure 4 for ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Share this with someone who'll enjoy it:

Abstract:It is difficult for people to interpret the decision-making in the inference process of deep neural networks. Visual explanation is one method for interpreting the decision-making of deep learning. It analyzes the decision-making of 2D CNNs by visualizing an attention map that highlights discriminative regions. Visual explanation for interpreting the decision-making process in video recognition is more difficult because it is necessary to consider not only spatial but also temporal information, which is different from the case of still images. In this paper, we propose a visual explanation method called spatio-temporal attention branch network (ST-ABN) for video recognition. It enables visual explanation for both spatial and temporal information. ST-ABN acquires the importance of spatial and temporal information during network inference and applies it to recognition processing to improve recognition performance and visual explainability. Experimental results with Something-Something datasets V1 \& V2 demonstrated that ST-ABN enables visual explanation that takes into account spatial and temporal information simultaneously and improves recognition performance.

* 15 pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition

Paper and Code