Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Excitation Backprop for RNNs

Mar 08, 2018

Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang, Vittorio Murino, Stan Sclaroff

Figure 1 for Excitation Backprop for RNNs

Figure 2 for Excitation Backprop for RNNs

Figure 3 for Excitation Backprop for RNNs

Figure 4 for Excitation Backprop for RNNs

Share this with someone who'll enjoy it:

Abstract:Deep models are state-of-the-art for many vision tasks including video action recognition and video captioning. Models are trained to caption or classify activity in videos, but little is known about the evidence used to make such decisions. Grounding decisions made by deep networks has been studied in spatial visual content, giving more insight into model predictions for images. However, such studies are relatively lacking for models of spatiotemporal visual content - videos. In this work, we devise a formulation that simultaneously grounds evidence in space and time, in a single pass, using top-down saliency. We visualize the spatiotemporal cues that contribute to a deep model's classification/captioning output using the model's internal representation. Based on these spatiotemporal cues, we are able to localize segments within a video that correspond with a specific action, or phrase from a caption, without explicitly optimizing/training for these tasks.

* IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 * CVPR 2018 Camera Ready Version

View paper on

Share this with someone who'll enjoy it:

Title:Excitation Backprop for RNNs

Paper and Code