Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Self-Feedback DETR for Temporal Action Detection

Aug 21, 2023

Jihwan Kim, Miso Lee, Jae-Pil Heo

Figure 1 for Self-Feedback DETR for Temporal Action Detection

Figure 2 for Self-Feedback DETR for Temporal Action Detection

Figure 3 for Self-Feedback DETR for Temporal Action Detection

Figure 4 for Self-Feedback DETR for Temporal Action Detection

Share this with someone who'll enjoy it:

Abstract:Temporal Action Detection (TAD) is challenging but fundamental for real-world video applications. Recently, DETR-based models have been devised for TAD but have not performed well yet. In this paper, we point out the problem in the self-attention of DETR for TAD; the attention modules focus on a few key elements, called temporal collapse problem. It degrades the capability of the encoder and decoder since their self-attention modules play no role. To solve the problem, we propose a novel framework, Self-DETR, which utilizes cross-attention maps of the decoder to reactivate self-attention modules. We recover the relationship between encoder features by simple matrix multiplication of the cross-attention map and its transpose. Likewise, we also get the information within decoder queries. By guiding collapsed self-attention maps with the guidance map calculated, we settle down the temporal collapse of self-attention modules in the encoder and decoder. Our extensive experiments demonstrate that Self-DETR resolves the temporal collapse problem by keeping high diversity of attention over all layers.

* Accepted to ICCV 2023

View paper on

Share this with someone who'll enjoy it:

Title:Self-Feedback DETR for Temporal Action Detection

Paper and Code