Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Dec 18, 2021

Shengyu Feng, Subarna Tripathi, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar

Figure 1 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 2 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 3 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 4 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Share this with someone who'll enjoy it:

Abstract:Structured video representation in the form of dynamic scene graphs is an effective tool for several video understanding tasks. Compared to the task of scene graph generation from images, dynamic scene graph generation is more challenging due to the temporal dynamics of the scene and the inherent temporal fluctuations of predictions. We show that capturing long-term dependencies is the key to effective generation of dynamic scene graphs. We present the detect-track-recognize paradigm by constructing consistent long-term object tracklets from a video, followed by transformers to capture the dynamics of objects and visual relations. Experimental results demonstrate that our Dynamic Scene Graph Detection Transformer (DSG-DETR) outperforms state-of-the-art methods by a significant margin on the benchmark dataset Action Genome. We also perform ablation studies and validate the effectiveness of each component of the proposed approach.

View paper on

Share this with someone who'll enjoy it:

Title:Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Paper and Code