Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Oct 08, 2023

Sunjae Yoon, Gwanhyeong Koo, Dahyun Kim, Chang D. Yoo

Figure 1 for SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Figure 2 for SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Figure 3 for SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Figure 4 for SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Share this with someone who'll enjoy it:

Abstract:Video moment retrieval aims to localize moments in video corresponding to a given language query. To avoid the expensive cost of annotating the temporal moments, weakly-supervised VMR (wsVMR) systems have been studied. For such systems, generating a number of proposals as moment candidates and then selecting the most appropriate proposal has been a popular approach. These proposals are assumed to contain many distinguishable scenes in a video as candidates. However, existing proposals of wsVMR systems do not respect the varying numbers of scenes in each video, where the proposals are heuristically determined irrespective of the video. We argue that the retrieval system should be able to counter the complexities caused by varying numbers of scenes in each video. To this end, we present a novel concept of a retrieval system referred to as Scene Complexity Aware Network (SCANet), which measures the `scene complexity' of multiple scenes in each video and generates adaptive proposals responding to variable complexities of scenes in each video. Experimental results on three retrieval benchmarks (i.e., Charades-STA, ActivityNet, TVR) achieve state-of-the-art performances and demonstrate the effectiveness of incorporating the scene complexity.

* 11 pages, Accepted in ICCV 2023

View paper on

Share this with someone who'll enjoy it:

Title:SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

Paper and Code