Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Feb 21, 2023

Zeyu Xiong, Daizong Liu, Pan Zhou, Jiahao Zhu

Figure 1 for Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Figure 2 for Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Figure 3 for Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Figure 4 for Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Share this with someone who'll enjoy it:

Abstract:Temporal sentence grounding (TSG) aims to localize the temporal segment which is semantically aligned with a natural language query in an untrimmed video.Most existing methods extract frame-grained features or object-grained features by 3D ConvNet or detection network under a conventional TSG framework, failing to capture the subtle differences between frames or to model the spatio-temporal behavior of core persons/objects. In this paper, we introduce a new perspective to address the TSG task by tracking pivotal objects and activities to learn more fine-grained spatio-temporal behaviors. Specifically, we propose a novel Temporal Sentence Tracking Network (TSTNet), which contains (A) a Cross-modal Targets Generator to generate multi-modal templates and search space, filtering objects and activities, and (B) a Temporal Sentence Tracker to track multi-modal targets for modeling the targets' behavior and to predict query-related segment. Extensive experiments and comparisons with state-of-the-arts are conducted on challenging benchmarks: Charades-STA and TACoS. And our TSTNet achieves the leading performance with a considerable real-time speed.

* accepted by ICASSP2023

View paper on

Share this with someone who'll enjoy it:

Title:Tracking Objects and Activities with Attention for Temporal Sentence Grounding

Paper and Code