Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Context-TAP: Tracking Any Point Demands Spatial Context Features

Jun 03, 2023

Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yitong Dong, Yijin Li, Hongsheng Li

Figure 1 for Context-TAP: Tracking Any Point Demands Spatial Context Features

Figure 2 for Context-TAP: Tracking Any Point Demands Spatial Context Features

Figure 3 for Context-TAP: Tracking Any Point Demands Spatial Context Features

Figure 4 for Context-TAP: Tracking Any Point Demands Spatial Context Features

Share this with someone who'll enjoy it:

Abstract:We tackle the problem of Tracking Any Point (TAP) in videos, which specifically aims at estimating persistent long-term trajectories of query points in videos. Previous methods attempted to estimate these trajectories independently to incorporate longer image sequences, therefore, ignoring the potential benefits of incorporating spatial context features. We argue that independent video point tracking also demands spatial context features. To this end, we propose a novel framework Context-TAP, which effectively improves point trajectory accuracy by aggregating spatial context features in videos. Context-TAP contains two main modules: 1) a SOurse Feature Enhancement (SOFE) module, and 2) a TArget Feature Aggregation (TAFA) module. Context-TAP significantly improves PIPs all-sided, reducing 11.4% Average Trajectory Error of Occluded Points (ATE-Occ) on CroHD and increasing 11.8% Average Percentage of Correct Keypoint (A-PCK) on TAP-Vid-Kinectics. Demos are available at this $\href{https://wkbian.github.io/Projects/Context-TAP/}{webpage}$.

* Project Page: this $\href{https://wkbian.github.io/Projects/Context-TAP/}{webpage}$

View paper on

Share this with someone who'll enjoy it:

Title:Context-TAP: Tracking Any Point Demands Spatial Context Features

Paper and Code