Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Jul 08, 2024

Skanda Koppula, Ignacio Rocco, Yi Yang, Joe Heyward, João Carreira, Andrew Zisserman, Gabriel Brostow, Carl Doersch

Figure 1 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Figure 2 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Figure 3 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Figure 4 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Share this with someone who'll enjoy it:

Abstract:We introduce a new benchmark, TAPVid-3D, for evaluating the task of long-range Tracking Any Point in 3D (TAP-3D). While point tracking in two dimensions (TAP) has many benchmarks measuring performance on real-world videos, such as TAPVid-DAVIS, three-dimensional point tracking has none. To this end, leveraging existing footage, we build a new benchmark for 3D point tracking featuring 4,000+ real-world videos, composed of three different data sources spanning a variety of object types, motion patterns, and indoor and outdoor environments. To measure performance on the TAP-3D task, we formulate a collection of metrics that extend the Jaccard-based metric used in TAP to handle the complexities of ambiguous depth scales across models, occlusions, and multi-track spatio-temporal smoothness. We manually verify a large sample of trajectories to ensure correct video annotations, and assess the current state of the TAP-3D task by constructing competitive baselines using existing tracking models. We anticipate this benchmark will serve as a guidepost to improve our ability to understand precise 3D motion and surface deformation from monocular video. Code for dataset download, generation, and model evaluation is available at https://tapvid3d.github.io

View paper on

Share this with someone who'll enjoy it:

Title:TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Paper and Code