Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SpatialTracker: Tracking Any 2D Pixels in 3D Space

Apr 05, 2024

Yuxi Xiao, Qianqian Wang, Shangzhan Zhang, Nan Xue, Sida Peng, Yujun Shen, Xiaowei Zhou

Figure 1 for SpatialTracker: Tracking Any 2D Pixels in 3D Space

Figure 2 for SpatialTracker: Tracking Any 2D Pixels in 3D Space

Figure 3 for SpatialTracker: Tracking Any 2D Pixels in 3D Space

Figure 4 for SpatialTracker: Tracking Any 2D Pixels in 3D Space

Share this with someone who'll enjoy it:

Abstract:Recovering dense and long-range pixel motion in videos is a challenging problem. Part of the difficulty arises from the 3D-to-2D projection process, leading to occlusions and discontinuities in the 2D motion domain. While 2D motion can be intricate, we posit that the underlying 3D motion can often be simple and low-dimensional. In this work, we propose to estimate point trajectories in 3D space to mitigate the issues caused by image projection. Our method, named SpatialTracker, lifts 2D pixels to 3D using monocular depth estimators, represents the 3D content of each frame efficiently using a triplane representation, and performs iterative updates using a transformer to estimate 3D trajectories. Tracking in 3D allows us to leverage as-rigid-as-possible (ARAP) constraints while simultaneously learning a rigidity embedding that clusters pixels into different rigid parts. Extensive evaluation shows that our approach achieves state-of-the-art tracking performance both qualitatively and quantitatively, particularly in challenging scenarios such as out-of-plane rotation.

* Accepted to CVPR 2024 (selected as highlight paper). Project page: https://henry123-boy.github.io/SpaTracker/

View paper on

Share this with someone who'll enjoy it:

Title:SpatialTracker: Tracking Any 2D Pixels in 3D Space

Paper and Code