Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tingyang Zhang

ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Jan 06, 2025

Tingyang Zhang, Chen Wang, Zhiyang Dou, Qingzhe Gao, Jiahui Lei, Baoquan Chen, Lingjie Liu

Abstract:In this paper, we propose ProTracker, a novel framework for robust and accurate long-term dense tracking of arbitrary points in videos. The key idea of our method is incorporating probabilistic integration to refine multiple predictions from both optical flow and semantic features for robust short-term and long-term tracking. Specifically, we integrate optical flow estimations in a probabilistic manner, producing smooth and accurate trajectories by maximizing the likelihood of each prediction. To effectively re-localize challenging points that disappear and reappear due to occlusion, we further incorporate long-term feature correspondence into our flow predictions for continuous trajectory generation. Extensive experiments show that ProTracker achieves the state-of-the-art performance among unsupervised and self-supervised approaches, and even outperforms supervised methods on several benchmarks. Our code and model will be publicly available upon publication.

* Project page: https://michaelszj.github.io/protracker

Via

Access Paper or Ask Questions

BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Mar 18, 2024

Tingyang Zhang, Qingzhe Gao, Weiyu Li, Libin Liu, Baoquan Chen

Figure 1 for BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Figure 2 for BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Figure 3 for BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Figure 4 for BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

Abstract:Animatable 3D reconstruction has significant applications across various fields, primarily relying on artists' handcraft creation. Recently, some studies have successfully constructed animatable 3D models from monocular videos. However, these approaches require sufficient view coverage of the object within the input video and typically necessitate significant time and computational costs for training and rendering. This limitation restricts the practical applications. In this work, we propose a method to build animatable 3D Gaussian Splatting from monocular video with diffusion priors. The 3D Gaussian representations significantly accelerate the training and rendering process, and the diffusion priors allow the method to learn 3D models with limited viewpoints. We also present the rigid regularization to enhance the utilization of the priors. We perform an extensive evaluation across various real-world videos, demonstrating its superior performance compared to the current state-of-the-art methods.

* https://talegqz.github.io/BAGS/

Via

Access Paper or Ask Questions