Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chengwei Zheng

GSTAR: Gaussian Surface Tracking and Reconstruction

Jan 17, 2025

Chengwei Zheng, Lixin Xue, Juan Zarate, Jie Song

Figure 1 for GSTAR: Gaussian Surface Tracking and Reconstruction

Figure 2 for GSTAR: Gaussian Surface Tracking and Reconstruction

Figure 3 for GSTAR: Gaussian Surface Tracking and Reconstruction

Figure 4 for GSTAR: Gaussian Surface Tracking and Reconstruction

Abstract:3D Gaussian Splatting techniques have enabled efficient photo-realistic rendering of static scenes. Recent works have extended these approaches to support surface reconstruction and tracking. However, tracking dynamic surfaces with 3D Gaussians remains challenging due to complex topology changes, such as surfaces appearing, disappearing, or splitting. To address these challenges, we propose GSTAR, a novel method that achieves photo-realistic rendering, accurate surface reconstruction, and reliable 3D tracking for general dynamic scenes with changing topology. Given multi-view captures as input, GSTAR binds Gaussians to mesh faces to represent dynamic objects. For surfaces with consistent topology, GSTAR maintains the mesh topology and tracks the meshes using Gaussians. In regions where topology changes, GSTAR adaptively unbinds Gaussians from the mesh, enabling accurate registration and the generation of new surfaces based on these optimized Gaussians. Additionally, we introduce a surface-based scene flow method that provides robust initialization for tracking between frames. Experiments demonstrate that our method effectively tracks and reconstructs dynamic surfaces, enabling a range of applications. Our project page with the code release is available at https://chengwei-zheng.github.io/GSTAR/.

Via

Access Paper or Ask Questions

Relightable and Animatable Neural Avatars from Videos

Dec 20, 2023

Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu

Figure 1 for Relightable and Animatable Neural Avatars from Videos

Figure 2 for Relightable and Animatable Neural Avatars from Videos

Figure 3 for Relightable and Animatable Neural Avatars from Videos

Figure 4 for Relightable and Animatable Neural Avatars from Videos

Abstract:Lightweight creation of 3D digital avatars is a highly desirable but challenging task. With only sparse videos of a person under unknown illumination, we propose a method to create relightable and animatable neural avatars, which can be used to synthesize photorealistic images of humans under novel viewpoints, body poses, and lighting. The key challenge here is to disentangle the geometry, material of the clothed body, and lighting, which becomes more difficult due to the complex geometry and shadow changes caused by body motions. To solve this ill-posed problem, we propose novel techniques to better model the geometry and shadow changes. For geometry change modeling, we propose an invertible deformation field, which helps to solve the inverse skinning problem and leads to better geometry quality. To model the spatial and temporal varying shading cues, we propose a pose-aware part-wise light visibility network to estimate light occlusion. Extensive experiments on synthetic and real datasets show that our approach reconstructs high-quality geometry and generates realistic shadows under different body poses. Code and data are available at \url{https://wenbin-lin.github.io/RelightableAvatar-page/}.

* Accepted by AAAI 2024

Via

Access Paper or Ask Questions

EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points

Dec 07, 2022

Chengwei Zheng, Wenbin Lin, Feng Xu

Figure 1 for EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points

Figure 2 for EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points

Figure 3 for EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points

Figure 4 for EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points

Abstract:Neural radiance fields (NeRF) achieve highly photo-realistic novel-view synthesis, but it's a challenging problem to edit the scenes modeled by NeRF-based methods, especially for dynamic scenes. We propose editable neural radiance fields that enable end-users to easily edit dynamic scenes and even support topological changes. Input with an image sequence from a single camera, our network is trained fully automatically and models topologically varying dynamics using our picked-out surface key points. Then end-users can edit the scene by easily dragging the key points to desired new positions. To achieve this, we propose a scene analysis method to detect and initialize key points by considering the dynamics in the scene, and a weighted key points strategy to model topologically varying dynamics by joint key points and weights optimization. Our method supports intuitive multi-dimensional (up to 3D) editing and can generate novel scenes that are unseen in the input sequence. Experiments demonstrate that our method achieves high-quality editing on various dynamic scenes and outperforms the state-of-the-art. We will release our code and captured data.

Via

Access Paper or Ask Questions

OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Mar 15, 2022

Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu

Figure 1 for OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Figure 2 for OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Figure 3 for OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Figure 4 for OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction

Abstract:RGBD-based real-time dynamic 3D reconstruction suffers from inaccurate inter-frame motion estimation as errors may accumulate with online tracking. This problem is even more severe for single-view-based systems due to strong occlusions. Based on these observations, we propose OcclusionFusion, a novel method to calculate occlusion-aware 3D motion to guide the reconstruction. In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network. Furthermore, our method computes the confidence of the estimated motion by modeling the network output with a probabilistic model, which alleviates untrustworthy motions and enables robust tracking. Experimental results on public datasets and our own recorded data show that our technique outperforms existing single-view-based real-time methods by a large margin. With the reduction of the motion errors, the proposed technique can handle long and challenging motion sequences. Please check out the project page for sequence results: https://wenbin-lin.github.io/OcclusionFusion.

* Accepted by CVPR 2022. Project page: https://wenbin-lin.github.io/OcclusionFusion

Via

Access Paper or Ask Questions