Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hao-Xiang Guo

LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Oct 21, 2024

Hao He, Yixun Liang, Luozhou Wang, Yuanhao Cai, Xinli Xu, Hao-Xiang Guo, Xiang Wen, Yingcong Chen

Figure 1 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Figure 2 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Figure 3 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Figure 4 for LucidFusion: Generating 3D Gaussians with Arbitrary Unposed Images

Abstract:Recent large reconstruction models have made notable progress in generating high-quality 3D objects from single images. However, these methods often struggle with controllability, as they lack information from multiple views, leading to incomplete or inconsistent 3D reconstructions. To address this limitation, we introduce LucidFusion, a flexible end-to-end feed-forward framework that leverages the Relative Coordinate Map (RCM). Unlike traditional methods linking images to 3D world thorough pose, LucidFusion utilizes RCM to align geometric features coherently across different views, making it highly adaptable for 3D generation from arbitrary, unposed images. Furthermore, LucidFusion seamlessly integrates with the original single-image-to-3D pipeline, producing detailed 3D Gaussians at a resolution of $512 \times 512$, making it well-suited for a wide range of applications.

* 17 pages, 12 figures, project page: coming soon

Via

Access Paper or Ask Questions

Part123: Part-aware 3D Reconstruction from a Single-view Image

May 27, 2024

Anran Liu, Cheng Lin, Yuan Liu, Xiaoxiao Long, Zhiyang Dou, Hao-Xiang Guo, Ping Luo, Wenping Wang

Abstract:Recently, the emergence of diffusion models has opened up new opportunities for single-view reconstruction. However, all the existing methods represent the target object as a closed mesh devoid of any structural information, thus neglecting the part-based structure, which is crucial for many downstream applications, of the reconstructed shape. Moreover, the generated meshes usually suffer from large noises, unsmooth surfaces, and blurry textures, making it challenging to obtain satisfactory part segments using 3D segmentation techniques. In this paper, we present Part123, a novel framework for part-aware 3D reconstruction from a single-view image. We first use diffusion models to generate multiview-consistent images from a given image, and then leverage Segment Anything Model (SAM), which demonstrates powerful generalization ability on arbitrary objects, to generate multiview segmentation masks. To effectively incorporate 2D part-based information into 3D reconstruction and handle inconsistency, we introduce contrastive learning into a neural rendering framework to learn a part-aware feature space based on the multiview segmentation masks. A clustering-based algorithm is also developed to automatically derive 3D part segmentation results from the reconstructed models. Experiments show that our method can generate 3D models with high-quality segmented parts on various objects. Compared to existing unstructured reconstruction methods, the part-aware 3D models from our method benefit some important applications, including feature-preserving reconstruction, primitive fitting, and 3D shape editing.

* Accepted to SIGGRAPH 2024 (conference track),webpage: https://liuar0512.github.io/part123_official_page/

Via

Access Paper or Ask Questions

Implicit Conversion of Manifold B-Rep Solids by Neural Halfspace Representation

Sep 21, 2022

Hao-Xiang Guo, Yang Liu, Hao Pan, Baining Guo

Figure 1 for Implicit Conversion of Manifold B-Rep Solids by Neural Halfspace Representation

Figure 2 for Implicit Conversion of Manifold B-Rep Solids by Neural Halfspace Representation

Figure 3 for Implicit Conversion of Manifold B-Rep Solids by Neural Halfspace Representation

Figure 4 for Implicit Conversion of Manifold B-Rep Solids by Neural Halfspace Representation

Abstract:We present a novel implicit representation -- neural halfspace representation (NH-Rep), to convert manifold B-Rep solids to implicit representations. NH-Rep is a Boolean tree built on a set of implicit functions represented by the neural network, and the composite Boolean function is capable of representing solid geometry while preserving sharp features. We propose an efficient algorithm to extract the Boolean tree from a manifold B-Rep solid and devise a neural network-based optimization approach to compute the implicit functions. We demonstrate the high quality offered by our conversion algorithm on ten thousand manifold B-Rep CAD models that contain various curved patches including NURBS, and the superiority of our learning approach over other representative implicit conversion algorithms in terms of surface reconstruction, sharp feature preservation, signed distance field approximation, and robustness to various surface geometry, as well as a set of applications supported by NH-Rep.

* ACM Trans. Graph. 41, 4, Article 128 (July 2022), 13 pages
* Accepted to SIGGRAPH Asia 2022. Our supplemental material and code are available at https://guohaoxiang.github.io/projects/nhrep.html

Via

Access Paper or Ask Questions

Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Apr 20, 2022

Chun-Yu Sun, Yu-Qi Yang, Hao-Xiang Guo, Peng-Shuai Wang, Xin Tong, Yang Liu, Heung-Yeung Shum

Figure 1 for Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Figure 2 for Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Figure 3 for Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Figure 4 for Semi-supervised 3D shape segmentation with multilevel consistency and part substitution

Abstract:The lack of fine-grained 3D shape segmentation data is the main obstacle to developing learning-based 3D segmentation techniques. We propose an effective semi-supervised method for learning 3D segmentations from a few labeled 3D shapes and a large amount of unlabeled 3D data. For the unlabeled data, we present a novel multilevel consistency loss to enforce consistency of network predictions between perturbed copies of a 3D shape at multiple levels: point-level, part-level, and hierarchical level. For the labeled data, we develop a simple yet effective part substitution scheme to augment the labeled 3D shapes with more structural variations to enhance training. Our method has been extensively validated on the task of 3D object semantic segmentation on PartNet and ShapeNetPart, and indoor scene semantic segmentation on ScanNet. It exhibits superior performance to existing semi-supervised and unsupervised pre-training 3D approaches. Our code and trained models are publicly available at https://github.com/isunchy/semi_supervised_3d_segmentation.

* Computational Visual Media 2022. Project page: https://isunchy.github.io/projects/semi_supervised_3d_segmentation.html

Via

Access Paper or Ask Questions

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Apr 06, 2021

Shi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu

Figure 1 for Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Figure 2 for Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Figure 3 for Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Figure 4 for Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Abstract:Point set is a flexible and lightweight representation widely used for 3D deep learning. However, their discrete nature prevents them from representing continuous and fine geometry, posing a major issue for learning-based shape generation. In this work, we turn the discrete point sets into smooth surfaces by introducing the well-known implicit moving least-squares (IMLS) surface formulation, which naturally defines locally implicit functions on point sets. We incorporate IMLS surface generation into deep neural networks for inheriting both the flexibility of point sets and the high quality of implicit surfaces. Our IMLSNet predicts an octree structure as a scaffold for generating MLS points where needed and characterizes shape geometry with learned local priors. Furthermore, our implicit function evaluation is independent of the neural network once the MLS points are predicted, thus enabling fast runtime evaluation. Our experiments on 3D object reconstruction demonstrate that IMLSNets outperform state-of-the-art learning-based methods in terms of reconstruction quality and computational efficiency. Extensive ablation tests also validate our network design and loss functions.

* Accepted by CVPR 2021, Code: https://github.com/Andy97/DeepMLS

Via

Access Paper or Ask Questions