Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haoran Zhou

CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Jan 03, 2024

Yi Rong, Haoran Zhou, Lixin Yuan, Cheng Mei, Jiahao Wang, Tong Lu

Figure 1 for CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Figure 2 for CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Figure 3 for CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Figure 4 for CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

Abstract:Point cloud completion is an indispensable task for recovering complete point clouds due to incompleteness caused by occlusion, limited sensor resolution, etc. The family of coarse-to-fine generation architectures has recently exhibited great success in point cloud completion and gradually became mainstream. In this work, we unveil one of the key ingredients behind these methods: meticulously devised feature extraction operations with explicit cross-resolution aggregation. We present Cross-Resolution Transformer that efficiently performs cross-resolution aggregation with local attention mechanisms. With the help of our recursive designs, the proposed operation can capture more scales of features than common aggregation operations, which is beneficial for capturing fine geometric characteristics. While prior methodologies have ventured into various manifestations of inter-level cross-resolution aggregation, the effectiveness of intra-level one and their combination has not been analyzed. With unified designs, Cross-Resolution Transformer can perform intra- or inter-level cross-resolution aggregation by switching inputs. We integrate two forms of Cross-Resolution Transformers into one up-sampling block for point generation, and following the coarse-to-fine manner, we construct CRA-PCN to incrementally predict complete shapes with stacked up-sampling blocks. Extensive experiments demonstrate that our method outperforms state-of-the-art methods by a large margin on several widely used benchmarks. Codes are available at https://github.com/EasyRy/CRA-PCN.

* Accepted to AAAI 2024

Via

Access Paper or Ask Questions

Interpretation on Multi-modal Visual Fusion

Aug 19, 2023

Hao Chen, Haoran Zhou, Yongjian Deng

Abstract:In this paper, we present an analytical framework and a novel metric to shed light on the interpretation of the multimodal vision community. Our approach involves measuring the proposed semantic variance and feature similarity across modalities and levels, and conducting semantic and quantitative analyses through comprehensive experiments. Specifically, we investigate the consistency and speciality of representations across modalities, evolution rules within each modality, and the collaboration logic used when optimizing a multi-modality model. Our studies reveal several important findings, such as the discrepancy in cross-modal features and the hybrid multi-modal cooperation rule, which highlights consistency and speciality simultaneously for complementary inference. Through our dissection and findings on multi-modal fusion, we facilitate a rethinking of the reasonability and necessity of popular multi-modal vision fusion strategies. Furthermore, our work lays the foundation for designing a trustworthy and universal multi-modal fusion model for a variety of tasks in the future.

* This version was under review since 2023/3/9

Via

Access Paper or Ask Questions

SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

Jul 21, 2022

Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu, Ying Tai, Chengjie Wang

Figure 1 for SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

Figure 2 for SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

Figure 3 for SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

Figure 4 for SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer

Abstract:Point cloud completion has become increasingly popular among generation tasks of 3D point clouds, as it is a challenging yet indispensable problem to recover the complete shape of a 3D object from its partial observation. In this paper, we propose a novel SeedFormer to improve the ability of detail preservation and recovery in point cloud completion. Unlike previous methods based on a global feature vector, we introduce a new shape representation, namely Patch Seeds, which not only captures general structures from partial inputs but also preserves regional information of local patterns. Then, by integrating seed features into the generation process, we can recover faithful details for complete point clouds in a coarse-to-fine manner. Moreover, we devise an Upsample Transformer by extending the transformer structure into basic operations of point generators, which effectively incorporates spatial and semantic relationships between neighboring points. Qualitative and quantitative evaluations demonstrate that our method outperforms state-of-the-art completion networks on several benchmark datasets. Our code is available at https://github.com/hrzhou2/seedformer.

* Camera-ready, to be published in ECCV 2022, with supplementary material

Via

Access Paper or Ask Questions

AGConv: Adaptive Graph Convolution on 3D Point Clouds

Jun 09, 2022

Mingqiang Wei, Zeyong Wei, Haoran Zhou, Fei Hu, Huajian Si, Zhilei Chen, Zhe Zhu, Jingbo Qiu, Xuefeng Yan, Yanwen Guo(+2 more)

Figure 1 for AGConv: Adaptive Graph Convolution on 3D Point Clouds

Figure 2 for AGConv: Adaptive Graph Convolution on 3D Point Clouds

Figure 3 for AGConv: Adaptive Graph Convolution on 3D Point Clouds

Figure 4 for AGConv: Adaptive Graph Convolution on 3D Point Clouds

Abstract:Convolution on 3D point clouds is widely researched yet far from perfect in geometric deep learning. The traditional wisdom of convolution characterises feature correspondences indistinguishably among 3D points, arising an intrinsic limitation of poor distinctive feature learning. In this paper, we propose Adaptive Graph Convolution (AGConv) for wide applications of point cloud analysis. AGConv generates adaptive kernels for points according to their dynamically learned features. Compared with the solution of using fixed/isotropic kernels, AGConv improves the flexibility of point cloud convolutions, effectively and precisely capturing the diverse relations between points from different semantic parts. Unlike the popular attentional weight schemes, AGConv implements the adaptiveness inside the convolution operation instead of simply assigning different weights to the neighboring points. Extensive evaluations clearly show that our method outperforms state-of-the-arts of point cloud classification and segmentation on various benchmark datasets.Meanwhile, AGConv can flexibly serve more point cloud analysis approaches to boost their performance. To validate its flexibility and effectiveness, we explore AGConv-based paradigms of completion, denoising, upsampling, registration and circle extraction, which are comparable or even superior to their competitors. Our code is available at https://github.com/hrzhou2/AdaptConv-master.

* arXiv admin note: substantial text overlap with arXiv:2108.08035

Via

Access Paper or Ask Questions

Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

Mar 23, 2022

Haoran Zhou, Honghua Chen, Yingkui Zhang, Mingqiang Wei, Haoran Xie, Jun Wang, Tong Lu, Jing Qin, Xiao-Ping Zhang

Figure 1 for Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

Figure 2 for Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

Figure 3 for Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

Figure 4 for Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

Abstract:Point normal, as an intrinsic geometric property of 3D objects, not only serves conventional geometric tasks such as surface consolidation and reconstruction, but also facilitates cutting-edge learning-based techniques for shape analysis and generation. In this paper, we propose a normal refinement network, called Refine-Net, to predict accurate normals for noisy point clouds. Traditional normal estimation wisdom heavily depends on priors such as surface shapes or noise distributions, while learning-based solutions settle for single types of hand-crafted features. Differently, our network is designed to refine the initial normal of each point by extracting additional information from multiple feature representations. To this end, several feature modules are developed and incorporated into Refine-Net by a novel connection module. Besides the overall network architecture of Refine-Net, we propose a new multi-scale fitting patch selection scheme for the initial normal estimation, by absorbing geometry domain knowledge. Also, Refine-Net is a generic normal estimation framework: 1) point normals obtained from other methods can be further refined, and 2) any feature module related to the surface geometric structures can be potentially integrated into the framework. Qualitative and quantitative evaluations demonstrate the clear superiority of Refine-Net over the state-of-the-arts on both synthetic and real-scanned datasets. Our code is available at https://github.com/hrzhou2/refinenet.

* Accepted by TPAMI

Via

Access Paper or Ask Questions

Boundary Distribution Estimation to Precise Object Detection

Nov 02, 2021

Haoran Zhou, Hang Huang, Rui Zhao, Wei Wang, Qingguo Zhou

Figure 1 for Boundary Distribution Estimation to Precise Object Detection

Figure 2 for Boundary Distribution Estimation to Precise Object Detection

Figure 3 for Boundary Distribution Estimation to Precise Object Detection

Figure 4 for Boundary Distribution Estimation to Precise Object Detection

Abstract:In principal modern detectors, the task of object localization is implemented by the box subnet which concentrates on bounding box regression. The box subnet customarily predicts the position of the object by regressing box center position and scaling factors. Although this approach is frequently adopted, we observe that the result of localization remains defective, which makes the performance of the detector unsatisfactory. In this paper, we prove the flaws in the previous method through theoretical analysis and experimental verification and propose a novel solution to detect objects precisely. Rather than plainly focusing on center and size, our approach refines the edges of the bounding box on previous localization results by estimating the distribution at the boundary of the object. Experimental results have shown the potentiality and generalization of our proposed method.

Via

Access Paper or Ask Questions

Adaptive Graph Convolution for Point Cloud Analysis

Aug 19, 2021

Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu

Figure 1 for Adaptive Graph Convolution for Point Cloud Analysis

Figure 2 for Adaptive Graph Convolution for Point Cloud Analysis

Figure 3 for Adaptive Graph Convolution for Point Cloud Analysis

Figure 4 for Adaptive Graph Convolution for Point Cloud Analysis

Abstract:Convolution on 3D point clouds that generalized from 2D grid-like domains is widely researched yet far from perfect. The standard convolution characterises feature correspondences indistinguishably among 3D points, presenting an intrinsic limitation of poor distinctive feature learning. In this paper, we propose Adaptive Graph Convolution (AdaptConv) which generates adaptive kernels for points according to their dynamically learned features. Compared with using a fixed/isotropic kernel, AdaptConv improves the flexibility of point cloud convolutions, effectively and precisely capturing the diverse relations between points from different semantic parts. Unlike popular attentional weight schemes, the proposed AdaptConv implements the adaptiveness inside the convolution operation instead of simply assigning different weights to the neighboring points. Extensive qualitative and quantitative evaluations show that our method outperforms state-of-the-art point cloud classification and segmentation approaches on several benchmark datasets. Our code is available at https://github.com/hrzhou2/AdaptConv-master.

* Camera-ready, to be published in ICCV 2021

Via

Access Paper or Ask Questions