Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fuyang Yu

CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise

Sep 21, 2024

Fuyang Yu, Runze Tian, Zhen Wang, Xiaochuan Wang, Xiaohui Liang

Figure 1 for CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise

Figure 2 for CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise

Figure 3 for CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise

Figure 4 for CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise

Abstract:To ease the difficulty of acquiring annotation labels in 3D data, a common method is using unsupervised and open-vocabulary semantic segmentation, which leverage 2D CLIP semantic knowledge. In this paper, unlike previous research that ignores the ``noise'' raised during feature projection from 2D to 3D, we propose a novel distillation learning framework named CUS3D. In our approach, an object-level denosing projection module is designed to screen out the ``noise'' and ensure more accurate 3D feature. Based on the obtained features, a multimodal distillation learning module is designed to align the 3D feature with CLIP semantic feature space with object-centered constrains to achieve advanced unsupervised semantic segmentation. We conduct comprehensive experiments in both unsupervised and open-vocabulary segmentation, and the results consistently showcase the superiority of our model in achieving advanced unsupervised segmentation results and its effectiveness in open-vocabulary segmentation.

* 6 pages,3 figures

Via

Access Paper or Ask Questions