Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Mar 22, 2024

Jun Guo, Xiaojian Ma, Yue Fan, Huaping Liu, Qing Li

Figure 1 for Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Figure 2 for Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Figure 3 for Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Figure 4 for Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Share this with someone who'll enjoy it:

Abstract:Open-vocabulary 3D scene understanding presents a significant challenge in computer vision, withwide-ranging applications in embodied agents and augmented reality systems. Previous approaches haveadopted Neural Radiance Fields (NeRFs) to analyze 3D scenes. In this paper, we introduce SemanticGaussians, a novel open-vocabulary scene understanding approach based on 3D Gaussian Splatting. Our keyidea is distilling pre-trained 2D semantics into 3D Gaussians. We design a versatile projection approachthat maps various 2Dsemantic features from pre-trained image encoders into a novel semantic component of 3D Gaussians, withoutthe additional training required by NeRFs. We further build a 3D semantic network that directly predictsthe semantic component from raw 3D Gaussians for fast inference. We explore several applications ofSemantic Gaussians: semantic segmentation on ScanNet-20, where our approach attains a 4.2% mIoU and 4.0%mAcc improvement over prior open-vocabulary scene understanding counterparts; object part segmentation,sceneediting, and spatial-temporal segmentation with better qualitative results over 2D and 3D baselines,highlighting its versatility and effectiveness on supporting diverse downstream tasks.

* Project page: see https://semantic-gaussians.github.io

View paper on

Share this with someone who'll enjoy it:

Title:Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Paper and Code