Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Oct 28, 2024

Yating Xu, Chen Li, Gim Hee Lee

Figure 1 for MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Figure 2 for MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Figure 3 for MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Figure 4 for MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Share this with someone who'll enjoy it:

Abstract:The key challenge of multi-view indoor 3D object detection is to infer accurate geometry information from images for precise 3D detection. Previous method relies on NeRF for geometry reasoning. However, the geometry extracted from NeRF is generally inaccurate, which leads to sub-optimal detection performance. In this paper, we propose MVSDet which utilizes plane sweep for geometry-aware 3D object detection. To circumvent the requirement for a large number of depth planes for accurate depth prediction, we design a probabilistic sampling and soft weighting mechanism to decide the placement of pixel features on the 3D volume. We select multiple locations that score top in the probability volume for each pixel and use their probability score to indicate the confidence. We further apply recent pixel-aligned Gaussian Splatting to regularize depth prediction and improve detection performance with little computation overhead. Extensive experiments on ScanNet and ARKitScenes datasets are conducted to show the superiority of our model. Our code is available at https://github.com/Pixie8888/MVSDet.

* Accepted by NeurIPS 2024

View paper on

Share this with someone who'll enjoy it:

Title:MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps

Paper and Code