Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Apr 16, 2020

Siqi Li, Changqing Zou, Yipeng Li, Xibin Zhao, Yue Gao

Figure 1 for Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Figure 2 for Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Figure 3 for Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Figure 4 for Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Share this with someone who'll enjoy it:

Abstract:This paper presents an end-to-end 3D convolutional network named attention-based multi-modal fusion network (AMFNet) for the semantic scene completion (SSC) task of inferring the occupancy and semantic labels of a volumetric 3D scene from single-view RGB-D images. Compared with previous methods which use only the semantic features extracted from RGB-D images, the proposed AMFNet learns to perform effective 3D scene completion and semantic segmentation simultaneously via leveraging the experience of inferring 2D semantic segmentation from RGB-D images as well as the reliable depth cues in spatial dimension. It is achieved by employing a multi-modal fusion architecture boosted from 2D semantic segmentation and a 3D semantic completion network empowered by residual attention blocks. We validate our method on both the synthetic SUNCG-RGBD dataset and the real NYUv2 dataset and the results show that our method respectively achieves the gains of 2.5% and 2.6% on the synthetic SUNCG-RGBD dataset and the real NYUv2 dataset against the state-of-the-art method.

* Accepted by AAAI 2020

View paper on

Share this with someone who'll enjoy it:

Title:Attention-based Multi-modal Fusion Network for Semantic Scene Completion

Paper and Code