Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Jun 06, 2023

Aixuan Li, Yuxin Mao, Jing Zhang, Yuchao Dai

Figure 1 for Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Figure 2 for Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Figure 3 for Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Figure 4 for Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Share this with someone who'll enjoy it:

Abstract:In this paper, we present a weakly-supervised RGB-D salient object detection model via scribble supervision. Specifically, as a multimodal learning task, we focus on effective multimodal representation learning via inter-modal mutual information regularization. In particular, following the principle of disentangled representation learning, we introduce a mutual information upper bound with a mutual information minimization regularizer to encourage the disentangled representation of each modality for salient object detection. Based on our multimodal representation learning framework, we introduce an asymmetric feature extractor for our multimodal data, which is proven more effective than the conventional symmetric backbone setting. We also introduce multimodal variational auto-encoder as stochastic prediction refinement techniques, which takes pseudo labels from the first training stage as supervision and generates refined prediction. Experimental results on benchmark RGB-D salient object detection datasets verify both effectiveness of our explicit multimodal disentangled representation learning method and the stochastic prediction refinement strategy, achieving comparable performance with the state-of-the-art fully supervised models. Our code and data are available at: https://github.com/baneitixiaomai/MIRV.

* IEEE Transactions on Circuits and Systems for Video Technology 2023

View paper on

Share this with someone who'll enjoy it:

Title:Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Paper and Code