Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Apr 12, 2024

Zhe Li, Haiwei Pan, Kejia Zhang, Yuhua Wang, Fengming Yu

Figure 1 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Figure 2 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Figure 3 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Figure 4 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Share this with someone who'll enjoy it:

Abstract:Multi-modality image fusion (MMIF) aims to integrate complementary information from different modalities into a single fused image to represent the imaging scene and facilitate downstream visual tasks comprehensively. In recent years, significant progress has been made in MMIF tasks due to advances in deep neural networks. However, existing methods cannot effectively and efficiently extract modality-specific and modality-fused features constrained by the inherent local reductive bias (CNN) or quadratic computational complexity (Transformers). To overcome this issue, we propose a Mamba-based Dual-phase Fusion (MambaDFuse) model. Firstly, a dual-level feature extractor is designed to capture long-range features from single-modality images by extracting low and high-level features from CNN and Mamba blocks. Then, a dual-phase feature fusion module is proposed to obtain fusion features that combine complementary information from different modalities. It uses the channel exchange method for shallow fusion and the enhanced Multi-modal Mamba (M3) blocks for deep fusion. Finally, the fused image reconstruction module utilizes the inverse transformation of the feature extraction to generate the fused result. Through extensive experiments, our approach achieves promising fusion results in infrared-visible image fusion and medical image fusion. Additionally, in a unified benchmark, MambaDFuse has also demonstrated improved performance in downstream tasks such as object detection. Code with checkpoints will be available after the peer-review process.

View paper on

Share this with someone who'll enjoy it:

Title:MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Paper and Code