Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:IVGF: The Fusion-Guided Infrared and Visible General Framework

Sep 02, 2024

Fangcen Liu, Chenqiang Gao, Fang Chen, Pengcheng Li, Junjie Guo, Deyu Meng

Figure 1 for IVGF: The Fusion-Guided Infrared and Visible General Framework

Figure 2 for IVGF: The Fusion-Guided Infrared and Visible General Framework

Figure 3 for IVGF: The Fusion-Guided Infrared and Visible General Framework

Figure 4 for IVGF: The Fusion-Guided Infrared and Visible General Framework

Share this with someone who'll enjoy it:

Abstract:Infrared and visible dual-modality tasks such as semantic segmentation and object detection can achieve robust performance even in extreme scenes by fusing complementary information. Most current methods design task-specific frameworks, which are limited in generalization across multiple tasks. In this paper, we propose a fusion-guided infrared and visible general framework, IVGF, which can be easily extended to many high-level vision tasks. Firstly, we adopt the SOTA infrared and visible foundation models to extract the general representations. Then, to enrich the semantics information of these general representations for high-level vision tasks, we design the feature enhancement module and token enhancement module for feature maps and tokens, respectively. Besides, the attention-guided fusion module is proposed for effectively fusing by exploring the complementary information of two modalities. Moreover, we also adopt the cutout&mix augmentation strategy to conduct the data augmentation, which further improves the ability of the model to mine the regional complementary between the two modalities. Extensive experiments show that the IVGF outperforms state-of-the-art dual-modality methods in the semantic segmentation and object detection tasks. The detailed ablation studies demonstrate the effectiveness of each module, and another experiment explores the anti-missing modality ability of the proposed method in the dual-modality semantic segmentation task.

* 11 pages, 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:IVGF: The Fusion-Guided Infrared and Visible General Framework

Paper and Code