Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Generative-based Fusion Mechanism for Multi-Modal Tracking

Sep 07, 2023

Zhangyong Tang, Tianyang Xu, Xuefeng Zhu, Xiao-Jun Wu, Josef Kittler

Figure 1 for Generative-based Fusion Mechanism for Multi-Modal Tracking

Figure 2 for Generative-based Fusion Mechanism for Multi-Modal Tracking

Figure 3 for Generative-based Fusion Mechanism for Multi-Modal Tracking

Figure 4 for Generative-based Fusion Mechanism for Multi-Modal Tracking

Share this with someone who'll enjoy it:

Abstract:Generative models (GMs) have received increasing research interest for their remarkable capacity to achieve comprehensive understanding. However, their potential application in the domain of multi-modal tracking has remained relatively unexplored. In this context, we seek to uncover the potential of harnessing generative techniques to address the critical challenge, information fusion, in multi-modal tracking. In this paper, we delve into two prominent GM techniques, namely, Conditional Generative Adversarial Networks (CGANs) and Diffusion Models (DMs). Different from the standard fusion process where the features from each modality are directly fed into the fusion block, we condition these multi-modal features with random noise in the GM framework, effectively transforming the original training samples into harder instances. This design excels at extracting discriminative clues from the features, enhancing the ultimate tracking performance. To quantitatively gauge the effectiveness of our approach, we conduct extensive experiments across two multi-modal tracking tasks, three baseline methods, and three challenging benchmarks. The experimental results demonstrate that the proposed generative-based fusion mechanism achieves state-of-the-art performance, setting new records on LasHeR and RGBD1K.

* 10 figures, 8 tables

View paper on

Share this with someone who'll enjoy it:

Title:Generative-based Fusion Mechanism for Multi-Modal Tracking

Paper and Code