Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Balanced Multimodal Learning via On-the-fly Gradient Modulation

Mar 29, 2022

Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu

Figure 1 for Balanced Multimodal Learning via On-the-fly Gradient Modulation

Figure 2 for Balanced Multimodal Learning via On-the-fly Gradient Modulation

Figure 3 for Balanced Multimodal Learning via On-the-fly Gradient Modulation

Figure 4 for Balanced Multimodal Learning via On-the-fly Gradient Modulation

Share this with someone who'll enjoy it:

Abstract:Multimodal learning helps to comprehensively understand the world, by integrating different senses. Accordingly, multiple input modalities are expected to boost model performance, but we actually find that they are not fully exploited even when the multimodal model outperforms its uni-modal counterpart. Specifically, in this paper we point out that existing multimodal discriminative models, in which uniform objective is designed for all modalities, could remain under-optimized uni-modal representations, caused by another dominated modality in some scenarios, e.g., sound in blowing wind event, vision in drawing picture event, etc. To alleviate this optimization imbalance, we propose on-the-fly gradient modulation to adaptively control the optimization of each modality, via monitoring the discrepancy of their contribution towards the learning objective. Further, an extra Gaussian noise that changes dynamically is introduced to avoid possible generalization drop caused by gradient modulation. As a result, we achieve considerable improvement over common fusion methods on different multimodal tasks, and this simple strategy can also boost existing multimodal methods, which illustrates its efficacy and versatility. The source code is available at \url{https://github.com/GeWu-Lab/OGM-GE_CVPR2022}.

* Accepted by CVPR 2022 (ORAL)

View paper on

Share this with someone who'll enjoy it:

Title:Balanced Multimodal Learning via On-the-fly Gradient Modulation

Paper and Code