Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Robust Temporal-Invariant Learning in Multimodal Disentanglement

Aug 30, 2024

Guoyang Xu, Junqi Xue, Zhenxi Song, Yuxin Liu, Zirui Wang, Min Zhang, Zhiguo Zhang

Figure 1 for Robust Temporal-Invariant Learning in Multimodal Disentanglement

Figure 2 for Robust Temporal-Invariant Learning in Multimodal Disentanglement

Figure 3 for Robust Temporal-Invariant Learning in Multimodal Disentanglement

Figure 4 for Robust Temporal-Invariant Learning in Multimodal Disentanglement

Share this with someone who'll enjoy it:

Abstract:Multimodal sentiment recognition aims to learn representations from different modalities to identify human emotions. However, previous works does not suppresses the frame-level redundancy inherent in continuous time series, resulting in incomplete modality representations with noise. To address this issue, we propose the Temporal-invariant learning, which minimizes the distributional differences between time steps to effectively capture smoother time series patterns, thereby enhancing the quality of the representations and robustness of the model. To fully exploit the rich semantic information in textual knowledge, we propose a Text-Driven Fusion Module (TDFM). To guide cross-modal interactions, TDFM evaluates the correlations between different modality through modality-invariant representations. Furthermore, we introduce a modality discriminator to disentangle modality-invariant and modality-specific subspaces. Experimental results on two public datasets demonstrate the superiority of our model.

* 5 pages, 2 figures, this is the first version. The code is available at https://github.com/X-G-Y/RTIL

View paper on

Share this with someone who'll enjoy it:

Title:Robust Temporal-Invariant Learning in Multimodal Disentanglement

Paper and Code