Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Low Rank Fusion based Transformers for Multimodal Sequences

Jul 04, 2020

Saurav Sahay, Eda Okur, Shachi H Kumar, Lama Nachman

Figure 1 for Low Rank Fusion based Transformers for Multimodal Sequences

Figure 2 for Low Rank Fusion based Transformers for Multimodal Sequences

Figure 3 for Low Rank Fusion based Transformers for Multimodal Sequences

Figure 4 for Low Rank Fusion based Transformers for Multimodal Sequences

Share this with someone who'll enjoy it:

Abstract:Our senses individually work in a coordinated fashion to express our emotional intentions. In this work, we experiment with modeling modality-specific sensory signals to attend to our latent multimodal emotional intentions and vice versa expressed via low-rank multimodal fusion and multimodal transformers. The low-rank factorization of multimodal fusion amongst the modalities helps represent approximate multiplicative latent signal interactions. Motivated by the work of~\cite{tsai2019MULT} and~\cite{Liu_2018}, we present our transformer-based cross-fusion architecture without any over-parameterization of the model. The low-rank fusion helps represent the latent signal interactions while the modality-specific attention helps focus on relevant parts of the signal. We present two methods for the Multimodal Sentiment and Emotion Recognition results on CMU-MOSEI, CMU-MOSI, and IEMOCAP datasets and show that our models have lesser parameters, train faster and perform comparably to many larger fusion-based architectures.

* ACL 2020 workshop on Second Grand Challenge and Workshop on Multimodal Language

View paper on

Share this with someone who'll enjoy it:

Title:Low Rank Fusion based Transformers for Multimodal Sequences

Paper and Code