Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:How Alignment Helps Make the Most of Multimodal Data

May 14, 2024

Christian Arnold, Andreas Küpfer

Figure 1 for How Alignment Helps Make the Most of Multimodal Data

Figure 2 for How Alignment Helps Make the Most of Multimodal Data

Figure 3 for How Alignment Helps Make the Most of Multimodal Data

Figure 4 for How Alignment Helps Make the Most of Multimodal Data

Share this with someone who'll enjoy it:

Abstract:When studying political communication, combining the information from text, audio, and video signals promises to reflect the richness of human communication more comprehensively than confining it to individual modalities alone. However, when modeling such multimodal data, its heterogeneity, connectedness, and interaction are challenging to address. We argue that aligning the respective modalities can be an essential step in entirely using the potential of multimodal data because it informs the model with human understanding. Exploring aligned modalities unlocks promising analytical leverage. First, it allows us to make the most of information in the data, which inter alia opens the door to better quality predictions. Second, it is possible to answer research questions that span multiple modalities with cross-modal queries. Finally, alignment addresses concerns about model interpretability. We illustrate the utility of this approach by analyzing how German MPs address members of the far-right AfD in their speeches, and predicting the tone of video advertising in the context of the 2020 US presidential race. Our paper offers important insights to all keen to analyze multimodal data effectively.

* Working Paper

View paper on

Share this with someone who'll enjoy it:

Title:How Alignment Helps Make the Most of Multimodal Data

Paper and Code