Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AttentionViz: A Global View of Transformer Attention

May 04, 2023

Catherine Yeh, Yida Chen, Aoyu Wu, Cynthia Chen, Fernanda Viégas, Martin Wattenberg

Figure 1 for AttentionViz: A Global View of Transformer Attention

Figure 2 for AttentionViz: A Global View of Transformer Attention

Figure 3 for AttentionViz: A Global View of Transformer Attention

Figure 4 for AttentionViz: A Global View of Transformer Attention

Share this with someone who'll enjoy it:

Abstract:Transformer models are revolutionizing machine learning, but their inner workings remain mysterious. In this work, we present a new visualization technique designed to help researchers understand the self-attention mechanism in transformers that allows these models to learn rich, contextual relationships between elements of a sequence. The main idea behind our method is to visualize a joint embedding of the query and key vectors used by transformer models to compute attention. Unlike previous attention visualization techniques, our approach enables the analysis of global patterns across multiple input sequences. We create an interactive visualization tool, AttentionViz, based on these joint query-key embeddings, and use it to study attention mechanisms in both language and vision transformers. We demonstrate the utility of our approach in improving model understanding and offering new insights about query-key interactions through several application scenarios and expert feedback.

* 11 pages, 13 figures

View paper on

Share this with someone who'll enjoy it:

Title:AttentionViz: A Global View of Transformer Attention

Paper and Code