Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring the Residual Stream of Transformers

Dec 19, 2023

Zeping Yu, Kailai Yang, Zhiwei Liu, Sophia Ananiadou

Figure 1 for Exploring the Residual Stream of Transformers

Figure 2 for Exploring the Residual Stream of Transformers

Figure 3 for Exploring the Residual Stream of Transformers

Figure 4 for Exploring the Residual Stream of Transformers

Share this with someone who'll enjoy it:

Abstract:Transformer-based models have achieved great breakthroughs in recent years. However, there are many significant questions that have not been answered in the field of explaining the reason why the models have powerful outputs. We do not know how to locate the models' important parameters storing the knowledge for predicting the next word, and whether these parameters are stored on the same layer/module or different ones. Moreover, we do not understand the mechanism to merge the knowledge into the final embedding for next word prediction. In this paper, we explore the residual stream of transformers to increase the interpretability. We find the mechanism behind residual connection is a direct addition function on before-softmax values, so the probabilities of tokens with larger before-softmax values will increase. Moreover, we prove that using log probability increase as contribution scores is reasonable, and based on this we can locate important parameters. Besides, we propose a method to analyze how previous layers affect upper layers by comparing the inner products. The experimental results and case study show that our research can increase the interpretability of transformer-based models. We will release our code on https://github.com/zepingyu0512/residualstream.

View paper on

Share this with someone who'll enjoy it:

Title:Exploring the Residual Stream of Transformers

Paper and Code