Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adi Zicher

What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

Dec 20, 2022

Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant, Amir Globerson

Figure 1 for What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

Figure 2 for What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

Figure 3 for What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

Figure 4 for What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

Abstract:Dual encoders are now the dominant architecture for dense retrieval. Yet, we have little understanding of how they represent text, and why this leads to good performance. In this work, we shed light on this question via distributions over the vocabulary. We propose to interpret the vector representations produced by dual encoders by projecting them into the model's vocabulary space. We show that the resulting distributions over vocabulary tokens are intuitive and contain rich semantic information. We find that this view can explain some of the failure cases of dense retrievers. For example, the inability of models to handle tail entities can be explained via a tendency of the token distributions to forget some of the tokens of those entities. We leverage this insight and propose a simple way to enrich query and passage representations with lexical information at inference time, and show that this significantly improves performance compared to the original model in out-of-domain settings.

Via

Access Paper or Ask Questions