Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Feb 07, 2023

Suyu Ge, Chenyan Xiong, Corby Rosset, Arnold Overwijk, Jiawei Han, Paul Bennett

Figure 1 for Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Figure 2 for Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Figure 3 for Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Figure 4 for Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Share this with someone who'll enjoy it:

Abstract:In this paper we improve the zero-shot generalization ability of language models via Mixture-Of-Memory Augmentation (MoMA), a mechanism that retrieves augmentation documents from multiple information corpora ("external memories"), with the option to "plug in" new memory at inference time. We develop a joint learning mechanism that trains the augmentation component with latent labels derived from the end retrieval task, paired with hard negatives from the memory mixture. We instantiate the model in a zero-shot dense retrieval setting by augmenting a strong T5-based retriever with MoMA. Our model, MoMA, obtains strong zero-shot retrieval accuracy on the eighteen tasks included in the standard BEIR benchmark. It outperforms systems that seek generalization from increased model parameters and computation steps. Our analysis further illustrates the necessity of augmenting with mixture-of-memory for robust generalization, the benefits of augmentation learning, and how MoMA utilizes the plug-in memory at inference time without changing its parameters. We plan to open source our code.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

Paper and Code