Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Aug 16, 2022

Xing Wu, Guangyuan Ma, Meng Lin, Zijia Lin, Zhongyuan Wang, Songlin Hu

Figure 1 for ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Figure 2 for ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Figure 3 for ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Figure 4 for ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Share this with someone who'll enjoy it:

Abstract:Dense passage retrieval aims to retrieve the relevant passages of a query from a large corpus based on dense representations (i.e., vectors) of the query and the passages. Recent studies have explored improving pre-trained language models to boost dense retrieval performance. This paper proposes CoT-MAE (ConTextual Masked Auto-Encoder), a simple yet effective generative pre-training method for dense passage retrieval. CoT-MAE employs an asymmetric encoder-decoder architecture that learns to compress the sentence semantics into a dense vector through self-supervised and context-supervised masked auto-encoding. Precisely, self-supervised masked auto-encoding learns to model the semantics of the tokens inside a text span, and context-supervised masked auto-encoding learns to model the semantical correlation between the text spans. We conduct experiments on large-scale passage retrieval benchmarks and show considerable improvements over strong baselines, demonstrating the high efficiency of CoT-MAE.

* 11 pages

View paper on

Share this with someone who'll enjoy it:

Title:ConTextual Mask Auto-Encoder for Dense Passage Retrieval

Paper and Code