Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Jul 31, 2024

Lukas Gienapp, Niklas Deckers, Martin Potthast, Harrisen Scells

Figure 1 for Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Figure 2 for Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Figure 3 for Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Figure 4 for Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Share this with someone who'll enjoy it:

Abstract:Representation-based retrieval models, so-called biencoders, estimate the relevance of a document to a query by calculating the similarity of their respective embeddings. Current state-of-the-art biencoders are trained using an expensive training regime involving knowledge distillation from a teacher model and batch-sampling. Instead of relying on a teacher model, we contribute a novel parameter-free loss function for self-supervision that exploits the pre-trained language modeling capabilities of the encoder model as a training signal, eliminating the need for batch sampling by performing implicit hard negative mining. We investigate the capabilities of our proposed approach through extensive ablation studies, demonstrating that self-distillation can match the effectiveness of teacher distillation using only 13.5% of the data, while offering a speedup in training time between 3x and 15x compared to parametrized losses. Code and data is made openly available.

* 9 Pages, 4 Tables, 6 Figures

View paper on

Share this with someone who'll enjoy it:

Title:Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins

Paper and Code