Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

May 18, 2023

Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang

Figure 1 for Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Figure 2 for Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Figure 3 for Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Figure 4 for Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Share this with someone who'll enjoy it:

Abstract:Prior studies diagnose the anisotropy problem in sentence representations from pre-trained language models, e.g., BERT, without fine-tuning. Our analysis reveals that the sentence embeddings from BERT suffer from a bias towards uninformative words, limiting the performance in semantic textual similarity (STS) tasks. To address this bias, we propose a simple and efficient unsupervised approach, Diagonal Attention Pooling (Ditto), which weights words with model-based importance estimations and computes the weighted average of word representations from pre-trained models as sentence embeddings. Ditto can be easily applied to any pre-trained language model as a postprocessing operation. Compared to prior sentence embedding approaches, Ditto does not add parameters nor requires any learning. Empirical evaluations demonstrate that our proposed Ditto can alleviate the anisotropy problem and improve various pre-trained models on STS tasks.

* 7 pages

View paper on

Share this with someone who'll enjoy it:

Title:Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings

Paper and Code