Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ali Forooghi

Whitening Not Recommended for Classification Tasks in LLMs

Jul 16, 2024

Ali Forooghi, Shaghayegh Sadeghi, Jianguo Lu

Figure 1 for Whitening Not Recommended for Classification Tasks in LLMs

Figure 2 for Whitening Not Recommended for Classification Tasks in LLMs

Figure 3 for Whitening Not Recommended for Classification Tasks in LLMs

Figure 4 for Whitening Not Recommended for Classification Tasks in LLMs

Abstract:Sentence embedding is a cornerstone in NLP. Whitening has been claimed to be an effective operation to improve embedding quality obtained from Large Language Models (LLMs). However, we find that the efficacy of whitening is model-dependent and task-dependent. In particular, whitening degenerates embeddings for classification tasks. The conclusion is supported by extensive experiments. We also explored a variety of whitening operations, including PCA, ZCA, PCA-Cor, ZCA-Cor and Cholesky whitenings. A by-product of our research is embedding evaluation platform for LLMs called SentEval+.

Via

Access Paper or Ask Questions