Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Frederick Ducatelle

A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Oct 17, 2024

Tianyi Zhang, Atta Norouzian, Aanchan Mohan, Frederick Ducatelle

Figure 1 for A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Figure 2 for A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Figure 3 for A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Figure 4 for A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Abstract:In virtual assistant (VA) systems it is important to reject or redirect user queries that fall outside the scope of the system. One of the most accurate approaches for out-of-scope (OOS) rejection is to combine it with the task of intent classification on in-scope queries, and to use methods based on the similarity of embeddings produced by transformer-based sentence encoders. Typically, such encoders are fine-tuned for the intent-classification task, using cross-entropy loss. Recent work has shown that while this produces suitable embeddings for the intent-classification task, it also tends to disperse in-scope embeddings over the full sentence embedding space. This causes the in-scope embeddings to potentially overlap with OOS embeddings, thereby making OOS rejection difficult. This is compounded when OOS data is unknown. To mitigate this issue our work proposes to regularize the cross-entropy loss with an in-scope embedding reconstruction loss learned using an auto-encoder. Our method achieves a 1-4% improvement in the area under the precision-recall curve for rejecting out-of-sample (OOS) instances, without compromising intent classification performance.

* Appearing at Empirical Methods in Natural Language Processing 2025 - Industry Track

Via

Access Paper or Ask Questions