Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unsupervised Label Refinement Improves Dataless Text Classification

Dec 08, 2020

Zewei Chu, Karl Stratos, Kevin Gimpel

Figure 1 for Unsupervised Label Refinement Improves Dataless Text Classification

Figure 2 for Unsupervised Label Refinement Improves Dataless Text Classification

Figure 3 for Unsupervised Label Refinement Improves Dataless Text Classification

Figure 4 for Unsupervised Label Refinement Improves Dataless Text Classification

Share this with someone who'll enjoy it:

Abstract:Dataless text classification is capable of classifying documents into previously unseen labels by assigning a score to any document paired with a label description. While promising, it crucially relies on accurate descriptions of the label set for each downstream task. This reliance causes dataless classifiers to be highly sensitive to the choice of label descriptions and hinders the broader application of dataless classification in practice. In this paper, we ask the following question: how can we improve dataless text classification using the inputs of the downstream task dataset? Our primary solution is a clustering based approach. Given a dataless classifier, our approach refines its set of predictions using k-means clustering. We demonstrate the broad applicability of our approach by improving the performance of two widely used classifier architectures, one that encodes text-category pairs with two independent encoders and one with a single joint encoder. Experiments show that our approach consistently improves dataless classification across different datasets and makes the classifier more robust to the choice of label descriptions.

View paper on

Share this with someone who'll enjoy it:

Title:Unsupervised Label Refinement Improves Dataless Text Classification

Paper and Code