Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Pointwise Paraphrase Appraisal is Potentially Problematic

Jun 05, 2020

Hannah Chen, Yangfeng Ji, David Evans

Figure 1 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 2 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 3 for Pointwise Paraphrase Appraisal is Potentially Problematic

Figure 4 for Pointwise Paraphrase Appraisal is Potentially Problematic

Share this with someone who'll enjoy it:

Abstract:The prevailing approach for training and evaluating paraphrase identification models is constructed as a binary classification problem: the model is given a pair of sentences, and is judged by how accurately it classifies pairs as either paraphrases or non-paraphrases. This pointwise-based evaluation method does not match well the objective of most real world applications, so the goal of our work is to understand how models which perform well under pointwise evaluation may fail in practice and find better methods for evaluating paraphrase identification models. As a first step towards that goal, we show that although the standard way of fine-tuning BERT for paraphrase identification by pairing two sentences as one sequence results in a model with state-of-the-art performance, that model may perform poorly on simple tasks like identifying pairs with two identical sentences. Moreover, we show that these models may even predict a pair of randomly-selected sentences with higher paraphrase score than a pair of identical ones.

* ACL 2020 Student Research Workshop

View paper on

Share this with someone who'll enjoy it:

Title:Pointwise Paraphrase Appraisal is Potentially Problematic

Paper and Code