Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Oct 07, 2022

Jan Philip Wahle, Terry Ruas, Frederic Kirstein, Bela Gipp

Figure 1 for How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Figure 2 for How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Figure 3 for How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Figure 4 for How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Share this with someone who'll enjoy it:

Abstract:The recent success of large language models for text generation poses a severe threat to academic integrity, as plagiarists can generate realistic paraphrases indistinguishable from original work. However, the role of large autoregressive transformers in generating machine-paraphrased plagiarism and their detection is still developing in the literature. This work explores T5 and GPT-3 for machine-paraphrase generation on scientific articles from arXiv, student theses, and Wikipedia. We evaluate the detection performance of six automated solutions and one commercial plagiarism detection software and perform a human study with 105 participants regarding their detection performance and the quality of generated examples. Our results suggest that large models can rewrite text humans have difficulty identifying as machine-paraphrased (53% mean acc.). Human experts rate the quality of paraphrases generated by GPT-3 as high as original texts (clarity 4.0/5, fluency 4.2/5, coherence 3.8/5). The best-performing detection model (GPT-3) achieves a 66% F1-score in detecting paraphrases.

View paper on

Share this with someone who'll enjoy it:

Title:How Large Language Models are Transforming Machine-Paraphrased Plagiarism

Paper and Code