Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rupak Kumar Das

Fake News Detection After LLM Laundering: Measurement and Explanation

Jan 29, 2025

Rupak Kumar Das, Jonathan Dodge

Figure 1 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 2 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 3 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 4 for Fake News Detection After LLM Laundering: Measurement and Explanation

Abstract:With their advanced capabilities, Large Language Models (LLMs) can generate highly convincing and contextually relevant fake news, which can contribute to disseminating misinformation. Though there is much research on fake news detection for human-written text, the field of detecting LLM-generated fake news is still under-explored. This research measures the efficacy of detectors in identifying LLM-paraphrased fake news, in particular, determining whether adding a paraphrase step in the detection pipeline helps or impedes detection. This study contributes: (1) Detectors struggle to detect LLM-paraphrased fake news more than human-written text, (2) We find which models excel at which tasks (evading detection, paraphrasing to evade detection, and paraphrasing for semantic similarity). (3) Via LIME explanations, we discovered a possible reason for detection failures: sentiment shift. (4) We discover a worrisome trend for paraphrase quality measurement: samples that exhibit sentiment shift despite a high BERTSCORE. (5) We provide a pair of datasets augmenting existing datasets with paraphrase outputs and scores. The dataset is available on GitHub

Via

Access Paper or Ask Questions

SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

Jan 15, 2024

Rupak Kumar Das, Dr. Ted Pedersen

Figure 1 for SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

Figure 2 for SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

Figure 3 for SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

Figure 4 for SemEval-2017 Task 4: Sentiment Analysis in Twitter using BERT

Abstract:This paper uses the BERT model, which is a transformer-based architecture, to solve task 4A, English Language, Sentiment Analysis in Twitter of SemEval2017. BERT is a very powerful large language model for classification tasks when the amount of training data is small. For this experiment, we have used the BERT{\textsubscript{\tiny BASE}} model, which has 12 hidden layers. This model provides better accuracy, precision, recall, and f1 score than the Naive Bayes baseline model. It performs better in binary classification subtasks than the multi-class classification subtasks. We also considered all kinds of ethical issues during this experiment, as Twitter data contains personal and sensible information. The dataset and code used in our experiment can be found in this GitHub repository.

Via

Access Paper or Ask Questions