Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan Dodge

Fake News Detection After LLM Laundering: Measurement and Explanation

Jan 29, 2025

Rupak Kumar Das, Jonathan Dodge

Figure 1 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 2 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 3 for Fake News Detection After LLM Laundering: Measurement and Explanation

Figure 4 for Fake News Detection After LLM Laundering: Measurement and Explanation

Abstract:With their advanced capabilities, Large Language Models (LLMs) can generate highly convincing and contextually relevant fake news, which can contribute to disseminating misinformation. Though there is much research on fake news detection for human-written text, the field of detecting LLM-generated fake news is still under-explored. This research measures the efficacy of detectors in identifying LLM-paraphrased fake news, in particular, determining whether adding a paraphrase step in the detection pipeline helps or impedes detection. This study contributes: (1) Detectors struggle to detect LLM-paraphrased fake news more than human-written text, (2) We find which models excel at which tasks (evading detection, paraphrasing to evade detection, and paraphrasing for semantic similarity). (3) Via LIME explanations, we discovered a possible reason for detection failures: sentiment shift. (4) We discover a worrisome trend for paraphrase quality measurement: samples that exhibit sentiment shift despite a high BERTSCORE. (5) We provide a pair of datasets augmenting existing datasets with paraphrase outputs and scores. The dataset is available on GitHub

Via

Access Paper or Ask Questions

Demystifying Legalese: An Automated Approach for Summarizing and Analyzing Overlaps in Privacy Policies and Terms of Service

Apr 17, 2024

Shikha Soneji, Mitchell Hoesing, Sujay Koujalgi, Jonathan Dodge

Abstract:The complexities of legalese in terms and policy documents can bind individuals to contracts they do not fully comprehend, potentially leading to uninformed data sharing. Our work seeks to alleviate this issue by developing language models that provide automated, accessible summaries and scores for such documents, aiming to enhance user understanding and facilitate informed decisions. We compared transformer-based and conventional models during training on our dataset, and RoBERTa performed better overall with a remarkable 0.74 F1-score. Leveraging our best-performing model, RoBERTa, we highlighted redundancies and potential guideline violations by identifying overlaps in GDPR-required documents, underscoring the necessity for stricter GDPR compliance.

Via

Access Paper or Ask Questions

Experiments with Encoding Structured Data for Neural Networks

Feb 15, 2024

Sujay Nagesh Koujalgi, Jonathan Dodge

Figure 1 for Experiments with Encoding Structured Data for Neural Networks

Figure 2 for Experiments with Encoding Structured Data for Neural Networks

Figure 3 for Experiments with Encoding Structured Data for Neural Networks

Figure 4 for Experiments with Encoding Structured Data for Neural Networks

Abstract:The project's aim is to create an AI agent capable of selecting good actions in a game-playing domain called Battlespace. Sequential domains like Battlespace are important testbeds for planning problems, as such, the Department of Defense uses such domains for wargaming exercises. The agents we developed combine Monte Carlo Tree Search (MCTS) and Deep Q-Network (DQN) techniques in an effort to navigate the game environment, avoid obstacles, interact with adversaries, and capture the flag. This paper will focus on the encoding techniques we explored to present complex structured data stored in a Python class, a necessary precursor to an agent.

* 18 pages, 8 figures, 2 tables

Via

Access Paper or Ask Questions

Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

Sep 28, 2021

Kin-Ho Lam, Zhengxian Lin, Jed Irvine, Jonathan Dodge, Zeyad T Shureih, Roli Khanna, Minsuk Kahng, Alan Fern

Figure 1 for Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

Figure 2 for Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

Figure 3 for Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

Figure 4 for Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

Abstract:Enabling humans to identify potential flaws in an agent's decision making is an important Explainable AI application. We consider identifying such flaws in a planning-based deep reinforcement learning (RL) agent for a complex real-time strategy game. In particular, the agent makes decisions via tree search using a learned model and evaluation function over interpretable states and actions. This gives the potential for humans to identify flaws at the level of reasoning steps in the tree, even if the entire reasoning process is too complex to understand. However, it is unclear whether humans will be able to identify such flaws due to the size and complexity of trees. We describe a user interface and case study, where a small group of AI experts and developers attempt to identify reasoning flaws due to inaccurate agent learning. Overall, the interface allowed the group to identify a number of significant flaws of varying types, demonstrating the promise of this approach.

Via

Access Paper or Ask Questions

Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Mar 22, 2019

Andrew Anderson, Jonathan Dodge, Amrita Sadarangani, Zoe Juozapaitis, Evan Newman, Jed Irvine, Souti Chattopadhyay, Alan Fern, Margaret Burnett

Figure 1 for Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Figure 2 for Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Figure 3 for Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Figure 4 for Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

Abstract:We present a user study to investigate the impact of explanations on non-experts' understanding of reinforcement learning (RL) agents. We investigate both a common RL visualization, saliency maps (the focus of attention), and a more recent explanation type, reward-decomposition bars (predictions of future types of rewards). We designed a 124 participant, four-treatment experiment to compare participants' mental models of an RL agent in a simple Real-Time Strategy (RTS) game. Our results show that the combination of both saliency and reward bars were needed to achieve a statistically significant improvement in mental model score over the control. In addition, our qualitative analysis of the data reveals a number of effects for further study.

* 7 pages

Via

Access Paper or Ask Questions

Visualizing and Understanding Atari Agents

Sep 10, 2018

Sam Greydanus, Anurag Koul, Jonathan Dodge, Alan Fern

Figure 1 for Visualizing and Understanding Atari Agents

Figure 2 for Visualizing and Understanding Atari Agents

Figure 3 for Visualizing and Understanding Atari Agents

Figure 4 for Visualizing and Understanding Atari Agents

Abstract:While deep reinforcement learning (deep RL) agents are effective at maximizing rewards, it is often unclear what strategies they use to do so. In this paper, we take a step toward explaining deep RL agents through a case study using Atari 2600 environments. In particular, we focus on using saliency maps to understand how an agent learns and executes a policy. We introduce a method for generating useful saliency maps and use it to show 1) what strong agents attend to, 2) whether agents are making decisions for the right or wrong reasons, and 3) how agents evolve during learning. We also test our method on non-expert human subjects and find that it improves their ability to reason about these agents. Overall, our results show that saliency information can provide significant insight into an RL agent's decisions and learning behavior.

* ICML 2018 conference paper. Code: https://github.com/greydanus/visualize_atari Blog: https://greydanus.github.io/2017/11/01/visualize-atari/

Via

Access Paper or Ask Questions