Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Practical Examination of AI-Generated Text Detectors for Large Language Models

Dec 06, 2024

Brian Tufts, Xuandong Zhao, Lei Li

Figure 1 for A Practical Examination of AI-Generated Text Detectors for Large Language Models

Figure 2 for A Practical Examination of AI-Generated Text Detectors for Large Language Models

Figure 3 for A Practical Examination of AI-Generated Text Detectors for Large Language Models

Figure 4 for A Practical Examination of AI-Generated Text Detectors for Large Language Models

Share this with someone who'll enjoy it:

Abstract:The proliferation of large language models has raised growing concerns about their misuse, particularly in cases where AI-generated text is falsely attributed to human authors. Machine-generated content detectors claim to effectively identify such text under various conditions and from any language model. This paper critically evaluates these claims by assessing several popular detectors (RADAR, Wild, T5Sentinel, Fast-DetectGPT, GPTID, LogRank, Binoculars) on a range of domains, datasets, and models that these detectors have not previously encountered. We employ various prompting strategies to simulate adversarial attacks, demonstrating that even moderate efforts can significantly evade detection. We emphasize the importance of the true positive rate at a specific false positive rate (TPR@FPR) metric and demonstrate that these detectors perform poorly in certain settings, with TPR@.01 as low as 0\%. Our findings suggest that both trained and zero-shot detectors struggle to maintain high sensitivity while achieving a reasonable true positive rate.

* 8 pages. Submitted to ARR October cycle

View paper on

Share this with someone who'll enjoy it:

Title:A Practical Examination of AI-Generated Text Detectors for Large Language Models

Paper and Code