Abuse Detection


They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References

Add code
Feb 03, 2026
Viaarxiv icon

Benchmarking Large Language Models for Zero-shot and Few-shot Phishing URL Detection

Add code
Feb 02, 2026
Viaarxiv icon

MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection Benchmark

Add code
Jan 08, 2026
Viaarxiv icon

NeXT-IMDL: Build Benchmark for NeXT-Generation Image Manipulation Detection & Localization

Add code
Dec 29, 2025
Viaarxiv icon

DualGuard: Dual-stream Large Language Model Watermarking Defense against Paraphrase and Spoofing Attack

Add code
Dec 18, 2025
Viaarxiv icon

BashArena: A Control Setting for Highly Privileged AI Agents

Add code
Dec 17, 2025
Figure 1 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 2 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 3 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 4 for BashArena: A Control Setting for Highly Privileged AI Agents
Viaarxiv icon

Soft Inductive Bias Approach via Explicit Reasoning Perspectives in Inappropriate Utterance Detection Using Large Language Models

Add code
Dec 09, 2025
Viaarxiv icon

OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform

Add code
Oct 22, 2025
Figure 1 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 2 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 3 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Figure 4 for OpenGuardrails: An Open-Source Context-Aware AI Guardrails Platform
Viaarxiv icon

A Graph Machine Learning Approach for Detecting Topological Patterns in Transactional Graphs

Add code
Sep 16, 2025
Viaarxiv icon

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Add code
May 17, 2025
Viaarxiv icon