Picture for Yun Shen

Yun Shen

When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

Add code
Mar 25, 2026
Viaarxiv icon

Understanding LLM Behavior When Encountering User-Supplied Harmful Content in Harmless Tasks

Add code
Mar 12, 2026
Viaarxiv icon

Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

Add code
Mar 03, 2026
Viaarxiv icon

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Add code
Feb 23, 2026
Viaarxiv icon

JADES: A Universal Framework for Jailbreak Assessment via Decompositional Scoring

Add code
Aug 28, 2025
Viaarxiv icon

The Ripple Effect: On Unforeseen Complications of Backdoor Attacks

Add code
May 16, 2025
Viaarxiv icon

The Challenge of Identifying the Origin of Black-Box Large Language Models

Add code
Mar 06, 2025
Figure 1 for The Challenge of Identifying the Origin of Black-Box Large Language Models
Figure 2 for The Challenge of Identifying the Origin of Black-Box Large Language Models
Figure 3 for The Challenge of Identifying the Origin of Black-Box Large Language Models
Figure 4 for The Challenge of Identifying the Origin of Black-Box Large Language Models
Viaarxiv icon

Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications

Add code
Feb 02, 2025
Viaarxiv icon

Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution

Add code
Aug 30, 2024
Figure 1 for Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution
Figure 2 for Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution
Figure 3 for Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution
Figure 4 for Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution
Viaarxiv icon

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

Add code
Jul 30, 2024
Viaarxiv icon