Picture for Tianyu Du

Tianyu Du

When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Add code
Jan 31, 2026
Viaarxiv icon

FraudShield: Knowledge Graph Empowered Defense for LLMs against Fraud Attacks

Add code
Jan 30, 2026
Viaarxiv icon

Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content?

Add code
Dec 26, 2025
Viaarxiv icon

The Eminence in Shadow: Exploiting Feature Boundary Ambiguity for Robust Backdoor Attacks

Add code
Dec 17, 2025
Viaarxiv icon

NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization

Add code
May 08, 2025
Viaarxiv icon

RAID: An In-Training Defense against Attribute Inference Attacks in Recommender Systems

Add code
Apr 15, 2025
Viaarxiv icon

Bridging the Gap Between Preference Alignment and Machine Unlearning

Add code
Apr 09, 2025
Figure 1 for Bridging the Gap Between Preference Alignment and Machine Unlearning
Figure 2 for Bridging the Gap Between Preference Alignment and Machine Unlearning
Figure 3 for Bridging the Gap Between Preference Alignment and Machine Unlearning
Figure 4 for Bridging the Gap Between Preference Alignment and Machine Unlearning
Viaarxiv icon

CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models

Add code
Nov 20, 2024
Figure 1 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 2 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 3 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 4 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Viaarxiv icon

HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models

Add code
Oct 30, 2024
Figure 1 for HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
Figure 2 for HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
Figure 3 for HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
Figure 4 for HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
Viaarxiv icon