Picture for Jinghuai Zhang

Jinghuai Zhang

RogueMerge: Robust and Unified Attacks against LLM Model Merging

Add code
Jun 02, 2026
Viaarxiv icon

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Add code
May 28, 2026
Viaarxiv icon

What-If World: A Causal Benchmark for General World Models in Embodied Scenarios

Add code
May 26, 2026
Viaarxiv icon

HIDBench: Benchmarking Large Language Models for Host-Based Intrusion Detection

Add code
May 20, 2026
Viaarxiv icon

ACIArena: Toward Unified Evaluation for Agent Cascading Injection

Add code
Apr 09, 2026
Viaarxiv icon

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Add code
Feb 11, 2026
Viaarxiv icon

When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Add code
Jan 31, 2026
Viaarxiv icon

FraudShield: Knowledge Graph Empowered Defense for LLMs against Fraud Attacks

Add code
Jan 30, 2026
Viaarxiv icon

Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content?

Add code
Dec 26, 2025
Viaarxiv icon

CollabEdit: Towards Non-destructive Collaborative Knowledge Editing

Add code
Oct 12, 2024
Figure 1 for CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Figure 2 for CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Figure 3 for CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Figure 4 for CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Viaarxiv icon