Picture for Kevin Zhu

Kevin Zhu

George Mason University

Broken Chains: The Cost of Incomplete Reasoning in LLMs

Add code
Feb 16, 2026
Viaarxiv icon

Weight space Detection of Backdoors in LoRA Adapters

Add code
Feb 16, 2026
Viaarxiv icon

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

Add code
Feb 05, 2026
Viaarxiv icon

A Few Bad Neurons: Isolating and Surgically Correcting Sycophancy

Add code
Jan 26, 2026
Viaarxiv icon

AMVICC: A Novel Benchmark for Cross-Modal Failure Mode Profiling for VLMs and IGMs

Add code
Jan 20, 2026
Viaarxiv icon

Zero-Shot Embedding Drift Detection: A Lightweight Defense Against Prompt Injections in LLMs

Add code
Jan 18, 2026
Viaarxiv icon

Interpretable Perturbation Modeling Through Biomedical Knowledge Graphs

Add code
Dec 31, 2025
Viaarxiv icon

When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models

Add code
Dec 22, 2025
Viaarxiv icon

Emergent Persuasion: Will LLMs Persuade Without Being Prompted?

Add code
Dec 20, 2025
Viaarxiv icon

Emergent World Beliefs: Exploring Transformers in Stochastic Games

Add code
Dec 18, 2025
Viaarxiv icon