Picture for Xuying Li

Xuying Li

Layer-Wise Perturbations via Sparse Autoencoders for Adversarial Text Generation

Add code
Aug 14, 2025
Viaarxiv icon

Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation

Add code
Dec 05, 2024
Figure 1 for Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation
Figure 2 for Targeting the Core: A Simple and Effective Method to Attack RAG-based Agents via Direct LLM Manipulation
Viaarxiv icon

Precision Knowledge Editing: Enhancing Safety in Large Language Models

Add code
Oct 02, 2024
Figure 1 for Precision Knowledge Editing: Enhancing Safety in Large Language Models
Figure 2 for Precision Knowledge Editing: Enhancing Safety in Large Language Models
Viaarxiv icon