Picture for Siyuan Cheng

Siyuan Cheng

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

Add code
Jun 12, 2025
Viaarxiv icon

ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Add code
Jun 12, 2025
Viaarxiv icon

CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling

Add code
Jan 27, 2025
Viaarxiv icon

DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation

Add code
Sep 17, 2024
Viaarxiv icon

ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation

Add code
Sep 12, 2024
Figure 1 for ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
Figure 2 for ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
Figure 3 for ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
Figure 4 for ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
Viaarxiv icon

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Add code
Jul 16, 2024
Figure 1 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 2 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 3 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 4 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Viaarxiv icon

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Add code
Jul 02, 2024
Viaarxiv icon

LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

Add code
Mar 25, 2024
Figure 1 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 2 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 3 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 4 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Viaarxiv icon

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Add code
Feb 25, 2024
Figure 1 for InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Figure 2 for InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Figure 3 for InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Figure 4 for InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Viaarxiv icon

MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing

Add code
Feb 18, 2024
Viaarxiv icon