Picture for Siyuan Cheng

Siyuan Cheng

DIGIMON: Diagnosis and Mitigation of Sampling Skew for Reinforcement Learning based Meta-Planner in Robot Navigation

Add code
Sep 17, 2024
Viaarxiv icon

ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation

Add code
Sep 12, 2024
Viaarxiv icon

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Add code
Jul 16, 2024
Viaarxiv icon

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Add code
Jul 02, 2024
Viaarxiv icon

LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

Add code
Mar 25, 2024
Figure 1 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 2 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 3 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 4 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Viaarxiv icon

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Add code
Feb 25, 2024
Viaarxiv icon

MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing

Add code
Feb 18, 2024
Viaarxiv icon

Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia

Add code
Feb 08, 2024
Viaarxiv icon

A Comprehensive Study of Knowledge Editing for Large Language Models

Add code
Jan 09, 2024
Viaarxiv icon

Make Them Spill the Beans! Coercive Knowledge Extraction from LLMs

Add code
Dec 08, 2023
Viaarxiv icon