Picture for Jing Cui

Jing Cui

Recent Advances in Attack and Defense Approaches of Large Language Models

Add code
Sep 05, 2024
Viaarxiv icon

BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning

Add code
Dec 19, 2023
Viaarxiv icon