Picture for Yuanpu Cao

Yuanpu Cao

AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models

Add code
Oct 28, 2024
Viaarxiv icon

Adversarially Robust Industrial Anomaly Detection Through Diffusion Model

Add code
Aug 09, 2024
Viaarxiv icon

Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization

Add code
May 28, 2024
Figure 1 for Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Figure 2 for Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Figure 3 for Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Figure 4 for Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Viaarxiv icon

WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response

Add code
May 22, 2024
Viaarxiv icon

Federated Learning with Projected Trajectory Regularization

Add code
Dec 22, 2023
Viaarxiv icon

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections

Add code
Nov 15, 2023
Viaarxiv icon

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM

Add code
Sep 18, 2023
Viaarxiv icon

RLCard: A Toolkit for Reinforcement Learning in Card Games

Add code
Oct 10, 2019
Figure 1 for RLCard: A Toolkit for Reinforcement Learning in Card Games
Figure 2 for RLCard: A Toolkit for Reinforcement Learning in Card Games
Figure 3 for RLCard: A Toolkit for Reinforcement Learning in Card Games
Figure 4 for RLCard: A Toolkit for Reinforcement Learning in Card Games
Viaarxiv icon