Picture for Zhibo Wang

Zhibo Wang

Towards LLM Guardrails via Sparse Representation Steering

Add code
Mar 21, 2025
Viaarxiv icon

Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation

Add code
Mar 09, 2025
Viaarxiv icon

DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model

Add code
Dec 02, 2024
Viaarxiv icon

Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face Detection

Add code
Dec 02, 2024
Viaarxiv icon

PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark

Add code
Aug 10, 2024
Figure 1 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 2 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 3 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 4 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Viaarxiv icon

Inferring turbulent velocity and temperature fields and their statistics from Lagrangian velocity measurements using physics-informed Kolmogorov-Arnold Networks

Add code
Jul 23, 2024
Viaarxiv icon

RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent

Add code
Jul 23, 2024
Viaarxiv icon

Breaking Secure Aggregation: Label Leakage from Aggregated Gradients in Federated Learning

Add code
Jun 22, 2024
Viaarxiv icon

Textual Unlearning Gives a False Sense of Unlearning

Add code
Jun 19, 2024
Viaarxiv icon

Towards Real World Debiasing: A Fine-grained Analysis On Spurious Correlation

Add code
May 30, 2024
Figure 1 for Towards Real World Debiasing: A Fine-grained Analysis On Spurious Correlation
Figure 2 for Towards Real World Debiasing: A Fine-grained Analysis On Spurious Correlation
Figure 3 for Towards Real World Debiasing: A Fine-grained Analysis On Spurious Correlation
Figure 4 for Towards Real World Debiasing: A Fine-grained Analysis On Spurious Correlation
Viaarxiv icon