Picture for Zhibo Wang

Zhibo Wang

AFLoRA: Adaptive Federated Fine-Tuning of Large Language Models with Resource-Aware Low-Rank Adaption

Add code
May 30, 2025
Viaarxiv icon

Personalized Query Auto-Completion for Long and Short-Term Interests with Adaptive Detoxification Generation

Add code
May 27, 2025
Viaarxiv icon

PRUNE: A Patching Based Repair Framework for Certiffable Unlearning of Neural Networks

Add code
May 10, 2025
Viaarxiv icon

Towards LLM Guardrails via Sparse Representation Steering

Add code
Mar 21, 2025
Viaarxiv icon

Can Small Language Models Reliably Resist Jailbreak Attacks? A Comprehensive Evaluation

Add code
Mar 09, 2025
Viaarxiv icon

DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model

Add code
Dec 02, 2024
Viaarxiv icon

Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face Detection

Add code
Dec 02, 2024
Viaarxiv icon

PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark

Add code
Aug 10, 2024
Figure 1 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 2 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 3 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Figure 4 for PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
Viaarxiv icon

RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent

Add code
Jul 23, 2024
Viaarxiv icon

Inferring turbulent velocity and temperature fields and their statistics from Lagrangian velocity measurements using physics-informed Kolmogorov-Arnold Networks

Add code
Jul 23, 2024
Viaarxiv icon