Picture for Liqiang Nie

Liqiang Nie

The VLLM Safety Paradox: Dual Ease in Jailbreak Attack and Defense

Add code
Nov 13, 2024
Viaarxiv icon

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

Add code
Oct 19, 2024
Viaarxiv icon

RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training

Add code
Oct 18, 2024
Figure 1 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 2 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 3 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 4 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Viaarxiv icon

Preview-based Category Contrastive Learning for Knowledge Distillation

Add code
Oct 18, 2024
Viaarxiv icon

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

BadCM: Invisible Backdoor Attack Against Cross-Modal Learning

Add code
Oct 03, 2024
Viaarxiv icon

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding

Add code
Sep 29, 2024
Viaarxiv icon

Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and Generalization

Add code
Sep 28, 2024
Viaarxiv icon

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

Add code
Sep 05, 2024
Viaarxiv icon

Laser: Parameter-Efficient LLM Bi-Tuning for Sequential Recommendation with Collaborative Information

Add code
Sep 03, 2024
Viaarxiv icon