Picture for Liqiang Nie

Liqiang Nie

SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation

Add code
Dec 08, 2024
Viaarxiv icon

The VLLM Safety Paradox: Dual Ease in Jailbreak Attack and Defense

Add code
Nov 13, 2024
Viaarxiv icon

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

Add code
Oct 19, 2024
Figure 1 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 2 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 3 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 4 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Viaarxiv icon

RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training

Add code
Oct 18, 2024
Figure 1 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 2 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 3 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Figure 4 for RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Viaarxiv icon

Preview-based Category Contrastive Learning for Knowledge Distillation

Add code
Oct 18, 2024
Viaarxiv icon

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

BadCM: Invisible Backdoor Attack Against Cross-Modal Learning

Add code
Oct 03, 2024
Viaarxiv icon

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding

Add code
Sep 29, 2024
Viaarxiv icon

Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and Generalization

Add code
Sep 28, 2024
Viaarxiv icon

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

Add code
Sep 05, 2024
Figure 1 for DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Figure 2 for DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Figure 3 for DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Figure 4 for DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Viaarxiv icon