Picture for Zi Huang

Zi Huang

Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

Add code
Apr 02, 2025
Viaarxiv icon

MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks

Add code
Mar 24, 2025
Viaarxiv icon

SCORE: Soft Label Compression-Centric Dataset Condensation via Coding Rate Optimization

Add code
Mar 18, 2025
Viaarxiv icon

Making Every Step Effective: Jailbreaking Large Vision-Language Models Through Hierarchical KV Equalization

Add code
Mar 14, 2025
Viaarxiv icon

SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning

Add code
Mar 13, 2025
Viaarxiv icon

StableFusion: Continual Video Retrieval via Frame Adaptation

Add code
Mar 13, 2025
Viaarxiv icon

TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting

Add code
Mar 11, 2025
Viaarxiv icon

MAA: Meticulous Adversarial Attack against Vision-Language Pre-trained Models

Add code
Feb 12, 2025
Viaarxiv icon

GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation

Add code
Feb 09, 2025
Viaarxiv icon

Lost in Edits? A $λ$-Compass for AIGC Provenance

Add code
Feb 05, 2025
Figure 1 for Lost in Edits? A $λ$-Compass for AIGC Provenance
Figure 2 for Lost in Edits? A $λ$-Compass for AIGC Provenance
Figure 3 for Lost in Edits? A $λ$-Compass for AIGC Provenance
Figure 4 for Lost in Edits? A $λ$-Compass for AIGC Provenance
Viaarxiv icon