Picture for Zaiquan Yang

Zaiquan Yang

Unified Energy for Invariant and Independent Decoding in Diffusion Language Models

Add code
Jun 08, 2026
Viaarxiv icon

Visual Enhanced Depth Scaling for Multimodal Latent Reasoning

Add code
Apr 12, 2026
Viaarxiv icon

Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding

Add code
Sep 18, 2025
Viaarxiv icon

Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension

Add code
Oct 02, 2024
Figure 1 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 2 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 3 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 4 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Viaarxiv icon

Learning Prototype via Placeholder for Zero-shot Recognition

Add code
Jul 29, 2022
Viaarxiv icon

Prototypical Contrastive Language Image Pretraining

Add code
Jun 22, 2022
Figure 1 for Prototypical Contrastive Language Image Pretraining
Figure 2 for Prototypical Contrastive Language Image Pretraining
Figure 3 for Prototypical Contrastive Language Image Pretraining
Figure 4 for Prototypical Contrastive Language Image Pretraining
Viaarxiv icon