Picture for Weili Guan

Weili Guan

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation

Add code
Jan 31, 2026
Viaarxiv icon

CVeDRL: An Efficient Code Verifier via Difficulty-aware Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

StructAlign: Structured Cross-Modal Alignment for Continual Text-to-Video Retrieval

Add code
Jan 28, 2026
Viaarxiv icon

IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework

Add code
Jan 28, 2026
Viaarxiv icon

PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

Add code
Jan 14, 2026
Viaarxiv icon

From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs

Add code
Sep 26, 2025
Figure 1 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 2 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 3 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Figure 4 for From Bias to Balance: Exploring and Mitigating Spatial Bias in LVLMs
Viaarxiv icon

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems

Add code
Sep 09, 2025
Figure 1 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 2 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 3 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Figure 4 for Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems
Viaarxiv icon

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Add code
Jun 12, 2025
Figure 1 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 2 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 3 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 4 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Viaarxiv icon

SplitLoRA: Balancing Stability and Plasticity in Continual Learning Through Gradient Space Splitting

Add code
May 29, 2025
Viaarxiv icon