Picture for Xiu Su

Xiu Su

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation

Add code
Jan 31, 2026
Viaarxiv icon

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning

Add code
Jan 31, 2026
Viaarxiv icon

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation

Add code
Jan 31, 2026
Viaarxiv icon

Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching

Add code
Jan 31, 2026
Viaarxiv icon

LEGATO: Good Identity Unlearning Is Continuous

Add code
Jan 07, 2026
Viaarxiv icon

Calibrated Multimodal Representation Learning with Missing Modalities

Add code
Nov 15, 2025
Figure 1 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 2 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 3 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 4 for Calibrated Multimodal Representation Learning with Missing Modalities
Viaarxiv icon

DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-training

Add code
Sep 18, 2025
Viaarxiv icon

ConCM: Consistency-Driven Calibration and Matching for Few-Shot Class-Incremental Learning

Add code
Jun 24, 2025
Viaarxiv icon

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models

Add code
May 23, 2025
Viaarxiv icon