Picture for Long Chen

Long Chen

University of Kaiserslautern-Landau, MODE Collaboration

Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning

Add code
Feb 03, 2026
Viaarxiv icon

Bi-Anchor Interpolation Solver for Accelerating Generative Modeling

Add code
Jan 29, 2026
Viaarxiv icon

VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness

Add code
Jan 19, 2026
Viaarxiv icon

Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning

Add code
Jan 16, 2026
Viaarxiv icon

V2P: Visual Attention Calibration for GUI Grounding via Background Suppression and Center Peaking

Add code
Jan 11, 2026
Viaarxiv icon

SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning

Add code
Jan 10, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

MotionAdapter: Video Motion Transfer via Content-Aware Attention Customization

Add code
Jan 05, 2026
Viaarxiv icon

From Failure to Mastery: Generating Hard Samples for Tool-use Agents

Add code
Jan 04, 2026
Viaarxiv icon

AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Add code
Dec 29, 2025
Viaarxiv icon