Picture for Tao Chen

Tao Chen

IEEE Fellow

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Add code
Mar 24, 2026
Viaarxiv icon

Revealing Domain-Spatiality Patterns for Configuration Tuning: Domain Knowledge Meets Fitness Landscapes

Add code
Mar 23, 2026
Viaarxiv icon

PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation

Add code
Mar 23, 2026
Viaarxiv icon

Beyond Quadratic: Linear-Time Change Detection with RWKV

Add code
Mar 20, 2026
Viaarxiv icon

Efficiency Follows Global-Local Decoupling

Add code
Mar 20, 2026
Viaarxiv icon

CurveStream: Boosting Streaming Video Understanding in MLLMs via Curvature-Aware Hierarchical Visual Memory Management

Add code
Mar 20, 2026
Viaarxiv icon

PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation

Add code
Mar 18, 2026
Viaarxiv icon

Joint beamforming and mode optimization for multi-functional STAR-RIS-aided integrated sensing and communication networks

Add code
Feb 18, 2026
Viaarxiv icon

BrainRVQ: A High-Fidelity EEG Foundation Model via Dual-Domain Residual Quantization and Hierarchical Autoregression

Add code
Feb 18, 2026
Viaarxiv icon