Picture for Tao Chen

Tao Chen

IEEE Fellow

FCMBench-Video: Benchmarking Document Video Intelligence

Add code
Apr 28, 2026
Viaarxiv icon

Can Multimodal Large Language Models Truly Understand Small Objects?

Add code
Apr 24, 2026
Viaarxiv icon

Rethinking Cross-Domain Evaluation for Face Forgery Detection with Semantic Fine-grained Alignment and Mixture-of-Experts

Add code
Apr 23, 2026
Viaarxiv icon

MLG-Stereo: ViT Based Stereo Matching with Multi-Stage Local-Global Enhancement

Add code
Apr 22, 2026
Viaarxiv icon

Human-Machine Co-Boosted Bug Report Identification with Mutualistic Neural Active Learning

Add code
Apr 20, 2026
Viaarxiv icon

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Add code
Apr 15, 2026
Viaarxiv icon

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Add code
Mar 24, 2026
Viaarxiv icon

Revealing Domain-Spatiality Patterns for Configuration Tuning: Domain Knowledge Meets Fitness Landscapes

Add code
Mar 23, 2026
Viaarxiv icon

PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation

Add code
Mar 23, 2026
Viaarxiv icon