Picture for Xiaobin Hu

Xiaobin Hu

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy

Add code
Mar 25, 2026
Viaarxiv icon

CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Video Subtitle Removal

Add code
Mar 23, 2026
Viaarxiv icon

TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics

Add code
Mar 14, 2026
Viaarxiv icon

MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems

Add code
Mar 10, 2026
Viaarxiv icon

The Trinity of Consistency as a Defining Principle for General World Models

Add code
Feb 26, 2026
Viaarxiv icon

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Dual Latent Memory for Visual Multi-agent System

Add code
Jan 31, 2026
Viaarxiv icon

Large-Scale Multidimensional Knowledge Profiling of Scientific Literature

Add code
Jan 21, 2026
Viaarxiv icon

M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding

Add code
Jan 13, 2026
Viaarxiv icon

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Add code
Jan 11, 2026
Viaarxiv icon