Picture for Xiangtai Li

Xiangtai Li

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Add code
Jul 10, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

Dense360: Dense Understanding from Omnidirectional Panoramas

Add code
Jun 17, 2025
Viaarxiv icon

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding

Add code
Jun 16, 2025
Viaarxiv icon

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Add code
Jun 09, 2025
Viaarxiv icon

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

Add code
May 30, 2025
Viaarxiv icon

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Add code
May 30, 2025
Viaarxiv icon

PixelThink: Towards Efficient Chain-of-Pixel Reasoning

Add code
May 29, 2025
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon