Picture for Xiang Fei

Xiang Fei

Dolphin-v2: Universal Document Parsing via Scalable Anchor Prompting

Add code
Feb 05, 2026
Viaarxiv icon

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Add code
Dec 31, 2025
Viaarxiv icon

Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing

Add code
Oct 26, 2025
Viaarxiv icon

Post-Completion Learning for Language Models

Add code
Jul 27, 2025
Viaarxiv icon

Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

Add code
May 20, 2025
Viaarxiv icon

Advancing Sequential Numerical Prediction in Autoregressive Models

Add code
May 19, 2025
Viaarxiv icon

WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?

Add code
May 16, 2025
Viaarxiv icon

MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark

Add code
Oct 15, 2024
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon