Picture for Zhongyuan Wang

Zhongyuan Wang

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

Add code
Mar 27, 2025
Viaarxiv icon

General Table Question Answering via Answer-Formula Joint Generation

Add code
Mar 16, 2025
Viaarxiv icon

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Add code
Mar 09, 2025
Viaarxiv icon

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Add code
Mar 06, 2025
Viaarxiv icon

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

Add code
Feb 28, 2025
Viaarxiv icon

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation

Add code
Feb 19, 2025
Viaarxiv icon

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Add code
Jan 03, 2025
Viaarxiv icon

FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos

Add code
Dec 22, 2024
Viaarxiv icon

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Add code
Dec 12, 2024
Figure 1 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 2 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 3 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 4 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Viaarxiv icon