Picture for Yudong Wang

Yudong Wang

JudgeRLVR: Judge First, Generate Second for Efficient Reasoning

Add code
Jan 13, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

UltraEval-Audio: A Unified Framework for Comprehensive Evaluation of Audio Foundation Models

Add code
Jan 04, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

Add code
Dec 19, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data

Add code
May 08, 2025
Figure 1 for Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data
Figure 2 for Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data
Figure 3 for Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data
Figure 4 for Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data
Viaarxiv icon

Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs

Add code
Dec 27, 2024
Figure 1 for Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs
Figure 2 for Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs
Figure 3 for Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs
Figure 4 for Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs
Viaarxiv icon

PyBench: Evaluating LLM Agent on various real-world coding tasks

Add code
Jul 23, 2024
Figure 1 for PyBench: Evaluating LLM Agent on various real-world coding tasks
Figure 2 for PyBench: Evaluating LLM Agent on various real-world coding tasks
Figure 3 for PyBench: Evaluating LLM Agent on various real-world coding tasks
Figure 4 for PyBench: Evaluating LLM Agent on various real-world coding tasks
Viaarxiv icon