Picture for Xiang Yue

Xiang Yue

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Add code
Mar 25, 2025
Viaarxiv icon

Overtrained Language Models Are Harder to Fine-Tune

Add code
Mar 24, 2025
Viaarxiv icon

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Add code
Mar 13, 2025
Viaarxiv icon

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Small Models Struggle to Learn from Strong Reasoners

Add code
Feb 17, 2025
Viaarxiv icon

Demystifying Long Chain-of-Thought Reasoning in LLMs

Add code
Feb 05, 2025
Viaarxiv icon

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Add code
Jan 30, 2025
Figure 1 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 2 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 3 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 4 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Viaarxiv icon

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Add code
Jan 23, 2025
Figure 1 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 2 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 3 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 4 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Viaarxiv icon

Aligning Instruction Tuning with Pre-training

Add code
Jan 16, 2025
Figure 1 for Aligning Instruction Tuning with Pre-training
Figure 2 for Aligning Instruction Tuning with Pre-training
Figure 3 for Aligning Instruction Tuning with Pre-training
Figure 4 for Aligning Instruction Tuning with Pre-training
Viaarxiv icon