Picture for Mingyang Song

Mingyang Song

FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models

Add code
Mar 21, 2025
Viaarxiv icon

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Add code
Mar 17, 2025
Viaarxiv icon

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Add code
Mar 08, 2025
Viaarxiv icon

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Add code
Jan 07, 2025
Figure 1 for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Figure 2 for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Figure 3 for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Figure 4 for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Viaarxiv icon

A Survey of Query Optimization in Large Language Models

Add code
Dec 23, 2024
Viaarxiv icon

MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table Reasoning

Add code
Dec 16, 2024
Viaarxiv icon

P/D-Serve: Serving Disaggregated Large Language Model at Scale

Add code
Aug 15, 2024
Viaarxiv icon

Mitigating Multilingual Hallucination in Large Vision-Language Models

Add code
Aug 01, 2024
Figure 1 for Mitigating Multilingual Hallucination in Large Vision-Language Models
Figure 2 for Mitigating Multilingual Hallucination in Large Vision-Language Models
Figure 3 for Mitigating Multilingual Hallucination in Large Vision-Language Models
Figure 4 for Mitigating Multilingual Hallucination in Large Vision-Language Models
Viaarxiv icon

SS-Bench: A Benchmark for Social Story Generation and Evaluation

Add code
Jun 22, 2024
Viaarxiv icon

Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Add code
Jun 17, 2024
Figure 1 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 2 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 3 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Figure 4 for Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!
Viaarxiv icon