Picture for Jiaxi Song

Jiaxi Song

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Add code
May 04, 2025
Viaarxiv icon

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Add code
Apr 04, 2025
Figure 1 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 2 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 3 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 4 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Viaarxiv icon