Picture for Yi Zong

Yi Zong

Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons

Add code
Feb 05, 2025
Viaarxiv icon

GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation

Add code
Feb 24, 2024
Viaarxiv icon

Evaluating the Performance of Large Language Models on GAOKAO Benchmark

Add code
May 23, 2023
Viaarxiv icon