Picture for Yiwei Qin

Yiwei Qin

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Add code
Nov 25, 2024
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Add code
Jan 07, 2024
Viaarxiv icon

Searching for Effective Multilingual Fine-Tuning Methods: A Case Study in Summarization

Add code
Dec 12, 2022
Viaarxiv icon

T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics

Add code
Dec 12, 2022
Viaarxiv icon