Picture for Diego Chicharro

Diego Chicharro

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Add code
Nov 07, 2024
Figure 1 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 2 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 3 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 4 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Viaarxiv icon