Picture for Matthew Barnett

Matthew Barnett

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Add code
Nov 07, 2024
Figure 1 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 2 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 3 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Figure 4 for FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Viaarxiv icon

An Empirical Study of Scaling Laws for Transfer

Add code
Aug 30, 2024
Viaarxiv icon

Chinchilla Scaling: A replication attempt

Add code
Apr 15, 2024
Viaarxiv icon