Picture for Malcolm Hardy

Malcolm Hardy

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Add code
Nov 20, 2024
Figure 1 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 2 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 3 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Figure 4 for BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Viaarxiv icon