Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Jul 15, 2024

George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri

Figure 1 for PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Figure 2 for PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Figure 3 for PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Figure 4 for PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Share this with someone who'll enjoy it:

Abstract:We present PutnamBench, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalizations in Lean 4 and Isabelle; a substantial subset also has Coq formalizations. Proving the theorems requires significant problem-solving ability and proficiency in a broad range of topics taught in undergraduate mathematics courses. We use PutnamBench to evaluate several established neural and symbolic theorem-provers. These approaches can only solve a handful of the PutnamBench problems, establishing the benchmark as a difficult open challenge for research on neural theorem-proving. PutnamBench is available at https://github.com/trishullab/PutnamBench.

View paper on

Share this with someone who'll enjoy it:

Title:PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Paper and Code