Picture for Martin Riddell

Martin Riddell

P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains

Add code
Oct 11, 2024
Figure 1 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 2 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 3 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 4 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Viaarxiv icon

Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models

Add code
Mar 06, 2024
Viaarxiv icon

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models

Add code
Oct 02, 2023
Viaarxiv icon

FOLIO: Natural Language Reasoning with First-Order Logic

Add code
Sep 02, 2022
Figure 1 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 2 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 3 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 4 for FOLIO: Natural Language Reasoning with First-Order Logic
Viaarxiv icon