Picture for Armando Solar-Lezama

Armando Solar-Lezama

Massachusetts Institute of Technology

Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

Add code
Jan 07, 2025
Figure 1 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 2 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 3 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 4 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Viaarxiv icon

MathDSL: A Domain-Specific Language for Concise Mathematical Solutions Via Program Synthesis

Add code
Sep 26, 2024
Viaarxiv icon

When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

Add code
Jun 12, 2024
Viaarxiv icon

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Add code
Mar 12, 2024
Viaarxiv icon

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

Add code
Feb 29, 2024
Viaarxiv icon

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Add code
Jan 05, 2024
Viaarxiv icon

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Add code
Oct 23, 2023
Viaarxiv icon

Learning a Hierarchical Planner from Humans in Multiple Generations

Add code
Oct 17, 2023
Viaarxiv icon

Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

Add code
Jun 24, 2023
Viaarxiv icon

Demystifying GPT Self-Repair for Code Generation

Add code
Jun 22, 2023
Viaarxiv icon