Picture for Armando Solar-Lezama

Armando Solar-Lezama

Massachusetts Institute of Technology

Challenges and Paths Towards AI for Software Engineering

Add code
Mar 28, 2025
Viaarxiv icon

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

Add code
Feb 23, 2025
Viaarxiv icon

Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

Add code
Jan 07, 2025
Figure 1 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 2 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 3 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 4 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Viaarxiv icon

MathDSL: A Domain-Specific Language for Concise Mathematical Solutions Via Program Synthesis

Add code
Sep 26, 2024
Viaarxiv icon

When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

Add code
Jun 12, 2024
Viaarxiv icon

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Add code
Mar 12, 2024
Viaarxiv icon

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

Add code
Feb 29, 2024
Viaarxiv icon

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Add code
Jan 05, 2024
Viaarxiv icon

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Add code
Oct 23, 2023
Viaarxiv icon

Learning a Hierarchical Planner from Humans in Multiple Generations

Add code
Oct 17, 2023
Viaarxiv icon