Picture for Armando Solar-Lezama

Armando Solar-Lezama

Massachusetts Institute of Technology

Power Term Polynomial Algebra for Boolean Logic

Add code
Mar 14, 2026
Viaarxiv icon

Adaptive Problem Generation via Symbolic Representations

Add code
Feb 22, 2026
Viaarxiv icon

Challenges and Paths Towards AI for Software Engineering

Add code
Mar 28, 2025
Viaarxiv icon

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

Add code
Feb 23, 2025
Viaarxiv icon

Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs

Add code
Jan 07, 2025
Figure 1 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 2 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 3 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Figure 4 for Randomly Sampled Language Reasoning Problems Reveal Limits of LLMs
Viaarxiv icon

MathDSL: A Domain-Specific Language for Concise Mathematical Solutions Via Program Synthesis

Add code
Sep 26, 2024
Viaarxiv icon

When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

Add code
Jun 12, 2024
Viaarxiv icon

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Add code
Mar 12, 2024
Figure 1 for LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Figure 2 for LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Figure 3 for LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Figure 4 for LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Viaarxiv icon

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

Add code
Feb 29, 2024
Viaarxiv icon

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Add code
Jan 05, 2024
Viaarxiv icon