Picture for Ashish Sabharwal

Ashish Sabharwal

Shammie

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Add code
Feb 03, 2025
Viaarxiv icon

Understanding the Logic of Direct Preference Alignment through Logic

Add code
Dec 23, 2024
Viaarxiv icon

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

Add code
Sep 11, 2024
Viaarxiv icon

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

Add code
Jul 26, 2024
Figure 1 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 2 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 3 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 4 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Viaarxiv icon

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions

Add code
Jul 21, 2024
Viaarxiv icon

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

Add code
Jul 01, 2024
Figure 1 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 2 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 3 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 4 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Viaarxiv icon

The Illusion of State in State-Space Models

Add code
Apr 12, 2024
Viaarxiv icon

Transformers as Transducers

Add code
Apr 02, 2024
Viaarxiv icon

Data-driven Discovery with Large Generative Models

Add code
Feb 21, 2024
Viaarxiv icon

Leveraging Code to Improve In-context Learning for Semantic Parsing

Add code
Nov 16, 2023
Viaarxiv icon