Picture for Ashish Sabharwal

Ashish Sabharwal

Shammie

A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers

Add code
Mar 05, 2025
Viaarxiv icon

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Add code
Feb 03, 2025
Figure 1 for ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Figure 2 for ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Figure 3 for ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Figure 4 for ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Viaarxiv icon

Understanding the Logic of Direct Preference Alignment through Logic

Add code
Dec 23, 2024
Viaarxiv icon

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

Add code
Sep 11, 2024
Viaarxiv icon

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

Add code
Jul 26, 2024
Figure 1 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 2 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 3 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Figure 4 for AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Viaarxiv icon

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions

Add code
Jul 21, 2024
Viaarxiv icon

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

Add code
Jul 01, 2024
Figure 1 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 2 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 3 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Figure 4 for DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
Viaarxiv icon

The Illusion of State in State-Space Models

Add code
Apr 12, 2024
Viaarxiv icon

Transformers as Transducers

Add code
Apr 02, 2024
Viaarxiv icon

Data-driven Discovery with Large Generative Models

Add code
Feb 21, 2024
Viaarxiv icon