Picture for Peter Clark

Peter Clark

SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Add code
Oct 17, 2024
Figure 1 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 2 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 3 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 4 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Viaarxiv icon

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

Add code
Sep 11, 2024
Viaarxiv icon

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

Add code
Jul 01, 2024
Viaarxiv icon

Can Language Models Serve as Text-Based World Simulators?

Add code
Jun 10, 2024
Figure 1 for Can Language Models Serve as Text-Based World Simulators?
Figure 2 for Can Language Models Serve as Text-Based World Simulators?
Figure 3 for Can Language Models Serve as Text-Based World Simulators?
Figure 4 for Can Language Models Serve as Text-Based World Simulators?
Viaarxiv icon

DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents

Add code
Jun 10, 2024
Figure 1 for DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Figure 2 for DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Figure 3 for DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Figure 4 for DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents
Viaarxiv icon

PDDLEGO: Iterative Planning in Textual Environments

Add code
May 30, 2024
Viaarxiv icon

Learning to Reason via Program Generation, Emulation, and Search

Add code
May 28, 2024
Viaarxiv icon

PROC2PDDL: Open-Domain Planning Representations from Texts

Add code
Feb 29, 2024
Viaarxiv icon

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

Add code
Feb 27, 2024
Viaarxiv icon

Data-driven Discovery with Large Generative Models

Add code
Feb 21, 2024
Viaarxiv icon