Picture for Arda Uzunoglu

Arda Uzunoglu

WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment

Add code
Jul 10, 2024
Viaarxiv icon

PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset

Add code
Mar 06, 2024
Figure 1 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 2 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 3 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Figure 4 for PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Viaarxiv icon