Picture for Peter Shaw

Peter Shaw

Bridging Kolmogorov Complexity and Deep Learning: Asymptotically Optimal Description Length Objectives for Transformers

Add code
Sep 26, 2025
Viaarxiv icon

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Add code
Apr 11, 2025
Figure 1 for AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
Figure 2 for AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
Figure 3 for AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
Figure 4 for AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
Viaarxiv icon

ALTA: Compiler-Based Analysis of Transformers

Add code
Oct 23, 2024
Figure 1 for ALTA: Compiler-Based Analysis of Transformers
Figure 2 for ALTA: Compiler-Based Analysis of Transformers
Figure 3 for ALTA: Compiler-Based Analysis of Transformers
Figure 4 for ALTA: Compiler-Based Analysis of Transformers
Viaarxiv icon

BAGEL: Bootstrapping Agents by Guiding Exploration with Language

Add code
Mar 12, 2024
Figure 1 for BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Figure 2 for BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Figure 3 for BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Figure 4 for BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Viaarxiv icon

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Add code
Dec 21, 2023
Figure 1 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 2 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 3 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 4 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Viaarxiv icon

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

Add code
May 31, 2023
Figure 1 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 2 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 3 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 4 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Viaarxiv icon

QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations

Add code
May 19, 2023
Figure 1 for QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
Figure 2 for QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
Figure 3 for QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
Figure 4 for QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
Viaarxiv icon

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Add code
Oct 07, 2022
Figure 1 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 2 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 3 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Figure 4 for Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Viaarxiv icon

Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing

Add code
Sep 29, 2022
Figure 1 for Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing
Figure 2 for Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing
Figure 3 for Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing
Figure 4 for Generate-and-Retrieve: use your predictions to improve retrieval for semantic parsing
Viaarxiv icon

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing

Add code
May 24, 2022
Figure 1 for Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
Figure 2 for Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
Figure 3 for Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
Figure 4 for Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing
Viaarxiv icon