Picture for Dylan Ashley

Dylan Ashley

Agent-as-a-Judge: Evaluate Agents with Agents

Add code
Oct 14, 2024
Figure 1 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 2 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 3 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 4 for Agent-as-a-Judge: Evaluate Agents with Agents
Viaarxiv icon

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Add code
Sep 20, 2023
Figure 1 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 2 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 3 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 4 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Viaarxiv icon

Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search

Add code
Apr 01, 2021
Figure 1 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 2 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 3 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 4 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Viaarxiv icon