Picture for Edward Grefenstette

Edward Grefenstette

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Add code
Nov 19, 2024
Figure 1 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 2 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 3 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 4 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Viaarxiv icon

Debating with More Persuasive LLMs Leads to More Truthful Answers

Add code
Feb 15, 2024
Viaarxiv icon

Leading the Pack: N-player Opponent Shaping

Add code
Dec 26, 2023
Figure 1 for Leading the Pack: N-player Opponent Shaping
Figure 2 for Leading the Pack: N-player Opponent Shaping
Figure 3 for Leading the Pack: N-player Opponent Shaping
Figure 4 for Leading the Pack: N-player Opponent Shaping
Viaarxiv icon

Scaling Opponent Shaping to High Dimensional Games

Add code
Dec 19, 2023
Figure 1 for Scaling Opponent Shaping to High Dimensional Games
Figure 2 for Scaling Opponent Shaping to High Dimensional Games
Figure 3 for Scaling Opponent Shaping to High Dimensional Games
Figure 4 for Scaling Opponent Shaping to High Dimensional Games
Viaarxiv icon

H-GAP: Humanoid Control with a Generalist Planner

Add code
Dec 05, 2023
Figure 1 for H-GAP: Humanoid Control with a Generalist Planner
Figure 2 for H-GAP: Humanoid Control with a Generalist Planner
Figure 3 for H-GAP: Humanoid Control with a Generalist Planner
Figure 4 for H-GAP: Humanoid Control with a Generalist Planner
Viaarxiv icon

minimax: Efficient Baselines for Autocurricula in JAX

Add code
Nov 23, 2023
Viaarxiv icon

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Add code
Nov 21, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Viaarxiv icon

Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions

Add code
Mar 30, 2023
Viaarxiv icon

Optimal Transport for Offline Imitation Learning

Add code
Mar 24, 2023
Figure 1 for Optimal Transport for Offline Imitation Learning
Figure 2 for Optimal Transport for Offline Imitation Learning
Figure 3 for Optimal Transport for Offline Imitation Learning
Figure 4 for Optimal Transport for Offline Imitation Learning
Viaarxiv icon