Picture for Edward Grefenstette

Edward Grefenstette

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Debating with More Persuasive LLMs Leads to More Truthful Answers

Add code
Feb 15, 2024
Viaarxiv icon

Leading the Pack: N-player Opponent Shaping

Add code
Dec 26, 2023
Viaarxiv icon

Scaling Opponent Shaping to High Dimensional Games

Add code
Dec 19, 2023
Viaarxiv icon

H-GAP: Humanoid Control with a Generalist Planner

Add code
Dec 05, 2023
Viaarxiv icon

minimax: Efficient Baselines for Autocurricula in JAX

Add code
Nov 23, 2023
Viaarxiv icon

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Add code
Nov 21, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Viaarxiv icon

Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions

Add code
Mar 30, 2023
Viaarxiv icon

Optimal Transport for Offline Imitation Learning

Add code
Mar 24, 2023
Viaarxiv icon