Picture for Tim Pearce

Tim Pearce

Scaling Laws for Pre-training Agents and World Models

Add code
Nov 07, 2024
Viaarxiv icon

Reconciling Kaplan and Chinchilla Scaling Laws

Add code
Jun 12, 2024
Viaarxiv icon

Diffusion for World Modeling: Visual Details Matter in Atari

Add code
May 20, 2024
Figure 1 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 2 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 3 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 4 for Diffusion for World Modeling: Visual Details Matter in Atari
Viaarxiv icon

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Add code
Feb 26, 2024
Viaarxiv icon

Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing

Add code
Oct 26, 2023
Viaarxiv icon

Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

Add code
Oct 26, 2023
Viaarxiv icon

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

Add code
Feb 21, 2023
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Jan 25, 2023
Viaarxiv icon

Censored Quantile Regression Neural Networks

Add code
May 26, 2022
Figure 1 for Censored Quantile Regression Neural Networks
Figure 2 for Censored Quantile Regression Neural Networks
Figure 3 for Censored Quantile Regression Neural Networks
Figure 4 for Censored Quantile Regression Neural Networks
Viaarxiv icon

Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection

Add code
Jul 28, 2021
Figure 1 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 2 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 3 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 4 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Viaarxiv icon