Picture for Benjamin Ellis

Benjamin Ellis

Beyond the Boundaries of Proximal Policy Optimization

Add code
Nov 01, 2024
Viaarxiv icon

CURATe: Benchmarking Personalised Alignment of Conversational AI Assistants

Add code
Oct 28, 2024
Viaarxiv icon

Simplifying Deep Temporal Difference Learning

Add code
Jul 05, 2024
Viaarxiv icon

Policy-Guided Diffusion

Add code
Apr 09, 2024
Viaarxiv icon

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

Add code
Feb 26, 2024
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Add code
Feb 15, 2023
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 14, 2022
Viaarxiv icon

Generalization in Cooperative Multi-Agent Systems

Add code
Jan 31, 2022
Figure 1 for Generalization in Cooperative Multi-Agent Systems
Figure 2 for Generalization in Cooperative Multi-Agent Systems
Figure 3 for Generalization in Cooperative Multi-Agent Systems
Figure 4 for Generalization in Cooperative Multi-Agent Systems
Viaarxiv icon