Picture for Joshua Romoff

Joshua Romoff

Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

Add code
Nov 28, 2023
Viaarxiv icon

Improving Intrinsic Exploration by Creating Stationary Objectives

Add code
Nov 03, 2023
Viaarxiv icon

Learning Computational Efficient Bots with Costly Features

Add code
Aug 18, 2023
Viaarxiv icon

Direct Behavior Specification via Constrained Reinforcement Learning

Add code
Jan 19, 2022
Figure 1 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 2 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 3 for Direct Behavior Specification via Constrained Reinforcement Learning
Figure 4 for Direct Behavior Specification via Constrained Reinforcement Learning
Viaarxiv icon

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

Add code
Dec 22, 2021
Figure 1 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 2 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 3 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Figure 4 for Graph augmented Deep Reinforcement Learning in the GameRLand3D environment
Viaarxiv icon

Deep Reinforcement Learning for Navigation in AAA Video Games

Add code
Nov 09, 2020
Figure 1 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 2 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 3 for Deep Reinforcement Learning for Navigation in AAA Video Games
Figure 4 for Deep Reinforcement Learning for Navigation in AAA Video Games
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Jul 06, 2020
Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning

Add code
Jan 31, 2020
Figure 1 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 2 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 3 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 4 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Viaarxiv icon

Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning

Add code
Jun 09, 2019
Figure 1 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 2 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 3 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Figure 4 for Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Viaarxiv icon

Separating value functions across time-scales

Add code
Feb 08, 2019
Figure 1 for Separating value functions across time-scales
Figure 2 for Separating value functions across time-scales
Figure 3 for Separating value functions across time-scales
Figure 4 for Separating value functions across time-scales
Viaarxiv icon