Picture for Michael Bowling

Michael Bowling

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning

Add code
Dec 10, 2024
Figure 1 for A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Figure 2 for A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Figure 3 for A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Figure 4 for A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Viaarxiv icon

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 2 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 3 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 4 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Viaarxiv icon

Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

Add code
Jun 27, 2024
Viaarxiv icon

Beyond Optimism: Exploration With Partially Observable Rewards

Add code
Jun 20, 2024
Viaarxiv icon

Monitored Markov Decision Processes

Add code
Feb 13, 2024
Figure 1 for Monitored Markov Decision Processes
Figure 2 for Monitored Markov Decision Processes
Figure 3 for Monitored Markov Decision Processes
Figure 4 for Monitored Markov Decision Processes
Viaarxiv icon

Assessing the Interpretability of Programmatic Policies with Large Language Models

Add code
Nov 12, 2023
Figure 1 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 2 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 3 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 4 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Viaarxiv icon

TacticAI: an AI assistant for football tactics

Add code
Oct 17, 2023
Viaarxiv icon

Proper Laplacian Representation Learning

Add code
Oct 16, 2023
Figure 1 for Proper Laplacian Representation Learning
Figure 2 for Proper Laplacian Representation Learning
Figure 3 for Proper Laplacian Representation Learning
Figure 4 for Proper Laplacian Representation Learning
Viaarxiv icon

Targeted Search Control in AlphaZero for Effective Policy Improvement

Add code
Feb 28, 2023
Figure 1 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 2 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 3 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 4 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Viaarxiv icon

Settling the Reward Hypothesis

Add code
Dec 20, 2022
Figure 1 for Settling the Reward Hypothesis
Figure 2 for Settling the Reward Hypothesis
Viaarxiv icon