Picture for Michael Bowling

Michael Bowling

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning

Add code
Dec 10, 2024
Viaarxiv icon

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Add code
Sep 02, 2024
Viaarxiv icon

Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

Add code
Jun 27, 2024
Viaarxiv icon

Beyond Optimism: Exploration With Partially Observable Rewards

Add code
Jun 20, 2024
Viaarxiv icon

Monitored Markov Decision Processes

Add code
Feb 13, 2024
Figure 1 for Monitored Markov Decision Processes
Figure 2 for Monitored Markov Decision Processes
Figure 3 for Monitored Markov Decision Processes
Figure 4 for Monitored Markov Decision Processes
Viaarxiv icon

Assessing the Interpretability of Programmatic Policies with Large Language Models

Add code
Nov 12, 2023
Figure 1 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 2 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 3 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Figure 4 for Assessing the Interpretability of Programmatic Policies with Large Language Models
Viaarxiv icon

TacticAI: an AI assistant for football tactics

Add code
Oct 17, 2023
Viaarxiv icon

Proper Laplacian Representation Learning

Add code
Oct 16, 2023
Viaarxiv icon

Targeted Search Control in AlphaZero for Effective Policy Improvement

Add code
Feb 28, 2023
Figure 1 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 2 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 3 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Figure 4 for Targeted Search Control in AlphaZero for Effective Policy Improvement
Viaarxiv icon

Settling the Reward Hypothesis

Add code
Dec 20, 2022
Viaarxiv icon