Picture for Ronald Parr

Ronald Parr

Duke University

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Viaarxiv icon

Amazing Things Come From Having Many Good Models

Add code
Jul 10, 2024
Viaarxiv icon

An Optimal Tightness Bound for the Simulation Lemma

Add code
Jun 24, 2024
Viaarxiv icon

A Path to Simpler Models Starts With Noise

Add code
Oct 30, 2023
Viaarxiv icon

Fitted Q-Learning for Relational Domains

Add code
Jun 10, 2020
Figure 1 for Fitted Q-Learning for Relational Domains
Figure 2 for Fitted Q-Learning for Relational Domains
Figure 3 for Fitted Q-Learning for Relational Domains
Figure 4 for Fitted Q-Learning for Relational Domains
Viaarxiv icon

Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence

Add code
Aug 28, 2014
Viaarxiv icon

Greedy Algorithms for Sparse Reinforcement Learning

Add code
Jun 27, 2012
Figure 1 for Greedy Algorithms for Sparse Reinforcement Learning
Figure 2 for Greedy Algorithms for Sparse Reinforcement Learning
Figure 3 for Greedy Algorithms for Sparse Reinforcement Learning
Figure 4 for Greedy Algorithms for Sparse Reinforcement Learning
Viaarxiv icon