Picture for Matthijs T. J. Spaan

Matthijs T. J. Spaan

Positive Experience Reflection for Agents in Interactive Text Environments

Add code
Nov 04, 2024
Viaarxiv icon

Training on more Reachable Tasks for Generalisation in Reinforcement Learning

Add code
Oct 04, 2024
Viaarxiv icon

Pessimistic Iterative Planning for Robust POMDPs

Add code
Aug 16, 2024
Viaarxiv icon

Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning

Add code
Jun 12, 2024
Viaarxiv icon

Value Improved Actor Critic Algorithms

Add code
Jun 03, 2024
Viaarxiv icon

Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications

Add code
Apr 02, 2024
Viaarxiv icon

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Add code
Feb 19, 2024
Viaarxiv icon

Reinforcement Learning by Guided Safe Exploration

Add code
Jul 26, 2023
Viaarxiv icon

Diverse Projection Ensembles for Distributional Reinforcement Learning

Add code
Jun 12, 2023
Viaarxiv icon

The Role of Diverse Replay for Generalisation in Reinforcement Learning

Add code
Jun 09, 2023
Viaarxiv icon