Picture for Steffen Udluft

Steffen Udluft

TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning

Add code
Nov 28, 2024
Viaarxiv icon

Neural-ANOVA: Model Decomposition for Interpretable Machine Learning

Add code
Aug 22, 2024
Viaarxiv icon

Why long model-based rollouts are no reason for bad Q-value estimates

Add code
Jul 16, 2024
Viaarxiv icon

Model-based Offline Quantum Reinforcement Learning

Add code
Apr 14, 2024
Viaarxiv icon

Learning Control Policies for Variable Objectives from Offline Data

Add code
Aug 11, 2023
Viaarxiv icon

Automatic Trade-off Adaptation in Offline RL

Add code
Jun 16, 2023
Viaarxiv icon

Safe Policy Improvement Approaches and their Limitations

Add code
Aug 01, 2022
Figure 1 for Safe Policy Improvement Approaches and their Limitations
Figure 2 for Safe Policy Improvement Approaches and their Limitations
Figure 3 for Safe Policy Improvement Approaches and their Limitations
Figure 4 for Safe Policy Improvement Approaches and their Limitations
Viaarxiv icon

Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning

Add code
Jun 09, 2022
Figure 1 for Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning
Figure 2 for Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning
Figure 3 for Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning
Figure 4 for Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning
Viaarxiv icon

User-Interactive Offline Reinforcement Learning

Add code
May 21, 2022
Figure 1 for User-Interactive Offline Reinforcement Learning
Figure 2 for User-Interactive Offline Reinforcement Learning
Figure 3 for User-Interactive Offline Reinforcement Learning
Figure 4 for User-Interactive Offline Reinforcement Learning
Viaarxiv icon

Safe Policy Improvement Approaches on Discrete Markov Decision Processes

Add code
Jan 28, 2022
Figure 1 for Safe Policy Improvement Approaches on Discrete Markov Decision Processes
Figure 2 for Safe Policy Improvement Approaches on Discrete Markov Decision Processes
Figure 3 for Safe Policy Improvement Approaches on Discrete Markov Decision Processes
Figure 4 for Safe Policy Improvement Approaches on Discrete Markov Decision Processes
Viaarxiv icon