Picture for Gellért Weisz

Gellért Weisz

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

Add code
May 27, 2024
Viaarxiv icon

Online RL in Linearly $q^π$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

Add code
Oct 11, 2023
Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
May 18, 2023
Figure 1 for Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Viaarxiv icon

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

Add code
Feb 25, 2023
Figure 1 for Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs

Add code
Oct 27, 2022
Viaarxiv icon

TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions

Add code
Oct 05, 2021
Figure 1 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Figure 2 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Viaarxiv icon

LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration

Add code
Jul 02, 2018
Figure 1 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 2 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 3 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

Add code
Feb 11, 2018
Figure 1 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 2 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 3 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 4 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Viaarxiv icon