Picture for Volodymyr Tkachuk

Volodymyr Tkachuk

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

Add code
May 27, 2024
Viaarxiv icon

Regret Minimization via Saddle Point Optimization

Add code
Mar 15, 2024
Viaarxiv icon

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 08, 2023
Viaarxiv icon

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

Add code
Mar 07, 2021
Figure 1 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Figure 2 for The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning
Viaarxiv icon