Picture for Alex Olshevsky

Alex Olshevsky

MDP Geometry, Normalization and Value Free Solvers

Add code
Jul 09, 2024
Viaarxiv icon

Tree Search for Simultaneous Move Games via Equilibrium Approximation

Add code
Jun 14, 2024
Figure 1 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 2 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 3 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 4 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Viaarxiv icon

On Value Iteration Convergence in Connected MDPs

Add code
Jun 13, 2024
Viaarxiv icon

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Add code
Apr 18, 2024
Viaarxiv icon

One-Shot Averaging for Distributed TD Under Markov Sampling

Add code
Mar 13, 2024
Viaarxiv icon

Convex SGD: Generalization Without Early Stopping

Add code
Jan 08, 2024
Viaarxiv icon

On the Performance of Temporal Difference Learning With Neural Networks

Add code
Dec 08, 2023
Viaarxiv icon

Distributed TD(0) with Almost No Communication

Add code
May 25, 2023
Figure 1 for Distributed TD(0) with Almost No Communication
Figure 2 for Distributed TD(0) with Almost No Communication
Viaarxiv icon

Closing the gap between SVRG and TD-SVRG with Gradient Splitting

Add code
Nov 29, 2022
Viaarxiv icon

A Small Gain Analysis of Single Timescale Actor Critic

Add code
Mar 08, 2022
Viaarxiv icon