Picture for Alex Olshevsky

Alex Olshevsky

Sample Complexity of Linear Quadratic Regulator Without Initial Stability

Add code
Feb 20, 2025
Viaarxiv icon

Analysis of Value Iteration Through Absolute Probability Sequences

Add code
Feb 05, 2025
Viaarxiv icon

MDP Geometry, Normalization and Value Free Solvers

Add code
Jul 09, 2024
Viaarxiv icon

Tree Search for Simultaneous Move Games via Equilibrium Approximation

Add code
Jun 14, 2024
Figure 1 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 2 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 3 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Figure 4 for Tree Search for Simultaneous Move Games via Equilibrium Approximation
Viaarxiv icon

On Value Iteration Convergence in Connected MDPs

Add code
Jun 13, 2024
Figure 1 for On Value Iteration Convergence in Connected MDPs
Viaarxiv icon

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Add code
Apr 18, 2024
Viaarxiv icon

One-Shot Averaging for Distributed TD Under Markov Sampling

Add code
Mar 13, 2024
Viaarxiv icon

Convex SGD: Generalization Without Early Stopping

Add code
Jan 08, 2024
Viaarxiv icon

On the Performance of Temporal Difference Learning With Neural Networks

Add code
Dec 08, 2023
Viaarxiv icon

Distributed TD(0) with Almost No Communication

Add code
May 25, 2023
Figure 1 for Distributed TD(0) with Almost No Communication
Figure 2 for Distributed TD(0) with Almost No Communication
Viaarxiv icon