Picture for Vivek S. Borkar

Vivek S. Borkar

A Concentration Bound for TD with Function Approximation

Add code
Dec 16, 2023
Viaarxiv icon

Approximation of Convex Envelope Using Reinforcement Learning

Add code
Nov 24, 2023
Viaarxiv icon

Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion

Add code
Nov 21, 2023
Viaarxiv icon

Actor-Critic or Critic-Actor? A Tale of Two Time Scales

Add code
Oct 10, 2022
Figure 1 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 2 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 3 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Figure 4 for Actor-Critic or Critic-Actor? A Tale of Two Time Scales
Viaarxiv icon

A Concentration Bound for LSPE($λ$)

Add code
Nov 04, 2021
Viaarxiv icon

Concentration of Contractive Stochastic Approximation and Reinforcement Learning

Add code
Jun 27, 2021
Viaarxiv icon

Dynamic social learning under graph constraints

Add code
Jul 08, 2020
Figure 1 for Dynamic social learning under graph constraints
Figure 2 for Dynamic social learning under graph constraints
Figure 3 for Dynamic social learning under graph constraints
Viaarxiv icon

Whittle index based Q-learning for restless bandits with average reward

Add code
Apr 29, 2020
Figure 1 for Whittle index based Q-learning for restless bandits with average reward
Figure 2 for Whittle index based Q-learning for restless bandits with average reward
Figure 3 for Whittle index based Q-learning for restless bandits with average reward
Figure 4 for Whittle index based Q-learning for restless bandits with average reward
Viaarxiv icon

Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits

Add code
Sep 13, 2017
Figure 1 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 2 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 3 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Figure 4 for Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits
Viaarxiv icon

Gradient Estimation with Simultaneous Perturbation and Compressive Sensing

Add code
Jul 26, 2016
Figure 1 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 2 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 3 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Figure 4 for Gradient Estimation with Simultaneous Perturbation and Compressive Sensing
Viaarxiv icon