Picture for Raghuram Bharadwaj Diddigi

Raghuram Bharadwaj Diddigi

Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm

Add code
Oct 19, 2021
Figure 1 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 2 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 3 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Figure 4 for Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Viaarxiv icon

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

Add code
Jan 07, 2021
Figure 1 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 2 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 3 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Figure 4 for Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning
Viaarxiv icon

A Convergent Off-Policy Temporal Difference Algorithm

Add code
Nov 13, 2019
Figure 1 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 2 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 3 for A Convergent Off-Policy Temporal Difference Algorithm
Figure 4 for A Convergent Off-Policy Temporal Difference Algorithm
Viaarxiv icon

Solution of Two-Player Zero-Sum Game by Successive Relaxation

Add code
Jun 16, 2019
Figure 1 for Solution of Two-Player Zero-Sum Game by Successive Relaxation
Viaarxiv icon

Second Order Value Iteration in Reinforcement Learning

Add code
May 10, 2019
Figure 1 for Second Order Value Iteration in Reinforcement Learning
Viaarxiv icon

Successive Over Relaxation Q-Learning

Add code
Mar 15, 2019
Figure 1 for Successive Over Relaxation Q-Learning
Figure 2 for Successive Over Relaxation Q-Learning
Figure 3 for Successive Over Relaxation Q-Learning
Figure 4 for Successive Over Relaxation Q-Learning
Viaarxiv icon

An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms

Add code
Feb 11, 2019
Figure 1 for An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms
Figure 2 for An Online Sample Based Method for Mode Estimation using ODE Analysis of Stochastic Approximation Algorithms
Viaarxiv icon

Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks

Add code
Feb 24, 2018
Figure 1 for Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks
Figure 2 for Novel Sensor Scheduling Scheme for Intruder Tracking in Energy Efficient Sensor Networks
Viaarxiv icon

A unified decision making framework for supply and demand management in microgrid networks

Add code
Nov 14, 2017
Figure 1 for A unified decision making framework for supply and demand management in microgrid networks
Figure 2 for A unified decision making framework for supply and demand management in microgrid networks
Figure 3 for A unified decision making framework for supply and demand management in microgrid networks
Figure 4 for A unified decision making framework for supply and demand management in microgrid networks
Viaarxiv icon

Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids

Add code
Aug 28, 2017
Figure 1 for Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids
Figure 2 for Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids
Viaarxiv icon