Picture for Ramki Gummadi

Ramki Gummadi

Satisficing Exploration for Deep Reinforcement Learning

Add code
Jul 16, 2024
Figure 1 for Satisficing Exploration for Deep Reinforcement Learning
Figure 2 for Satisficing Exploration for Deep Reinforcement Learning
Figure 3 for Satisficing Exploration for Deep Reinforcement Learning
Figure 4 for Satisficing Exploration for Deep Reinforcement Learning
Viaarxiv icon

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

Add code
May 31, 2024
Viaarxiv icon

A Parametric Class of Approximate Gradient Updates for Policy Optimization

Add code
Jun 17, 2022
Figure 1 for A Parametric Class of Approximate Gradient Updates for Policy Optimization
Figure 2 for A Parametric Class of Approximate Gradient Updates for Policy Optimization
Figure 3 for A Parametric Class of Approximate Gradient Updates for Policy Optimization
Figure 4 for A Parametric Class of Approximate Gradient Updates for Policy Optimization
Viaarxiv icon

Characterizing the Gap Between Actor-Critic and Policy Gradient

Add code
Jun 13, 2021
Figure 1 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 2 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 3 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Figure 4 for Characterizing the Gap Between Actor-Critic and Policy Gradient
Viaarxiv icon

Variational Rejection Sampling

Add code
Apr 05, 2018
Figure 1 for Variational Rejection Sampling
Figure 2 for Variational Rejection Sampling
Figure 3 for Variational Rejection Sampling
Figure 4 for Variational Rejection Sampling
Viaarxiv icon