Picture for A. Rupam Mahmood

A. Rupam Mahmood

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

Add code
Nov 22, 2024
Viaarxiv icon

Streaming Deep Reinforcement Learning Finally Works

Add code
Oct 18, 2024
Viaarxiv icon

Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning

Add code
Jul 08, 2024
Viaarxiv icon

Weight Clipping for Deep Continual and Reinforcement Learning

Add code
Jul 01, 2024
Figure 1 for Weight Clipping for Deep Continual and Reinforcement Learning
Figure 2 for Weight Clipping for Deep Continual and Reinforcement Learning
Figure 3 for Weight Clipping for Deep Continual and Reinforcement Learning
Figure 4 for Weight Clipping for Deep Continual and Reinforcement Learning
Viaarxiv icon

Revisiting Constant Negative Rewards for Goal-Reaching Tasks in Robot Learning

Add code
Jun 29, 2024
Viaarxiv icon

More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling

Add code
Jun 18, 2024
Viaarxiv icon

Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning

Add code
Jun 05, 2024
Viaarxiv icon

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

Add code
May 31, 2024
Viaarxiv icon

Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning

Add code
Mar 31, 2024
Viaarxiv icon

MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

Add code
Dec 23, 2023
Viaarxiv icon