Picture for Shalabh Bhatnagar

Shalabh Bhatnagar

Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks

Add code
Sep 18, 2024
Viaarxiv icon

Critic-Actor for Average Reward MDPs with Function Approximation: A Finite-Time Analysis

Add code
Feb 02, 2024
Viaarxiv icon

Approximate Linear Programming and Decentralized Policy Improvement in Cooperative Multi-agent Markov Decision Processes

Add code
Nov 20, 2023
Viaarxiv icon

Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms

Add code
Oct 25, 2023
Viaarxiv icon

The Reinforce Policy Gradient Algorithm Revisited

Add code
Oct 08, 2023
Viaarxiv icon

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

Add code
May 20, 2023
Viaarxiv icon

A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks

Add code
May 20, 2023
Viaarxiv icon

A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning

Add code
Apr 21, 2023
Viaarxiv icon

n-Step Temporal Difference Learning with Optimal n

Add code
Mar 13, 2023
Viaarxiv icon

Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias

Add code
Dec 20, 2022
Viaarxiv icon