Picture for Mudit Gaur

Mudit Gaur

On The Global Convergence Of Online RLHF With Neural Parametrization

Add code
Oct 21, 2024
Viaarxiv icon

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

Add code
May 06, 2024
Viaarxiv icon

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

Add code
Jun 18, 2023
Viaarxiv icon

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

Add code
Nov 14, 2022
Viaarxiv icon