Picture for Vaneet Aggarwal

Vaneet Aggarwal

On The Global Convergence Of Online RLHF With Neural Parametrization

Add code
Oct 21, 2024
Viaarxiv icon

Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion

Add code
Sep 25, 2024
Viaarxiv icon

Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs

Add code
Aug 21, 2024
Figure 1 for Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Viaarxiv icon

A Scalable Quantum Non-local Neural Network for Image Classification

Add code
Jul 26, 2024
Viaarxiv icon

An Accelerated Multi-level Monte Carlo Approach for Average Reward Reinforcement Learning with General Policy Parametrization

Add code
Jul 26, 2024
Viaarxiv icon

Variational Offline Multi-agent Skill Discovery

Add code
May 26, 2024
Viaarxiv icon

Sample-Efficient Constrained Reinforcement Learning with General Parameterization

Add code
May 17, 2024
Viaarxiv icon

Stochastic Q-learning for Large Discrete Action Spaces

Add code
May 16, 2024
Viaarxiv icon

Federated Combinatorial Multi-Agent Multi-Armed Bandits

Add code
May 09, 2024
Figure 1 for Federated Combinatorial Multi-Agent Multi-Armed Bandits
Figure 2 for Federated Combinatorial Multi-Agent Multi-Armed Bandits
Figure 3 for Federated Combinatorial Multi-Agent Multi-Armed Bandits
Figure 4 for Federated Combinatorial Multi-Agent Multi-Armed Bandits
Viaarxiv icon

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

Add code
May 06, 2024
Viaarxiv icon