Picture for Niao He

Niao He

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Add code
Feb 26, 2025
Viaarxiv icon

Poincaré Inequality for Local Log-Polyak-Lojasiewicz Measures : Non-asymptotic Analysis in Low-temperature Regime

Add code
Feb 12, 2025
Viaarxiv icon

On the Crucial Role of Initialization for Matrix Factorization

Add code
Oct 24, 2024
Figure 1 for On the Crucial Role of Initialization for Matrix Factorization
Figure 2 for On the Crucial Role of Initialization for Matrix Factorization
Figure 3 for On the Crucial Role of Initialization for Matrix Factorization
Figure 4 for On the Crucial Role of Initialization for Matrix Factorization
Viaarxiv icon

Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems

Add code
Oct 18, 2024
Viaarxiv icon

From Gradient Clipping to Normalization for Heavy Tailed SGD

Add code
Oct 17, 2024
Viaarxiv icon

Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning

Add code
Aug 27, 2024
Viaarxiv icon

Multi-level Monte-Carlo Gradient Methods for Stochastic Optimization with Biased Oracles

Add code
Aug 20, 2024
Viaarxiv icon

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players

Add code
Aug 15, 2024
Viaarxiv icon

Complexity of Minimizing Projected-Gradient-Dominated Functions with Stochastic First-order Oracles

Add code
Aug 03, 2024
Figure 1 for Complexity of Minimizing Projected-Gradient-Dominated Functions with Stochastic First-order Oracles
Viaarxiv icon

Learning to Steer Markovian Agents under Model Uncertainty

Add code
Jul 14, 2024
Viaarxiv icon