Picture for Maryam Fazel

Maryam Fazel

Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback

Add code
Mar 11, 2025
Viaarxiv icon

Keeping up with dynamic attackers: Certifying robustness to adaptive online data poisoning

Add code
Feb 23, 2025
Viaarxiv icon

Finite Sample Identification of Partially Observed Bilinear Dynamical Systems

Add code
Jan 13, 2025
Viaarxiv icon

Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration

Add code
Dec 13, 2024
Viaarxiv icon

Dual Approximation Policy Optimization

Add code
Oct 02, 2024
Figure 1 for Dual Approximation Policy Optimization
Figure 2 for Dual Approximation Policy Optimization
Figure 3 for Dual Approximation Policy Optimization
Figure 4 for Dual Approximation Policy Optimization
Viaarxiv icon

Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models

Add code
Jun 29, 2024
Viaarxiv icon

Offline Multi-task Transfer RL with Representational Penalization

Add code
Feb 19, 2024
Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Add code
Feb 12, 2024
Viaarxiv icon

Initializing Services in Interactive ML Systems for Diverse Users

Add code
Dec 19, 2023
Viaarxiv icon

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Add code
Jul 27, 2023
Viaarxiv icon