Picture for Maryam Fazel

Maryam Fazel

Sub-optimality of the Separation Principle for Quadratic Control from Bilinear Observations

Add code
Apr 15, 2025
Viaarxiv icon

Gating is Weighting: Understanding Gated Linear Attention through In-context Learning

Add code
Apr 06, 2025
Viaarxiv icon

Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback

Add code
Mar 11, 2025
Viaarxiv icon

Keeping up with dynamic attackers: Certifying robustness to adaptive online data poisoning

Add code
Feb 23, 2025
Viaarxiv icon

Finite Sample Identification of Partially Observed Bilinear Dynamical Systems

Add code
Jan 13, 2025
Viaarxiv icon

Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration

Add code
Dec 13, 2024
Viaarxiv icon

Dual Approximation Policy Optimization

Add code
Oct 02, 2024
Figure 1 for Dual Approximation Policy Optimization
Figure 2 for Dual Approximation Policy Optimization
Figure 3 for Dual Approximation Policy Optimization
Figure 4 for Dual Approximation Policy Optimization
Viaarxiv icon

Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models

Add code
Jun 29, 2024
Viaarxiv icon

Offline Multi-task Transfer RL with Representational Penalization

Add code
Feb 19, 2024
Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Add code
Feb 12, 2024
Viaarxiv icon