Picture for Asuman Ozdaglar

Asuman Ozdaglar

Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Add code
Sep 02, 2024
Viaarxiv icon

A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence

Add code
Aug 01, 2024
Figure 1 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Figure 2 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Figure 3 for A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Viaarxiv icon

Finite-Sample Guarantees for Best-Response Learning Dynamics in Zero-Sum Matrix Games

Add code
Jul 29, 2024
Viaarxiv icon

LiteEFG: An Efficient Python Library for Solving Extensive-form Games

Add code
Jul 29, 2024
Viaarxiv icon

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

Add code
May 20, 2024
Viaarxiv icon

Uniformly Stable Algorithms for Adversarial Training and Beyond

Add code
May 03, 2024
Viaarxiv icon

Principled RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation

Add code
Apr 30, 2024
Figure 1 for Principled RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Figure 2 for Principled RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Viaarxiv icon

Do LLM Agents Have Regret? A Case Study in Online Learning and Games

Add code
Mar 25, 2024
Viaarxiv icon

Matching of Users and Creators in Two-Sided Markets with Departures

Add code
Jan 17, 2024
Viaarxiv icon

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

Add code
Dec 08, 2023
Viaarxiv icon