Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Minbiao Han

No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

May 14, 2024

Minbiao Han, Fengxue Zhang, Yuxin Chen

Figure 1 for No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

Figure 2 for No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

Figure 3 for No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

Figure 4 for No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

Abstract:This paper investigates the challenge of learning in black-box games, where the underlying utility function is unknown to any of the agents. While there is an extensive body of literature on the theoretical analysis of algorithms for computing the Nash equilibrium with complete information about the game, studies on Nash equilibrium in black-box games are less common. In this paper, we focus on learning the Nash equilibrium when the only available information about an agent's payoff comes in the form of empirical queries. We provide a no-regret learning algorithm that utilizes Gaussian processes to identify the equilibrium in such games. Our approach not only ensures a theoretical convergence rate but also demonstrates effectiveness across a variety collection of games through experimental validation.

* 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

Via

Access Paper or Ask Questions

Learning in Online Principal-Agent Interactions: The Power of Menus

Dec 28, 2023

Minbiao Han, Michael Albert, Haifeng Xu

Abstract:We study a ubiquitous learning challenge in online principal-agent problems during which the principal learns the agent's private information from the agent's revealed preferences in historical interactions. This paradigm includes important special cases such as pricing and contract design, which have been widely studied in recent literature. However, existing work considers the case where the principal can only choose a single strategy at every round to interact with the agent and then observe the agent's revealed preference through their actions. In this paper, we extend this line of study to allow the principal to offer a menu of strategies to the agent and learn additionally from observing the agent's selection from the menu. We provide a thorough investigation of several online principal-agent problem settings and characterize their sample complexities, accompanied by the corresponding algorithms we have developed. We instantiate this paradigm to several important design problems $-$ including Stackelberg (security) games, contract design, and information design. Finally, we also explore the connection between our findings and existing results about online learning in Stackelberg games, and we offer a solution that can overcome a key hard instance of Peng et al. (2019).

* AAAI Conference on Artificial Intelligence (AAAI 2024)

Via

Access Paper or Ask Questions

On the Effect of Defections in Federated Learning and How to Prevent Them

Nov 28, 2023

Minbiao Han, Kumar Kshitij Patel, Han Shao, Lingxiao Wang

Abstract:Federated learning is a machine learning protocol that enables a large population of agents to collaborate over multiple rounds to produce a single consensus model. There are several federated learning applications where agents may choose to defect permanently$-$essentially withdrawing from the collaboration$-$if they are content with their instantaneous model in that round. This work demonstrates the detrimental impact of such defections on the final model's robustness and ability to generalize. We also show that current federated optimization algorithms fail to disincentivize these harmful defections. We introduce a novel optimization algorithm with theoretical guarantees to prevent defections while ensuring asymptotic convergence to an effective solution for all participating agents. We also provide numerical experiments to corroborate our findings and demonstrate the effectiveness of our algorithm.

Via

Access Paper or Ask Questions

A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

Oct 27, 2023

Minbiao Han, Jonathan Light, Steven Xia, Sainyam Galhotra, Raul Castro Fernandez, Haifeng Xu

Figure 1 for A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

Figure 2 for A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

Figure 3 for A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

Figure 4 for A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

Abstract:Data fuels machine learning (ML) - rich and high-quality training data is essential to the success of ML. However, to transform ML from the race among a few large corporations to an accessible technology that serves numerous normal users' data analysis requests, there still exist important challenges. One gap we observed is that many ML users can benefit from new data that other data owners possess, whereas these data owners sit on piles of data without knowing who can benefit from it. This gap creates the opportunity for building an online market that can automatically connect supply with demand. While online matching markets are prevalent (e.g., ride-hailing systems), designing a data-centric market for ML exhibits many unprecedented challenges. This paper develops new techniques to tackle two core challenges in designing such a market: (a) to efficiently match demand with supply, we design an algorithm to automatically discover useful data for any ML task from a pool of thousands of datasets, achieving high-quality matching between ML models and data; (b) to encourage market participation of ML users without much ML expertise, we design a new pricing mechanism for selling data-augmented ML models. Furthermore, our market is designed to be API-compatible with existing online ML markets like Vertex AI and Sagemaker, making it easy to use while providing better results due to joint data and model search. We envision that the synergy of our data and model discovery algorithm and pricing mechanism will be an important step towards building a new data-centric online market that serves ML users effectively.

Via

Access Paper or Ask Questions