Picture for Michael I. Jordan

Michael I. Jordan

Reduced-Rank Multi-objective Policy Learning and Optimization

Add code
Apr 29, 2024
Viaarxiv icon

Collaborative Heterogeneous Causal Inference Beyond Meta-analysis

Add code
Apr 24, 2024
Viaarxiv icon

Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

Add code
Mar 28, 2024
Figure 1 for Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction
Figure 2 for Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction
Figure 3 for Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction
Figure 4 for Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction
Viaarxiv icon

AutoEval Done Right: Using Synthetic Data for Model Evaluation

Add code
Mar 09, 2024
Figure 1 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Figure 2 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Figure 3 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Viaarxiv icon

Incentivized Learning in Principal-Agent Bandit Games

Add code
Mar 06, 2024
Figure 1 for Incentivized Learning in Principal-Agent Bandit Games
Figure 2 for Incentivized Learning in Principal-Agent Bandit Games
Figure 3 for Incentivized Learning in Principal-Agent Bandit Games
Figure 4 for Incentivized Learning in Principal-Agent Bandit Games
Viaarxiv icon

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

Add code
Jan 29, 2024
Viaarxiv icon

Towards Optimal Statistical Watermarking

Add code
Dec 13, 2023
Viaarxiv icon

A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games

Add code
Nov 17, 2023
Viaarxiv icon

Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

Add code
Oct 24, 2023
Viaarxiv icon

A Specialized Semismooth Newton Method for Kernel-Based Optimal Transport

Add code
Oct 21, 2023
Viaarxiv icon