Picture for Boxiang Lyu

Boxiang Lyu

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Add code
Mar 09, 2026
Viaarxiv icon

An Instrumental Value for Data Production and its Application to Data Pricing

Add code
Dec 24, 2024
Viaarxiv icon

Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning

Add code
Jul 24, 2024
Figure 1 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 2 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 3 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Figure 4 for Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
Viaarxiv icon

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Add code
Jul 10, 2024
Figure 1 for Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Viaarxiv icon

Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

Add code
Jun 05, 2023
Figure 1 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 2 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 3 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Figure 4 for Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm
Viaarxiv icon

Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions

Add code
Jun 01, 2023
Figure 1 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 2 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 3 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Figure 4 for Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions
Viaarxiv icon

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

Add code
Oct 19, 2022
Figure 1 for A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Viaarxiv icon

One Policy is Enough: Parallel Exploration with a Single Policy is Minimax Optimal for Reward-Free Reinforcement Learning

Add code
May 31, 2022
Viaarxiv icon

Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

Add code
May 05, 2022
Viaarxiv icon

Personalized Federated Learning with Multiple Known Clusters

Add code
Apr 28, 2022
Figure 1 for Personalized Federated Learning with Multiple Known Clusters
Figure 2 for Personalized Federated Learning with Multiple Known Clusters
Figure 3 for Personalized Federated Learning with Multiple Known Clusters
Figure 4 for Personalized Federated Learning with Multiple Known Clusters
Viaarxiv icon