Picture for Lei Ying

Lei Ying

Joint Optimal Transport and Embedding for Network Alignment

Add code
Feb 26, 2025
Viaarxiv icon

Achieving O(1/N) Optimality Gap in Restless Bandits through Diffusion Approximation

Add code
Oct 19, 2024
Viaarxiv icon

Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference

Add code
Sep 25, 2024
Viaarxiv icon

Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence

Add code
May 23, 2024
Figure 1 for Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Figure 2 for Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Figure 3 for Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Figure 4 for Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Viaarxiv icon

Learning-Based Pricing and Matching for Two-Sided Queues

Add code
Mar 17, 2024
Figure 1 for Learning-Based Pricing and Matching for Two-Sided Queues
Figure 2 for Learning-Based Pricing and Matching for Two-Sided Queues
Figure 3 for Learning-Based Pricing and Matching for Two-Sided Queues
Figure 4 for Learning-Based Pricing and Matching for Two-Sided Queues
Viaarxiv icon

Cost Aware Best Arm Identification

Add code
Feb 26, 2024
Viaarxiv icon

Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration

Add code
Dec 22, 2023
Figure 1 for Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Figure 2 for Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Figure 3 for Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Figure 4 for Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Viaarxiv icon

Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs

Add code
Sep 27, 2023
Figure 1 for Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Figure 2 for Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Figure 3 for Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Figure 4 for Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Viaarxiv icon

Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms

Add code
Sep 01, 2023
Viaarxiv icon

Reconstructing Graph Diffusion History from a Single Snapshot

Add code
Jun 04, 2023
Viaarxiv icon