Picture for Zhengling Qi

Zhengling Qi

Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning

Add code
Dec 08, 2024
Viaarxiv icon

A Tale of Two Cities: Pessimism and Opportunism in Offline Dynamic Pricing

Add code
Nov 12, 2024
Viaarxiv icon

Learning Robust Treatment Rules for Censored Data

Add code
Aug 17, 2024
Viaarxiv icon

Distributional Off-policy Evaluation with Bellman Residual Minimization

Add code
Feb 02, 2024
Viaarxiv icon

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards

Add code
Oct 28, 2023
Viaarxiv icon

Off-policy Evaluation in Doubly Inhomogeneous Environments

Add code
Jun 14, 2023
Viaarxiv icon

A Policy Gradient Method for Confounded POMDPs

Add code
May 26, 2023
Figure 1 for A Policy Gradient Method for Confounded POMDPs
Figure 2 for A Policy Gradient Method for Confounded POMDPs
Figure 3 for A Policy Gradient Method for Confounded POMDPs
Figure 4 for A Policy Gradient Method for Confounded POMDPs
Viaarxiv icon

Sequential Knockoffs for Variable Selection in Reinforcement Learning

Add code
Mar 24, 2023
Viaarxiv icon

Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning

Add code
Feb 24, 2023
Figure 1 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 2 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 3 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Figure 4 for Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Viaarxiv icon

PASTA: Pessimistic Assortment Optimization

Add code
Feb 08, 2023
Viaarxiv icon