Picture for Jiheng Zhang

Jiheng Zhang

Minimax Optimality in Contextual Dynamic Pricing with General Valuation Models

Add code
Jun 24, 2024
Viaarxiv icon

RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

Add code
Mar 20, 2024
Viaarxiv icon

Stochastic Graph Bandit Learning with Side-Observations

Add code
Aug 29, 2023
Viaarxiv icon

Provably Efficient Learning in Partially Observable Contextual Bandit

Add code
Aug 07, 2023
Viaarxiv icon

Debiasing Recommendation by Learning Identifiable Latent Confounders

Add code
Feb 10, 2023
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Jan 27, 2023
Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Sep 29, 2022
Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Dual Instrumental Method for Confounded Kernelized Bandits

Add code
Sep 07, 2022
Figure 1 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 2 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 3 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 4 for Dual Instrumental Method for Confounded Kernelized Bandits
Viaarxiv icon

On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits

Add code
Jun 16, 2022
Figure 1 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 2 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 3 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 4 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Viaarxiv icon

Generalized Linear Bandits with Local Differential Privacy

Add code
Jun 07, 2021
Figure 1 for Generalized Linear Bandits with Local Differential Privacy
Figure 2 for Generalized Linear Bandits with Local Differential Privacy
Figure 3 for Generalized Linear Bandits with Local Differential Privacy
Viaarxiv icon