Picture for Jiheng Zhang

Jiheng Zhang

Make Optimization Once and for All with Fine-grained Guidance

Add code
Mar 14, 2025
Viaarxiv icon

Parameter-Adaptive Dynamic Pricing

Add code
Mar 02, 2025
Viaarxiv icon

Minimax Optimality in Contextual Dynamic Pricing with General Valuation Models

Add code
Jun 24, 2024
Viaarxiv icon

RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

Add code
Mar 20, 2024
Viaarxiv icon

Stochastic Graph Bandit Learning with Side-Observations

Add code
Aug 29, 2023
Viaarxiv icon

Provably Efficient Learning in Partially Observable Contextual Bandit

Add code
Aug 07, 2023
Viaarxiv icon

Debiasing Recommendation by Learning Identifiable Latent Confounders

Add code
Feb 10, 2023
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Jan 27, 2023
Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Sep 29, 2022
Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Dual Instrumental Method for Confounded Kernelized Bandits

Add code
Sep 07, 2022
Figure 1 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 2 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 3 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 4 for Dual Instrumental Method for Confounded Kernelized Bandits
Viaarxiv icon