Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits

Add code
Oct 08, 2024
Figure 1 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 2 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 3 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 4 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Viaarxiv icon

Diffusion-Reward Adversarial Imitation Learning

Add code
May 25, 2024
Viaarxiv icon

Image Deraining via Self-supervised Reinforcement Learning

Add code
Mar 27, 2024
Figure 1 for Image Deraining via Self-supervised Reinforcement Learning
Figure 2 for Image Deraining via Self-supervised Reinforcement Learning
Figure 3 for Image Deraining via Self-supervised Reinforcement Learning
Figure 4 for Image Deraining via Self-supervised Reinforcement Learning
Viaarxiv icon

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion

Add code
Mar 19, 2024
Viaarxiv icon

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping

Add code
Dec 19, 2023
Viaarxiv icon

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning

Add code
Oct 18, 2023
Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Add code
Oct 17, 2023
Viaarxiv icon

Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games

Add code
Sep 27, 2023
Viaarxiv icon

Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees

Add code
Dec 10, 2022
Viaarxiv icon

Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots

Add code
Dec 06, 2022
Viaarxiv icon