Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs

Add code
Mar 17, 2025
Viaarxiv icon

Imitation Learning of Correlated Policies in Stackelberg Games

Add code
Mar 11, 2025
Viaarxiv icon

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation

Add code
Feb 28, 2025
Viaarxiv icon

Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective

Add code
Feb 17, 2025
Viaarxiv icon

Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits

Add code
Oct 08, 2024
Figure 1 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 2 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 3 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 4 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Viaarxiv icon

Diffusion-Reward Adversarial Imitation Learning

Add code
May 25, 2024
Figure 1 for Diffusion-Reward Adversarial Imitation Learning
Figure 2 for Diffusion-Reward Adversarial Imitation Learning
Figure 3 for Diffusion-Reward Adversarial Imitation Learning
Figure 4 for Diffusion-Reward Adversarial Imitation Learning
Viaarxiv icon

Image Deraining via Self-supervised Reinforcement Learning

Add code
Mar 27, 2024
Figure 1 for Image Deraining via Self-supervised Reinforcement Learning
Figure 2 for Image Deraining via Self-supervised Reinforcement Learning
Figure 3 for Image Deraining via Self-supervised Reinforcement Learning
Figure 4 for Image Deraining via Self-supervised Reinforcement Learning
Viaarxiv icon

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion

Add code
Mar 19, 2024
Viaarxiv icon

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping

Add code
Dec 19, 2023
Viaarxiv icon

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning

Add code
Oct 18, 2023
Viaarxiv icon