Picture for Ping-Chun Hsieh

Ping-Chun Hsieh

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation

Add code
Feb 28, 2025
Viaarxiv icon

Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective

Add code
Feb 17, 2025
Viaarxiv icon

Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits

Add code
Oct 08, 2024
Figure 1 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 2 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 3 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Figure 4 for Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Viaarxiv icon

Diffusion-Reward Adversarial Imitation Learning

Add code
May 25, 2024
Figure 1 for Diffusion-Reward Adversarial Imitation Learning
Figure 2 for Diffusion-Reward Adversarial Imitation Learning
Figure 3 for Diffusion-Reward Adversarial Imitation Learning
Figure 4 for Diffusion-Reward Adversarial Imitation Learning
Viaarxiv icon

Image Deraining via Self-supervised Reinforcement Learning

Add code
Mar 27, 2024
Figure 1 for Image Deraining via Self-supervised Reinforcement Learning
Figure 2 for Image Deraining via Self-supervised Reinforcement Learning
Figure 3 for Image Deraining via Self-supervised Reinforcement Learning
Figure 4 for Image Deraining via Self-supervised Reinforcement Learning
Viaarxiv icon

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion

Add code
Mar 19, 2024
Viaarxiv icon

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping

Add code
Dec 19, 2023
Viaarxiv icon

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning

Add code
Oct 18, 2023
Viaarxiv icon

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs

Add code
Oct 17, 2023
Viaarxiv icon

Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games

Add code
Sep 27, 2023
Viaarxiv icon