Picture for Chengqian Gao

Chengqian Gao

Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Add code
May 02, 2024
Viaarxiv icon

Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation

Add code
Oct 19, 2022
Figure 1 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 2 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 3 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 4 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Viaarxiv icon

Value Penalized Q-Learning for Recommender Systems

Add code
Oct 15, 2021
Figure 1 for Value Penalized Q-Learning for Recommender Systems
Figure 2 for Value Penalized Q-Learning for Recommender Systems
Figure 3 for Value Penalized Q-Learning for Recommender Systems
Figure 4 for Value Penalized Q-Learning for Recommender Systems
Viaarxiv icon