Picture for Xiyue Peng

Xiyue Peng

Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization

Add code
Oct 25, 2024
Viaarxiv icon

Adversarially Trained Actor Critic for offline CMDPs

Add code
Jan 01, 2024
Viaarxiv icon