Picture for Longxiang He

Longxiang He

FOSP: Fine-tuning Offline Safe Policy through World Models

Add code
Jul 06, 2024
Viaarxiv icon

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

Add code
May 28, 2024
Viaarxiv icon

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

Add code
Oct 09, 2023
Viaarxiv icon