Beyond Reward: Offline Preference-guided Policy Optimization

Add code
May 25, 2023
Figure 1 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 2 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 3 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 4 for Beyond Reward: Offline Preference-guided Policy Optimization

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: