Picture for Simon Holk

Simon Holk

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

Add code
Feb 23, 2024
Viaarxiv icon