Picture for Annie S Chen

Annie S Chen

RLVF: Learning from Verbal Feedback without Overgeneralization

Add code
Feb 16, 2024
Viaarxiv icon