Picture for Raphaël Baur

Raphaël Baur

Mapping out the Space of Human Feedback for Reinforcement Learning: A Conceptual Framework

Add code
Nov 18, 2024
Viaarxiv icon

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

Add code
Aug 08, 2023
Viaarxiv icon