Picture for Alex Hutcheson

Alex Hutcheson

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Viaarxiv icon