Picture for Alex Ahern

Alex Ahern

Reinforced Self-Training (ReST) for Language Modeling

Add code
Aug 21, 2023
Viaarxiv icon