Picture for Lotte Weerts

Lotte Weerts

Reinforced Self-Training (ReST) for Language Modeling

Add code
Aug 21, 2023
Viaarxiv icon