Picture for Oscar R. Navarrete-Parra

Oscar R. Navarrete-Parra

Aligning a medium-size GPT model in English to a small closed domain in Spanish using reinforcement learning

Add code
Apr 03, 2023
Viaarxiv icon