Picture for Alexander G. Padula

Alexander G. Padula

Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards

Add code
Oct 22, 2024
Viaarxiv icon