Picture for Alexander G. Padula

Alexander G. Padula

Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards

Add code
Oct 22, 2024
Figure 1 for Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards
Figure 2 for Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards
Figure 3 for Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards
Figure 4 for Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards
Viaarxiv icon