Aligning Language Models Using Follow-up Likelihood as Reward Signal

Add code
Sep 20, 2024
Figure 1 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 2 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 3 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 4 for Aligning Language Models Using Follow-up Likelihood as Reward Signal

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: