Picture for Guohua Tang

Guohua Tang

Aligning Language Models Using Follow-up Likelihood as Reward Signal

Add code
Sep 20, 2024
Figure 1 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 2 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 3 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Figure 4 for Aligning Language Models Using Follow-up Likelihood as Reward Signal
Viaarxiv icon

SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training

Add code
Jul 09, 2024
Viaarxiv icon

TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models

Add code
May 30, 2024
Viaarxiv icon

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark

Add code
Oct 13, 2023
Viaarxiv icon