Picture for Nan He

Nan He

SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training

Add code
Jul 09, 2024
Viaarxiv icon

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

Add code
Oct 31, 2023
Viaarxiv icon