Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models

Add code
Oct 25, 2022

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: