SADT: Combining Sharpness-Aware Minimization with Self-Distillation for Improved Model Generalization

Add code
Nov 01, 2022

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: