GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Add code
Jun 11, 2023
Figure 1 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 2 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 3 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Figure 4 for GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: