Picture for Guoqing Jiang

Guoqing Jiang

SKDBERT: Compressing BERT via Stochastic Knowledge Distillation

Add code
Nov 29, 2022
Viaarxiv icon

Understanding Why Neural Networks Generalize Well Through GSNR of Parameters

Add code
Feb 24, 2020
Figure 1 for Understanding Why Neural Networks Generalize Well Through GSNR of Parameters
Figure 2 for Understanding Why Neural Networks Generalize Well Through GSNR of Parameters
Figure 3 for Understanding Why Neural Networks Generalize Well Through GSNR of Parameters
Figure 4 for Understanding Why Neural Networks Generalize Well Through GSNR of Parameters
Viaarxiv icon