Picture for Huixin Chen

Huixin Chen

Batch Normalization Is Blind to the First and Second Derivatives of the Loss

Add code
Jun 02, 2022
Figure 1 for Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Figure 2 for Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Figure 3 for Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Figure 4 for Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Viaarxiv icon