Picture for Ieshan Vaidya

Ieshan Vaidya

MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture

Add code
Feb 22, 2021
Figure 1 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 2 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 3 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 4 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Viaarxiv icon