Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Generalization to translation shifts: a study in architectures and augmentations

Jul 05, 2022

Suriya Gunasekar

Figure 1 for Generalization to translation shifts: a study in architectures and augmentations

Figure 2 for Generalization to translation shifts: a study in architectures and augmentations

Figure 3 for Generalization to translation shifts: a study in architectures and augmentations

Figure 4 for Generalization to translation shifts: a study in architectures and augmentations

Share this with someone who'll enjoy it:

Abstract:We provide a detailed evaluation of various image classification architectures (convolutional, vision transformer, and fully connected MLP networks) and data augmentation techniques towards generalization to large spacial translation shifts. We make the following observations: (a) In the absence of data augmentation, all architectures, including convolutional networks suffer degradation in performance when evaluated on translated test distributions. Understandably, both the in-distribution accuracy as well as degradation to shifts is significantly worse for non-convolutional architectures. (b) Across all architectures, even a minimal augmentation of $4$ pixel random crop improves the robustness of performance to much larger magnitude shifts of up to $1/4$ of image size ($8$-$16$ pixels) in the test data -- suggesting a form of meta generalization from augmentation. For non-convolutional architectures, while the absolute accuracy is still low, we see dramatic improvements in robustness to large translation shifts. (c) With sufficiently advanced augmentation ($4$ pixel crop+RandAugmentation+Erasing+MixUp) pipeline all architectures can be trained to have competitive performance, both in terms of in-distribution accuracy as well as generalization to large translation shifts.

View paper on

Share this with someone who'll enjoy it:

Title:Generalization to translation shifts: a study in architectures and augmentations

Paper and Code