Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Mar 09, 2022

Manoj Kumar, Neil Houlsby, Nal Kalchbrenner, Ekin D. Cubuk

Figure 1 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Figure 2 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Figure 3 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Figure 4 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Share this with someone who'll enjoy it:

Abstract:Perceptual distances between images, as measured in the space of pre-trained deep features, have outperformed prior low-level, pixel-based metrics on assessing image similarity. While the capabilities of older and less accurate models such as AlexNet and VGG to capture perceptual similarity are well known, modern and more accurate models are less studied. First, we observe a surprising inverse correlation between ImageNet accuracy and Perceptual Scores of modern networks such as ResNets, EfficientNets, and Vision Transformers: that is better classifiers achieve worse Perceptual Scores. Then, we perform a large-scale study and examine the ImageNet accuracy/Perceptual Score relationship on varying the depth, width, number of training steps, weight decay, label smoothing, and dropout. Higher accuracy improves Perceptual Score up to a certain point, but we uncover a Pareto frontier between accuracies and Perceptual Score in the mid-to-high accuracy regime. We explore this relationship further using distortion invariance, spatial frequency sensitivity, and alternative perceptual functions. Interestingly we discover shallow ResNets, trained for less than 5 epochs only on ImageNet, whose emergent Perceptual Score matches the prior best networks trained directly on supervised human perceptual judgements.

* Preprint

View paper on

Share this with someone who'll enjoy it:

Title:On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Paper and Code