Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Jun 08, 2020

Prasad Cheema, Mahito Sugiyama

Figure 1 for A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Figure 2 for A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Figure 3 for A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Figure 4 for A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Share this with someone who'll enjoy it:

Abstract:The appearance of the double-descent risk phenomenon has received growing interest in the machine learning and statistics community, as it challenges well-understood notions behind the U-shaped train-test curves. Motivated through Rissanen's minimum description length (MDL), Balasubramanian's Occam's Razor, and Amari's information geometry, we investigate how the logarithm of the model volume: $\log V$, works to extend intuition behind the AIC and BIC model selection criteria. We find that for the particular model classes of isotropic linear regression, statistical lattices, and the stochastic perceptron unit, the $\log V$ term may be decomposed into a sum of distinct components. These components work to extend the idea of model complexity inherent in AIC and BIC, and are driven by new, albeit intuitive notions of (i) Model richness, and (ii) Model distinguishability. Our theoretical analysis assists in the understanding of how the double descent phenomenon may manifest, as well as why generalization error does not necessarily continue to grow with increasing model dimensionality.

* 8 main pages, 3 references pages, 12 supplementary pages

View paper on

Share this with someone who'll enjoy it:

Title:A Geometric Look at Double Descent Risk: Volumes, Singularities, and Distinguishabilities

Paper and Code