Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding Convolutional Neural Network Training with Information Theory

Oct 12, 2018

Shujian Yu, Kristoffer Wickstrøm, Robert Jenssen, Jose C. Principe

Figure 1 for Understanding Convolutional Neural Network Training with Information Theory

Figure 2 for Understanding Convolutional Neural Network Training with Information Theory

Figure 3 for Understanding Convolutional Neural Network Training with Information Theory

Figure 4 for Understanding Convolutional Neural Network Training with Information Theory

Share this with someone who'll enjoy it:

Abstract:Using information theoretic concepts to understand and explore the inner organization of deep neural networks (DNNs) remains a big challenge. Recently, the concept of an information plane (coupled with the famed information bottleneck principle) began to shed light on the analysis of multilayer perceptrons (MLPs). We provided an in-depth insight into stacked autoencoders (SAEs) using a novel matrix-based Renyi's {\alpha}-entropy functional, enabling for the first time the analysis of the dynamics of learning using information flow in the real-world scenario involving complex network architecture and large data. Despite the great potential of these past works, there are several open questions when it comes to applying information theoretic concepts to understand convolutional neural networks (CNNs). These include for instance the accurate estimation of information quantities among multiple variables, and the many different training methodologies. By extending the novel matrix-based Renyi's {\alpha}-entropy functional to a multivariate scenario and introducing the partial information decomposition (PID) framework, this paper presents a systematic method to analyze CNNs training using information theory. Our results validate two fundamental data processing inequalities in CNNs, and also reveals some fundamental issues embedded in the training phase of CNNs.

* substantial improvement over v1

View paper on

Share this with someone who'll enjoy it:

Title:Understanding Convolutional Neural Network Training with Information Theory

Paper and Code