Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Apr 25, 2023

Flavio Martinelli, Berfin Simsek, Johanni Brea, Wulfram Gerstner

Figure 1 for Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Figure 2 for Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Figure 3 for Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Figure 4 for Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Share this with someone who'll enjoy it:

Abstract:Can we recover the hidden parameters of an Artificial Neural Network (ANN) by probing its input-output mapping? We propose a systematic method, called `Expand-and-Cluster' that needs only the number of hidden layers and the activation function of the probed ANN to identify all network parameters. In the expansion phase, we train a series of student networks of increasing size using the probed data of the ANN as a teacher. Expansion stops when a minimal loss is consistently reached in student networks of a given size. In the clustering phase, weight vectors of the expanded students are clustered, which allows structured pruning of superfluous neurons in a principled way. We find that an overparameterization of a factor four is sufficient to reliably identify the minimal number of neurons and to retrieve the original network parameters in $80\%$ of tasks across a family of 150 toy problems of variable difficulty. Furthermore, a teacher network trained on MNIST data can be identified with less than $5\%$ overhead in the neuron number. Thus, while direct training of a student network with a size identical to that of the teacher is practically impossible because of the non-convex loss function, training with mild overparameterization followed by clustering and structured pruning correctly identifies the target network.

* Preprint: 15 pages, 6 figures. Appendix: 7 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:Expand-and-Cluster: Exact Parameter Recovery of Neural Networks

Paper and Code