Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Jul 15, 2019

Alejandro Molina, Patrick Schramowski, Kristian Kersting

Figure 1 for Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Figure 2 for Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Figure 3 for Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Figure 4 for Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Share this with someone who'll enjoy it:

Abstract:The performance of deep network learning strongly depends on the choice of the non-linear activation function associated with each neuron. However, deciding on the best activation is non-trivial and the choice depends on the architecture, hyper-parameters, and even on the dataset. Typically these activations are fixed by hand before training. Here, we demonstrate how to eliminate the reliance on first picking fixed activation functions by using flexible parametric rational functions instead. The resulting Pad\'e Activation Units (PAUs) can both approximate common activation functions and also learn new ones while providing compact representations. Our empirical evidence shows that end-to-end learning deep networks with PAUs can increase the predictive performance and reduce the training time of common deep architectures. Moreover, PAUs pave the way to approximations with provable robustness. The source code can be found at https://github.com/ml-research/pau

* 12 Pages, 6 Figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Paper and Code