Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ludmil Zikatanov

Neural networks with trainable matrix activation functions

Oct 06, 2021

Yuwen Li, Zhengqi Liu, Ludmil Zikatanov

Figure 1 for Neural networks with trainable matrix activation functions

Figure 2 for Neural networks with trainable matrix activation functions

Figure 3 for Neural networks with trainable matrix activation functions

Figure 4 for Neural networks with trainable matrix activation functions

Abstract:The training process of neural networks usually optimize weights and bias parameters of linear transformations, while nonlinear activation functions are pre-specified and fixed. This work develops a systematic approach to constructing matrix activation functions whose entries are generalized from ReLU. The activation is based on matrix-vector multiplications using only scalar multiplications and comparisons. The proposed activation functions depend on parameters that are trained along with the weights and bias vectors. Neural networks based on this approach are simple and efficient and are shown to be robust in numerical experiments.

Via

Access Paper or Ask Questions