Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luke B. Godfrey

Leveraging Product as an Activation Function in Deep Networks

Oct 19, 2018

Luke B. Godfrey, Michael S. Gashler

Figure 1 for Leveraging Product as an Activation Function in Deep Networks

Figure 2 for Leveraging Product as an Activation Function in Deep Networks

Figure 3 for Leveraging Product as an Activation Function in Deep Networks

Abstract:Product unit neural networks (PUNNs) are powerful representational models with a strong theoretical basis, but have proven to be difficult to train with gradient-based optimizers. We present windowed product unit neural networks (WPUNNs), a simple method of leveraging product as a nonlinearity in a neural network. Windowing the product tames the complex gradient surface and enables WPUNNs to learn effectively, solving the problems faced by PUNNs. WPUNNs use product layers between traditional sum layers, capturing the representational power of product units and using the product itself as a nonlinearity. We find the result that this method works as well as traditional nonlinearities like ReLU on the MNIST dataset. We demonstrate that WPUNNs can also generalize gated units in recurrent neural networks, yielding results comparable to LSTM networks.

* 6 pages, 3 figures, IEEE SMC 2018

Via

Access Paper or Ask Questions

A parameterized activation function for learning fuzzy logic operations in deep neural networks

Sep 11, 2017

Luke B. Godfrey, Michael S. Gashler

Figure 1 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 2 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 3 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 4 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Abstract:We present a deep learning architecture for learning fuzzy logic expressions. Our model uses an innovative, parameterized, differentiable activation function that can learn a number of logical operations by gradient descent. This activation function allows a neural network to determine the relationships between its input variables and provides insight into the logical significance of learned network parameters. We provide a theoretical basis for this parameterization and demonstrate its effectiveness and utility by successfully applying our model to five classification problems from the UCI Machine Learning Repository.

* 6 pages, 3 figures, IEEE SMC 2017

Via

Access Paper or Ask Questions

Neural Decomposition of Time-Series Data for Effective Generalization

Jun 05, 2017

Luke B. Godfrey, Michael S. Gashler

Figure 1 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 2 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 3 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 4 for Neural Decomposition of Time-Series Data for Effective Generalization

Abstract:We present a neural network technique for the analysis and extrapolation of time-series data called Neural Decomposition (ND). Units with a sinusoidal activation function are used to perform a Fourier-like decomposition of training samples into a sum of sinusoids, augmented by units with nonperiodic activation functions to capture linear trends and other nonperiodic components. We show how careful weight initialization can be combined with regularization to form a simple model that generalizes well. Our method generalizes effectively on the Mackey-Glass series, a dataset of unemployment rates as reported by the U.S. Department of Labor Statistics, a time-series of monthly international airline passengers, the monthly ozone concentration in downtown Los Angeles, and an unevenly sampled time-series of oxygen isotope measurements from a cave in north India. We find that ND outperforms popular time-series forecasting techniques including LSTM, echo state networks, ARIMA, SARIMA, SVR with a radial basis function, and Gashler and Ashmore's model.

* IEEE Transactions on Neural Networks and Learning Systems 29.7 (2018) 2973-2985
* 13 pages, 11 figures, IEEE TNNLS Preprint

Via

Access Paper or Ask Questions

A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Feb 03, 2016

Luke B. Godfrey, Michael S. Gashler

Figure 1 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 2 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 3 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 4 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Abstract:We present the soft exponential activation function for artificial neural networks that continuously interpolates between logarithmic, linear, and exponential functions. This activation function is simple, differentiable, and parameterized so that it can be trained as the rest of the network is trained. We hypothesize that soft exponential has the potential to improve neural network learning, as it can exactly calculate many natural operations that typical neural networks can only approximate, including addition, multiplication, inner product, distance, polynomials, and sinusoids.

* 6 pages, 8 figures, conference, In Proceedings of Knowledge Discovery and Information Retrieval (KDIR) 2015, Lisbon, Portugal, December 2015

Via

Access Paper or Ask Questions