Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael S. Gashler

Uninorm-like parametric activation functions for human-understandable neural models

May 13, 2022

Orsolya Csiszár, Luca Sára Pusztaházi, Lehel Dénes-Fazakas, Michael S. Gashler, Vladik Kreinovich, Gábor Csiszár

Figure 1 for Uninorm-like parametric activation functions for human-understandable neural models

Figure 2 for Uninorm-like parametric activation functions for human-understandable neural models

Figure 3 for Uninorm-like parametric activation functions for human-understandable neural models

Figure 4 for Uninorm-like parametric activation functions for human-understandable neural models

Abstract:We present a deep learning model for finding human-understandable connections between input features. Our approach uses a parameterized, differentiable activation function, based on the theoretical background of nilpotent fuzzy logic and multi-criteria decision-making (MCDM). The learnable parameter has a semantic meaning indicating the level of compensation between input features. The neural network determines the parameters using gradient descent to find human-understandable relationships between input features. We demonstrate the utility and effectiveness of the model by successfully applying it to classification problems from the UCI Machine Learning Repository.

Via

Access Paper or Ask Questions

Leveraging Product as an Activation Function in Deep Networks

Oct 19, 2018

Luke B. Godfrey, Michael S. Gashler

Figure 1 for Leveraging Product as an Activation Function in Deep Networks

Figure 2 for Leveraging Product as an Activation Function in Deep Networks

Figure 3 for Leveraging Product as an Activation Function in Deep Networks

Abstract:Product unit neural networks (PUNNs) are powerful representational models with a strong theoretical basis, but have proven to be difficult to train with gradient-based optimizers. We present windowed product unit neural networks (WPUNNs), a simple method of leveraging product as a nonlinearity in a neural network. Windowing the product tames the complex gradient surface and enables WPUNNs to learn effectively, solving the problems faced by PUNNs. WPUNNs use product layers between traditional sum layers, capturing the representational power of product units and using the product itself as a nonlinearity. We find the result that this method works as well as traditional nonlinearities like ReLU on the MNIST dataset. We demonstrate that WPUNNs can also generalize gated units in recurrent neural networks, yielding results comparable to LSTM networks.

* 6 pages, 3 figures, IEEE SMC 2018

Via

Access Paper or Ask Questions

A parameterized activation function for learning fuzzy logic operations in deep neural networks

Sep 11, 2017

Luke B. Godfrey, Michael S. Gashler

Figure 1 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 2 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 3 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Figure 4 for A parameterized activation function for learning fuzzy logic operations in deep neural networks

Abstract:We present a deep learning architecture for learning fuzzy logic expressions. Our model uses an innovative, parameterized, differentiable activation function that can learn a number of logical operations by gradient descent. This activation function allows a neural network to determine the relationships between its input variables and provides insight into the logical significance of learned network parameters. We provide a theoretical basis for this parameterization and demonstrate its effectiveness and utility by successfully applying our model to five classification problems from the UCI Machine Learning Repository.

* 6 pages, 3 figures, IEEE SMC 2017

Via

Access Paper or Ask Questions

Deep Learning in Robotics: A Review of Recent Research

Jul 22, 2017

Harry A. Pierson, Michael S. Gashler

Figure 1 for Deep Learning in Robotics: A Review of Recent Research

Figure 2 for Deep Learning in Robotics: A Review of Recent Research

Abstract:Advances in deep learning over the last decade have led to a flurry of research in the application of deep artificial neural networks to robotic systems, with at least thirty papers published on the subject between 2014 and the present. This review discusses the applications, benefits, and limitations of deep learning vis-\`a-vis physical robotic systems, using contemporary research as exemplars. It is intended to communicate recent advances to the wider robotics community and inspire additional interest in and application of deep learning in robotics.

* 41 pages, 135 references

Via

Access Paper or Ask Questions

Neural Decomposition of Time-Series Data for Effective Generalization

Jun 05, 2017

Luke B. Godfrey, Michael S. Gashler

Figure 1 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 2 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 3 for Neural Decomposition of Time-Series Data for Effective Generalization

Figure 4 for Neural Decomposition of Time-Series Data for Effective Generalization

Abstract:We present a neural network technique for the analysis and extrapolation of time-series data called Neural Decomposition (ND). Units with a sinusoidal activation function are used to perform a Fourier-like decomposition of training samples into a sum of sinusoids, augmented by units with nonperiodic activation functions to capture linear trends and other nonperiodic components. We show how careful weight initialization can be combined with regularization to form a simple model that generalizes well. Our method generalizes effectively on the Mackey-Glass series, a dataset of unemployment rates as reported by the U.S. Department of Labor Statistics, a time-series of monthly international airline passengers, the monthly ozone concentration in downtown Los Angeles, and an unevenly sampled time-series of oxygen isotope measurements from a cave in north India. We find that ND outperforms popular time-series forecasting techniques including LSTM, echo state networks, ARIMA, SARIMA, SVR with a radial basis function, and Gashler and Ashmore's model.

* IEEE Transactions on Neural Networks and Learning Systems 29.7 (2018) 2973-2985
* 13 pages, 11 figures, IEEE TNNLS Preprint

Via

Access Paper or Ask Questions

A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Feb 03, 2016

Luke B. Godfrey, Michael S. Gashler

Figure 1 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 2 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 3 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Figure 4 for A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks

Abstract:We present the soft exponential activation function for artificial neural networks that continuously interpolates between logarithmic, linear, and exponential functions. This activation function is simple, differentiable, and parameterized so that it can be trained as the rest of the network is trained. We hypothesize that soft exponential has the potential to improve neural network learning, as it can exactly calculate many natural operations that typical neural networks can only approximate, including addition, multiplication, inner product, distance, polynomials, and sinusoids.

* 6 pages, 8 figures, conference, In Proceedings of Knowledge Discovery and Information Retrieval (KDIR) 2015, Lisbon, Portugal, December 2015

Via

Access Paper or Ask Questions

A Minimal Architecture for General Cognition

Jul 31, 2015

Michael S. Gashler, Zachariah Kindle, Michael R. Smith

Figure 1 for A Minimal Architecture for General Cognition

Figure 2 for A Minimal Architecture for General Cognition

Figure 3 for A Minimal Architecture for General Cognition

Figure 4 for A Minimal Architecture for General Cognition

Abstract:A minimalistic cognitive architecture called MANIC is presented. The MANIC architecture requires only three function approximating models, and one state machine. Even with so few major components, it is theoretically sufficient to achieve functional equivalence with all other cognitive architectures, and can be practically trained. Instead of seeking to transfer architectural inspiration from biology into artificial intelligence, MANIC seeks to minimize novelty and follow the most well-established constructs that have evolved within various sub-fields of data science. From this perspective, MANIC offers an alternate approach to a long-standing objective of artificial intelligence. This paper provides a theoretical analysis of the MANIC architecture.

* 8 pages, 8 figures, conference, Proceedings of the 2015 International Joint Conference on Neural Networks

Via

Access Paper or Ask Questions

Training Deep Fourier Neural Networks To Fit Time-Series Data

May 09, 2014

Michael S. Gashler, Stephen C. Ashmore

Figure 1 for Training Deep Fourier Neural Networks To Fit Time-Series Data

Figure 2 for Training Deep Fourier Neural Networks To Fit Time-Series Data

Figure 3 for Training Deep Fourier Neural Networks To Fit Time-Series Data

Figure 4 for Training Deep Fourier Neural Networks To Fit Time-Series Data

Abstract:We present a method for training a deep neural network containing sinusoidal activation functions to fit to time-series data. Weights are initialized using a fast Fourier transform, then trained with regularization to improve generalization. A simple dynamic parameter tuning method is employed to adjust both the learning rate and regularization term, such that stability and efficient training are both achieved. We show how deeper layers can be utilized to model the observed sequence using a sparser set of sinusoid units, and how non-uniform regularization can improve generalization by promoting the shifting of weight toward simpler units. The method is demonstrated with time-series problems to show that it leads to effective extrapolation of nonlinear trends.

Via

Access Paper or Ask Questions

Missing Value Imputation With Unsupervised Backpropagation

Dec 19, 2013

Michael S. Gashler, Michael R. Smith, Richard Morris, Tony Martinez

Figure 1 for Missing Value Imputation With Unsupervised Backpropagation

Figure 2 for Missing Value Imputation With Unsupervised Backpropagation

Figure 3 for Missing Value Imputation With Unsupervised Backpropagation

Figure 4 for Missing Value Imputation With Unsupervised Backpropagation

Abstract:Many data mining and data analysis techniques operate on dense matrices or complete tables of data. Real-world data sets, however, often contain unknown values. Even many classification algorithms that are designed to operate with missing values still exhibit deteriorated accuracy. One approach to handling missing values is to fill in (impute) the missing values. In this paper, we present a technique for unsupervised learning called Unsupervised Backpropagation (UBP), which trains a multi-layer perceptron to fit to the manifold sampled by a set of observed point-vectors. We evaluate UBP with the task of imputing missing values in datasets, and show that UBP is able to predict missing values with significantly lower sum-squared error than other collaborative filtering and imputation techniques. We also demonstrate with 24 datasets and 9 supervised learning algorithms that classification accuracy is usually higher when randomly-withheld values are imputed using UBP, rather than with other methods.

Via

Access Paper or Ask Questions