Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcelino Martínez-Sober

Empirical study of the modulus as activation function in computer vision applications

Jan 15, 2023

Iván Vallés-Pérez, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Joan Vila-Francés, Juan Gómez-Sanchís

Abstract:In this work we propose a new non-monotonic activation function: the modulus. The majority of the reported research on nonlinearities is focused on monotonic functions. We empirically demonstrate how by using the modulus activation function on computer vision tasks the models generalize better than with other nonlinearities - up to a 15% accuracy increase in CIFAR100 and 4% in CIFAR10, relative to the best of the benchmark activations tested. With the proposed activation function the vanishing gradient and dying neurons problems disappear, because the derivative of the activation function is always 1 or -1. The simplicity of the proposed function and its derivative make this solution specially suitable for TinyML and hardware applications.

* Accepted at Engineering Applications of AI

Via

Access Paper or Ask Questions

Approaching sales forecasting using recurrent neural networks and transformers

Apr 16, 2022

Iván Vallés-Pérez, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Juan Gómez-Sanchís, Fernando Mateo

Figure 1 for Approaching sales forecasting using recurrent neural networks and transformers

Figure 2 for Approaching sales forecasting using recurrent neural networks and transformers

Figure 3 for Approaching sales forecasting using recurrent neural networks and transformers

Figure 4 for Approaching sales forecasting using recurrent neural networks and transformers

Abstract:Accurate and fast demand forecast is one of the hot topics in supply chain for enabling the precise execution of the corresponding downstream processes (inbound and outbound planning, inventory placement, network planning, etc). We develop three alternatives to tackle the problem of forecasting the customer sales at day/store/item level using deep learning techniques and the Corporaci\'on Favorita data set, published as part of a Kaggle competition. Our empirical results show how good performance can be achieved by using a simple sequence to sequence architecture with minimal data preprocessing effort. Additionally, we describe a training trick for making the model more time independent and hence improving generalization over time. The proposed solution achieves a RMSLE of around 0.54, which is competitive with other more specific solutions to the problem proposed in the Kaggle competition.

* Accepted for publication in Expert Systems and Applications

Via

Access Paper or Ask Questions

End-to-end Keyword Spotting using Xception-1d

Oct 09, 2021

Iván Vallés-Pérez, Juan Gómez-Sanchis, Marcelino Martínez-Sober, Joan Vila-Francés, Antonio J. Serrano-López, Emilio Soria-Olivas

Figure 1 for End-to-end Keyword Spotting using Xception-1d

Figure 2 for End-to-end Keyword Spotting using Xception-1d

Figure 3 for End-to-end Keyword Spotting using Xception-1d

Abstract:The field of conversational agents is growing fast and there is an increasing need for algorithms that enhance natural interaction. In this work we show how we achieved state of the art results in the Keyword Spotting field by adapting and tweaking the Xception algorithm, which achieved outstanding results in several computer vision tasks. We obtained about 96\% accuracy when classifying audio clips belonging to 35 different categories, beating human annotation at the most complex tasks proposed.

* In proceedings of ESANN 2021 conference. 5 pages + references

Via

Access Paper or Ask Questions