Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Leonardo Trujillo

Highlights of Semantics in Multi-objective Genetic Programming

Jun 13, 2022

Edgar Galván, Leonardo Trujillo, Fergal Stapleton

Figure 1 for Highlights of Semantics in Multi-objective Genetic Programming

Abstract:Semantics is a growing area of research in Genetic programming (GP) and refers to the behavioural output of a Genetic Programming individual when executed. This research expands upon the current understanding of semantics by proposing a new approach: Semantic-based Distance as an additional criteriOn (SDO), in the thus far, somewhat limited researched area of semantics in Multi-objective GP (MOGP). Our work included an expansive analysis of the GP in terms of performance and diversity metrics, using two additional semantic-based approaches, namely Semantic Similarity-based Crossover (SCC) and Semantic-based Crowding Distance (SCD). Each approach is integrated into two evolutionary multi-objective (EMO) frameworks: Non-dominated Sorting Genetic Algorithm II (NSGA-II) and the Strength Pareto Evolutionary Algorithm 2 (SPEA2), and along with the three semantic approaches, the canonical form of NSGA-II and SPEA2 are rigorously compared. Using highly-unbalanced binary classification datasets, we demonstrated that the newly proposed approach of SDO consistently generated more non-dominated solutions, with better diversity and improved hypervolume results.

* Accepted in GECCO '22 Companion, July 9--13, 2022, Boston, MA, USA, 2 pages, 1 figure. This Hot-off-the-Press paper summarises "Semantics in Multi-objective Genetic Programming" by Edgar Galv\'an, Leonardo Trujillo and Fergal Stapleton, published in the journal of Applied Soft Computing 2022, https://doi.org/10.1016/j.asoc.2021.108143 [arXiv:2105.02944]

Via

Access Paper or Ask Questions

GSGP-CUDA -- a CUDA framework for Geometric Semantic Genetic Programming

Jun 08, 2021

Leonardo Trujillo, Jose Manuel Muñoz Contreras, Daniel E Hernandez, Mauro Castelli, Juan J Tapia

Figure 1 for GSGP-CUDA -- a CUDA framework for Geometric Semantic Genetic Programming

Figure 2 for GSGP-CUDA -- a CUDA framework for Geometric Semantic Genetic Programming

Figure 3 for GSGP-CUDA -- a CUDA framework for Geometric Semantic Genetic Programming

Figure 4 for GSGP-CUDA -- a CUDA framework for Geometric Semantic Genetic Programming

Abstract:Geometric Semantic Genetic Programming (GSGP) is a state-of-the-art machine learning method based on evolutionary computation. GSGP performs search operations directly at the level of program semantics, which can be done more efficiently then operating at the syntax level like most GP systems. Efficient implementations of GSGP in C++ exploit this fact, but not to its full potential. This paper presents GSGP-CUDA, the first CUDA implementation of GSGP and the most efficient, which exploits the intrinsic parallelism of GSGP using GPUs. Results show speedups greater than 1,000X relative to the state-of-the-art sequential implementation.

* 14 pages, 3 figures

Via

Access Paper or Ask Questions

Semantics in Multi-objective Genetic Programming

May 06, 2021

Edgar Galván, Leonardo Trujillo, Fergal Stapleton

Figure 1 for Semantics in Multi-objective Genetic Programming

Figure 2 for Semantics in Multi-objective Genetic Programming

Figure 3 for Semantics in Multi-objective Genetic Programming

Figure 4 for Semantics in Multi-objective Genetic Programming

Abstract:Semantics has become a key topic of research in Genetic Programming (GP). Semantics refers to the outputs (behaviour) of a GP individual when this is run on a data set. The majority of works that focus on semantic diversity in single-objective GP indicates that it is highly beneficial in evolutionary search. Surprisingly, there is minuscule research conducted in semantics in Multi-objective GP (MOGP). In this work we make a leap beyond our understanding of semantics in MOGP and propose SDO: Semantic-based Distance as an additional criteriOn. This naturally encourages semantic diversity in MOGP. To do so, we find a pivot in the less dense region of the first Pareto front (most promising front). This is then used to compute a distance between the pivot and every individual in the population. The resulting distance is then used as an additional criterion to be optimised to favour semantic diversity. We also use two other semantic-based methods as baselines, called Semantic Similarity-based Crossover and Semantic-based Crowding Distance. Furthermore, we also use the NSGA-II and the SPEA2 for comparison too. We use highly unbalanced binary classification problems and consistently show how our proposed SDO approach produces more non-dominated solutions and better diversity, leading to better statistically significant results, using the hypervolume results as evaluation measure, compared to the rest of the other four methods.

* 30 pages, 4 figures, 10 tables, journal article

Via

Access Paper or Ask Questions

Plotting time: On the usage of CNNs for time series classification

Feb 08, 2021

Nuno M. Rodrigues, João E. Batista, Leonardo Trujillo, Bernardo Duarte, Mario Giacobini, Leonardo Vanneschi, Sara Silva

Figure 1 for Plotting time: On the usage of CNNs for time series classification

Figure 2 for Plotting time: On the usage of CNNs for time series classification

Figure 3 for Plotting time: On the usage of CNNs for time series classification

Figure 4 for Plotting time: On the usage of CNNs for time series classification

Abstract:We present a novel approach for time series classification where we represent time series data as plot images and feed them to a simple CNN, outperforming several state-of-the-art methods. We propose a simple and highly replicable way of plotting the time series, and feed these images as input to a non-optimized shallow CNN, without any normalization or residual connections. These representations are no more than default line plots using the time series data, where the only pre-processing applied is to reduce the number of white pixels in the image. We compare our method with different state-of-the-art methods specialized in time series classification on two real-world non public datasets, as well as 98 datasets of the UCR dataset collection. The results show that our approach is very promising, achieving the best results on both real-world datasets and matching / beating the best state-of-the-art methods in six UCR datasets. We argue that, if a simple naive design like ours can obtain such good results, it is worth further exploring the capabilities of using image representation of time series data, along with more powerful CNNs, for classification and other related tasks.

Via

Access Paper or Ask Questions