Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giorgio Morales

Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

Dec 13, 2024

Giorgio Morales, John Sheppard

Figure 1 for Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

Figure 2 for Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

Figure 3 for Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

Figure 4 for Adaptive Sampling to Reduce Epistemic Uncertainty Using Prediction Interval-Generation Neural Networks

Abstract:Obtaining high certainty in predictive models is crucial for making informed and trustworthy decisions in many scientific and engineering domains. However, extensive experimentation required for model accuracy can be both costly and time-consuming. This paper presents an adaptive sampling approach designed to reduce epistemic uncertainty in predictive models. Our primary contribution is the development of a metric that estimates potential epistemic uncertainty leveraging prediction interval-generation neural networks. This estimation relies on the distance between the predicted upper and lower bounds and the observed data at the tested positions and their neighboring points. Our second contribution is the proposal of a batch sampling strategy based on Gaussian processes (GPs). A GP is used as a surrogate model of the networks trained at each iteration of the adaptive sampling process. Using this GP, we design an acquisition function that selects a combination of sampling locations to maximize the reduction of epistemic uncertainty across the domain. We test our approach on three unidimensional synthetic problems and a multi-dimensional dataset based on an agricultural field for selecting experimental fertilizer rates. The results demonstrate that our method consistently converges faster to minimum epistemic uncertainty levels compared to Normalizing Flows Ensembles, MC-Dropout, and simple GPs.

* Accepted to appear in AAAI 2025

Via

Access Paper or Ask Questions

Univariate Skeleton Prediction in Multivariate Systems Using Transformers

Jun 25, 2024

Giorgio Morales, John W. Sheppard

Abstract:Symbolic regression (SR) methods attempt to learn mathematical expressions that approximate the behavior of an observed system. However, when dealing with multivariate systems, they often fail to identify the functional form that explains the relationship between each variable and the system's response. To begin to address this, we propose an explainable neural SR method that generates univariate symbolic skeletons that aim to explain how each variable influences the system's response. By analyzing multiple sets of data generated artificially, where one input variable varies while others are fixed, relationships are modeled separately for each input variable. The response of such artificial data sets is estimated using a regression neural network (NN). Finally, the multiple sets of input-response pairs are processed by a pre-trained Multi-Set Transformer that solves a problem we termed Multi-Set Skeleton Prediction and outputs a univariate symbolic skeleton. Thus, such skeletons represent explanations of the function approximated by the regression NN. Experimental results demonstrate that this method learns skeleton expressions matching the underlying functions and outperforms two GP-based and two neural SR methods.

* Paper accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2024

Via

Access Paper or Ask Questions

Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

Mar 15, 2024

Giorgio Morales, John Sheppard

Figure 1 for Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

Figure 2 for Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

Figure 3 for Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

Figure 4 for Counterfactual Analysis of Neural Networks Used to Create Fertilizer Management Zones

Abstract:In Precision Agriculture, the utilization of management zones (MZs) that take into account within-field variability facilitates effective fertilizer management. This approach enables the optimization of nitrogen (N) rates to maximize crop yield production and enhance agronomic use efficiency. However, existing works often neglect the consideration of responsivity to fertilizer as a factor influencing MZ determination. In response to this gap, we present a MZ clustering method based on fertilizer responsivity. We build upon the statement that the responsivity of a given site to the fertilizer rate is described by the shape of its corresponding N fertilizer-yield response (N-response) curve. Thus, we generate N-response curves for all sites within the field using a convolutional neural network (CNN). The shape of the approximated N-response curves is then characterized using functional principal component analysis. Subsequently, a counterfactual explanation (CFE) method is applied to discern the impact of various variables on MZ membership. The genetic algorithm-based CFE solves a multi-objective optimization problem and aims to identify the minimum combination of features needed to alter a site's cluster assignment. Results from two yield prediction datasets indicate that the features with the greatest influence on MZ membership are associated with terrain characteristics that either facilitate or impede fertilizer runoff, such as terrain slope or topographic aspect.

* Accepted to appear in the International Joint Conference on Neural Networks 2024

Via

Access Paper or Ask Questions

Counterfactual Explanations of Neural Network-Generated Response Curves

Apr 13, 2023

Giorgio Morales, John Sheppard

Figure 1 for Counterfactual Explanations of Neural Network-Generated Response Curves

Figure 2 for Counterfactual Explanations of Neural Network-Generated Response Curves

Figure 3 for Counterfactual Explanations of Neural Network-Generated Response Curves

Figure 4 for Counterfactual Explanations of Neural Network-Generated Response Curves

Abstract:Response curves exhibit the magnitude of the response of a sensitive system to a varying stimulus. However, response of such systems may be sensitive to multiple stimuli (i.e., input features) that are not necessarily independent. As a consequence, the shape of response curves generated for a selected input feature (referred to as "active feature") might depend on the values of the other input features (referred to as "passive features"). In this work, we consider the case of systems whose response is approximated using regression neural networks. We propose to use counterfactual explanations (CFEs) for the identification of the features with the highest relevance on the shape of response curves generated by neural network black boxes. CFEs are generated by a genetic algorithm-based approach that solves a multi-objective optimization problem. In particular, given a response curve generated for an active feature, a CFE finds the minimum combination of passive features that need to be modified to alter the shape of the response curve. We tested our method on a synthetic dataset with 1-D inputs and two crop yield prediction datasets with 2-D inputs. The relevance ranking of features and feature combinations obtained on the synthetic dataset coincided with the analysis of the equation that was used to generate the problem. Results obtained on the yield prediction datasets revealed that the impact on fertilizer responsivity of passive features depends on the terrain characteristics of each field.

* Accepted to appear in the International Joint Conference on Neural Networks 2023

Via

Access Paper or Ask Questions

Dual Accuracy-Quality-Driven Neural Network for Prediction Interval Generation

Dec 13, 2022

Giorgio Morales, John W. Sheppard

Abstract:Accurate uncertainty quantification is necessary to enhance the reliability of deep learning models in real-world applications. In the case of regression tasks, prediction intervals (PIs) should be provided along with the deterministic predictions of deep learning models. Such PIs are useful or "high-quality'' as long as they are sufficiently narrow and capture most of the probability density. In this paper, we present a method to learn prediction intervals for regression-based neural networks automatically in addition to the conventional target predictions. In particular, we train two companion neural networks: one that uses one output, the target estimate, and another that uses two outputs, the upper and lower bounds of the corresponding PI. Our main contribution is the design of a loss function for the PI-generation network that takes into account the output of the target-estimation network and has two optimization objectives: minimizing the mean prediction interval width and ensuring the PI integrity using constraints that maximize the prediction interval probability coverage implicitly. Both objectives are balanced within the loss function using a self-adaptive coefficient. Furthermore, we apply a Monte Carlo-based approach that evaluates the model uncertainty in the learned PIs. Experiments using a synthetic dataset, six benchmark datasets, and a real-world crop yield prediction dataset showed that our method was able to maintain a nominal probability coverage and produce narrower PIs without detriment to its target estimation accuracy when compared to those PIs generated by three state-of-the-art neural-network-based methods.

* Submitted to IEEE Transactions on Neural Networks and Learning Systems

Via

Access Paper or Ask Questions

Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

Nov 15, 2021

Giorgio Morales, John W. Sheppard

Figure 1 for Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

Figure 2 for Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

Figure 3 for Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

Figure 4 for Two-dimensional Deep Regression for Early Yield Prediction of Winter Wheat

Abstract:Crop yield prediction is one of the tasks of Precision Agriculture that can be automated based on multi-source periodic observations of the fields. We tackle the yield prediction problem using a Convolutional Neural Network (CNN) trained on data that combines radar satellite imagery and on-ground information. We present a CNN architecture called Hyper3DNetReg that takes in a multi-channel input image and outputs a two-dimensional raster, where each pixel represents the predicted yield value of the corresponding input pixel. We utilize radar data acquired from the Sentinel-1 satellites, while the on-ground data correspond to a set of six raster features: nitrogen rate applied, precipitation, slope, elevation, topographic position index (TPI), and aspect. We use data collected during the early stage of the winter wheat growing season (March) to predict yield values during the harvest season (August). We present experiments over four fields of winter wheat and show that our proposed methodology yields better results than five compared methods, including multiple linear regression, an ensemble of feedforward networks using AdaBoost, a stacked autoencoder, and two other CNN architectures.

* Accepted to appear in the SPIE Future Sensing Technologies 2021

Via

Access Paper or Ask Questions

Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Jun 01, 2021

Giorgio Morales, John Sheppard, Riley Logan, Joseph Shaw

Figure 1 for Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Figure 2 for Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Figure 3 for Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Figure 4 for Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Abstract:In recent years, Hyperspectral Imaging (HSI) has become a powerful source for reliable data in applications such as remote sensing, agriculture, and biomedicine. However, hyperspectral images are highly data-dense and often benefit from methods to reduce the number of spectral bands while retaining the most useful information for a specific application. We propose a novel band selection method to select a reduced set of wavelengths, obtained from an HSI system in the context of image classification. Our approach consists of two main steps: the first utilizes a filter-based approach to find relevant spectral bands based on a collinearity analysis between a band and its neighbors. This analysis helps to remove redundant bands and dramatically reduces the search space. The second step applies a wrapper-based approach to select bands from the reduced set based on their information entropy values, and trains a compact Convolutional Neural Network (CNN) to evaluate the performance of the current selection. We present classification results obtained from our method and compare them to other feature selection methods on two hyperspectral image datasets. Additionally, we use the original hyperspectral data cube to simulate the process of using actual filters in a multispectral imager. We show that our method produces more suitable results for a multispectral sensor design.

* Accepted to appear in the International Joint Conference on Neural Networks 2021

Via

Access Paper or Ask Questions

Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Jun 10, 2019

Marco Apolinario, Samuel Huaman Bustamante, Giorgio Morales, Joel Telles, Daniel Diaz

Figure 1 for Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Figure 2 for Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Figure 3 for Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Figure 4 for Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Abstract:The parameters estimation of a system using indirect measurements over the same system is a problem that occurs in many fields of engineering, known as the inverse problem. It also happens in the field of underwater acoustic, especially in mediums that are not transparent enough. In those cases, shape identification of objects using only acoustic signals is a challenge because it is carried out with information of echoes that are produced by objects with different densities from that of the medium. In general, these echoes are difficult to understand since their information is usually noisy and redundant. In this paper, we propose a model of convolutional neural network with an Encoder-Decoder configuration to estimate both localization and shape of objects, which produce reflected signals. This model allows us to obtain a 2D velocity model. The model was trained with data generated by the finite-difference method, and it achieved a value of 98.58% in the intersection over union metric 75.88% in precision and 64.69% in sensibility.

* Submitted to IEEE XXVI International Conference on Electronics, Electrical Engineering and Computing (INTERCON 2019). Lima, Peru

Via

Access Paper or Ask Questions

End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning

Apr 29, 2019

Giorgio Morales, Alejandro Ramírez, Joel Telles

Figure 1 for End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning

Figure 2 for End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning

Figure 3 for End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning

Figure 4 for End-to-end Cloud Segmentation in High-Resolution Multispectral Satellite Imagery Using Deep Learning

Abstract:Segmenting clouds in high-resolution satellite images is an arduous and challenging task due to the many types of geographies and clouds a satellite can capture. Therefore, it needs to be automated and optimized, specially for those who regularly process great amounts of satellite images, such as governmental institutions. In that sense, the contribution of this work is twofold: We present the CloudPeru2 dataset, consisting of 22,400 images of 512x512 pixels and their respective hand-drawn cloud masks, as well as the proposal of an end-to-end segmentation method for clouds using a Convolutional Neural Network (CNN) based on the Deeplab v3+ architecture. The results over the test set achieved an accuracy of 96.62%, precision of 96.46%, specificity of 98.53%, and sensitivity of 96.72% which is superior to the compared methods.

* Submitted to INTERCON2019 conference. Lima, Peru

Via

Access Paper or Ask Questions