Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marco F. Duarte

Explainable Machine Learning for Scientific Insights and Discoveries

May 21, 2019

Ribana Roscher, Bastian Bohn, Marco F. Duarte, Jochen Garcke

Figure 1 for Explainable Machine Learning for Scientific Insights and Discoveries

Abstract:Machine learning methods have been remarkably successful for a wide range of application areas in the extraction of essential information from data. An exciting and relatively recent development is the uptake of machine learning in the natural sciences, where the major goal is to obtain novel scientific insights and discoveries from observational or simulated data. A prerequisite for obtaining a scientific outcome is domain knowledge, which is needed to gain explainability, but also to enhance scientific consistency. In this article we review explainable machine learning in view of applications in the natural sciences and discuss three core elements which we identified as relevant in this context: transparency, interpretability, and explainability. With respect to these core elements, we provide a survey of recent scientific works incorporating machine learning, and in particular to the way that explainable machine learning is used in their respective application areas.

Via

Access Paper or Ask Questions

Few-Shot Learning-Based Human Activity Recognition

Mar 25, 2019

Siwei Feng, Marco F. Duarte

Figure 1 for Few-Shot Learning-Based Human Activity Recognition

Figure 2 for Few-Shot Learning-Based Human Activity Recognition

Figure 3 for Few-Shot Learning-Based Human Activity Recognition

Figure 4 for Few-Shot Learning-Based Human Activity Recognition

Abstract:Few-shot learning is a technique to learn a model with a very small amount of labeled training data by transferring knowledge from relevant tasks. In this paper, we propose a few-shot learning method for wearable sensor based human activity recognition, a technique that seeks high-level human activity knowledge from low-level sensor inputs. Due to the high costs to obtain human generated activity data and the ubiquitous similarities between activity modes, it can be more efficient to borrow information from existing activity recognition models than to collect more data to train a new model from scratch when only a few data are available for model training. The proposed few-shot human activity recognition method leverages a deep learning model for feature extraction and classification while knowledge transfer is performed in the manner of model parameter transfer. In order to alleviate negative transfer, we propose a metric to measure cross-domain class-wise relevance so that knowledge of higher relevance is assigned larger weights during knowledge transfer. Promising results in extensive experiments show the advantages of the proposed approach.

Via

Access Paper or Ask Questions

Autoencoder Based Sample Selection for Self-Taught Learning

Aug 05, 2018

Siwei Feng, Marco F. Duarte

Figure 1 for Autoencoder Based Sample Selection for Self-Taught Learning

Figure 2 for Autoencoder Based Sample Selection for Self-Taught Learning

Figure 3 for Autoencoder Based Sample Selection for Self-Taught Learning

Figure 4 for Autoencoder Based Sample Selection for Self-Taught Learning

Abstract:Self-taught learning is a technique that uses a large number of unlabeled data as source samples to improve the task performance on target samples. Compared with other transfer learning techniques, self-taught learning can be applied to a broader set of scenarios due to the loose restrictions on source data. However, knowledge transferred from source samples that are not sufficiently related to the target domain may negatively influence the target learner, which is referred to as negative transfer. In this paper, we propose a metric for the relevance between a source sample and target samples. To be more specific, both source and target samples are reconstructed through a single-layer autoencoder with a linear relationship between source samples and target samples simultaneously enforced. An l_{2,1}-norm sparsity constraint is imposed on the transformation matrix to identify source samples relevant to the target domain. Source domain samples that are deemed relevant are assigned pseudo-labels reflecting their relevance to target domain samples, and are combined with target samples in order to provide an expanded training set for classifier training. Local data structures are also preserved during source sample selection through spectral graph analysis. Promising results in extensive experiments show the advantages of the proposed approach.

Via

Access Paper or Ask Questions

Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Apr 21, 2018

Siwei Feng, Marco F. Duarte

Figure 1 for Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Figure 2 for Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Figure 3 for Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Figure 4 for Graph Autoencoder-Based Unsupervised Feature Selection with Broad and Local Data Structure Preservation

Abstract:Feature selection is a dimensionality reduction technique that selects a subset of representative features from high dimensional data by eliminating irrelevant and redundant features. Recently, feature selection combined with sparse learning has attracted significant attention due to its outstanding performance compared with traditional feature selection methods that ignores correlation between features. These works first map data onto a low-dimensional subspace and then select features by posing a sparsity constraint on the transformation matrix. However, they are restricted by design to linear data transformation, a potential drawback given that the underlying correlation structures of data are often non-linear. To leverage a more sophisticated embedding, we propose an autoencoder-based unsupervised feature selection approach that leverages a single-layer autoencoder for a joint framework of feature selection and manifold learning. More specifically, we enforce column sparsity on the weight matrix connecting the input layer and the hidden layer, as in previous work. Additionally, we include spectral graph analysis on the projected data into the learning process to achieve local data geometry preservation from the original data space to the low-dimensional feature space. Extensive experiments are conducted on image, audio, text, and biological data. The promising experimental results validate the superiority of the proposed method.

Via

Access Paper or Ask Questions

Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series

Jul 29, 2017

Hamid Dadkhahi, Marco F. Duarte, Benjamin Marlin

Figure 1 for Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series

Figure 2 for Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series

Figure 3 for Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series

Figure 4 for Out-of-Sample Extension for Dimensionality Reduction of Noisy Time Series

Abstract:This paper proposes an out-of-sample extension framework for a global manifold learning algorithm (Isomap) that uses temporal information in out-of-sample points in order to make the embedding more robust to noise and artifacts. Given a set of noise-free training data and its embedding, the proposed framework extends the embedding for a noisy time series. This is achieved by adding a spatio-temporal compactness term to the optimization objective of the embedding. To the best of our knowledge, this is the first method for out-of-sample extension of manifold embeddings that leverages timing information available for the extension set. Experimental results demonstrate that our out-of-sample extension algorithm renders a more robust and accurate embedding of sequentially ordered image data in the presence of various noise and artifacts when compared to other timing-aware embeddings. Additionally, we show that an out-of-sample extension framework based on the proposed algorithm outperforms the state of the art in eye-gaze estimation.

Via

Access Paper or Ask Questions

Semi-Supervised Endmember Identification In Nonlinear Spectral Mixtures Via Semantic Representation

Jan 03, 2017

Yuki Itoh, Siwei Feng, Marco F. Duarte, Mario Parente

Figure 1 for Semi-Supervised Endmember Identification In Nonlinear Spectral Mixtures Via Semantic Representation

Figure 2 for Semi-Supervised Endmember Identification In Nonlinear Spectral Mixtures Via Semantic Representation

Figure 3 for Semi-Supervised Endmember Identification In Nonlinear Spectral Mixtures Via Semantic Representation

Figure 4 for Semi-Supervised Endmember Identification In Nonlinear Spectral Mixtures Via Semantic Representation

Abstract:This paper proposes a new hyperspectral unmixing method for nonlinearly mixed hyperspectral data using a semantic representation in a semi-supervised fashion, assuming the availability of a spectral reference library. Existing semi-supervised unmixing algorithms select members from an endmember library that are present at each of the pixels; most such methods assume a linear mixing model. However, those methods will fail in the presence of nonlinear mixing among the observed spectra. To address this issue, we develop an endmember selection method using a recently proposed semantic spectral representation obtained via non-homogeneous hidden Markov chain (NHMC) model for a wavelet transform of the spectra. The semantic representation can encode spectrally discriminative features for any observed spectrum and, therefore, our proposed method can perform endmember selection without any assumption on the mixing model. Experimental results show that in the presence of sufficiently nonlinear mixing our proposed method outperforms dictionary-based sparse unmixing approaches based on linear models.

* 15 pages, 11 figures, 4 tables

Via

Access Paper or Ask Questions

Perfect Recovery Conditions For Non-Negative Sparse Modeling

Sep 20, 2016

Yuki Itoh, Marco F. Duarte, Mario Parente

Figure 1 for Perfect Recovery Conditions For Non-Negative Sparse Modeling

Figure 2 for Perfect Recovery Conditions For Non-Negative Sparse Modeling

Figure 3 for Perfect Recovery Conditions For Non-Negative Sparse Modeling

Figure 4 for Perfect Recovery Conditions For Non-Negative Sparse Modeling

Abstract:Sparse modeling has been widely and successfully used in many applications such as computer vision, machine learning, and pattern recognition. Accompanied with those applications, significant research has studied the theoretical limits and algorithm design for convex relaxations in sparse modeling. However, theoretical analyses on non-negative versions of sparse modeling are limited in the literature either to a noiseless setting or a scenario with a specific statistical noise model such as Gaussian noise. This paper studies the performance of non-negative sparse modeling in a more general scenario where the observed signals have an unknown arbitrary distortion, especially focusing on non-negativity constrained and L1-penalized least squares, and gives an exact bound for which this problem can recover the correct signal elements. We pose two conditions to guarantee the correct signal recovery: minimum coefficient condition (MCC) and nonlinearity vs. subset coherence condition (NSCC). The former defines the minimum weight for each of the correct atoms present in the signal and the latter defines the tolerable deviation from the linear model relative to the positive subset coherence (PSC), a novel type of "coherence" metric. We provide rigorous performance guarantees based on these conditions and experimentally verify their precise predictive power in a hyperspectral data unmixing application.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

Masking Strategies for Image Manifolds

Jun 15, 2016

Hamid Dadkhahi, Marco F. Duarte

Figure 1 for Masking Strategies for Image Manifolds

Figure 2 for Masking Strategies for Image Manifolds

Figure 3 for Masking Strategies for Image Manifolds

Figure 4 for Masking Strategies for Image Manifolds

Abstract:We consider the problem of selecting an optimal mask for an image manifold, i.e., choosing a subset of the pixels of the image that preserves the manifold's geometric structure present in the original data. Such masking implements a form of compressive sensing through emerging imaging sensor platforms for which the power expense grows with the number of pixels acquired. Our goal is for the manifold learned from masked images to resemble its full image counterpart as closely as possible. More precisely, we show that one can indeed accurately learn an image manifold without having to consider a large majority of the image pixels. In doing so, we consider two masking methods that preserve the local and global geometric structure of the manifold, respectively. In each case, the process of finding the optimal masking pattern can be cast as a binary integer program, which is computationally expensive but can be approximated by a fast greedy algorithm. Numerical experiments show that the relevant manifold structure is preserved through the data-dependent masking process, even for modest mask sizes.

Via

Access Paper or Ask Questions

Wavelet-Based Semantic Features for Hyperspectral Signature Discrimination

Apr 08, 2016

Siwei Feng, Yuki Itoh, Mario Parente, Marco F. Duarte

Figure 1 for Wavelet-Based Semantic Features for Hyperspectral Signature Discrimination

Figure 2 for Wavelet-Based Semantic Features for Hyperspectral Signature Discrimination

Figure 3 for Wavelet-Based Semantic Features for Hyperspectral Signature Discrimination

Figure 4 for Wavelet-Based Semantic Features for Hyperspectral Signature Discrimination

Abstract:Hyperspectral signature classification is a quantitative analysis approach for hyperspectral imagery which performs detection and classification of the constituent materials at the pixel level in the scene. The classification procedure can be operated directly on hyperspectral data or performed by using some features extracted from the corresponding hyperspectral signatures containing information like the signature's energy or shape. In this paper, we describe a technique that applies non-homogeneous hidden Markov chain (NHMC) models to hyperspectral signature classification. The basic idea is to use statistical models (such as NHMC) to characterize wavelet coefficients which capture the spectrum semantics (i.e., structural information) at multiple levels. Experimental results show that the approach based on NHMC models can outperform existing approaches relevant in classification tasks.

* 21 pages, 8 figures, 4 tables, preprint, revised April 8 2016

Via

Access Paper or Ask Questions

A Theoretical Analysis of Joint Manifolds

Dec 09, 2009

Mark A. Davenport, Chinmay Hegde, Marco F. Duarte, Richard G. Baraniuk

Figure 1 for A Theoretical Analysis of Joint Manifolds

Figure 2 for A Theoretical Analysis of Joint Manifolds

Figure 3 for A Theoretical Analysis of Joint Manifolds

Figure 4 for A Theoretical Analysis of Joint Manifolds

Abstract:The emergence of low-cost sensor architectures for diverse modalities has made it possible to deploy sensor arrays that capture a single event from a large number of vantage points and using multiple modalities. In many scenarios, these sensors acquire very high-dimensional data such as audio signals, images, and video. To cope with such high-dimensional data, we typically rely on low-dimensional models. Manifold models provide a particularly powerful model that captures the structure of high-dimensional data when it is governed by a low-dimensional set of parameters. However, these models do not typically take into account dependencies among multiple sensors. We thus propose a new joint manifold framework for data ensembles that exploits such dependencies. We show that simple algorithms can exploit the joint manifold structure to improve their performance on standard signal processing applications. Additionally, recent results concerning dimensionality reduction for manifolds enable us to formulate a network-scalable data compression scheme that uses random projections of the sensed data. This scheme efficiently fuses the data from all sensors through the addition of such projections, regardless of the data modalities and dimensions.

* 24 pages, 4 figures. Corrected typo on grant number in acknowledgements, page 1

Via

Access Paper or Ask Questions