Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan P. Browne

One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

Feb 15, 2021

Ilia Sucholutsky, Nam-Hwui Kim, Ryan P. Browne, Matthias Schonlau

Figure 1 for One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

Figure 2 for One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

Figure 3 for One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

Figure 4 for One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes

Abstract:Increasingly large datasets are rapidly driving up the computational costs of machine learning. Prototype generation methods aim to create a small set of synthetic observations that accurately represent a training dataset but greatly reduce the computational cost of learning from it. Assigning soft labels to prototypes can allow increasingly small sets of prototypes to accurately represent the original training dataset. Although foundational work on `less than one'-shot learning has proven the theoretical plausibility of learning with fewer than one observation per class, developing practical algorithms for generating such prototypes remains an unexplored territory. We propose a novel, modular method for generating soft-label prototypical lines that still maintains representational accuracy even when there are fewer prototypes than the number of classes in the data. In addition, we propose the Hierarchical Soft-Label Prototype k-Nearest Neighbor classification algorithm based on these prototypical lines. We show that our method maintains high classification accuracy while greatly reducing the number of prototypes required to represent a dataset, even when working with severely imbalanced and difficult data. Our code is available at https://github.com/ilia10000/SLkNN.

* 8 pages

Via

Access Paper or Ask Questions

Asymmetric Clusters and Outliers: Mixtures of Multivariate Contaminated Shifted Asymmetric Laplace Distributions

Apr 24, 2018

Katherine Morris, Antonio Punzo, Paul D. McNicholas, Ryan P. Browne

Figure 1 for Asymmetric Clusters and Outliers: Mixtures of Multivariate Contaminated Shifted Asymmetric Laplace Distributions

Figure 2 for Asymmetric Clusters and Outliers: Mixtures of Multivariate Contaminated Shifted Asymmetric Laplace Distributions

Figure 3 for Asymmetric Clusters and Outliers: Mixtures of Multivariate Contaminated Shifted Asymmetric Laplace Distributions

Figure 4 for Asymmetric Clusters and Outliers: Mixtures of Multivariate Contaminated Shifted Asymmetric Laplace Distributions

Abstract:Mixtures of multivariate contaminated shifted asymmetric Laplace distributions are developed for handling asymmetric clusters in the presence of outliers (also referred to as bad points herein). In addition to the parameters of the related non-contaminated mixture, for each (asymmetric) cluster, our model has one parameter controlling the proportion of outliers and one specifying the degree of contamination. Crucially, these parameters do not have to be specified a priori, adding a flexibility to our approach that is absent from other approaches such as trimming. Moreover, each observation is given a posterior probability of belonging to a particular cluster, and of being an outlier or not; advantageously, this allows for the automatic detection of outliers. An expectation-conditional maximization algorithm is outlined for parameter estimation and various implementation issues are discussed. The behaviour of the proposed model is investigated, and compared with well-established finite mixtures, on artificial and real data.

Via

Access Paper or Ask Questions

Multivariate response and parsimony for Gaussian cluster-weighted models

Feb 26, 2016

Utkarsh J. Dang, Antonio Punzo, Paul D. McNicholas, Salvatore Ingrassia, Ryan P. Browne

Figure 1 for Multivariate response and parsimony for Gaussian cluster-weighted models

Figure 2 for Multivariate response and parsimony for Gaussian cluster-weighted models

Figure 3 for Multivariate response and parsimony for Gaussian cluster-weighted models

Figure 4 for Multivariate response and parsimony for Gaussian cluster-weighted models

Abstract:A family of parsimonious Gaussian cluster-weighted models is presented. This family concerns a multivariate extension to cluster-weighted modelling that can account for correlations between multivariate responses. Parsimony is attained by constraining parts of an eigen-decomposition imposed on the component covariance matrices. A sufficient condition for identifiability is provided and an expectation-maximization algorithm is presented for parameter estimation. Model performance is investigated on both synthetic and classical real data sets and compared with some popular approaches. Finally, accounting for linear dependencies in the presence of a linear regression structure is shown to offer better performance, vis-\`{a}-vis clustering, over existing methodologies.

Via

Access Paper or Ask Questions

A Mixture of Generalized Hyperbolic Factor Analyzers

May 22, 2015

Cristina Tortora, Paul D. McNicholas, Ryan P. Browne

Figure 1 for A Mixture of Generalized Hyperbolic Factor Analyzers

Figure 2 for A Mixture of Generalized Hyperbolic Factor Analyzers

Figure 3 for A Mixture of Generalized Hyperbolic Factor Analyzers

Abstract:Model-based clustering imposes a finite mixture modelling structure on data for clustering. Finite mixture models assume that the population is a convex combination of a finite number of densities, the distribution within each population is a basic assumption of each particular model. Among all distributions that have been tried, the generalized hyperbolic distribution has the advantage that is a generalization of several other methods, such as the Gaussian distribution, the skew t-distribution, etc. With specific parameters, it can represent either a symmetric or a skewed distribution. While its inherent flexibility is an advantage in many ways, it means the estimation of more parameters than its special and limiting cases. The aim of this work is to propose a mixture of generalized hyperbolic factor analyzers to introduce parsimony and extend the method to high dimensional data. This work can be seen as an extension of the mixture of factor analyzers model to generalized hyperbolic mixtures. The performance of our generalized hyperbolic factor analyzers is illustrated on real data, where it performs favourably compared to its Gaussian analogue.

Via

Access Paper or Ask Questions

Model Based Clustering of High-Dimensional Binary Data

Apr 27, 2014

Yang Tang, Ryan P. Browne, Paul D. McNicholas

Figure 1 for Model Based Clustering of High-Dimensional Binary Data

Figure 2 for Model Based Clustering of High-Dimensional Binary Data

Figure 3 for Model Based Clustering of High-Dimensional Binary Data

Figure 4 for Model Based Clustering of High-Dimensional Binary Data

Abstract:We propose a mixture of latent trait models with common slope parameters (MCLT) for model-based clustering of high-dimensional binary data, a data type for which few established methods exist. Recent work on clustering of binary data, based on a $d$-dimensional Gaussian latent variable, is extended by incorporating common factor analyzers. Accordingly, our approach facilitates a low-dimensional visual representation of the clusters. We extend the model further by the incorporation of random block effects. The dependencies in each block are taken into account through block-specific parameters that are considered to be random variables. A variational approximation to the likelihood is exploited to derive a fast algorithm for determining the model parameters. Our approach is demonstrated on real and simulated data.

Via

Access Paper or Ask Questions

Parsimonious Shifted Asymmetric Laplace Mixtures

Nov 01, 2013

Brian C. Franczak, Paul D. McNicholas, Ryan P. Browne, Paula M. Murray

Figure 1 for Parsimonious Shifted Asymmetric Laplace Mixtures

Figure 2 for Parsimonious Shifted Asymmetric Laplace Mixtures

Figure 3 for Parsimonious Shifted Asymmetric Laplace Mixtures

Figure 4 for Parsimonious Shifted Asymmetric Laplace Mixtures

Abstract:A family of parsimonious shifted asymmetric Laplace mixture models is introduced. We extend the mixture of factor analyzers model to the shifted asymmetric Laplace distribution. Imposing constraints on the constitute parts of the resulting decomposed component scale matrices leads to a family of parsimonious models. An explicit two-stage parameter estimation procedure is described, and the Bayesian information criterion and the integrated completed likelihood are compared for model selection. This novel family of models is applied to real data, where it is compared to its Gaussian analogue within clustering and classification paradigms.

Via

Access Paper or Ask Questions

Mixtures of Common Skew-t Factor Analyzers

Aug 30, 2013

Paula M. Murray, Paul D. McNicholas, Ryan P. Browne

Figure 1 for Mixtures of Common Skew-t Factor Analyzers

Figure 2 for Mixtures of Common Skew-t Factor Analyzers

Figure 3 for Mixtures of Common Skew-t Factor Analyzers

Figure 4 for Mixtures of Common Skew-t Factor Analyzers

Abstract:A mixture of common skew-t factor analyzers model is introduced for model-based clustering of high-dimensional data. By assuming common component factor loadings, this model allows clustering to be performed in the presence of a large number of mixture components or when the number of dimensions is too large to be well-modelled by the mixtures of factor analyzers model or a variant thereof. Furthermore, assuming that the component densities follow a skew-t distribution allows robust clustering of skewed data. The alternating expectation-conditional maximization algorithm is employed for parameter estimation. We demonstrate excellent clustering performance when our model is applied to real and simulated data.This paper marks the first time that skewed common factors have been used.

Via

Access Paper or Ask Questions

Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

Jun 25, 2013

Ryan P. Browne, Sanjeena Subedi, Paul McNicholas

Figure 1 for Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

Figure 2 for Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

Figure 3 for Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

Figure 4 for Constrained Optimization for a Subset of the Gaussian Parsimonious Clustering Models

Abstract:The expectation-maximization (EM) algorithm is an iterative method for finding maximum likelihood estimates when data are incomplete or are treated as being incomplete. The EM algorithm and its variants are commonly used for parameter estimation in applications of mixture models for clustering and classification. This despite the fact that even the Gaussian mixture model likelihood surface contains many local maxima and is singularity riddled. Previous work has focused on circumventing this problem by constraining the smallest eigenvalue of the component covariance matrices. In this paper, we consider constraining the smallest eigenvalue, the largest eigenvalue, and both the smallest and largest within the family setting. Specifically, a subset of the GPCM family is considered for model-based clustering, where we use a re-parameterized version of the famous eigenvalue decomposition of the component covariance matrices. Our approach is illustrated using various experiments with simulated and real data.

Via

Access Paper or Ask Questions

Mixtures of Shifted Asymmetric Laplace Distributions

Dec 21, 2012

Brian C. Franczak, Ryan P. Browne, Paul D. McNicholas

Figure 1 for Mixtures of Shifted Asymmetric Laplace Distributions

Figure 2 for Mixtures of Shifted Asymmetric Laplace Distributions

Figure 3 for Mixtures of Shifted Asymmetric Laplace Distributions

Figure 4 for Mixtures of Shifted Asymmetric Laplace Distributions

Abstract:A mixture of shifted asymmetric Laplace distributions is introduced and used for clustering and classification. A variant of the EM algorithm is developed for parameter estimation by exploiting the relationship with the general inverse Gaussian distribution. This approach is mathematically elegant and relatively computationally straightforward. Our novel mixture modelling approach is demonstrated on both simulated and real data to illustrate clustering and classification applications. In these analyses, our mixture of shifted asymmetric Laplace distributions performs favourably when compared to the popular Gaussian approach. This work, which marks an important step in the non-Gaussian model-based clustering and classification direction, concludes with discussion as well as suggestions for future work.

Via

Access Paper or Ask Questions