Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cetin Savkli

Random Subspace Mixture Models for Interpretable Anomaly Detection

Aug 13, 2021

Cetin Savkli, Catherine Schwartz

Figure 1 for Random Subspace Mixture Models for Interpretable Anomaly Detection

Figure 2 for Random Subspace Mixture Models for Interpretable Anomaly Detection

Figure 3 for Random Subspace Mixture Models for Interpretable Anomaly Detection

Figure 4 for Random Subspace Mixture Models for Interpretable Anomaly Detection

Abstract:We present a new subspace-based method to construct probabilistic models for high-dimensional data and highlight its use in anomaly detection. The approach is based on a statistical estimation of probability density using densities of random subspaces combined with geometric averaging. In selecting random subspaces, equal representation of each attribute is used to ensure correct statistical limits. Gaussian mixture models (GMMs) are used to create the probability densities for each subspace with techniques included to mitigate singularities allowing for the ability to handle both numerical and categorial attributes. The number of components for each GMM is determined automatically through Bayesian information criterion to prevent overfitting. The proposed algorithm attains competitive AUC scores compared with prominent algorithms against benchmark anomaly detection datasets with the added benefits of being simple, scalable, and interpretable.

* The 23rd International Conference on Artificial Intelligence, July 26-29, 2021, USA
* 10 pages

Via

Access Paper or Ask Questions

Novel Edge and Density Metrics for Link Cohesion

Mar 06, 2020

Cetin Savkli, Catherine Schwartz, Amanda Galante, Jonathan Cohen

Figure 1 for Novel Edge and Density Metrics for Link Cohesion

Figure 2 for Novel Edge and Density Metrics for Link Cohesion

Figure 3 for Novel Edge and Density Metrics for Link Cohesion

Figure 4 for Novel Edge and Density Metrics for Link Cohesion

Abstract:We present a new metric of link cohesion for measuring the strength of edges in complex, highly connected graphs. Link cohesion accounts for local small hop connections and associated node degrees and can be used to support edge scoring and graph simplification. We also present a novel graph density measure to estimate the average cohesion across nodes. Link cohesion and the density measure are employed to demonstrate community detection through graph sparsification by maximizing graph density. Link cohesion is also shown to be loosely correlated with edge betweenness centrality.

Via

Access Paper or Ask Questions

GALILEO: A Generalized Low-Entropy Mixture Model

Aug 24, 2017

Cetin Savkli, Jeffrey Lin, Philip Graff, Matthew Kinsey

Figure 1 for GALILEO: A Generalized Low-Entropy Mixture Model

Figure 2 for GALILEO: A Generalized Low-Entropy Mixture Model

Figure 3 for GALILEO: A Generalized Low-Entropy Mixture Model

Figure 4 for GALILEO: A Generalized Low-Entropy Mixture Model

Abstract:We present a new method of generating mixture models for data with categorical attributes. The keys to this approach are an entropy-based density metric in categorical space and annealing of high-entropy/low-density components from an initial state with many components. Pruning of low-density components using the entropy-based density allows GALILEO to consistently find high-quality clusters and the same optimal number of clusters. GALILEO has shown promising results on a range of test datasets commonly used for categorical clustering benchmarks. We demonstrate that the scaling of GALILEO is linear in the number of records in the dataset, making this method suitable for very large categorical datasets.

* Proceedings of the International Conference on Data Mining (DMIN 17). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp). 2017
* 7 pages, 8 figures, 3 tables

Via

Access Paper or Ask Questions

Bayesian Learning of Clique Tree Structure

Aug 23, 2017

Cetin Savkli, J. Ryan Carr, Philip Graff, Lauren Kennell

Figure 1 for Bayesian Learning of Clique Tree Structure

Figure 2 for Bayesian Learning of Clique Tree Structure

Figure 3 for Bayesian Learning of Clique Tree Structure

Figure 4 for Bayesian Learning of Clique Tree Structure

Abstract:The problem of categorical data analysis in high dimensions is considered. A discussion of the fundamental difficulties of probability modeling is provided, and a solution to the derivation of high dimensional probability distributions based on Bayesian learning of clique tree decomposition is presented. The main contributions of this paper are an automated determination of the optimal clique tree structure for probability modeling, the resulting derived probability distribution, and a corresponding unified approach to clustering and anomaly detection based on the probability distribution.

* Proceedings of the International Conference on Data Mining (DMIN). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp). p 201, 2016
* 7 pages, 11 figures; see http://worldcomp-proceedings.com/proc/p2016/DMIN16_Contents.html

Via

Access Paper or Ask Questions