Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marek Smieja

Semi-Supervised Clustering via Markov Chain Aggregation

Dec 17, 2021

Sophie Steger, Bernhard C. Geiger, Marek Smieja

Figure 1 for Semi-Supervised Clustering via Markov Chain Aggregation

Figure 2 for Semi-Supervised Clustering via Markov Chain Aggregation

Figure 3 for Semi-Supervised Clustering via Markov Chain Aggregation

Figure 4 for Semi-Supervised Clustering via Markov Chain Aggregation

Abstract:We connect the problem of semi-supervised clustering to constrained Markov aggregation, i.e., the task of partitioning the state space of a Markov chain. We achieve this connection by considering every data point in the dataset as an element of the Markov chain's state space, by defining the transition probabilities between states via similarities between corresponding data points, and by incorporating semi-supervision information as hard constraints in a Hartigan-style algorithm. The introduced Constrained Markov Clustering (CoMaC) is an extension of a recent information-theoretic framework for (unsupervised) Markov aggregation to the semi-supervised case. Instantiating CoMaC for certain parameter settings further generalizes two previous information-theoretic objectives for unsupervised clustering. Our results indicate that CoMaC is competitive with the state-of-the-art. Furthermore, our approach is less sensitive to hyperparameter settings than the unsupervised counterpart, which is especially attractive in the semi-supervised setting characterized by little labeled data.

* 13 pages, 6 figures; this is an extended version of a short paper accepted at ACM SAC 2022

Via

Access Paper or Ask Questions

Processing of missing data by neural networks

Sep 12, 2018

Marek Smieja, Łukasz Struski, Jacek Tabor, Bartosz Zieliński, Przemysław Spurek

Figure 1 for Processing of missing data by neural networks

Figure 2 for Processing of missing data by neural networks

Figure 3 for Processing of missing data by neural networks

Figure 4 for Processing of missing data by neural networks

Abstract:We propose a general, theoretically justified mechanism for processing missing data by neural networks. Our idea is to replace typical neuron response in the first hidden layer by its expected value. This approach can be applied for various types of networks at minimal cost in their modification. Moreover, in contrast to recent approaches, it does not require complete data for training. Experimental results performed on different types of architectures show that our method gives better results than typical imputation strategies and other methods dedicated for incomplete data.

Via

Access Paper or Ask Questions