Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marthinus Christoffel du Plessis

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Jun 16, 2017

Tomoya Sakai, Marthinus Christoffel du Plessis, Gang Niu, Masashi Sugiyama

Figure 1 for Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Figure 2 for Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Figure 3 for Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Figure 4 for Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

Abstract:Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unlabeled data. In this paper, we extend PU classification to also incorporate negative data and propose a novel semi-supervised classification approach. We establish generalization error bounds for our novel methods and show that the bounds decrease with respect to the number of unlabeled data without the distributional assumptions that are required in existing semi-supervised classification methods. Through experiments, we demonstrate the usefulness of the proposed methods.

* Accepted to the 34th International Conference on Machine Learning (ICML 2017)

Via

Access Paper or Ask Questions

Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Oct 28, 2016

Gang Niu, Marthinus Christoffel du Plessis, Tomoya Sakai, Yao Ma, Masashi Sugiyama

Figure 1 for Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Figure 2 for Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Figure 3 for Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Figure 4 for Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Abstract:In PU learning, a binary classifier is trained from positive (P) and unlabeled (U) data without negative (N) data. Although N data is missing, it sometimes outperforms PN learning (i.e., ordinary supervised learning). Hitherto, neither theoretical nor experimental analysis has been given to explain this phenomenon. In this paper, we theoretically compare PU (and NU) learning against PN learning based on the upper bounds on estimation errors. We find simple conditions when PU and NU learning are likely to outperform PN learning, and we prove that, in terms of the upper bounds, either PU or NU learning (depending on the class-prior probability and the sizes of P and N data) given infinite U data will improve on PN learning. Our theoretical findings well agree with the experimental results on artificial and benchmark data even when the experimental setup does not match the theoretical assumptions exactly.

* NIPS 2016 camera-ready version

Via

Access Paper or Ask Questions

Transductive Learning with Multi-class Volume Approximation

Feb 03, 2014

Gang Niu, Bo Dai, Marthinus Christoffel du Plessis, Masashi Sugiyama

Figure 1 for Transductive Learning with Multi-class Volume Approximation

Figure 2 for Transductive Learning with Multi-class Volume Approximation

Figure 3 for Transductive Learning with Multi-class Volume Approximation

Figure 4 for Transductive Learning with Multi-class Volume Approximation

Abstract:Given a hypothesis space, the large volume principle by Vladimir Vapnik prioritizes equivalence classes according to their volume in the hypothesis space. The volume approximation has hitherto been successfully applied to binary learning problems. In this paper, we extend it naturally to a more general definition which can be applied to several transductive problem settings, such as multi-class, multi-label and serendipitous learning. Even though the resultant learning method involves a non-convex optimization problem, the globally optimal solution is almost surely unique and can be obtained in O(n^3) time. We theoretically provide stability and error analyses for the proposed method, and then experimentally show that it is promising.

Via

Access Paper or Ask Questions

Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

May 01, 2013

Marthinus Christoffel du Plessis, Masashi Sugiyama

Figure 1 for Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

Figure 2 for Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

Figure 3 for Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

Abstract:We consider the unsupervised learning problem of assigning labels to unlabeled data. A naive approach is to use clustering methods, but this works well only when data is properly clustered and each cluster corresponds to an underlying class. In this paper, we first show that this unsupervised labeling problem in balanced binary cases can be solved if two unlabeled datasets having different class balances are available. More specifically, estimation of the sign of the difference between probability densities of two unlabeled datasets gives the solution. We then introduce a new method to directly estimate the sign of the density difference without density estimation. Finally, we demonstrate the usefulness of the proposed method against several clustering methods on various toy problems and real-world datasets.

Via

Access Paper or Ask Questions

Density-Difference Estimation

Jun 30, 2012

Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel du Plessis, Song Liu, Ichiro Takeuchi

Abstract:We address the problem of estimating the difference between two probability densities. A naive approach is a two-step procedure of first estimating two densities separately and then computing their difference. However, such a two-step procedure does not necessarily work well because the first step is performed without regard to the second step and thus a small error incurred in the first stage can cause a big error in the second stage. In this paper, we propose a single-shot procedure for directly estimating the density difference without separately estimating two densities. We derive a non-parametric finite-sample error bound for the proposed single-shot density-difference estimator and show that it achieves the optimal convergence rate. The usefulness of the proposed method is also demonstrated experimentally.

Via

Access Paper or Ask Questions