Skip to main content

Showing 1–7 of 7 results for author: Plessis, M C d

Searching in archive cs. Search in all archives.
.
  1. arXiv:1703.00593  [pdf, other

    cs.LG stat.ML

    Positive-Unlabeled Learning with Non-Negative Risk Estimator

    Authors: Ryuichi Kiryo, Gang Niu, Marthinus C. du Plessis, Masashi Sugiyama

    Abstract: From only positive (P) and unlabeled (U) data, a binary classifier could be trained with PU learning, in which the state of the art is unbiased PU learning. However, if its model is very flexible, empirical risks on training data will go negative, and we will suffer from serious overfitting. In this paper, we propose a non-negative risk estimator for PU learning: when getting minimized, it is more… ▽ More

    Submitted 4 November, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    Comments: NIPS 2017 camera-ready version (this paper was selected for oral presentation)

  2. Class-prior Estimation for Learning from Positive and Unlabeled Data

    Authors: Marthinus C. du Plessis, Gang Niu, Masashi Sugiyama

    Abstract: We consider the problem of estimating the class prior in an unlabeled dataset. Under the assumption that an additional labeled dataset is available, the class prior can be estimated by fitting a mixture of class-wise data distributions to the unlabeled data distribution. However, in practice, such an additional labeled dataset is often not available. In this paper, we show that, with additional sa… ▽ More

    Submitted 4 November, 2016; originally announced November 2016.

    Comments: To appear in Machine Learning

  3. arXiv:1605.06955  [pdf, other

    cs.LG

    Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

    Authors: Tomoya Sakai, Marthinus Christoffel du Plessis, Gang Niu, Masashi Sugiyama

    Abstract: Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unlabel… ▽ More

    Submitted 16 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

    Comments: Accepted to the 34th International Conference on Machine Learning (ICML 2017)

  4. arXiv:1603.03130  [pdf, other

    cs.LG stat.ML

    Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

    Authors: Gang Niu, Marthinus Christoffel du Plessis, Tomoya Sakai, Yao Ma, Masashi Sugiyama

    Abstract: In PU learning, a binary classifier is trained from positive (P) and unlabeled (U) data without negative (N) data. Although N data is missing, it sometimes outperforms PN learning (i.e., ordinary supervised learning). Hitherto, neither theoretical nor experimental analysis has been given to explain this phenomenon. In this paper, we theoretically compare PU (and NU) learning against PN learning ba… ▽ More

    Submitted 28 October, 2016; v1 submitted 9 March, 2016; originally announced March 2016.

    Comments: NIPS 2016 camera-ready version

  5. arXiv:1402.0288  [pdf, other

    cs.LG stat.ML

    Transductive Learning with Multi-class Volume Approximation

    Authors: Gang Niu, Bo Dai, Marthinus Christoffel du Plessis, Masashi Sugiyama

    Abstract: Given a hypothesis space, the large volume principle by Vladimir Vapnik prioritizes equivalence classes according to their volume in the hypothesis space. The volume approximation has hitherto been successfully applied to binary learning problems. In this paper, we extend it naturally to a more general definition which can be applied to several transductive problem settings, such as multi-class, m… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

  6. arXiv:1305.0103  [pdf, ps, other

    cs.LG

    Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

    Authors: Marthinus Christoffel du Plessis, Masashi Sugiyama

    Abstract: We consider the unsupervised learning problem of assigning labels to unlabeled data. A naive approach is to use clustering methods, but this works well only when data is properly clustered and each cluster corresponds to an underlying class. In this paper, we first show that this unsupervised labeling problem in balanced binary cases can be solved if two unlabeled datasets having different class b… ▽ More

    Submitted 1 May, 2013; originally announced May 2013.

  7. arXiv:1207.0099  [pdf, ps, other

    cs.LG stat.ML

    Density-Difference Estimation

    Authors: Masashi Sugiyama, Takafumi Kanamori, Taiji Suzuki, Marthinus Christoffel du Plessis, Song Liu, Ichiro Takeuchi

    Abstract: We address the problem of estimating the difference between two probability densities. A naive approach is a two-step procedure of first estimating two densities separately and then computing their difference. However, such a two-step procedure does not necessarily work well because the first step is performed without regard to the second step and thus a small error incurred in the first stage can… ▽ More

    Submitted 30 June, 2012; originally announced July 2012.