Skip to main content

Showing 1–20 of 20 results for author: Mattei, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.12232  [pdf, other

    stat.ML cs.AI cs.LG

    Kernel KMeans clustering splits for end-to-end unsupervised decision trees

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

    Abstract: Trees are convenient models for obtaining explainable predictions on relatively small datasets. Although there are many proposals for the end-to-end construction of such trees in supervised learning, learning a tree end-to-end for clustering without labels remains an open challenge. As most works focus on interpreting with trees the result of another clustering algorithm, we present here a novel e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    MSC Class: 62h30 ACM Class: G.3

  2. arXiv:2311.17885  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Are Ensembles Getting Better all the Time?

    Authors: Pierre-Alexandre Mattei, Damien Garreau

    Abstract: Ensemble methods combine the predictions of several base models. We study whether or not including more models always improves their average performance. This question depends on the kind of ensemble considered, as well as the predictive metric chosen. We focus on situations where all members of the ensemble are a priori expected to perform as well, which is the case of several popular methods suc… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    MSC Class: 62-08 (Primary) 60F10 (Secondary) ACM Class: G.3

  3. arXiv:2309.02858  [pdf, other

    stat.ML cs.AI cs.IT cs.LG stat.ME

    Generalised Mutual Information: a Framework for Discriminative Clustering

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Warith Harchaoui, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

    Abstract: In the last decade, recent successes in deep clustering majorly involved the Mutual Information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight h… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Submitted for review at the IEEE Transactions on Pattern Analysis and Machine Intelligence. This article is an extension of an original NeurIPS 2022 article [arXiv:2210.06300]

    MSC Class: 62H30 ACM Class: G.3

  4. arXiv:2304.08054  [pdf, other

    stat.ML cs.LG

    Fed-MIWAE: Federated Imputation of Incomplete Data via Deep Generative Models

    Authors: Irene Balelli, Aude Sportisse, Francesco Cremonesi, Pierre-Alexandre Mattei, Marco Lorenzi

    Abstract: Federated learning allows for the training of machine learning models on multiple decentralized local datasets without requiring explicit data exchange. However, data pre-processing, including strategies for handling missing data, remains a major bottleneck in real-world federated learning deployment, and is typically performed locally. This approach may be biased, since the subpopulations locally… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  5. arXiv:2302.07540  [pdf, other

    stat.ML

    Are labels informative in semi-supervised learning? -- Estimating and leveraging the missing-data mechanism

    Authors: Aude Sportisse, Hugo Schmutz, Olivier Humbert, Charles Bouveyron, Pierre-Alexandre Mattei

    Abstract: Semi-supervised learning is a powerful technique for leveraging unlabeled data to improve machine learning models, but it can be affected by the presence of ``informative'' labels, which occur when some classes are more likely to be labeled than others. In the missing data literature, such labels are called missing not at random. In this paper, we propose a novel approach to address this issue by… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  6. arXiv:2302.03391  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Sparse GEMINI for Joint Discriminative Clustering and Feature Selection

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Mickaël Leclercq, Arnaud Droit, Frédéric Precioso

    Abstract: Feature selection in clustering is a hard task which involves simultaneously the discovery of relevant clusters as well as relevant variables with respect to these clusters. While feature selection algorithms are often model-based through optimised model selection or strong assumptions on $p(\pmb{x})$, we introduce a discriminative clustering model trying to maximise a geometry-aware generalisatio… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    MSC Class: 62H30 ACM Class: G.3

  7. arXiv:2212.03131  [pdf, other

    cs.LG cs.AI stat.ME

    Explainability as statistical inference

    Authors: Hugo Henri Joseph Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei

    Abstract: A wide variety of model explanation approaches have been proposed in recent years, all guided by very different rationales and heuristics. In this paper, we take a new route and cast interpretability as a statistical inference problem. We propose a general deep probabilistic model designed to produce interpretable predictions. The model parameters can be learned via maximum likelihood, and the met… ▽ More

    Submitted 29 December, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 10 pages, 22 figures, published at ICLR 2023

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:30584-30612, 2023

  8. arXiv:2210.06300  [pdf, other

    stat.ML cs.AI cs.IT cs.LG stat.ME

    Generalised Mutual Information for Discriminative Clustering

    Authors: Louis Ohl, Pierre-Alexandre Mattei, Charles Bouveyron, Warith Harchaoui, Mickaël Leclercq, Arnaud Droit, Frederic Precioso

    Abstract: In the last decade, recent successes in deep clustering majorly involved the mutual information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight h… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: To be published in Neural Information Processing Systems 2022

    MSC Class: 62H30 ACM Class: G.3

  9. arXiv:2203.07512  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Don't fear the unlabelled: safe semi-supervised learning via simple debiasing

    Authors: Hugo Schmutz, Olivier Humbert, Pierre-Alexandre Mattei

    Abstract: Semi-supervised learning (SSL) provides an effective means of leveraging unlabelled data to improve a model performance. Even though the domain has received a considerable amount of attention in the past years, most methods present the common drawback of lacking theoretical guarantees. Our starting point is to notice that the estimate of the risk that most discriminative SSL methods minimise is bi… ▽ More

    Submitted 3 March, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

  10. arXiv:2203.01097  [pdf, other

    stat.ML cs.LG

    Model-agnostic out-of-distribution detection using combined statistical tests

    Authors: Federico Bergamin, Pierre-Alexandre Mattei, Jakob D. Havtorn, Hugo Senetaire, Hugo Schmutz, Lars Maaløe, Søren Hauberg, Jes Frellsen

    Abstract: We present simple methods for out-of-distribution detection using a trained generative model. These techniques, based on classical statistical tests, are model-agnostic in the sense that they can be applied to any differentiable generative model. The idea is to combine a classical parametric test (Rao's score test) with the recently introduced typicality test. These two test statistics are both th… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022

  11. arXiv:2201.10989  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Uphill Roads to Variational Tightness: Monotonicity and Monte Carlo Objectives

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: We revisit the theory of importance weighted variational inference (IWVI), a promising strategy for learning latent variable models. IWVI uses new variational bounds, known as Monte Carlo objectives (MCOs), obtained by replacing intractable integrals by Monte Carlo estimates -- usually simply obtained via importance sampling. Burda, Grosse and Salakhutdinov (2016) showed that increasing the number… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    MSC Class: 62-08

  12. Tensor decomposition for learning Gaussian mixtures from moments

    Authors: Rima Khouja, Pierre-Alexandre Mattei, Bernard Mourrain

    Abstract: In data processing and machine learning, an important challenge is to recover and exploit models that can represent accurately the data. We consider the problem of recovering Gaussian mixture models from datasets. We investigate symmetric tensor decomposition methods for tackling this problem, where the tensor is built from empirical moments of the data distribution. We consider identifiable tenso… ▽ More

    Submitted 21 June, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Journal ref: Journal of Symbolic Computation, Elsevier, 2022, 113, pp.193-210

  13. arXiv:2102.01982  [pdf, other

    stat.ME stat.CO stat.ML

    Unobserved classes and extra variables in high-dimensional discriminant analysis

    Authors: Michael Fop, Pierre-Alexandre Mattei, Charles Bouveyron, Thomas Brendan Murphy

    Abstract: In supervised classification problems, the test set may contain data points belonging to classes not observed in the learning phase. Moreover, the same units in the test data may be measured on a set of additional variables recorded at a subsequent stage with respect to when the learning sample was collected. In this situation, the classifier built in the learning phase needs to adapt to handle po… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 29 pages, 29 figures

  14. arXiv:2006.12871  [pdf, other

    stat.ML cs.LG stat.ME

    not-MIWAE: Deep Generative Modelling with Missing not at Random Data

    Authors: Niels Bruun Ipsen, Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: When a missing process depends on the missing values themselves, it needs to be explicitly modelled and taken into account while doing likelihood-based inference. We present an approach for building and fitting deep latent variable models (DLVMs) in cases where the missing process is dependent on the missing data. Specifically, a deep neural network enables us to flexibly model the conditional dis… ▽ More

    Submitted 18 March, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Camera-ready version for ICLR 2021

  15. arXiv:1902.05539  [pdf, other

    stat.ME

    A Parsimonious Tour of Bayesian Model Uncertainty

    Authors: Pierre-Alexandre Mattei

    Abstract: Modern statistical software and machine learning libraries are enabling semi-automated statistical inference. Within this context, it appears easier and easier to try and fit many models to the data at hand, reversing thereby the Fisherian way of conducting science by collecting data after the scientific hypothesis (and hence the model) has been determined. The renewed goal of the statistician bec… ▽ More

    Submitted 25 September, 2020; v1 submitted 14 February, 2019; originally announced February 2019.

    MSC Class: 62-01 ACM Class: I.2.6

  16. arXiv:1901.10230  [pdf, other

    stat.ML cs.LG stat.CO

    Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation

    Authors: Samuel Wiqvist, Pierre-Alexandre Mattei, Umberto Picchini, Jes Frellsen

    Abstract: We present a novel family of deep neural architectures, named partially exchangeable networks (PENs) that leverage probabilistic symmetries. By design, PENs are invariant to block-switch transformations, which characterize the partial exchangeability properties of conditionally Markovian processes. Moreover, we show that any block-switch invariant function has a PEN-like representation. The DeepSe… ▽ More

    Submitted 17 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Forthcoming on the Proceedings of ICML 2019. New comparisons with several different networks. We now use the Wasserstein distance to produce comparisons. Code available on GitHub. 16 pages, 5 figures, 21 tables

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:6798--6807, 2019

  17. arXiv:1812.02633  [pdf, other

    stat.ML cs.LG stat.ME

    MIWAE: Deep Generative Modelling and Imputation of Incomplete Data

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: We consider the problem of handling missing data with deep latent variable models (DLVMs). First, we present a simple technique to train DLVMs when the training set contains missing-at-random data. Our approach, called MIWAE, is based on the importance-weighted autoencoder (IWAE), and maximises a potentially tight lower bound of the log-likelihood of the observed data. Compared to the original IWA… ▽ More

    Submitted 4 February, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: A short version of this paper was presented at the 3rd NeurIPS workshop on Bayesian Deep Learning

  18. arXiv:1802.04826  [pdf, other

    stat.ML cs.LG stat.ME

    Leveraging the Exact Likelihood of Deep Latent Variable Models

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: Deep latent variable models (DLVMs) combine the approximation abilities of deep neural networks and the statistical foundations of generative models. Variational methods are commonly used for inference; however, the exact likelihood of these models has been largely overlooked. The purpose of this work is to study the general properties of this quantity and to show how they can be leveraged in prac… ▽ More

    Submitted 28 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    MSC Class: 62H25

  19. arXiv:1703.02834  [pdf, ps, other

    stat.ME math.ST stat.ML

    Exact Dimensionality Selection for Bayesian PCA

    Authors: Charles Bouveyron, Pierre Latouche, Pierre-Alexandre Mattei

    Abstract: We present a Bayesian model selection approach to estimate the intrinsic dimensionality of a high-dimensional dataset. To this end, we introduce a novel formulation of the probabilisitic principal component analysis model based on a normal-gamma prior distribution. In this context, we exhibit a closed-form expression of the marginal likelihood which allows to infer an optimal number of components.… ▽ More

    Submitted 21 May, 2019; v1 submitted 8 March, 2017; originally announced March 2017.

  20. Bayesian Variable Selection for Globally Sparse Probabilistic PCA

    Authors: Charles Bouveyron, Pierre Latouche, Pierre-Alexandre Mattei

    Abstract: Sparse versions of principal component analysis (PCA) have imposed themselves as simple, yet powerful ways of selecting relevant features of high-dimensional data in an unsupervised manner. However, when several sparse principal components are computed, the interpretation of the selected variables is difficult since each axis has its own sparsity pattern and has to be interpreted separately. To ov… ▽ More

    Submitted 20 September, 2016; v1 submitted 19 May, 2016; originally announced May 2016.

    Comments: An earlier version of this paper appeared in the Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS 2016)