Skip to main content

Showing 1–23 of 23 results for author: Germain, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.04935  [pdf, other

    cs.LG stat.ML

    Statistical Guarantees for Variational Autoencoders using PAC-Bayesian Theory

    Authors: Sokhna Diarra Mbacke, Florence Clerc, Pascal Germain

    Abstract: Since their inception, Variational Autoencoders (VAEs) have become central in machine learning. Despite their widespread use, numerous questions regarding their theoretical properties remain open. Using PAC-Bayesian theory, this work develops statistical guarantees for VAEs. First, we derive the first PAC-Bayesian bound for posterior distributions conditioned on individual samples from the data-ge… ▽ More

    Submitted 7 December, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Spotlight Paper at NeurIPS 2023

  2. arXiv:2306.04777  [pdf, other

    cs.LG stat.ME stat.ML

    Invariant Causal Set Covering Machines

    Authors: Thibaud Godon, Baptiste Bauvin, Pascal Germain, Jacques Corbeil, Alexandre Drouin

    Abstract: Rule-based models, such as decision trees, appeal to practitioners due to their interpretable nature. However, the learning algorithms that produce such models are often vulnerable to spurious associations and thus, they are not guaranteed to extract causally-relevant insights. In this work, we build on ideas from the invariant causal prediction literature to propose Invariant Causal Set Covering… ▽ More

    Submitted 19 July, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  3. arXiv:2302.08942  [pdf, other

    cs.LG cs.AI stat.ML

    PAC-Bayesian Generalization Bounds for Adversarial Generative Models

    Authors: Sokhna Diarra Mbacke, Florence Clerc, Pascal Germain

    Abstract: We extend PAC-Bayesian theory to generative models and develop generalization bounds for models based on the Wasserstein distance and the total variation distance. Our first result on the Wasserstein distance assumes the instance space is bounded, while our second result takes advantage of dimensionality reduction. Our results naturally apply to Wasserstein GANs and Energy-Based GANs, and our boun… ▽ More

    Submitted 13 November, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Published at ICML 2023

  4. arXiv:2106.12535  [pdf, other

    cs.LG stat.ME stat.ML

    Learning Stochastic Majority Votes by Minimizing a PAC-Bayes Generalization Bound

    Authors: Valentina Zantedeschi, Paul Viallard, Emilie Morvant, Rémi Emonet, Amaury Habrard, Pascal Germain, Benjamin Guedj

    Abstract: We investigate a stochastic counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties. While our approach holds for arbitrary distributions, we instantiate it with Dirichlet distributions: this allows for a closed-form and differentiable expression for the expected risk, which then turns the generalization bound into a tractable training objective.… ▽ More

    Submitted 19 October, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  5. arXiv:2104.13626  [pdf, other

    stat.ML cs.LG

    Self-Bounding Majority Vote Learning Algorithms by the Direct Minimization of a Tight PAC-Bayesian C-Bound

    Authors: Paul Viallard, Pascal Germain, Amaury Habrard, Emilie Morvant

    Abstract: In the PAC-Bayesian literature, the C-Bound refers to an insightful relation between the risk of a majority vote classifier (under the zero-one loss) and the first two moments of its margin (i.e., the expected margin and the voters' diversity). Until now, learning algorithms developed in this framework minimize the empirical version of the C-Bound, instead of explicit PAC-Bayesian generalization b… ▽ More

    Submitted 31 August, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: ECML PKDD 2021, Sep 2021, Bilbao, Spain

  6. arXiv:2102.08649  [pdf, other

    stat.ML cs.LG

    A General Framework for the Practical Disintegration of PAC-Bayesian Bounds

    Authors: Paul Viallard, Pascal Germain, Amaury Habrard, Emilie Morvant

    Abstract: PAC-Bayesian bounds are known to be tight and informative when studying the generalization ability of randomized classifiers. However, they require a loose and costly derandomization step when applied to some families of deterministic models such as neural networks. As an alternative to this step, we introduce new PAC-Bayesian generalization bounds that have the originality to provide disintegrate… ▽ More

    Submitted 18 September, 2023; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Machine Learning, In press

  7. arXiv:2010.12995  [pdf, other

    cs.LG cs.AI stat.ML

    Out-of-distribution detection for regression tasks: parameter versus predictor entropy

    Authors: Yann Pequignot, Mathieu Alain, Patrick Dallaire, Alireza Yeganehparast, Pascal Germain, Josée Desharnais, François Laviolette

    Abstract: It is crucial to detect when an instance lies downright too far from the training samples for the machine learning model to be trusted, a challenge known as out-of-distribution (OOD) detection. For neural networks, one approach to this task consists of learning a diversity of predictors that all can explain the training data. This information can be used to estimate the epistemic uncertainty at a… ▽ More

    Submitted 11 September, 2023; v1 submitted 24 October, 2020; originally announced October 2020.

  8. arXiv:1912.03036  [pdf, ps, other

    cs.LG stat.ML

    Improved PAC-Bayesian Bounds for Linear Regression

    Authors: Vera Shalaeva, Alireza Fakhrizadeh Esfahani, Pascal Germain, Mihaly Petreczky

    Abstract: In this paper, we improve the PAC-Bayesian error bound for linear regression derived in Germain et al. [10]. The improvements are twofold. First, the proposed error bound is tighter, and converges to the generalization loss with a well-chosen temperature parameter. Second, the error bound also holds for training data that are not independently sampled. In particular, the error bound applies to cer… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Journal ref: Thirty-Fourth AAAI Conference on Artificial Intelligence, Feb 2020, New York, United States

  9. arXiv:1910.04464  [pdf, ps, other

    cs.LG math.ST stat.ML

    PAC-Bayesian Contrastive Unsupervised Representation Learning

    Authors: Kento Nozawa, Pascal Germain, Benjamin Guedj

    Abstract: Contrastive unsupervised representation learning (CURL) is the state-of-the-art technique to learn representations (as a set of features) from unlabelled data. While CURL has collected several empirical successes recently, theoretical understanding of its performance was still missing. In a recent work, Arora et al. (2019) provide the first generalisation bounds for CURL, relying on a Rademacher c… ▽ More

    Submitted 17 July, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: Published in the proceedings of the Conference on Uncertainty in Artificial Intelligence 2020 (UAI)

    Journal ref: PMLR, volume 124 (UAI 2020), 2020

  10. arXiv:1906.06203  [pdf, other

    stat.ML cs.LG

    Learning Landmark-Based Ensembles with Random Fourier Features and Gradient Boosting

    Authors: Léo Gautheron, Pascal Germain, Amaury Habrard, Emilie Morvant, Marc Sebban, Valentina Zantedeschi

    Abstract: We propose a Gradient Boosting algorithm for learning an ensemble of kernel functions adapted to the task at hand. Unlike state-of-the-art Multiple Kernel Learning techniques that make use of a pre-computed dictionary of kernel functions to select from, at each iteration we fit a kernel by approximating it as a weighted sum of Random Fourier Features (RFF) and by optimizing their barycenter. This… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  11. arXiv:1905.10259  [pdf, other

    cs.LG stat.ML

    Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural Networks

    Authors: Gaël Letarte, Pascal Germain, Benjamin Guedj, François Laviolette

    Abstract: We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Our results are obtained by minimizing the expected l… ▽ More

    Submitted 4 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: NeurIPS 2019

  12. arXiv:1810.12683  [pdf, other

    stat.ML cs.LG

    Pseudo-Bayesian Learning with Kernel Fourier Transform as Prior

    Authors: Gaël Letarte, Emilie Morvant, Pascal Germain

    Abstract: We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF) method through the lens of the PAC-Bayesian theory. While the primary goal of RFF is to approximate a kernel, we look at the Fourier transform as a prior distribution over trigonometric hypotheses. It naturally suggests learning a posterior on these hypotheses. We derive generalization bounds that are optimized by learning a… ▽ More

    Submitted 27 March, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: Published at AISTATS 2019

  13. arXiv:1808.05784  [pdf, other

    stat.ML cs.LG

    Multiview Boosting by Controlling the Diversity and the Accuracy of View-specific Voters

    Authors: Anil Goyal, Emilie Morvant, Pascal Germain, Massih-Reza Amini

    Abstract: In this paper we propose a boosting based multiview learning algorithm, referred to as PB-MVBoost, which iteratively learns i) weights over view-specific voters capturing view-specific information; and ii) weights over views by optimizing a PAC-Bayes multiview C-Bound that takes into account the accuracy of view-specific classifiers and the diversity between the views. We derive a generalization b… ▽ More

    Submitted 27 August, 2018; v1 submitted 17 August, 2018; originally announced August 2018.

  14. PAC-Bayes and Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: We provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different, but related, target distribution. Firstly, we propose an improvement of the previous approach we proposed in Germain et al. (2013), which relies on a novel distribution pseudodistance based on a disagreement aver… ▽ More

    Submitted 15 November, 2019; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: Neurocomputing, Elsevier, 2019. arXiv admin note: substantial text overlap with arXiv:1503.06944

  15. arXiv:1606.07240  [pdf, other

    stat.ML

    PAC-Bayesian Analysis for a two-step Hierarchical Multiview Learning Approach

    Authors: Anil Goyal, Emilie Morvant, Pascal Germain, Massih-Reza Amini

    Abstract: We study a two-level multiview learning with more than two views under the PAC-Bayesian framework. This approach, sometimes referred as late fusion, consists in learning sequentially multiple view-specific classifiers at the first level, and then combining these view-specific classifiers at the second level. Our main theoretical result is a generalization bound on the risk of the majority vote whi… ▽ More

    Submitted 13 July, 2017; v1 submitted 23 June, 2016; originally announced June 2016.

  16. arXiv:1605.08636  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Theory Meets Bayesian Inference

    Authors: Pascal Germain, Francis Bach, Alexandre Lacoste, Simon Lacoste-Julien

    Abstract: We exhibit a strong link between frequentist PAC-Bayesian risk bounds and the Bayesian marginal likelihood. That is, for the negative log-likelihood loss function, we show that the minimization of PAC-Bayesian generalization risk bounds maximizes the Bayesian marginal likelihood. This provides an alternative explanation to the Bayesian Occam's razor criteria, under the assumption that the data is… ▽ More

    Submitted 13 February, 2017; v1 submitted 27 May, 2016; originally announced May 2016.

    Comments: Published at NIPS 2015 (http://papers.nips.cc/paper/6569-pac-bayesian-theory-meets-bayesian-inference)

    Journal ref: Advances in Neural Information Processing Systems 29 (NIPS 2016), p. 1884-1892

  17. arXiv:1506.04573  [pdf, other

    stat.ML cs.LG

    A New PAC-Bayesian Perspective on Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: We study the issue of PAC-Bayesian domain adaptation: We want to learn, from a source domain, a majority vote model dedicated to a target one. Our theoretical contribution brings a new perspective by deriving an upper-bound on the target risk where the distributions' divergence---expressed as a ratio---controls the trade-off between a source error measure and the target voters' disagreement. Our b… ▽ More

    Submitted 26 July, 2016; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: Published at ICML 2016

  18. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  19. arXiv:1503.08329  [pdf, other

    stat.ML cs.LG

    Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

    Authors: Pascal Germain, Alexandre Lacasse, François Laviolette, Mario Marchand, Jean-Francis Roy

    Abstract: We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in th… ▽ More

    Submitted 28 July, 2015; v1 submitted 28 March, 2015; originally announced March 2015.

    Comments: Published in JMLR http://jmlr.org/papers/v16/germain15a.html

    Journal ref: Journal of Machine Learning Research 2015, vol. 16, p. 787-860

  20. arXiv:1503.06944  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Theorems for Domain Adaptation with Specialization to Linear Classifiers

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In this paper, we provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different target distribution. On the one hand, we propose an improvement of the previous approach proposed by Germain et al. (2013), that relies on a novel distribution pseudodistance based on a disagreement… ▽ More

    Submitted 9 August, 2016; v1 submitted 24 March, 2015; originally announced March 2015.

    Comments: This report is a long version of our paper entitled A PAC-Bayesian Approach for Domain Adaptation with Specialization to Linear Classifiers published in the proceedings of the International Conference on Machine Learning (ICML) 2013. We improved our main results, extended our experiments, and proposed an extension to multisource domain adaptation

  21. arXiv:1501.03002  [pdf, ps, other

    stat.ML cs.LG

    An Improvement to the Domain Adaptation Bound in a PAC-Bayesian context

    Authors: Pascal Germain, Amaury Habrard, Francois Laviolette, Emilie Morvant

    Abstract: This paper provides a theoretical analysis of domain adaptation based on the PAC-Bayesian theory. We propose an improvement of the previous domain adaptation bound obtained by Germain et al. in two ways. We first give another generalization bound tighter and easier to interpret. Moreover, we provide a new analysis of the constant term appearing in the bound that can be of high interest for develop… ▽ More

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Transfer and Multi-task learning: Theory Meets Practice, Dec 2014, Montr{é}al, Canada

  22. arXiv:1412.4446  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Neural Networks

    Authors: Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: We introduce a new representation learning algorithm suited to the context of domain adaptation, in which data at training and test time come from similar but different distributions. Our algorithm is directly inspired by theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on a data representation that cannot discriminate betwee… ▽ More

    Submitted 9 February, 2015; v1 submitted 14 December, 2014; originally announced December 2014.

    Comments: The first version of this paper was accepted at the "Second Workshop on Transfer and Multi-Task Learning: Theory meets Practice" (NIPS 2014, Montreal, Canada). See: https://sites.google.com/site/multitaskwsnips2014/

  23. arXiv:1212.2340  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Learning and Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In machine learning, Domain Adaptation (DA) arises when the distribution gen- erating the test (target) data differs from the one generating the learning (source) data. It is well known that DA is an hard task even under strong assumptions, among which the covariate-shift where the source and target distributions diverge only in their marginals, i.e. they have the same labeling function. Another p… ▽ More

    Submitted 11 December, 2012; originally announced December 2012.

    Comments: https://sites.google.com/site/multitradeoffs2012/

    Journal ref: Multi-Trade-offs in Machine Learning, NIPS 2012 Workshop, Lake Tahoe : United States (2012)