Skip to main content

Showing 1–24 of 24 results for author: Morvant, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.13285  [pdf, other

    stat.ML cs.LG

    Leveraging PAC-Bayes Theory and Gibbs Distributions for Generalization Bounds with Complexity Measures

    Authors: Paul Viallard, Rémi Emonet, Amaury Habrard, Emilie Morvant, Valentina Zantedeschi

    Abstract: In statistical learning theory, a generalization bound usually involves a complexity measure imposed by the considered theoretical framework. This limits the scope of such bounds, as other forms of capacity measures or regularizations are used in algorithms. In this paper, we leverage the framework of disintegrated PAC-Bayes bounds to derive a general generalization bound instantiable with arbitra… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: AISTATS 2024

  2. arXiv:2106.12535  [pdf, other

    cs.LG stat.ME stat.ML

    Learning Stochastic Majority Votes by Minimizing a PAC-Bayes Generalization Bound

    Authors: Valentina Zantedeschi, Paul Viallard, Emilie Morvant, Rémi Emonet, Amaury Habrard, Pascal Germain, Benjamin Guedj

    Abstract: We investigate a stochastic counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties. While our approach holds for arbitrary distributions, we instantiate it with Dirichlet distributions: this allows for a closed-form and differentiable expression for the expected risk, which then turns the generalization bound into a tractable training objective.… ▽ More

    Submitted 19 October, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  3. arXiv:2104.13626  [pdf, other

    stat.ML cs.LG

    Self-Bounding Majority Vote Learning Algorithms by the Direct Minimization of a Tight PAC-Bayesian C-Bound

    Authors: Paul Viallard, Pascal Germain, Amaury Habrard, Emilie Morvant

    Abstract: In the PAC-Bayesian literature, the C-Bound refers to an insightful relation between the risk of a majority vote classifier (under the zero-one loss) and the first two moments of its margin (i.e., the expected margin and the voters' diversity). Until now, learning algorithms developed in this framework minimize the empirical version of the C-Bound, instead of explicit PAC-Bayesian generalization b… ▽ More

    Submitted 31 August, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: ECML PKDD 2021, Sep 2021, Bilbao, Spain

  4. arXiv:2102.11069  [pdf, other

    cs.LG cs.AI stat.ML

    A PAC-Bayes Analysis of Adversarial Robustness

    Authors: Paul Viallard, Guillaume Vidot, Amaury Habrard, Emilie Morvant

    Abstract: We propose the first general PAC-Bayesian generalization bounds for adversarial robustness, that estimate, at test time, how much a model will be invariant to imperceptible perturbations in the input. Instead of deriving a worst-case analysis of the risk of a hypothesis over all the possible perturbations, we leverage the PAC-Bayesian framework to bound the averaged risk on the perturbations for m… ▽ More

    Submitted 27 October, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS 2021, Dec 2021, Sydney, Australia

  5. arXiv:2102.08649  [pdf, other

    stat.ML cs.LG

    A General Framework for the Practical Disintegration of PAC-Bayesian Bounds

    Authors: Paul Viallard, Pascal Germain, Amaury Habrard, Emilie Morvant

    Abstract: PAC-Bayesian bounds are known to be tight and informative when studying the generalization ability of randomized classifiers. However, they require a loose and costly derandomization step when applied to some families of deterministic models such as neural networks. As an alternative to this step, we introduce new PAC-Bayesian generalization bounds that have the originality to provide disintegrate… ▽ More

    Submitted 18 September, 2023; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Machine Learning, In press

  6. arXiv:2004.11829  [pdf, other

    cs.LG stat.ML

    A survey on domain adaptation theory: learning bounds and theoretical guarantees

    Authors: Ievgen Redko, Emilie Morvant, Amaury Habrard, Marc Sebban, Younès Bennani

    Abstract: All famous machine learning algorithms that comprise both supervised and semi-supervised learning work well only under a common assumption: the training and test data follow the same distribution. When the distribution changes, most statistical models must be reconstructed from newly collected data, which for some applications can be costly or impossible to obtain. Therefore, it has become necessa… ▽ More

    Submitted 13 July, 2022; v1 submitted 24 April, 2020; originally announced April 2020.

  7. arXiv:1909.01651  [pdf, other

    stat.ML cs.LG

    Metric Learning from Imbalanced Data

    Authors: Léo Gautheron, Emilie Morvant, Amaury Habrard, Marc Sebban

    Abstract: A key element of any machine learning algorithm is the use of a function that measures the dis/similarity between data points. Given a task, such a function can be optimized with a metric learning algorithm. Although this research field has received a lot of attention during the past decade, very few approaches have focused on learning a metric in an imbalanced scenario where the number of positiv… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

  8. arXiv:1906.06203  [pdf, other

    stat.ML cs.LG

    Learning Landmark-Based Ensembles with Random Fourier Features and Gradient Boosting

    Authors: Léo Gautheron, Pascal Germain, Amaury Habrard, Emilie Morvant, Marc Sebban, Valentina Zantedeschi

    Abstract: We propose a Gradient Boosting algorithm for learning an ensemble of kernel functions adapted to the task at hand. Unlike state-of-the-art Multiple Kernel Learning techniques that make use of a pre-computed dictionary of kernel functions to select from, at each iteration we fit a kernel by approximating it as a weighted sum of Random Fourier Features (RFF) and by optimizing their barycenter. This… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  9. arXiv:1810.12683  [pdf, other

    stat.ML cs.LG

    Pseudo-Bayesian Learning with Kernel Fourier Transform as Prior

    Authors: Gaël Letarte, Emilie Morvant, Pascal Germain

    Abstract: We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF) method through the lens of the PAC-Bayesian theory. While the primary goal of RFF is to approximate a kernel, we look at the Fourier transform as a prior distribution over trigonometric hypotheses. It naturally suggests learning a posterior on these hypotheses. We derive generalization bounds that are optimized by learning a… ▽ More

    Submitted 27 March, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: Published at AISTATS 2019

  10. arXiv:1808.05784  [pdf, other

    stat.ML cs.LG

    Multiview Boosting by Controlling the Diversity and the Accuracy of View-specific Voters

    Authors: Anil Goyal, Emilie Morvant, Pascal Germain, Massih-Reza Amini

    Abstract: In this paper we propose a boosting based multiview learning algorithm, referred to as PB-MVBoost, which iteratively learns i) weights over view-specific voters capturing view-specific information; and ii) weights over views by optimizing a PAC-Bayes multiview C-Bound that takes into account the accuracy of view-specific classifiers and the diversity between the views. We derive a generalization b… ▽ More

    Submitted 27 August, 2018; v1 submitted 17 August, 2018; originally announced August 2018.

  11. arXiv:1805.10212  [pdf, other

    stat.ML cs.LG

    Multiview Learning of Weighted Majority Vote by Bregman Divergence Minimization

    Authors: Anil Goyal, Emilie Morvant, Massih-Reza Amini

    Abstract: We tackle the issue of classifier combinations when observations have multiple views. Our method jointly learns view-specific weighted majority vote classifiers (i.e. for each view) over a set of base voters, and a second weighted majority vote classifier over the set of these view-specific weighted majority vote classifiers. We show that the empirical risk minimization of the final majority vote… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

  12. PAC-Bayes and Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: We provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different, but related, target distribution. Firstly, we propose an improvement of the previous approach we proposed in Germain et al. (2013), which relies on a novel distribution pseudodistance based on a disagreement aver… ▽ More

    Submitted 15 November, 2019; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: Neurocomputing, Elsevier, 2019. arXiv admin note: substantial text overlap with arXiv:1503.06944

  13. arXiv:1606.07240  [pdf, other

    stat.ML

    PAC-Bayesian Analysis for a two-step Hierarchical Multiview Learning Approach

    Authors: Anil Goyal, Emilie Morvant, Pascal Germain, Massih-Reza Amini

    Abstract: We study a two-level multiview learning with more than two views under the PAC-Bayesian framework. This approach, sometimes referred as late fusion, consists in learning sequentially multiple view-specific classifiers at the first level, and then combining these view-specific classifiers at the second level. Our main theoretical result is a generalization bound on the risk of the majority vote whi… ▽ More

    Submitted 13 July, 2017; v1 submitted 23 June, 2016; originally announced June 2016.

  14. arXiv:1506.04573  [pdf, other

    stat.ML cs.LG

    A New PAC-Bayesian Perspective on Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: We study the issue of PAC-Bayesian domain adaptation: We want to learn, from a source domain, a majority vote model dedicated to a target one. Our theoretical contribution brings a new perspective by deriving an upper-bound on the target risk where the distributions' divergence---expressed as a ratio---controls the trade-off between a source error measure and the target voters' disagreement. Our b… ▽ More

    Submitted 26 July, 2016; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: Published at ICML 2016

  15. arXiv:1503.06944  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Theorems for Domain Adaptation with Specialization to Linear Classifiers

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In this paper, we provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different target distribution. On the one hand, we propose an improvement of the previous approach proposed by Germain et al. (2013), that relies on a novel distribution pseudodistance based on a disagreement… ▽ More

    Submitted 9 August, 2016; v1 submitted 24 March, 2015; originally announced March 2015.

    Comments: This report is a long version of our paper entitled A PAC-Bayesian Approach for Domain Adaptation with Specialization to Linear Classifiers published in the proceedings of the International Conference on Machine Learning (ICML) 2013. We improved our main results, extended our experiments, and proposed an extension to multisource domain adaptation

  16. arXiv:1501.03002  [pdf, ps, other

    stat.ML cs.LG

    An Improvement to the Domain Adaptation Bound in a PAC-Bayesian context

    Authors: Pascal Germain, Amaury Habrard, Francois Laviolette, Emilie Morvant

    Abstract: This paper provides a theoretical analysis of domain adaptation based on the PAC-Bayesian theory. We propose an improvement of the previous domain adaptation bound obtained by Germain et al. in two ways. We first give another generalization bound tighter and easier to interpret. Moreover, we provide a new analysis of the constant term appearing in the bound that can be of high interest for develop… ▽ More

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Transfer and Multi-task learning: Theory Meets Practice, Dec 2014, Montr{é}al, Canada

  17. arXiv:1501.03001  [pdf, other

    stat.ML cs.LG

    On Generalizing the C-Bound to the Multiclass and Multi-label Settings

    Authors: Francois Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: The C-bound, introduced in Lacasse et al., gives a tight upper bound on the risk of a binary majority vote classifier. In this work, we present a first step towards extending this work to more complex outputs, by providing generalizations of the C-bound to the multiclass and multi-label settings.

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Representation and Learning Methods for Complex Outputs, Dec 2014, Montr{é}al, Canada

  18. arXiv:1410.0334  [pdf, other

    stat.ML cs.LG

    Domain adaptation of weighted majority votes via perturbed variation-based self-labeling

    Authors: Emilie Morvant

    Abstract: In machine learning, the domain adaptation problem arrives when the test (target) and the train (source) data are generated from different distributions. A key applied issue is thus the design of algorithms able to generalize on a new distribution, for which we have no label information. We focus on learning classification models defined as a weighted majority vote over a set of real-val ued funct… ▽ More

    Submitted 1 October, 2014; originally announced October 2014.

    Journal ref: Pattern Recognition Letters (2014) To be published

  19. arXiv:1408.1336  [pdf, other

    stat.ML

    On the Generalization of the C-Bound to Structured Output Ensemble Methods

    Authors: François Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: This paper generalizes an important result from the PAC-Bayesian literature for binary classification to the case of ensemble methods for structured outputs. We prove a generic version of the \Cbound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. This bound may advantageously $(i)$ be app… ▽ More

    Submitted 15 June, 2015; v1 submitted 6 August, 2014; originally announced August 2014.

  20. arXiv:1404.7796  [pdf, other

    stat.ML cs.LG cs.MM

    Majority Vote of Diverse Classifiers for Late Fusion

    Authors: Emilie Morvant, Amaury Habrard, Stéphane Ayache

    Abstract: In the past few years, a lot of attention has been devoted to multimedia indexing by fusing multimodal informations. Two kinds of fusion schemes are generally considered: The early fusion and the late fusion. We focus on late classifier fusion, where one combines the scores of each modality at the decision level. To tackle this problem, we investigate a recent and elegant well-founded quadratic pr… ▽ More

    Submitted 19 June, 2014; v1 submitted 30 April, 2014; originally announced April 2014.

    Comments: IAPR Joint International Workshops on Statistical Techniques in Pattern Recognition and Structural and Syntactic Pattern Recignition, Joensuu : Finland (2014)

  21. Domain Adaptation of Majority Votes via Perturbed Variation-based Label Transfer

    Authors: Emilie Morvant

    Abstract: We tackle the PAC-Bayesian Domain Adaptation (DA) problem. This arrives when one desires to learn, from a source distribution, a good weighted majority vote (over a set of classifiers) on a different target distribution. In this context, the disagreement between classifiers is known crucial to control. In non-DA supervised setting, a theoretical bound - the C-bound - involves this disagreement and… ▽ More

    Submitted 19 November, 2013; originally announced November 2013.

    Journal ref: Pattern Recognition Letters 2015

  22. arXiv:1212.2340  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Learning and Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In machine learning, Domain Adaptation (DA) arises when the distribution gen- erating the test (target) data differs from the one generating the learning (source) data. It is well known that DA is an hard task even under strong assumptions, among which the covariate-shift where the source and target distributions diverge only in their marginals, i.e. they have the same labeling function. Another p… ▽ More

    Submitted 11 December, 2012; originally announced December 2012.

    Comments: https://sites.google.com/site/multitradeoffs2012/

    Journal ref: Multi-Trade-offs in Machine Learning, NIPS 2012 Workshop, Lake Tahoe : United States (2012)

  23. arXiv:1207.1019  [pdf, ps, other

    stat.ML cs.CV cs.LG cs.MM

    PAC-Bayesian Majority Vote for Late Classifier Fusion

    Authors: Emilie Morvant, Amaury Habrard, Stéphane Ayache

    Abstract: A lot of attention has been devoted to multimedia indexing over the past few years. In the literature, we often consider two kinds of fusion schemes: The early fusion and the late fusion. In this paper we focus on late classifier fusion, where one combines the scores of each modality at the decision level. To tackle this problem, we investigate a recent and elegant well-founded quadratic program n… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: 7 pages, Research report

  24. arXiv:1202.6228  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Generalization Bound on Confusion Matrix for Multi-Class Classification

    Authors: Emilie Morvant, Sokol Koço, Liva Ralaivola

    Abstract: In this work, we propose a PAC-Bayes bound for the generalization risk of the Gibbs classifier in the multi-class classification framework. The novelty of our work is the critical use of the confusion matrix of a classifier as an error measure; this puts our contribution in the line of work aiming at dealing with performance measure that are richer than mere scalar criterion such as the misclassif… ▽ More

    Submitted 22 October, 2013; v1 submitted 28 February, 2012; originally announced February 2012.

    Comments: Arxiv: http://arxiv.longhoe.net/abs/1202.6228, Accepted at ICML 2012