Skip to main content

Showing 1–13 of 13 results for author: Marchand, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2304.02064  [pdf, other

    cs.LG stat.ML

    Algorithm-Dependent Bounds for Representation Learning of Multi-Source Domain Adaptation

    Authors: Qi Chen, Mario Marchand

    Abstract: We use information-theoretic tools to derive a novel analysis of Multi-source Domain Adaptation (MDA) from the representation learning perspective. Concretely, we study joint distribution alignment for supervised MDA with few target labels and unsupervised MDA with pseudo labels, where the latter is relatively hard and less commonly studied. We further provide algorithm-dependent generalization bo… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  2. arXiv:2210.10781  [pdf, ps, other

    stat.ML cs.LG

    Generalization Properties of Decision Trees on Real-valued and Categorical Features

    Authors: Jean-Samuel Leboeuf, Frédéric LeBlanc, Mario Marchand

    Abstract: We revisit binary decision trees from the perspective of partitions of the data. We introduce the notion of partitioning function, and we relate it to the growth function and to the VC dimension. We consider three types of features: real-valued, categorical ordinal and categorical nominal, with different split rules for each. For each feature type, we upper bound the partitioning function of the c… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 79 pages. arXiv admin note: text overlap with arXiv:2010.07374

  3. arXiv:2109.14595  [pdf, other

    cs.LG stat.ML

    Generalization Bounds For Meta-Learning: An Information-Theoretic Analysis

    Authors: Qi Chen, Changjian Shui, Mario Marchand

    Abstract: We derive a novel information-theoretic analysis of the generalization property of meta-learning algorithms. Concretely, our analysis proposes a generic understanding of both the conventional learning-to-learn framework and the modern model-agnostic meta-learning (MAML) algorithms. Moreover, we provide a data-dependent generalization bound for a stochastic variant of MAML, which is non-vacuous for… ▽ More

    Submitted 10 December, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

  4. arXiv:1905.12131  [pdf, other

    cs.LG stat.ML

    Adaptive Deep Kernel Learning

    Authors: Prudencio Tossou, Basile Dura, Francois Laviolette, Mario Marchand, Alexandre Lacoste

    Abstract: Deep kernel learning provides an elegant and principled framework for combining the structural properties of deep learning algorithms with the flexibility of kernel methods. By means of a deep neural network, we learn a parametrized kernel operator that can be combined with a differentiable kernel algorithm during inference. While previous work within this framework has focused on learning a singl… ▽ More

    Submitted 11 December, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  5. arXiv:1612.01030  [pdf, other

    q-bio.GN cs.LG stat.ML

    Large scale modeling of antimicrobial resistance with interpretable classifiers

    Authors: Alexandre Drouin, Frédéric Raymond, Gaël Letarte St-Pierre, Mario Marchand, Jacques Corbeil, François Laviolette

    Abstract: Antimicrobial resistance is an important public health concern that has implications in the practice of medicine worldwide. Accurately predicting resistance phenotypes from genome sequences shows great promise in promoting better use of antimicrobial agents, by determining which antibiotics are likely to be effective in specific clinical cases. In healthcare, this would allow for the design of tre… ▽ More

    Submitted 3 December, 2016; originally announced December 2016.

    Comments: Peer-reviewed and accepted for presentation at the Machine Learning for Health Workshop, NIPS 2016, Barcelona, Spain

  6. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  7. arXiv:1505.06249  [pdf, other

    q-bio.GN cs.LG stat.ML

    Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

    Authors: Alexandre Drouin, Sébastien Giguère, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The Set Covering Machine (SCM) is a greedy learning algorithm that produces sparse classifiers. We extend the SCM for datasets that contain a huge number of features. The whole genetic material of living organisms is an example of such a case, where the number of feature exceeds 10^7. Three human pathogens were used to evaluate the performance of the SCM at predicting antimicrobial resistance. Our… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: Peer-reviewed and accepted for an oral presentation in the Greed is Great workshop at the International Conference on Machine Learning, Lille, France, 2015

  8. arXiv:1503.08329  [pdf, other

    stat.ML cs.LG

    Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

    Authors: Pascal Germain, Alexandre Lacasse, François Laviolette, Mario Marchand, Jean-Francis Roy

    Abstract: We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in th… ▽ More

    Submitted 28 July, 2015; v1 submitted 28 March, 2015; originally announced March 2015.

    Comments: Published in JMLR http://jmlr.org/papers/v16/germain15a.html

    Journal ref: Journal of Machine Learning Research 2015, vol. 16, p. 787-860

  9. arXiv:1412.4446  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Neural Networks

    Authors: Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: We introduce a new representation learning algorithm suited to the context of domain adaptation, in which data at training and test time come from similar but different distributions. Our algorithm is directly inspired by theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on a data representation that cannot discriminate betwee… ▽ More

    Submitted 9 February, 2015; v1 submitted 14 December, 2014; originally announced December 2014.

    Comments: The first version of this paper was accepted at the "Second Workshop on Transfer and Multi-Task Learning: Theory meets Practice" (NIPS 2014, Montreal, Canada). See: https://sites.google.com/site/multitaskwsnips2014/

  10. arXiv:1412.1074  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

    Authors: Alexandre Drouin, Sébastien Giguère, Vladana Sagatovich, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The increased affordability of whole genome sequencing has motivated its use for phenotypic studies. We address the problem of learning interpretable models for discrete phenotypes from whole genomes. We propose a general approach that relies on the Set Covering Machine and a k-mer representation of the genomes. We show results for the problem of predicting the resistance of Pseudomonas Aeruginosa… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: Presented at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

  11. arXiv:1402.0796  [pdf, other

    cs.LG stat.ML

    Sequential Model-Based Ensemble Optimization

    Authors: Alexandre Lacoste, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: One of the most tedious tasks in the application of machine learning is model selection, i.e. hyperparameter selection. Fortunately, recent progress has been made in the automation of this process, through the use of sequential model-based optimization (SMBO) methods. This can be used to optimize a cross-validation performance of a learning algorithm over the value of its hyperparameters. However,… ▽ More

    Submitted 4 February, 2014; originally announced February 2014.

  12. arXiv:1207.7253  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Learning a peptide-protein binding affinity predictor with kernel ridge regression

    Authors: Sébastien Giguère, Mario Marchand, François Laviolette, Alexandre Drouin, Jacques Corbeil

    Abstract: We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalize eight kernels, such as the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

    Comments: 22 pages, 4 figures, 5 tables

    MSC Class: 92B05 ACM Class: I.2.6; J.3; G.3; G.4; I.5.2

    Journal ref: BMC Bioinformatics 2013, 14:82

  13. arXiv:1005.0530  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data

    Authors: Mohak Shah, Mario Marchand, Jacques Corbeil

    Abstract: One of the objectives of designing feature selection learning algorithms is to obtain classifiers that depend on a small number of attributes and have verifiable future performance guarantees. There are few, if any, approaches that successfully address the two goals simultaneously. Performance guarantees become crucial for tasks such as microarray data analysis due to very small sample sizes resul… ▽ More

    Submitted 4 May, 2010; originally announced May 2010.