Skip to main content

Showing 1–9 of 9 results for author: Jacob, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2212.12542  [pdf, ps, other

    q-bio.GN cs.LG stat.ML

    Neural Networks beyond explainability: Selective inference for sequence motifs

    Authors: Antoine VilliƩ, Philippe Veber, Yohann de Castro, Laurent Jacob

    Abstract: Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  2. arXiv:2003.05189  [pdf, other

    stat.ML cs.LG

    Convolutional Kernel Networks for Graph-Structured Data

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an… ▽ More

    Submitted 29 June, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Report number: hal-02151135

    Journal ref: International Conference on Machine Learning (ICML), Jul 2020

  3. arXiv:1906.03200  [pdf, other

    stat.ML cs.LG

    Recurrent Kernel Networks

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points… ▽ More

    Submitted 17 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Report number: hal-02151135

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  4. arXiv:1211.4259  [pdf, ps, other

    stat.AP q-bio.GN stat.ME

    Correcting gene expression data when neither the unwanted variation nor the factor of interest are observed

    Authors: Laurent Jacob, Johann Gagnon-Bartsch, Terence P. Speed

    Abstract: When dealing with large scale gene expression studies, observations are commonly contaminated by unwanted variation factors such as platforms or batches. Not taking this unwanted variation into account when analyzing the data can lead to spurious associations and to missing important signals. When the analysis is unsupervised, e.g., when the goal is to cluster the samples or to build a corrected v… ▽ More

    Submitted 18 November, 2012; originally announced November 2012.

  5. arXiv:1206.6980  [pdf, ps, other

    stat.AP q-bio.QM

    More power via graph-structured tests for differential expression of gene networks

    Authors: Laurent Jacob, Pierre Neuvial, Sandrine Dudoit

    Abstract: We consider multivariate two-sample tests of means, where the location shift between the two populations is expected to be related to a known graph structure. An important application of such tests is the detection of differentially expressed genes between two patient populations, as shifts in expression levels are expected to be coherent with the structure of graphs reflecting gene properties suc… ▽ More

    Submitted 29 June, 2012; originally announced June 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOAS528 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: substantial text overlap with arXiv:1009.5173

    Report number: IMS-AOAS-AOAS528

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 2, 561-600

  6. arXiv:1110.0413  [pdf, other

    stat.ML cs.LG

    Group Lasso with Overlaps: the Latent Group Lasso approach

    Authors: Guillaume Obozinski, Laurent Jacob, Jean-Philippe Vert

    Abstract: We study a norm for structured sparsity which leads to sparse linear predictors whose supports are unions of prede ned overlap** groups of variables. We call the obtained formulation latent group Lasso, since it is based on applying the usual group Lasso penalty on a set of latent variables. A detailed analysis of the norm and its properties is presented and we characterize conditions under whic… ▽ More

    Submitted 3 October, 2011; originally announced October 2011.

  7. arXiv:1108.2401  [pdf, ps, other

    math.ST stat.ME stat.ML

    A More Powerful Two-Sample Test in High Dimensions using Random Projection

    Authors: Miles E. Lopes, Laurent J. Jacob, Martin J. Wainwright

    Abstract: We consider the hypothesis testing problem of detecting a shift between the means of two multivariate normal distributions in the high-dimensional setting, allowing for the data dimension p to exceed the sample size n. Specifically, we propose a new test statistic for the two-sample test of means that integrates a random projection with the classical Hotelling T^2 statistic. Working under a high-d… ▽ More

    Submitted 13 September, 2015; v1 submitted 11 August, 2011; originally announced August 2011.

    Comments: Version 3 is an extended version of our NIPS 2011 conference paper. This should be regarded as the final version and cited as a NIPS 2011 paper. Note that version3=version1. Also, version 2 should be considered as defunct, as it contains an error in the variance formula in equation (4)

  8. arXiv:1009.5173  [pdf, ps, other

    q-bio.QM stat.AP

    Gains in Power from Structured Two-Sample Tests of Means on Graphs

    Authors: Laurent Jacob, Pierre Neuvial, Sandrine Dudoit

    Abstract: We consider multivariate two-sample tests of means, where the location shift between the two populations is expected to be related to a known graph structure. An important application of such tests is the detection of differentially expressed genes between two patient populations, as shifts in expression levels are expected to be coherent with the structure of graphs reflecting gene properties suc… ▽ More

    Submitted 27 September, 2010; originally announced September 2010.

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 2, 561-600

  9. arXiv:1001.3109  [pdf, ps, other

    stat.ML q-bio.GN q-bio.QM stat.AP

    Increasing stability and interpretability of gene expression signatures

    Authors: Anne-Claire Haury, Laurent Jacob, Jean-Philippe Vert

    Abstract: Motivation : Molecular signatures for diagnosis or prognosis estimated from large-scale gene expression data often lack robustness and stability, rendering their biological interpretation challenging. Increasing the signature's interpretability and stability across perturbations of a given dataset and, if possible, across datasets, is urgently needed to ease the discovery of important biological… ▽ More

    Submitted 18 January, 2010; originally announced January 2010.