Search | arXiv e-print repository

Neural Networks beyond explainability: Selective inference for sequence motifs

Authors: Antoine Villié, Philippe Veber, Yohann de Castro, Laurent Jacob

Abstract: Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM,… ▽ More Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM, a selective inference procedure to test the association between these extracted features and the predicted phenotype. In particular, we discuss how training a one-layer convolutional network is formally equivalent to selecting motifs maximizing some association score. We adapt existing sampling-based selective inference procedures by quantizing this selection over an infinite set to a large but finite grid. Finally, we show that sampling under a specific choice of parameters is sufficient to characterize the composite null hypothesis typically used for selective inference-a result that goes well beyond our particular framework. We illustrate the behavior of our method in terms of calibration, power and speed and discuss its power/speed trade-off with a simpler data-split strategy. SEISM paves the way to an easier analysis of neural networks used in regulatory genomics, and to more powerful methods for genome wide association studies (GWAS). △ Less

Submitted 23 December, 2022; originally announced December 2022.

arXiv:2003.05189 [pdf, other]

Convolutional Kernel Networks for Graph-Structured Data

Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

Abstract: We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an… ▽ More We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an unsupervised, expressive, and easy-to-regularize data representation, which is useful when limited samples are available. On the other hand, our model can also be trained end-to-end on large-scale data, leading to new types of graph convolutional neural networks. We show that our method achieves competitive performance on several graph classification benchmarks, while offering simple model interpretation. Our code is freely available at https://github.com/claying/GCKN. △ Less

Submitted 29 June, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

Report number: hal-02151135

Journal ref: International Conference on Machine Learning (ICML), Jul 2020

arXiv:1906.03200 [pdf, other]

Recurrent Kernel Networks

Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

Abstract: Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points… ▽ More Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points in a reproducing kernel Hilbert space (RKHS). In this paper, we revisit this link by generalizing convolutional kernel networks---originally related to a relaxation of the mismatch kernel---to model gaps in sequences. It results in a new type of recurrent neural network which can be trained end-to-end with backpropagation, or without supervision by using kernel approximation techniques. We experimentally show that our approach is well suited to biological sequences, where it outperforms existing methods for protein classification tasks. △ Less

Submitted 17 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

Report number: hal-02151135

Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

arXiv:1811.11821 [pdf]

doi 10.5748/9788599693148-15CONTECSI/PS-5898

Adoção de Social CRM em Micro e Pequenas Empresas: Uma Análise do Mercado Santareno

Authors: Gustavo Nogueira de Sousa, Luan Vinícius Huppes, Antônio Fernando Lavareda Jacob Jr, Fábio Manoel França Lobato

Abstract: Online social networks have changed the ways of communication and social interactions, especially in the Customer Relationship Management (CRM). In this sense, a new concept about business strategies involving CRM and social media has aroused, known as Social Customer Relationship Management. Despite to be an emergent and promising research field, it was perceived that Micro and Small Enterprises… ▽ More Online social networks have changed the ways of communication and social interactions, especially in the Customer Relationship Management (CRM). In this sense, a new concept about business strategies involving CRM and social media has aroused, known as Social Customer Relationship Management. Despite to be an emergent and promising research field, it was perceived that Micro and Small Enterprises (MSE) have shown few or no process of Social CRM implemented. Aiming to test this hypothesis, this work conducts a market analysis in Santarém City, located in the Pará State, evaluating the adoption of Social CRM by MSE. The main contribution of this study is related to the understanding of the dynamics between Social CRM and MSE. As results, the construction of insights' list of products and solutions suitable for the implementation of Social CRM by MSE, with the potential to guide research and development projects in this area. △ Less

Submitted 26 November, 2018; originally announced November 2018.

Comments: in Portuguese, Paper presented at the 15th International Conference On Information Systems & Technology Management

arXiv:1809.03020 [pdf, other]

Development of a Social Network for Research Support and Individual Well-being Improvement

Authors: Lucas V. A. Caldas, Antonio F. L. Jacob Jr., Simone S. C. Silva, Fernando A. R. Pontes, Fábio M. F. Lobato

Abstract: The ways of communication and social interactions are changing. Web users are becoming increasingly engaged with Online Social Networks (OSN), which has a significant impact on the relationship mechanisms between individuals and communities. Most OSN platforms have strict policies regarding data access, harming its usage in psychological and social phenomena studies, It is also impacting the devel… ▽ More The ways of communication and social interactions are changing. Web users are becoming increasingly engaged with Online Social Networks (OSN), which has a significant impact on the relationship mechanisms between individuals and communities. Most OSN platforms have strict policies regarding data access, harming its usage in psychological and social phenomena studies, It is also impacting the development of computational methods to evaluate and improve social and individual well-being via the web. Aiming to fill this gap, we propose a platform that brings together social networks dynamics with forum features, altogether with gamification elements, targeting researchers interested in obtaining access to user's data to study psychological and social phenomena. △ Less

Submitted 9 September, 2018; originally announced September 2018.

Comments: This paper was accepted in the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2018

arXiv:1110.0413 [pdf, other]

Group Lasso with Overlaps: the Latent Group Lasso approach

Authors: Guillaume Obozinski, Laurent Jacob, Jean-Philippe Vert

Abstract: We study a norm for structured sparsity which leads to sparse linear predictors whose supports are unions of prede ned overlap** groups of variables. We call the obtained formulation latent group Lasso, since it is based on applying the usual group Lasso penalty on a set of latent variables. A detailed analysis of the norm and its properties is presented and we characterize conditions under whic… ▽ More We study a norm for structured sparsity which leads to sparse linear predictors whose supports are unions of prede ned overlap** groups of variables. We call the obtained formulation latent group Lasso, since it is based on applying the usual group Lasso penalty on a set of latent variables. A detailed analysis of the norm and its properties is presented and we characterize conditions under which the set of groups associated with latent variables are correctly identi ed. We motivate and discuss the delicate choice of weights associated to each group, and illustrate this approach on simulated data and on the problem of breast cancer prognosis from gene expression data. △ Less

Submitted 3 October, 2011; originally announced October 2011.

arXiv:0809.2085 [pdf, ps, other]

Clustered Multi-Task Learning: A Convex Formulation

Authors: Laurent Jacob, Francis Bach, Jean-Philippe Vert

Abstract: In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they… ▽ More In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they are expected to be related to each other. In this paper, we assume that tasks are clustered into groups, which are unknown beforehand, and that tasks within a group have similar weight vectors. We design a new spectral norm that encodes this a priori assumption, without the prior knowledge of the partition of tasks into groups, resulting in a new convex optimization formulation for multi-task learning. We show in simulations on synthetic examples and on the IEDB MHC-I binding dataset, that our approach outperforms well-known convex methods for multi-task learning, as well as related non convex methods dedicated to the same problem. △ Less

Submitted 11 September, 2008; originally announced September 2008.

Showing 1–7 of 7 results for author: Jacob, L