-
Neural Networks beyond explainability: Selective inference for sequence motifs
Authors:
Antoine Villié,
Philippe Veber,
Yohann de Castro,
Laurent Jacob
Abstract:
Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM,…
▽ More
Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM, a selective inference procedure to test the association between these extracted features and the predicted phenotype. In particular, we discuss how training a one-layer convolutional network is formally equivalent to selecting motifs maximizing some association score. We adapt existing sampling-based selective inference procedures by quantizing this selection over an infinite set to a large but finite grid. Finally, we show that sampling under a specific choice of parameters is sufficient to characterize the composite null hypothesis typically used for selective inference-a result that goes well beyond our particular framework. We illustrate the behavior of our method in terms of calibration, power and speed and discuss its power/speed trade-off with a simpler data-split strategy. SEISM paves the way to an easier analysis of neural networks used in regulatory genomics, and to more powerful methods for genome wide association studies (GWAS).
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Convolutional Kernel Networks for Graph-Structured Data
Authors:
Dexiong Chen,
Laurent Jacob,
Julien Mairal
Abstract:
We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an…
▽ More
We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an unsupervised, expressive, and easy-to-regularize data representation, which is useful when limited samples are available. On the other hand, our model can also be trained end-to-end on large-scale data, leading to new types of graph convolutional neural networks. We show that our method achieves competitive performance on several graph classification benchmarks, while offering simple model interpretation. Our code is freely available at https://github.com/claying/GCKN.
△ Less
Submitted 29 June, 2020; v1 submitted 11 March, 2020;
originally announced March 2020.
-
Recurrent Kernel Networks
Authors:
Dexiong Chen,
Laurent Jacob,
Julien Mairal
Abstract:
Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points…
▽ More
Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points in a reproducing kernel Hilbert space (RKHS). In this paper, we revisit this link by generalizing convolutional kernel networks---originally related to a relaxation of the mismatch kernel---to model gaps in sequences. It results in a new type of recurrent neural network which can be trained end-to-end with backpropagation, or without supervision by using kernel approximation techniques. We experimentally show that our approach is well suited to biological sequences, where it outperforms existing methods for protein classification tasks.
△ Less
Submitted 17 October, 2019; v1 submitted 7 June, 2019;
originally announced June 2019.
-
Adoção de Social CRM em Micro e Pequenas Empresas: Uma Análise do Mercado Santareno
Authors:
Gustavo Nogueira de Sousa,
Luan Vinícius Huppes,
Antônio Fernando Lavareda Jacob Jr,
Fábio Manoel França Lobato
Abstract:
Online social networks have changed the ways of communication and social interactions, especially in the Customer Relationship Management (CRM). In this sense, a new concept about business strategies involving CRM and social media has aroused, known as Social Customer Relationship Management. Despite to be an emergent and promising research field, it was perceived that Micro and Small Enterprises…
▽ More
Online social networks have changed the ways of communication and social interactions, especially in the Customer Relationship Management (CRM). In this sense, a new concept about business strategies involving CRM and social media has aroused, known as Social Customer Relationship Management. Despite to be an emergent and promising research field, it was perceived that Micro and Small Enterprises (MSE) have shown few or no process of Social CRM implemented. Aiming to test this hypothesis, this work conducts a market analysis in Santarém City, located in the Pará State, evaluating the adoption of Social CRM by MSE. The main contribution of this study is related to the understanding of the dynamics between Social CRM and MSE. As results, the construction of insights' list of products and solutions suitable for the implementation of Social CRM by MSE, with the potential to guide research and development projects in this area.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.
-
Development of a Social Network for Research Support and Individual Well-being Improvement
Authors:
Lucas V. A. Caldas,
Antonio F. L. Jacob Jr.,
Simone S. C. Silva,
Fernando A. R. Pontes,
Fábio M. F. Lobato
Abstract:
The ways of communication and social interactions are changing. Web users are becoming increasingly engaged with Online Social Networks (OSN), which has a significant impact on the relationship mechanisms between individuals and communities. Most OSN platforms have strict policies regarding data access, harming its usage in psychological and social phenomena studies, It is also impacting the devel…
▽ More
The ways of communication and social interactions are changing. Web users are becoming increasingly engaged with Online Social Networks (OSN), which has a significant impact on the relationship mechanisms between individuals and communities. Most OSN platforms have strict policies regarding data access, harming its usage in psychological and social phenomena studies, It is also impacting the development of computational methods to evaluate and improve social and individual well-being via the web. Aiming to fill this gap, we propose a platform that brings together social networks dynamics with forum features, altogether with gamification elements, targeting researchers interested in obtaining access to user's data to study psychological and social phenomena.
△ Less
Submitted 9 September, 2018;
originally announced September 2018.
-
Group Lasso with Overlaps: the Latent Group Lasso approach
Authors:
Guillaume Obozinski,
Laurent Jacob,
Jean-Philippe Vert
Abstract:
We study a norm for structured sparsity which leads to sparse linear predictors whose supports are unions of prede ned overlap** groups of variables. We call the obtained formulation latent group Lasso, since it is based on applying the usual group Lasso penalty on a set of latent variables. A detailed analysis of the norm and its properties is presented and we characterize conditions under whic…
▽ More
We study a norm for structured sparsity which leads to sparse linear predictors whose supports are unions of prede ned overlap** groups of variables. We call the obtained formulation latent group Lasso, since it is based on applying the usual group Lasso penalty on a set of latent variables. A detailed analysis of the norm and its properties is presented and we characterize conditions under which the set of groups associated with latent variables are correctly identi ed. We motivate and discuss the delicate choice of weights associated to each group, and illustrate this approach on simulated data and on the problem of breast cancer prognosis from gene expression data.
△ Less
Submitted 3 October, 2011;
originally announced October 2011.
-
Clustered Multi-Task Learning: A Convex Formulation
Authors:
Laurent Jacob,
Francis Bach,
Jean-Philippe Vert
Abstract:
In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they…
▽ More
In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for supervised classification or regression, this can be achieved by including a priori information about the weight vectors associated with the tasks, and how they are expected to be related to each other. In this paper, we assume that tasks are clustered into groups, which are unknown beforehand, and that tasks within a group have similar weight vectors. We design a new spectral norm that encodes this a priori assumption, without the prior knowledge of the partition of tasks into groups, resulting in a new convex optimization formulation for multi-task learning. We show in simulations on synthetic examples and on the IEDB MHC-I binding dataset, that our approach outperforms well-known convex methods for multi-task learning, as well as related non convex methods dedicated to the same problem.
△ Less
Submitted 11 September, 2008;
originally announced September 2008.