-
CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration
Authors:
Sophie Jaffard,
Samuel Vaiter,
Patricia Reynaud-Bouret
Abstract:
The present work aims at proving mathematically that a neural network inspired by biology can learn a classification task thanks to local transformations only. In this purpose, we propose a spiking neural network named CHANI (Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration), whose neurons activity is modeled by Hawkes processes. Synaptic weights are updated thanks to an expert…
▽ More
The present work aims at proving mathematically that a neural network inspired by biology can learn a classification task thanks to local transformations only. In this purpose, we propose a spiking neural network named CHANI (Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration), whose neurons activity is modeled by Hawkes processes. Synaptic weights are updated thanks to an expert aggregation algorithm, providing a local and simple learning rule. We were able to prove that our network can learn on average and asymptotically. Moreover, we demonstrated that it automatically produces neuronal assemblies in the sense that the network can encode several classes and that a same neuron in the intermediate layers might be activated by more than one class, and we provided numerical simulations on synthetic dataset. This theoretical approach contrasts with the traditional empirical validation of biologically inspired networks and paves the way for understanding how local learning rules enable neurons to form assemblies able to represent complex concepts.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
General oracle inequalities for a penalized log-likelihood criterion based on non-stationary data
Authors:
Julien Aubert,
Luc Lehéricy,
Patricia Reynaud-Bouret
Abstract:
We prove oracle inequalities for a penalized log-likelihood criterion that hold even if the data are not independent and not stationary, based on a martingale approach. The assumptions are checked for various contexts: density estimation with independent and identically distributed (i.i.d) data, hidden Markov models, spiking neural networks, adversarial bandits. In each case, we compare our result…
▽ More
We prove oracle inequalities for a penalized log-likelihood criterion that hold even if the data are not independent and not stationary, based on a martingale approach. The assumptions are checked for various contexts: density estimation with independent and identically distributed (i.i.d) data, hidden Markov models, spiking neural networks, adversarial bandits. In each case, we compare our results to the literature, showing that, although we lose some logarithmic factors in the most classical case (i.i.d.), these results are comparable or more general than the existing results in the more dependent cases.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Separation rates for the detection of synchronization of interacting point processes in a mean field frame. Application to neuroscience
Authors:
Josué Tchouanti,
Éva Löcherbach,
Patricia Reynaud-Bouret,
Etienne Tanré
Abstract:
We develop and study a statistical test to detect synchrony in spike trains. Our test is based on the number of coincidences between two trains of spikes. The data are supplied in the form of \(n\) pairs (assumed to be independent) of spike trains. The aim is to assess whether the two trains in a pair are also independent. Our approach is based on previous results of Albert et al. (2015, 2019) and…
▽ More
We develop and study a statistical test to detect synchrony in spike trains. Our test is based on the number of coincidences between two trains of spikes. The data are supplied in the form of \(n\) pairs (assumed to be independent) of spike trains. The aim is to assess whether the two trains in a pair are also independent. Our approach is based on previous results of Albert et al. (2015, 2019) and Kim et al. (2022) that we extend to our setting, focusing on the construction of a non-asymptotic criterion ensuring the detection of synchronization in the framework of permutation tests. Our criterion is constructed such that it ensures the control of the Type II error, while the Type I error is controlled by construction. We illustrate our results within two classical models of interacting neurons, the jittering Poisson model and Hawkes processes having \(M\) components interacting in a mean field frame and evolving in stationary regime. For this latter model, we obtain a lower bound of the size \(n\) of the sample necessary to detect the dependency between two neurons.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Non-asymptotic statistical test of the diffusion coefficient of stochastic differential equations
Authors:
Anna Melnykova,
Patricia Reynaud-Bouret,
Adeline Samson
Abstract:
We develop several statistical tests of the determinant of the diffusion coefficient of a stochastic differential equation, based on discrete observations on a time interval $[0,T]$ sampled with a time step $Δ$. Our main contribution is to control the test Type I and Type II errors in a non asymptotic setting, i.e. when the number of observations and the time step are fixed. The test statistics ar…
▽ More
We develop several statistical tests of the determinant of the diffusion coefficient of a stochastic differential equation, based on discrete observations on a time interval $[0,T]$ sampled with a time step $Δ$. Our main contribution is to control the test Type I and Type II errors in a non asymptotic setting, i.e. when the number of observations and the time step are fixed. The test statistics are calculated from the process increments. In dimension 1, the density of the test statistic is explicit. In dimension 2, the test statistic has no explicit density but upper and lower bounds are proved. We also propose a multiple testing procedure in dimension greater than 2. Every test is proved to be of a given non-asymptotic level and separability conditions to control their power are also provided. A numerical study illustrates the properties of the tests for stochastic processes with known or estimated drifts.
△ Less
Submitted 21 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm
Authors:
Julien Aubert,
Luc Lehéricy,
Patricia Reynaud-Bouret
Abstract:
When fitting the learning data of an individual to algorithm-like learning models, the observations are so dependent and non-stationary that one may wonder what the classical Maximum Likelihood Estimator (MLE) could do, even if it is the usual tool applied to experimental cognition. Our objective in this work is to show that the estimation of the learning rate cannot be efficient if the learning r…
▽ More
When fitting the learning data of an individual to algorithm-like learning models, the observations are so dependent and non-stationary that one may wonder what the classical Maximum Likelihood Estimator (MLE) could do, even if it is the usual tool applied to experimental cognition. Our objective in this work is to show that the estimation of the learning rate cannot be efficient if the learning rate is constant in the classical Exp3 (Exponential weights for Exploration and Exploitation) algorithm. Secondly, we show that if the learning rate decreases polynomially with the sample size, then the prediction error and in some cases the estimation error of the MLE satisfy bounds in probability that decrease at a polynomial rate.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Provable local learning rule by expert aggregation for a Hawkes network
Authors:
Sophie Jaffard,
Samuel Vaiter,
Alexandre Muzy,
Patricia Reynaud-Bouret
Abstract:
We propose a simple network of Hawkes processes as a cognitive model capable of learning to classify objects. Our learning algorithm, named HAN for Hawkes Aggregation of Neurons, is based on a local synaptic learning rule based on spiking probabilities at each output node. We were able to use local regret bounds to prove mathematically that the network is able to learn on average and even asymptot…
▽ More
We propose a simple network of Hawkes processes as a cognitive model capable of learning to classify objects. Our learning algorithm, named HAN for Hawkes Aggregation of Neurons, is based on a local synaptic learning rule based on spiking probabilities at each output node. We were able to use local regret bounds to prove mathematically that the network is able to learn on average and even asymptotically under more restrictive assumptions.
△ Less
Submitted 23 February, 2024; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Neural Coding as a Statistical Testing Problem
Authors:
Guilherme Ost,
Patricia Reynaud-Bouret
Abstract:
We take the testing perspective to understand what the minimal discrimination time between two stimuli is for different types of rate coding neurons. Our main goal is to describe the testing abilities of two different encoding systems: place cells and grid cells. In particular, we show, through the notion of adaptation, that a fixed place cell system can have a minimum discrimination time that dec…
▽ More
We take the testing perspective to understand what the minimal discrimination time between two stimuli is for different types of rate coding neurons. Our main goal is to describe the testing abilities of two different encoding systems: place cells and grid cells. In particular, we show, through the notion of adaptation, that a fixed place cell system can have a minimum discrimination time that decreases when the stimuli are further away. This could be a considerable advantage for the place cell system that could complement the grid cell system, which is able to discriminate stimuli that are much closer than place cells.
△ Less
Submitted 27 December, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Sliding window strategy for convolutional spike sorting with Lasso : Algorithm, theoretical guarantees and complexity
Authors:
Laurent Dragoni,
Rémi Flamary,
Karim Lounici,
Patricia Reynaud-Bouret
Abstract:
Spike sorting is a class of algorithms used in neuroscience to attribute the time occurences of particular electric signals, called action potential or spike, to neurons. We rephrase this problem as a particular optimization problem : Lasso for convolutional models in high dimension. Lasso (i.e. least absolute shrinkage and selection operator) is a very generic tool in machine learning that help u…
▽ More
Spike sorting is a class of algorithms used in neuroscience to attribute the time occurences of particular electric signals, called action potential or spike, to neurons. We rephrase this problem as a particular optimization problem : Lasso for convolutional models in high dimension. Lasso (i.e. least absolute shrinkage and selection operator) is a very generic tool in machine learning that help us to look for sparse solutions (here the time occurrences). However, for the size of the problem at hand in this neuroscience context, the classical Lasso solvers are failing. We present here a new and much faster algorithm. Making use of biological properties related to neurons, we explain how the particular structure of the problem allows several optimizations, leading to an algorithm with a temporal complexity which grows linearly with respect to the size of the recorded signal and can be performed online. Moreover the spatial separability of the initial problem allows to break it into subproblems, further reducing the complexity and making possible its application on the latest recording devices which comprise a large number of sensors. We provide several mathematical results: the size and numerical complexity of the subproblems can be estimated mathematically by using percolation theory. We also show under reasonable assumptions that the Lasso estimator retrieves the true time occurrences of the spikes {with large probability}. Finally the theoretical time complexity of the algorithm is given. Numerical simulations are also provided in order to illustrate the efficiency of our approach.
△ Less
Submitted 11 April, 2022; v1 submitted 29 October, 2021;
originally announced October 2021.
-
Kalikow decomposition for counting processes with stochastic intensity and application to simulation algorithms
Authors:
Tien Cuong Phi,
Eva Löcherbach,
Patricia Reynaud-Bouret
Abstract:
We propose a new Kalikow decomposition for continuous time multivariate counting processes, on potentially infinite networks. We prove the existence of such a decomposition in various cases. This decomposition allows us to derive simulation algorithms that hold either for stationary processes with potentially infinite network but bounded intensities, or for processes with unbounded intensities in…
▽ More
We propose a new Kalikow decomposition for continuous time multivariate counting processes, on potentially infinite networks. We prove the existence of such a decomposition in various cases. This decomposition allows us to derive simulation algorithms that hold either for stationary processes with potentially infinite network but bounded intensities, or for processes with unbounded intensities in a finite network and with empty past before 0. The Kalikow decomposition is not unique and we discuss the choice of the decomposition in terms of algorithmic efficiency in certain cases. We apply these methods on several examples: linear Hawkes process, age dependent Hawkes process, exponential Hawkes process, Galves-Löcherbach process.
△ Less
Submitted 2 May, 2022; v1 submitted 1 April, 2021;
originally announced April 2021.
-
Optimal Change-Point Detection and Localization
Authors:
Nicolas Verzelen,
Magalie Fromont,
Matthieu Lerasle,
Patricia Reynaud-Bouret
Abstract:
Given a times series ${\bf Y}$ in $\mathbb{R}^n$, with a piece-wise contant mean and independent components, the twin problems of change-point detection and change-point localization respectively amount to detecting the existence of times where the mean varies and estimating the positions of those change-points. In this work, we tightly characterize optimal rates for both problems and uncover the…
▽ More
Given a times series ${\bf Y}$ in $\mathbb{R}^n$, with a piece-wise contant mean and independent components, the twin problems of change-point detection and change-point localization respectively amount to detecting the existence of times where the mean varies and estimating the positions of those change-points. In this work, we tightly characterize optimal rates for both problems and uncover the phase transition phenomenon from a global testing problem to a local estimation problem. Introducing a suitable definition of the energy of a change-point, we first establish in the single change-point setting that the optimal detection threshold is $\sqrt{2\log\log(n)}$. When the energy is just above the detection threshold, then the problem of localizing the change-point becomes purely parametric: it only depends on the difference in means and not on the position of the change-point anymore. Interestingly, for most change-point positions, it is possible to detect and localize them at a much smaller energy level. In the multiple change-point setting, we establish the energy detection threshold and show similarly that the optimal localization error of a specific change-point becomes purely parametric. Along the way, tight optimal rates for Hausdorff and $l_1$ estimation losses of the vector of all change-points positions are also established. Two procedures achieving these optimal rates are introduced. The first one is a least-squares estimator with a new multiscale penalty that favours well spread change-points. The second one is a two-step multiscale post-processing procedure whose computational complexity can be as low as $O(n\log(n))$. Notably, these two procedures accommodate with the presence of possibly many low-energy and therefore undetectable change-points and are still able to detect and localize high-energy change-points even with the presence of those nuisance parameters.
△ Less
Submitted 15 November, 2020; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Efficient Simulation of Sparse Graphs of Point Processes
Authors:
Cyrille Mascart,
Alexandre Muzy,
Patricia Reynaud-bouret
Abstract:
We derive new discrete event simulation algorithms for marked time point processes. The main idea is to couple a special structure, namely the associated local independence graph, as defined by Didelez arXiv:0710.5874, with the activity tracking algorithm [muzy, 2019] for achieving high performance asynchronous simulations. With respect to classical algorithm, this allows reducing drastically the…
▽ More
We derive new discrete event simulation algorithms for marked time point processes. The main idea is to couple a special structure, namely the associated local independence graph, as defined by Didelez arXiv:0710.5874, with the activity tracking algorithm [muzy, 2019] for achieving high performance asynchronous simulations. With respect to classical algorithm, this allows reducing drastically the computational complexity, especially when the graph is sparse.
[muzy, 2019] A. Muzy. 2019. Exploiting activity for the modeling and simulation of dynamics and learning processes in hierarchical (neurocognitive) systems. (Submitted to) Magazine of Computing in Science & Engineering (2019)
△ Less
Submitted 4 March, 2021; v1 submitted 6 January, 2020;
originally announced January 2020.
-
Event-scheduling algorithms with Kalikow decomposition for simulating potentially infinite neuronal networks
Authors:
Tien Cuong Phi,
Alexandre Muzy,
Patricia Reynaud-Bouret
Abstract:
Event-scheduling algorithms can compute in continuous time the next occurrence of points (as events) of a counting process based on their current conditional intensity. In particular event-scheduling algorithms can be adapted to perform the simulation of finite neuronal networks activity. These algorithms are based on Ogata's thinning strategy \cite{Oga81}, which always needs to simulate the whole…
▽ More
Event-scheduling algorithms can compute in continuous time the next occurrence of points (as events) of a counting process based on their current conditional intensity. In particular event-scheduling algorithms can be adapted to perform the simulation of finite neuronal networks activity. These algorithms are based on Ogata's thinning strategy \cite{Oga81}, which always needs to simulate the whole network to access the behaviour of one particular neuron of the network. On the other hand, for discrete time models, theoretical algorithms based on Kalikow decomposition can pick at random influencing neurons and perform a perfect simulation (meaning without approximations) of the behaviour of one given neuron embedded in an infinite network, at every time step. These algorithms are currently not computationally tractable in continuous time. To solve this problem, an event-scheduling algorithm with Kalikow decomposition is proposed here for the sequential simulation of point processes neuronal models satisfying this decomposition. This new algorithm is applied to infinite neuronal networks whose finite time simulation is a prerequisite to realistic brain modeling.
△ Less
Submitted 23 October, 2019;
originally announced October 2019.
-
Large scale Lasso with windowed active set for convolutional spike sorting
Authors:
Laurent Dragoni,
Rémi Flamary,
Karim Lounici,
Patricia Reynaud-Bouret
Abstract:
Spike sorting is a fundamental preprocessing step in neuroscience that is central to access simultaneous but distinct neuronal activities and therefore to better understand the animal or even human brain. But numerical complexity limits studies that require processing large scale datasets in terms of number of electrodes, neurons, spikes and length of the recorded signals. We propose in this work…
▽ More
Spike sorting is a fundamental preprocessing step in neuroscience that is central to access simultaneous but distinct neuronal activities and therefore to better understand the animal or even human brain. But numerical complexity limits studies that require processing large scale datasets in terms of number of electrodes, neurons, spikes and length of the recorded signals. We propose in this work a novel active set algorithm aimed at solving the Lasso for a classical convolutional model. Our algorithm can be implemented efficiently on parallel architecture and has a linear complexity w.r.t. the temporal dimensionality which ensures scaling and will open the door to online spike sorting. We provide theoretical results about the complexity of the algorithm and illustrate it in numerical experiments along with results about the accuracy of the spike recovery and robustness to the regularization parameter.
△ Less
Submitted 28 June, 2019;
originally announced June 2019.
-
Exponential inequality for chaos based on sampling without replacement
Authors:
P Hodara,
Patricia Reynaud-Bouret
Abstract:
We are interested in the behavior of particular functionals, in a framework where the only source of randomness is a sampling without replacement. More precisely the aim of this short note is to prove an exponential concentration inequality for special U-statistics of order 2, that can be seen as chaos.
We are interested in the behavior of particular functionals, in a framework where the only source of randomness is a sampling without replacement. More precisely the aim of this short note is to prove an exponential concentration inequality for special U-statistics of order 2, that can be seen as chaos.
△ Less
Submitted 28 August, 2018;
originally announced August 2018.
-
Sparse space-time models: Concentration Inequalities and Lasso
Authors:
Guilherme Ost,
Patricia Reynaud-Bouret
Abstract:
Inspired by Kalikow-type decompositions, we introduce a new stochastic model of infinite neuronal networks, for which we establish sharp oracle inequalities for Lasso methods and restricted eigenvalue properties for the associated Gram matrix with high probability. These results hold even if the network is only partially observed. The main argument rely on the fact that concentration inequalities…
▽ More
Inspired by Kalikow-type decompositions, we introduce a new stochastic model of infinite neuronal networks, for which we establish sharp oracle inequalities for Lasso methods and restricted eigenvalue properties for the associated Gram matrix with high probability. These results hold even if the network is only partially observed. The main argument rely on the fact that concentration inequalities can easily be derived whenever the transition probabilities of the underlying process admit a sparse space-time representation.
△ Less
Submitted 12 August, 2019; v1 submitted 19 July, 2018;
originally announced July 2018.
-
Continuous testing for Poisson process intensities: A new perspective on scanning statistics
Authors:
Franck Picard,
Patricia Reynaud-Bouret,
Etienne Roquain
Abstract:
We propose a novel continuous testing framework to test the intensities of Poisson Processes. This framework allows a rigorous definition of the complete testing procedure, from an infinite number of hypothesis to joint error rates. Our work extends traditional procedures based on scanning windows, by controlling the family-wise error rate and the false discovery rate in a non-asymptotic manner an…
▽ More
We propose a novel continuous testing framework to test the intensities of Poisson Processes. This framework allows a rigorous definition of the complete testing procedure, from an infinite number of hypothesis to joint error rates. Our work extends traditional procedures based on scanning windows, by controlling the family-wise error rate and the false discovery rate in a non-asymptotic manner and in a continuous way. The decision rule is based on a \pvalue process that can be estimated by a Monte-Carlo procedure. We also propose new test statistics based on kernels. Our method is applied in Neurosciences and Genomics through the standard test of homogeneity, and the two-sample test.
△ Less
Submitted 24 May, 2017;
originally announced May 2017.
-
Optimal kernel selection for density estimation
Authors:
M Lerasle,
N Magalhães,
P Reynaud-Bouret
Abstract:
We provide new general kernel selection rules thanks to penalized least-squares criteria. We derive optimal oracle inequalities using adequate concentration tools. We also investigate the problem of minimal penalty as described in [BM07].
We provide new general kernel selection rules thanks to penalized least-squares criteria. We derive optimal oracle inequalities using adequate concentration tools. We also investigate the problem of minimal penalty as described in [BM07].
△ Less
Submitted 6 November, 2015;
originally announced November 2015.
-
A data-dependent weighted LASSO under Poisson noise
Authors:
Xin Jiang,
Patricia Reynaud-Bouret,
Vincent Rivoirard,
Laure Sansonnet,
Rebecca Willett
Abstract:
Sparse linear inverse problems appear in a variety of settings, but often the noise contaminating observations cannot accurately be described as bounded by or arising from a Gaussian distribution. Poisson observations in particular are a feature of several real-world applications. Previous work on sparse Poisson inverse problems encountered several limiting technical hurdles. This paper describes…
▽ More
Sparse linear inverse problems appear in a variety of settings, but often the noise contaminating observations cannot accurately be described as bounded by or arising from a Gaussian distribution. Poisson observations in particular are a feature of several real-world applications. Previous work on sparse Poisson inverse problems encountered several limiting technical hurdles. This paper describes a novel alternative analysis approach for sparse Poisson inverse problems that (a) sidesteps the technical challenges in previous work, (b) admits estimators that can readily be computed using off-the-shelf LASSO algorithms, and (c) hints at a general framework for broad classes of noise in sparse linear inverse problems. At the heart of this new approach lies a weighted LASSO estimator for which data-dependent weights are based on Poisson concentration inequalities. Unlike previous analyses of the weighted LASSO, the proposed analysis depends on conditions which can be checked or shown to hold in general settings with high probability.
△ Less
Submitted 13 February, 2018; v1 submitted 29 September, 2015;
originally announced September 2015.
-
Microscopic approach of a time elapsed neural model
Authors:
Julien Chevallier,
Maria J. Caceres,
Marie Doumic,
Patricia Reynaud-Bouret
Abstract:
The spike trains are the main components of the information processing in the brain. To model spike trains several point processes have been investigated in the literature. And more macroscopic approaches have also been studied, using partial differential equation models. The main aim of the present article is to build a bridge between several point processes models (Poisson, Wold, Hawkes) that ha…
▽ More
The spike trains are the main components of the information processing in the brain. To model spike trains several point processes have been investigated in the literature. And more macroscopic approaches have also been studied, using partial differential equation models. The main aim of the present article is to build a bridge between several point processes models (Poisson, Wold, Hawkes) that have been proved to statistically fit real spike trains data and age-structured partial differential equations as introduced by Pakdaman, Perthame and Salort.
△ Less
Submitted 8 June, 2015;
originally announced June 2015.
-
A Distribution Free Unitary Events Method based on Delayed Coincidence Count
Authors:
Mélisande Albert,
Yann Bouret,
Magalie Fromont,
Patricia Reynaud-Bouret
Abstract:
We investigate several distribution free dependence detection procedures, mainly based on bootstrap principles and their approximation properties. Thanks to this study, we introduce a new distribution free Unitary Events (UE) method, named Permutation UE, which consists in a multiple testing procedure based on permutation and delayed coincidence count. Each involved single test of this procedure a…
▽ More
We investigate several distribution free dependence detection procedures, mainly based on bootstrap principles and their approximation properties. Thanks to this study, we introduce a new distribution free Unitary Events (UE) method, named Permutation UE, which consists in a multiple testing procedure based on permutation and delayed coincidence count. Each involved single test of this procedure achieves the prescribed level, so that the corresponding multiple testing procedure controls the False Discovery Rate (FDR), and this with as few assumptions as possible on the underneath distribution. Some simulations show that this method outperforms the trial-shuffling and the MTGAUE method in terms of single levels and FDR, for a comparable amount of false negatives. Application on real data is also provided.
△ Less
Submitted 22 May, 2015;
originally announced May 2015.
-
Bootstrap and permutation tests of independence for point processes
Authors:
Mélisande Albert,
Yann Bouret,
Magalie Fromont,
Patricia Reynaud-Bouret
Abstract:
Motivated by a neuroscience question about synchrony detection in spike train analysis, we deal with the independence testing problem for point processes. We introduce non-parametric test statistics, which are rescaled general $U$-statistics, whose corresponding critical values are constructed from bootstrap and randomization/permutation approaches, making as few assumptions as possible on the und…
▽ More
Motivated by a neuroscience question about synchrony detection in spike train analysis, we deal with the independence testing problem for point processes. We introduce non-parametric test statistics, which are rescaled general $U$-statistics, whose corresponding critical values are constructed from bootstrap and randomization/permutation approaches, making as few assumptions as possible on the underlying distribution of the point processes. We derive general consistency results for the bootstrap and for the permutation w.r.t. to Wasserstein's metric, which induce weak convergence as well as convergence of second order moments. The obtained bootstrap or permutation independence tests are thus proved to be asymptotically of the prescribed size, and to be consistent against any reasonable alternative. A simulation study is performed to illustrate the derived theoretical results, and to compare the performance of our new tests with existing ones in the neuroscientific literature.
△ Less
Submitted 27 May, 2015; v1 submitted 6 June, 2014;
originally announced June 2014.
-
Lasso and probabilistic inequalities for multivariate point processes
Authors:
Niels Richard Hansen,
Patricia Reynaud-Bouret,
Vincent Rivoirard
Abstract:
Due to its low computational cost, Lasso is an attractive regularization method for high-dimensional statistical settings. In this paper, we consider multivariate counting processes depending on an unknown function parameter to be estimated by linear combinations of a fixed dictionary. To select coefficients, we propose an adaptive $\ell_1$-penalization methodology, where data-driven weights of th…
▽ More
Due to its low computational cost, Lasso is an attractive regularization method for high-dimensional statistical settings. In this paper, we consider multivariate counting processes depending on an unknown function parameter to be estimated by linear combinations of a fixed dictionary. To select coefficients, we propose an adaptive $\ell_1$-penalization methodology, where data-driven weights of the penalty are derived from new Bernstein type inequalities for martingales. Oracle inequalities are established under assumptions on the Gram matrix of the dictionary. Nonasymptotic probabilistic results for multivariate Hawkes processes are proven, which allows us to check these assumptions by considering general dictionaries based on histograms, Fourier or wavelet bases. Motivated by problems of neuronal activity inference, we finally carry out a simulation study for multivariate Hawkes processes and compare our methodology with the adaptive Lasso procedure proposed by Zou in (J. Amer. Statist. Assoc. 101 (2006) 1418-1429). We observe an excellent behavior of our procedure. We rely on theoretical aspects for the essential question of tuning our methodology. Unlike adaptive Lasso of (J. Amer. Statist. Assoc. 101 (2006) 1418-1429), our tuning procedure is proven to be robust with respect to all the parameters of the problem, revealing its potential for concrete purposes, in particular in neuroscience.
△ Less
Submitted 7 April, 2015; v1 submitted 2 August, 2012;
originally announced August 2012.
-
The two-sample problem for Poisson processes: adaptive tests with a non-asymptotic wild bootstrap approach
Authors:
Magalie Fromont,
Béatrice Laurent,
Patricia Reynaud-Bouret
Abstract:
Considering two independent Poisson processes, we address the question of testing equality of their respective intensities. We first propose single tests whose test statistics are U-statistics based on general kernel functions. The corresponding critical values are constructed from a non-asymptotic wild bootstrap approach, leading to level αtests. Various choices for the kernel functions are possi…
▽ More
Considering two independent Poisson processes, we address the question of testing equality of their respective intensities. We first propose single tests whose test statistics are U-statistics based on general kernel functions. The corresponding critical values are constructed from a non-asymptotic wild bootstrap approach, leading to level αtests. Various choices for the kernel functions are possible, including projection, approximation or reproducing kernels. In this last case, we obtain a parametric rate of testing for a weak metric defined in the RKHS associated with the considered reproducing kernel. Then we introduce, in the other cases, an aggregation procedure, which allows us to import ideas coming from model selection, thresholding and/or approximation kernels adaptive estimation. The resulting multiple tests are proved to be of level α, and to satisfy non-asymptotic oracle type conditions for the classical L2-norm. From these conditions, we deduce that they are adaptive in the minimax sense over a large variety of classes of alternatives based on classical and weak Besov bodies in the univariate case, but also Sobolev and anisotropic Nikol'skii-Besov balls in the multivariate case.
△ Less
Submitted 13 November, 2012; v1 submitted 15 March, 2012;
originally announced March 2012.
-
Nonparametric estimation of the division rate of a size-structured population
Authors:
Marie Doumic Jauffret,
Marc Hoffmann,
Patricia Reynaud-Bouret,
Vincent Rivoirard
Abstract:
We consider the problem of estimating the division rate of a size-structured population in a nonparametric setting. The size of the system evolves according to a transport-fragmentation equation: each individual grows with a given transport rate, and splits into two offsprings of the same size, following a binary fragmentation process with unknown division rate that depends on its size. In contras…
▽ More
We consider the problem of estimating the division rate of a size-structured population in a nonparametric setting. The size of the system evolves according to a transport-fragmentation equation: each individual grows with a given transport rate, and splits into two offsprings of the same size, following a binary fragmentation process with unknown division rate that depends on its size. In contrast to a deterministic inverse problem approach, as in (Perthame, Zubelli, 2007) and (Doumic, Perthame, Zubelli, 2009), we take in this paper the perspective of statistical inference: our data consists in a large sample of the size of individuals, when the evolution of the system is close to its time-asymptotic behavior, so that it can be related to the eigenproblem of the considered transport-fragmentation equation (see \cite{PR} for instance). By estimating statistically each term of the eigenvalue problem and by suitably inverting a certain linear operator (see previously quoted articles), we are able to construct a more realistic estimator of the division rate that achieves the same optimal error bound as in related deterministic inverse problems. Our procedure relies on kernel methods with automatic bandwidth selection. It is inspired by model selection and recent results of Goldenschluger and Lepski.
△ Less
Submitted 22 March, 2011;
originally announced March 2011.
-
Adaptive density estimation: a curse of support?
Authors:
Patricia Reynaud-Bouret,
Vincent Rivoirard,
Christine Tuleau-Malot
Abstract:
This paper deals with the classical problem of density estimation on the real line. Most of the existing papers devoted to minimax properties assume that the support of the underlying density is bounded and known. But this assumption may be very difficult to handle in practice. In this work, we show that, exactly as a curse of dimensionality exists when the data lie in $\R^d$, there exists a cur…
▽ More
This paper deals with the classical problem of density estimation on the real line. Most of the existing papers devoted to minimax properties assume that the support of the underlying density is bounded and known. But this assumption may be very difficult to handle in practice. In this work, we show that, exactly as a curse of dimensionality exists when the data lie in $\R^d$, there exists a curse of support as well when the support of the density is infinite. As for the dimensionality problem where the rates of convergence deteriorate when the dimension grows, the minimax rates of convergence may deteriorate as well when the support becomes infinite. This problem is not purely theoretical since the simulations show that the support-dependent methods are really affected in practice by the size of the density support, or by the weight of the density tail. We propose a method based on a biorthogonal wavelet thresholding rule that is adaptive with respect to the nature of the support and the regularity of the signal, but that is also robust in practice to this curse of support. The threshold, that is proposed here, is very accurately calibrated so that the gap between optimal theoretical and practical tuning parameters is almost filled.
△ Less
Submitted 10 July, 2009;
originally announced July 2009.
-
Adaptive tests of homogeneity for a Poisson process
Authors:
M. Fromont,
B. Laurent,
P. Reynaud-Bouret
Abstract:
We propose to test the homogeneity of a Poisson process observed on a finite interval. In this framework, we first provide lower bounds for the uniform separation rates in $\mathbb{L}^2$ norm over classical Besov bodies and weak Besov bodies. Surprisingly, the obtained lower bounds over weak Besov bodies coincide with the minimax estimation rates over such classes. Then we construct non asymptot…
▽ More
We propose to test the homogeneity of a Poisson process observed on a finite interval. In this framework, we first provide lower bounds for the uniform separation rates in $\mathbb{L}^2$ norm over classical Besov bodies and weak Besov bodies. Surprisingly, the obtained lower bounds over weak Besov bodies coincide with the minimax estimation rates over such classes. Then we construct non asymptotic and nonparametric testing procedures that are adaptive in the sense that they achieve, up to a possible logarithmic factor, the optimal uniform separation rates over various Besov bodies simultaneously. These procedures are based on model selection and thresholding methods. We finally complete our theoretical study with a Monte Carlo evaluation of the power of our tests under various alternatives.
△ Less
Submitted 7 May, 2009;
originally announced May 2009.
-
Calibration of thresholding rules for Poisson intensity estimation
Authors:
Patricia Reynaud-Bouret,
Vincent Rivoirard
Abstract:
In this paper, we deal with the problem of calibrating thresholding rules in the setting of Poisson intensity estimation. By using sharp concentration inequalities, oracle inequalities are derived and we establish the optimality of our estimate up to a logarithmic term. This result is proved under mild assumptions and we do not impose any condition on the support of the signal to be estimated. O…
▽ More
In this paper, we deal with the problem of calibrating thresholding rules in the setting of Poisson intensity estimation. By using sharp concentration inequalities, oracle inequalities are derived and we establish the optimality of our estimate up to a logarithmic term. This result is proved under mild assumptions and we do not impose any condition on the support of the signal to be estimated. Our procedure is based on data-driven thresholds. As usual, they depend on a threshold parameter $γ$ whose optimal value is hard to estimate from the data. Our main concern is to provide some theoretical and numerical results to handle this issue. In particular, we establish the existence of a minimal threshold parameter from the theoretical point of view: taking $γ<1$ deteriorates oracle performances of our procedure. In the same spirit, we establish the existence of a maximal threshold parameter and our theoretical results point out the optimal range $γ\in[1,12]$. Then, we lead a numerical study that shows that choosing $γ$ larger than 1 but close to 1 is a fairly good choice. Finally, we compare our procedure with classical ones revealing the harmful role of the support of functions when estimated by classical procedures.
△ Less
Submitted 7 April, 2009;
originally announced April 2009.
-
Adaptive estimation for Hawkes processes; application to genome analysis
Authors:
Patricia Reynaud-Bouret,
Sophie Schbath
Abstract:
The aim of this paper is to provide a new method for the detection of either favored or avoided distances between genomic events along DNA sequences. These events are modeled by a Hawkes process. The biological problem is actually complex enough to need a nonasymptotic penalized model selection approach. We provide a theoretical penalty that satisfies an oracle inequality even for quite complex fa…
▽ More
The aim of this paper is to provide a new method for the detection of either favored or avoided distances between genomic events along DNA sequences. These events are modeled by a Hawkes process. The biological problem is actually complex enough to need a nonasymptotic penalized model selection approach. We provide a theoretical penalty that satisfies an oracle inequality even for quite complex families of models. The consecutive theoretical estimator is shown to be adaptive minimax for Hölderian functions with regularity in $(1/2,1]$: those aspects have not yet been studied for the Hawkes' process. Moreover, we introduce an efficient strategy, named Islands, which is not classically used in model selection, but that happens to be particularly relevant to the biological question we want to answer. Since a multiplicative constant in the theoretical penalty is not computable in practice, we provide extensive simulations to find a data-driven calibration of this constant. The results obtained on real genomic data are coherent with biological knowledge and eventually refine them.
△ Less
Submitted 10 November, 2010; v1 submitted 17 March, 2009;
originally announced March 2009.
-
Near optimal thresholding estimation of a Poisson intensity on the real line
Authors:
Patricia Reynaud-Bouret,
Vincent Rivoirard
Abstract:
The purpose of this paper is to estimate the intensity of a Poisson process $N$ by using thresholding rules. In this paper, the intensity, defined as the derivative of the mean measure of $N$ with respect to $ndx$ where $n$ is a fixed parameter, is assumed to be non-compactly supported. The estimator $\tilde{f}_{n,γ}$ based on random thresholds is proved to achieve the same performance as the or…
▽ More
The purpose of this paper is to estimate the intensity of a Poisson process $N$ by using thresholding rules. In this paper, the intensity, defined as the derivative of the mean measure of $N$ with respect to $ndx$ where $n$ is a fixed parameter, is assumed to be non-compactly supported. The estimator $\tilde{f}_{n,γ}$ based on random thresholds is proved to achieve the same performance as the oracle estimator up to a possible logarithmic term. Then, minimax properties of $\tilde{f}_{n,γ}$ on Besov spaces ${\cal B}^{\ensuremath α}_{p,q}$ are established. Under mild assumptions, we prove that $$\sup_{f\in B^{\ensuremath α}_{p,q}\cap \ensuremath \mathbb {L}_{\infty}} \ensuremath \mathbb {E}(\ensuremath | | \tilde{f}_{n,γ}-f| |_2^2)\leq C(\frac{\log n}{n})^{\frac{\ensuremath α}{\ensuremath α+{1/2}+({1/2}-\frac{1}{p})_+}}$$ and the lower bound of the minimax risk for ${\cal B}^{\ensuremath α}_{p,q}\cap \ensuremath \mathbb {L}_{\infty}$ coincides with the previous upper bound up to the logarithmic term. This new result has two consequences. First, it establishes that the minimax rate of Besov spaces ${\cal B}^{\ensuremath α}_{p,q}$ with $p\leq 2$ when non compactly supported functions are considered is the same as for compactly supported functions up to a logarithmic term. When $p>2$, the rate exponent, which depends on $p$, deteriorates when $p$ increases, which means that the support plays a harmful role in this case. Furthermore, $\tilde{f}_{n,γ}$ is adaptive minimax up to a logarithmic term.
△ Less
Submitted 29 October, 2008;
originally announced October 2008.
-
Adaptive thresholding estimation of a Poisson intensity with infinite support
Authors:
Patricia Reynaud-Bouret,
Vincent Rivoirard
Abstract:
The purpose of this paper is to estimate the intensity of a Poisson process $N$ by using thresholding rules. In this paper, the intensity, defined as the derivative of the mean measure of $N$ with respect to $ndx$ where $n$ is a fixed parameter, is assumed to be non-compactly supported. The estimator $\tilde{f}_{n,γ}$ based on random thresholds is proved to achieve the same performance as the or…
▽ More
The purpose of this paper is to estimate the intensity of a Poisson process $N$ by using thresholding rules. In this paper, the intensity, defined as the derivative of the mean measure of $N$ with respect to $ndx$ where $n$ is a fixed parameter, is assumed to be non-compactly supported. The estimator $\tilde{f}_{n,γ}$ based on random thresholds is proved to achieve the same performance as the oracle estimator up to a logarithmic term. Oracle inequalities allow to derive the maxiset of $\tilde{f}_{n,γ}$. Then, minimax properties of $\tilde{f}_{n,γ}$ are established. We first prove that the rate of this estimator on Besov spaces ${\cal B}^\al_{p,q}$ when $p\leq 2$ is $(\ln(n)/n)^{\al/(1+2\al)}$. This result has two consequences. First, it establishes that the minimax rate of Besov spaces ${\cal B}^\al_{p,q}$ with $p\leq 2$ when non compactly supported functions are considered is the same as for compactly supported functions up to a logarithmic term. This result is new. Furthermore, $\tilde{f}_{n,γ}$ is adaptive minimax up to a logarithmic term. When $p>2$, the situation changes dramatically and the rate of $\tilde{f}_{n,γ}$ on Besov spaces ${\cal B}^\al_{p,q}$ is worse than $(\ln(n)/n)^{\al/(1+2\al)}$. Finally, the random threshold depends on a parameter $γ$ that has to be suitably chosen in practice. Some theoretical results provide upper and lower bounds of $γ$ to obtain satisfying oracle inequalities. Simulations reinforce these results.
△ Less
Submitted 21 January, 2008;
originally announced January 2008.
-
Concentration for norms of infinitely divisible vectors with independent components
Authors:
Christian Houdré,
Philippe Marchal,
Patricia Reynaud-Bouret
Abstract:
We obtain dimension-free concentration inequalities for $\ell^p$-norms, $p\geq2$, of infinitely divisible random vectors with independent coordinates and finite exponential moments. Besides such norms, the methods and results extend to some other classes of Lipschitz functions.
We obtain dimension-free concentration inequalities for $\ell^p$-norms, $p\geq2$, of infinitely divisible random vectors with independent coordinates and finite exponential moments. Besides such norms, the methods and results extend to some other classes of Lipschitz functions.
△ Less
Submitted 14 November, 2008; v1 submitted 3 July, 2006;
originally announced July 2006.
-
Concentration for Infinitely Divisible Vectors with Independent Components
Authors:
C. Houdré,
P. Reynaud-Bouret
Abstract:
For various classes of Lipschitz functions we provide dimension free concentration inequalities for infinitely divisible random vectors with independent components and finite exponential moments.
For various classes of Lipschitz functions we provide dimension free concentration inequalities for infinitely divisible random vectors with independent components and finite exponential moments.
△ Less
Submitted 29 June, 2006;
originally announced June 2006.