Skip to main content

Showing 1–19 of 19 results for author: De Castro, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18397  [pdf, other

    math.ST cs.LG math.DG math.PR stat.ML

    Second Maximum of a Gaussian Random Field and Exact (t-)Spacing test

    Authors: Jean-Marc Azaïs, Federico Dalmao, Yohann De Castro

    Abstract: In this article, we introduce the novel concept of the second maximum of a Gaussian random field on a Riemannian submanifold. This second maximum serves as a powerful tool for characterizing the distribution of the maximum. By utilizing an ad-hoc Kac Rice formula, we derive the explicit form of the maximum's distribution, conditioned on the second maximum and some regressed component of the Rieman… ▽ More

    Submitted 5 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 5 figures, 22 pages main document, 2 pages supplements

    MSC Class: Primary 62E15; 62F03; 60G15; 62H10; 62H15; secondary 60E05; 60G10; 62J05; 94A08

  2. arXiv:2212.12542  [pdf, ps, other

    q-bio.GN cs.LG stat.ML

    Neural Networks beyond explainability: Selective inference for sequence motifs

    Authors: Antoine Villié, Philippe Veber, Yohann de Castro, Laurent Jacob

    Abstract: Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  3. arXiv:2203.15351  [pdf, other

    cs.SI math.ST

    Random Geometric Graph: Some recent developments and perspectives

    Authors: Quentin Duchemin, Yohann de Castro

    Abstract: The Random Geometric Graph (RGG) is a random graph model for network data with an underlying spatial representation. Geometry endows RGGs with a rich dependence structure and often leads to desirable properties of real-world networks such as the small-world phenomenon and clustering. Originally introduced to model wireless communication networks, RGGs are now very popular with applications ranging… ▽ More

    Submitted 24 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: This is a research report that is part of a Chapter of a PhD thesis. An updated version will be available soon

  4. arXiv:2102.05314  [pdf, other

    cs.LG math.ST stat.ML

    Forecasting Nonnegative Time Series via Sliding Mask Method (SMM) and Latent Clustered Forecast (LCF)

    Authors: Yohann de Castro, Luca Mencarelli

    Abstract: We consider nonnegative time series forecasting framework. Based on recent advances in Nonnegative Matrix Factorization (NMF) and Archetypal Analysis, we introduce two procedures referred to as Sliding Mask Method (SMM) and Latent Clustered Forecast (LCF). SMM is a simple and powerful method based on time window prediction using Completion of Nonnegative Matrices. This new procedure combines low n… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  5. arXiv:2006.07001  [pdf, other

    cs.LG cs.SI math.ST stat.ML

    Markov Random Geometric Graph (MRGG): A Growth Model for Temporal Dynamic Networks

    Authors: Quentin Duchemin, Yohann de Castro

    Abstract: We introduce Markov Random Geometric Graphs (MRGGs), a growth model for temporal dynamic networks. It is based on a Markovian latent space dynamic: consecutive latent points are sampled on the Euclidean Sphere using an unknown Markov kernel; and two nodes are connected with a probability depending on a unknown function of their latent geodesic distance. More precisely, at each stamp-time $k$ we ad… ▽ More

    Submitted 9 March, 2022; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Electronic Journal of Statistics , Shaker Heights, OH : Institute of Mathematical Statistics, 2022, 16 (1), pp.671-699

  6. arXiv:1909.06841  [pdf, other

    stat.ML cs.LG

    Latent Distance Estimation for Random Geometric Graphs

    Authors: Ernesto Araya, Yohann De Castro

    Abstract: Random geometric graphs are a popular choice for a latent points generative model for networks. Their definition is based on a sample of $n$ points $X_1,X_2,\cdots,X_n$ on the Euclidean sphere~$\mathbb{S}^{d-1}$ which represents the latent positions of nodes of the network. The connection probabilities between the nodes are determined by an unknown function (referred to as the "link" function) eva… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  7. arXiv:1907.10592  [pdf, other

    math.ST cs.LG stat.ML

    SuperMix: Sparse Regularization for Mixtures

    Authors: Yohann de Castro, Sébastien Gadat, Clément Marteau, Cathy Maugis

    Abstract: This paper investigates the statistical estimation of a discrete mixing measure $μ$0 involved in a kernel mixture model. Using some recent advances in l1-regularization over the space of measures, we introduce a "data fitting and regularization" convex program for estimating $μ$0 in a grid-less manner from a sample of mixture law, this method is referred to as Beurling-LASSO. Our contribution is t… ▽ More

    Submitted 18 June, 2020; v1 submitted 23 July, 2019; originally announced July 2019.

  8. arXiv:1906.12072  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Multiple Testing and Variable Selection along the path of the Least Angle Regression

    Authors: J. -M. Azaïs, Y. De Castro

    Abstract: We investigate multiple testing and variable selection using the Least Angle Regression (LARS) algorithm in high dimensions under the assumption of Gaussian noise. LARS is known to produce a piecewise affine solution path with change points referred to as the knots of the LARS path. The key to our results is an expression in closed form of the exact joint law of a $K$-tuple of knots conditional on… ▽ More

    Submitted 4 May, 2022; v1 submitted 28 June, 2019; originally announced June 2019.

    Comments: FINAL version (the paper has improved and we advise you to disregard the previous versions); NEW: link with the Polyhedral lemma is now explicit and the conditional law of the estimation of the variance is given

    MSC Class: 62E15; 62F03; 60G15; 62H10; 62H15;

    Journal ref: Information and Inference: A journal of the IMA (2022)

  9. arXiv:1812.04355  [pdf, other

    math.OC cs.IT

    Convex Regularization and Representer Theorems

    Authors: Claire Boyer, Antonin Chambolle, Yohann de Castro, Vincent Duval, Frédéric de Gournay, Pierre Weiss

    Abstract: We establish a result which states that regularizing an inverse problem with the gauge of a convex set $C$ yields solutions which are linear combinations of a few extreme points or elements of the extreme rays of $C$. These can be understood as the \textit{atoms} of the regularizer. We then explicit that general principle by using a few popular applications. In particular, we relate it to the comm… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: in Proceedings of iTWIST'18, Paper-ID: 30, Marseille, France, November, 21-23, 2018

    MSC Class: 15A29

  10. arXiv:1806.09810  [pdf, other

    math.OC cs.IT

    On Representer Theorems and Convex Regularization

    Authors: Claire Boyer, Antonin Chambolle, Yohann De Castro, Vincent Duval, Frédéric De Gournay, Pierre Weiss

    Abstract: We establish a general principle which states that regularizing an inverse problem with a convex function yields solutions which are convex combinations of a small number of atoms. These atoms are identified with the extreme points and elements of the extreme rays of the regularizer level sets. An extension to a broader class of quasi-convex regularizers is also discussed. As a side result, we cha… ▽ More

    Submitted 26 November, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

  11. arXiv:1706.04059  [pdf, other

    math.ST cs.IT math.NA stat.CO stat.ME

    Approximate Optimal Designs for Multivariate Polynomial Regression

    Authors: Yohann De Castro, Fabrice Gamboa, Didier Henrion, Roxana Hess, Jean-Bernard Lasserre

    Abstract: We introduce a new approach aiming at computing approximate optimal designs for multivariate polynomial regressions on compact (semi-algebraic) design spaces. We use the moment-sum-of-squares hierarchy of semidefinite programming problems to solve numerically the approximate optimal design problem. The geometry of the design is recovered via semidefinite programming duality theory. This article sh… ▽ More

    Submitted 25 October, 2017; v1 submitted 9 June, 2017; originally announced June 2017.

    Comments: 30 Pages, 8 Figures. arXiv admin note: substantial text overlap with arXiv:1703.01777

    MSC Class: 62K05; 90C25 (Primary) 41A10; 49M29; 90C90; 15A15 (secondary)

  12. arXiv:1706.00679  [pdf, other

    math.ST cs.IT math.PR

    Testing Gaussian Process with Applications to Super-Resolution

    Authors: Jean-Marc Azaïs, Yohann De Castro, Stéphane Mourareau

    Abstract: This article introduces exact testing procedures on the mean of a Gaussian process $X$ derived from the outcomes of $\ell_1$-minimization over the space of complex valued measures. The process $X$ can be thought as the sum of two terms: first, the convolution between some kernel and a target atomic measure (mean of the process); second, a random perturbation by an additive centered Gaussian proces… ▽ More

    Submitted 2 July, 2018; v1 submitted 2 June, 2017; originally announced June 2017.

    Comments: Final version, 6 figures, Python code and Jupyter notebook available at https://github.com/ydecastro/super-resolution-testing

    MSC Class: 62E15; 62F03; 60G15; 62H10; 62H15 (Primary) 60E05; 60G10; 62J05; 94A08 (secondary)

  13. arXiv:1606.04760  [pdf, other

    cs.IT math.OC math.ST

    Adapting to unknown noise level in sparse deconvolution

    Authors: Claire Boyer, Yohann De Castro, Joseph Salmon

    Abstract: In this paper, we study sparse spike deconvolution over the space of complex-valued measures when the input measure is a finite sum of Dirac masses. We introduce a modified version of the Beurling Lasso (BLasso), a semi-definite program that we refer to as the Concomitant Beurling Lasso (CBLasso). This new procedure estimates the target measure and the unknown noise level simultaneously. Contrary… ▽ More

    Submitted 19 October, 2016; v1 submitted 15 June, 2016; originally announced June 2016.

  14. arXiv:1604.01171  [pdf, other

    math.ST cs.IT math.PR stat.ML

    Sparse Recovery from Extreme Eigenvalues Deviation Inequalities

    Authors: Sandrine Dallaporta, Yohann De Castro

    Abstract: This article provides a new toolbox to derive sparse recovery guarantees from small deviations on extreme singular values or extreme eigenvalues obtained in Random Matrix Theory. This work is based on Restricted Isometry Constants (RICs) which are a pivotal notion in Compressed Sensing and High-Dimensional Statistics as these constants finely assess how a linear operator is conditioned on the set… ▽ More

    Submitted 14 November, 2018; v1 submitted 5 April, 2016; originally announced April 2016.

    Comments: 33 pages, 1 figure, final version

  15. arXiv:1603.08113  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Reconstructing undirected graphs from eigenspaces

    Authors: Yohann De Castro, Thibault Espinasse, Paul Rochet

    Abstract: In this paper, we aim at recovering an undirected weighted graph of $N$ vertices from the knowledge of a perturbed version of the eigenspaces of its adjacency matrix $W$. For instance, this situation arises for stationary signals on graphs or for Markov chains observed at random times. Our approach is based on minimizing a cost function given by the Frobenius norm of the commutator… ▽ More

    Submitted 15 March, 2017; v1 submitted 26 March, 2016; originally announced March 2016.

    Comments: 25 pages, some figures. Final version

  16. arXiv:1502.02436  [pdf, other

    cs.IT math.OC stat.CO

    Exact solutions to Super Resolution on semi-algebraic domains in higher dimensions

    Authors: Y De Castro, F Gamboa, D Henrion, J. -B Lasserre

    Abstract: We investigate the multi-dimensional Super Resolution problem on closed semi-algebraic domains for various sampling schemes such as Fourier or moments. We present a new semidefinite programming (SDP) formulation of the 1 -minimization in the space of Radon measures in the multi-dimensional frame on semi-algebraic sets. While standard approaches have focused on SDP relaxations of the dual program (… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

  17. arXiv:1108.5533  [pdf, ps, other

    math.ST cs.IT math.FA

    A Remark on the Lasso and the Dantzig Selector

    Authors: Yohann de Castro

    Abstract: This article investigates a new parameter for the high-dimensional regression with noise: the distortion. This latter has attracted a lot of attention recently with the appearance of new deterministic constructions of 'almost'-Euclidean sections of the L1-ball. It measures how far is the intersection between the kernel of the design matrix and the unit L1-ball from an L2-ball. We show that the dis… ▽ More

    Submitted 28 September, 2012; v1 submitted 29 August, 2011; originally announced August 2011.

    Comments: Final Version. This article was written mostly during his Ph.D. at the Institut de Mathématiques de Toulouse (IMT)

  18. arXiv:1103.4951  [pdf, ps, other

    math.ST cs.IT math.OC math.PR

    Exact Reconstruction using Beurling Minimal Extrapolation

    Authors: Yohann de Castro, Fabrice Gamboa

    Abstract: We show that measures with finite support on the real line are the unique solution to an algorithm, named generalized minimal extrapolation, involving only a finite number of generalized moments (which encompass the standard moments, the Laplace transform, the Stieltjes transformation, etc). Generalized minimal extrapolation shares related geometric properties with basis pursuit of Chen, Donoho an… ▽ More

    Submitted 5 April, 2012; v1 submitted 25 March, 2011; originally announced March 2011.

    Comments: 27 pages, 3 figures version 2 : minor changes and new title

  19. arXiv:1010.2457  [pdf, ps, other

    math.ST cs.IT math.PR stat.ME stat.ML

    Optimal designs for Lasso and Dantzig selector using Expander Codes

    Authors: Yohann de Castro

    Abstract: We investigate the high-dimensional regression problem using adjacency matrices of unbalanced expander graphs. In this frame, we prove that the $\ell_{2}$-prediction error and the $\ell_{1}$-risk of the lasso and the Dantzig selector are optimal up to an explicit multiplicative constant. Thus we can estimate a high-dimensional target vector with an error term similar to the one obtained in a situa… ▽ More

    Submitted 22 July, 2014; v1 submitted 12 October, 2010; originally announced October 2010.

    Comments: Last version with optimal bounds

    MSC Class: 62G05; 62J05; 62J12