Skip to main content

Showing 1–16 of 16 results for author: Levrard, C

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.06168  [pdf, other

    math.ST stat.ML

    Topological Analysis for Detecting Anomalies (TADA) in Time Series

    Authors: Frédéric Chazal, Martin Royer, Clément Levrard

    Abstract: This paper introduces new methodology based on the field of Topological Data Analysis for detecting anomalies in multivariate time series, that aims to detect global changes in the dependency structure between channels. The proposed approach is lean enough to handle large scale datasets, and extensive numerical experiments back the intuition that it is more suitable for detecting global changes of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2311.18613  [pdf, ps, other

    math.ST

    Wasserstein GANs are Minimax Optimal Distribution Estimators

    Authors: Arthur Stéphanovitch, Eddie Aamari, Clément Levrard

    Abstract: We provide non asymptotic rates of convergence of the Wasserstein Generative Adversarial networks (WGAN) estimator. We build neural networks classes representing the generators and discriminators which yield a GAN that achieves the minimax optimal rate for estimating a certain probability measure $μ$ with support in $\mathbb{R}^p$. The probability $μ$ is considered to be the push forward of the Le… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  3. arXiv:2303.08456  [pdf, other

    cs.CG math.ST stat.ML

    Statistical learning on measures: an application to persistence diagrams

    Authors: Olympio Hacquard, Gilles Blanchard, Clément Levrard

    Abstract: We consider a binary supervised learning classification problem where instead of having data in a finite-dimensional Euclidean space, we observe measures on a compact space $\mathcal{X}$. Formally, we observe data $D_N = (μ_1, Y_1), \ldots, (μ_N, Y_N)$ where $μ_i$ is a measure on $\mathcal{X}$ and $Y_i$ is a label in $\{0, 1\}$. Given a set $\mathcal{F}$ of base-classifiers on $\mathcal{X}$, we bu… ▽ More

    Submitted 31 May, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  4. arXiv:2207.06074  [pdf, other

    math.ST math.MG

    Optimal Reach Estimation and Metric Learning

    Authors: Eddie Aamari, Clément Berenfeld, Clément Levrard

    Abstract: We study the estimation of the reach, an ubiquitous regularity parameter in manifold estimation and geometric data analysis. Given an i.i.d. sample over an unknown $d$-dimensional $\mathcal{C}^k$-smooth submanifold of $\mathbb{R}^D$, we provide optimal nonasymptotic bounds for the estimation of its reach. We build upon a formulation of the reach in terms of maximal curvature on one hand, and geode… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    MSC Class: 62G05; 62C20; 68U05

  5. arXiv:2110.13749  [pdf, other

    cs.LG math.ST

    Topologically penalized regression on manifolds

    Authors: Olympio Hacquard, Krishnakumar Balasubramanian, Gilles Blanchard, Clément Levrard, Wolfgang Polonik

    Abstract: We study a regression problem on a compact manifold M. In order to take advantage of the underlying geometry and topology of the data, the regression task is performed on the basis of the first several eigenfunctions of the Laplace-Beltrami operator of the manifold, that are regularized with topological penalties. The proposed penalties are based on the topology of the sub-level sets of either the… ▽ More

    Submitted 10 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Journal ref: JMLR, 2022

  6. arXiv:2108.03135  [pdf, other

    math.ST

    Minimax Boundary Estimation and Estimation with Boundary

    Authors: Eddie Aamari, Catherine Aaron, Clément Levrard

    Abstract: We derive non-asymptotic minimax bounds for the Hausdorff estimation of $d$-dimensional submanifolds $M \subset \mathbb{R}^D$ with (possibly) non-empty boundary $\partial M$. The model reunites and extends the most prevalent $\mathcal{C}^2$-type set estimation models: manifolds without boundary, and full-dimensional domains. We consider both the estimation of the manifold $M$ itself and that of it… ▽ More

    Submitted 10 March, 2023; v1 submitted 6 August, 2021; originally announced August 2021.

  7. arXiv:2002.01216  [pdf, other

    math.ST

    Optimal quantization of the mean measure and applications to statistical learning

    Authors: Frédéric Chazal, Clément Levrard, Martin Royer

    Abstract: This paper addresses the case where data come as point sets, or more generally as discrete measures. Our motivation is twofold: first we intend to approximate with a compactly supported measure the mean of the measure generating process, that coincides with the intensity measure in the point process framework, or with the expected persistence diagram in the framework of persistence-based topologic… ▽ More

    Submitted 18 March, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

  8. arXiv:1812.04356  [pdf, other

    math.ST stat.ML

    Robust Bregman Clustering

    Authors: Aurélie Fischer, Clément Levrard, Claire Brécheteau

    Abstract: Using a trimming approach, we investigate a k-means type method based on Bregman divergences for clustering data possibly corrupted with clutter noise. The main interest of Bregman divergences is that the standard Lloyd algorithm adapts to these distortion measures, and they are well-suited for clustering data sampled according to mixture models from exponential families. We prove that there exist… ▽ More

    Submitted 9 September, 2020; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: Annals of Statistics, Institute of Mathematical Statistics, In press

  9. arXiv:1801.10346  [pdf, other

    math.ST cs.CG

    The k-PDTM : a coreset for robust geometric inference

    Authors: Claire Brécheteau, Clément Levrard

    Abstract: Analyzing the sub-level sets of the distance to a compact sub-manifold of R d is a common method in TDA to understand its topology. The distance to measure (DTM) was introduced by Chazal, Cohen-Steiner and M{é}rigot in [7] to face the non-robustness of the distance to a compact set to noise and outliers. This function makes possible the inference of the topology of a compact subset of R d from a n… ▽ More

    Submitted 31 January, 2018; originally announced January 2018.

  10. arXiv:1801.03742  [pdf, ps, other

    math.ST

    Quantization/clustering: when and why does k-means work?

    Authors: Clément Levrard

    Abstract: Though mostly used as a clustering algorithm, k-means are originally designed as a quantization algorithm. Namely, it aims at providing a compression of a probability distribution with k points. Building upon [21, 33], we try to investigate how and when these two approaches are compatible. Namely, we show that provided the sample distribution satisfies a margin like condition (in the sense of [27]… ▽ More

    Submitted 30 January, 2018; v1 submitted 11 January, 2018; originally announced January 2018.

  11. arXiv:1705.00989  [pdf, other

    math.ST

    Non-Asymptotic Rates for Manifold, Tangent Space, and Curvature Estimation

    Authors: Eddie Aamari, Clément Levrard

    Abstract: Given an $n$-sample drawn on a submanifold $M \subset \mathbb{R}^D$, we derive optimal rates for the estimation of tangent spaces $T\_X M$, the second fundamental form $II\_X^M$, and the submanifold $M$.After motivating their study, we introduce a quantitative class of $\mathcal{C}^k$-submanifolds in analogy with H{ö}lder classes.The proposed estimators are based on local polynomials and allow to… ▽ More

    Submitted 5 February, 2018; v1 submitted 2 May, 2017; originally announced May 2017.

  12. arXiv:1512.02857  [pdf, other

    math.ST

    Stability and Minimax Optimality of Tangential Delaunay Complexes for Manifold Reconstruction

    Authors: Eddie Aamari, Clément Levrard

    Abstract: We consider the problem of optimality in manifold reconstruction. A random sample $\mathbb{X}_n = \left\{X_1,\ldots,X_n\right\}\subset \mathbb{R}^D$ composed of points close to a $d$-dimensional submanifold $M$, with or without outliers drawn in the ambient space, is observed. Based on the Tangential Delaunay Complex, we construct an estimator $\hat{M}$ that is ambient isotopic and Hausdorff-close… ▽ More

    Submitted 31 January, 2018; v1 submitted 9 December, 2015; originally announced December 2015.

    ACM Class: I.3.5; G.1.2; G.3

  13. arXiv:1406.3334  [pdf, ps, other

    math.ST

    Sparse Oracle Inequalities for Variable Selection via Regularized Quantization

    Authors: Clément Levrard

    Abstract: We give oracle inequalities on procedures which combines quantization and variable selection via a weighted Lasso $k$-means type algorithm. The results are derived for a general family of weights, which can be tuned to size the influence of the variables in different ways. Moreover, these theoretical guarantees are proved to adapt the corresponding sparsity of the optimal codebooks, if appropriat… ▽ More

    Submitted 6 July, 2016; v1 submitted 12 June, 2014; originally announced June 2014.

  14. Nonasymptotic bounds for vector quantization in Hilbert spaces

    Authors: Clément Levrard

    Abstract: Recent results in quantization theory show that the mean-squared expected distortion can reach a rate of convergence of $\mathcal{O}(1/n)$, where $n$ is the sample size [see, e.g., IEEE Trans. Inform. Theory 60 (2014) 7279-7292 or Electron. J. Stat. 7 (2013) 1716-1746]. This rate is attained for the empirical risk minimizer strategy, if the source distribution satisfies some regularity conditions.… ▽ More

    Submitted 1 April, 2015; v1 submitted 26 May, 2014; originally announced May 2014.

    Comments: Published at http://dx.doi.org/10.1214/14-AOS1293 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: substantial text overlap with arXiv:1310.7138

    Report number: IMS-AOS-AOS1293

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 2, 592-619

  15. arXiv:1310.7138  [pdf, ps, other

    math.ST

    Margin conditions for vector quantization

    Authors: Clément Levrard

    Abstract: In this report, oracle inequalities on the excess risk of the empirical risk minimizer in the quantization framework are derived. These inequalities are based on conditions which may be thought of as margin type conditions, such as one derived in the statistical learning framework. Furthermore, these inequalities derive from innovative chaining techniques and its use for Dudley's entropy integral.

    Submitted 25 April, 2014; v1 submitted 26 October, 2013; originally announced October 2013.

    Comments: 43 pages

  16. arXiv:1201.6052  [pdf, ps, other

    math.ST

    Fast rates for empirical vector quantization

    Authors: Clément Levrard

    Abstract: We consider the rate of convergence of the expected loss of empirically optimal vector quantizers. Earlier results show that the mean-squared expected distortion for any fixed distribution supported on a bounded set and satisfying some regularity conditions decreases at the rate O(log n/n). We prove that this rate is actually O(1/n). Although these conditions are hard to check, we show that well-p… ▽ More

    Submitted 29 January, 2012; originally announced January 2012.

    Comments: 18 pages