Skip to main content

Showing 1–20 of 20 results for author: Polonik, W

Searching in archive math. Search in all archives.
.
  1. arXiv:2403.09960  [pdf, other

    math.ST math.PR stat.ML

    Multivariate Gaussian Approximation for Random Forest via Region-based Stabilization

    Authors: Zhaoyang Shi, Chinmoy Bhattacharjee, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We derive Gaussian approximation bounds for random forest predictions based on a set of training points given by a Poisson process, under fairly mild regularity assumptions on the data generating process. Our approach is based on the key observation that the random forest predictions satisfy a certain geometric property called region-based stabilization. In the process of develo** our results fo… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  2. arXiv:2402.14985  [pdf, other

    math.ST stat.ML

    Nonsmooth Nonparametric Regression via Fractional Laplacian Eigenmaps

    Authors: Zhaoyang Shi, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We develop nonparametric regression methods for the case when the true regression function is not necessarily smooth. More specifically, our approach is using the fractional Laplacian and is designed to handle the case when the true regression function lies in an $L_2$-fractional Sobolev space with order $s\in (0,1)$. This function class is a Hilbert space lying between the space of square-integra… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  3. arXiv:2311.00140  [pdf, ps, other

    math.ST stat.ML

    Adaptive and non-adaptive minimax rates for weighted Laplacian-eigenmap based nonparametric regression

    Authors: Zhaoyang Shi, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We show both adaptive and non-adaptive minimax rates of convergence for a family of weighted Laplacian-Eigenmap based nonparametric regression methods, when the true regression function belongs to a Sobolev space and the sampling density is bounded from above and below. The adaptation methodology is based on extensions of Lepski's method and is over both the smoothness parameter (… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  4. arXiv:2210.10744  [pdf, ps, other

    math.PR math.ST stat.ML

    A Flexible Approach for Normal Approximation of Geometric and Topological Statistics

    Authors: Zhaoyang Shi, Krishnakumar Balasubramanian, Wolfgang Polonik

    Abstract: We derive normal approximation results for a class of stabilizing functionals of binomial or Poisson point process, that are not necessarily expressible as sums of certain score functions. Our approach is based on a flexible notion of the add-one cost operator, which helps one to deal with the second-order cost operator via suitably appropriate first-order operators. We combine this flexible notio… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  5. arXiv:2110.13749  [pdf, other

    cs.LG math.ST

    Topologically penalized regression on manifolds

    Authors: Olympio Hacquard, Krishnakumar Balasubramanian, Gilles Blanchard, Clément Levrard, Wolfgang Polonik

    Abstract: We study a regression problem on a compact manifold M. In order to take advantage of the underlying geometry and topology of the data, the regression task is performed on the basis of the first several eigenfunctions of the Laplace-Beltrami operator of the manifold, that are regularized with topological penalties. The proposed penalties are based on the topology of the sub-level sets of either the… ▽ More

    Submitted 10 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Journal ref: JMLR, 2022

  6. arXiv:2104.12314  [pdf, other

    stat.ML cs.LG math.ST

    Algorithms for ridge estimation with convergence guarantees

    Authors: Wanli Qiao, Wolfgang Polonik

    Abstract: The extraction of filamentary structure from a point cloud is discussed. The filaments are modeled as ridge lines or higher dimensional ridges of an underlying density. We propose two novel algorithms, and provide theoretical guarantees for their convergences. We consider the new algorithms as alternatives to the Subspace Constraint Mean Shift (SCMS) algorithm that do not suffer from a shortcoming… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 41 pages, 8 figures

    MSC Class: 62G05

  7. arXiv:2103.14668  [pdf, other

    stat.ME math.ST

    Testing For Global Covariate Effects in Dynamic Interaction Event Networks

    Authors: Alexander Kreiss, Enno Mammen, Wolfgang Polonik

    Abstract: In statistical network analysis it is common to observe so called interaction data. Such data is characterized by actors forming the vertices and interacting along edges of the network, where edges are randomly formed and dissolved over the observation horizon. In addition covariates are observed and the goal is to model the impact of the covariates on the interactions. We distinguish two types of… ▽ More

    Submitted 15 June, 2023; v1 submitted 26 March, 2021; originally announced March 2021.

  8. arXiv:2005.07557  [pdf, other

    math.PR math.ST

    On approximation theorems for the Euler characteristic with applications to the bootstrap

    Authors: Johannes Krebs, Benjamin Roycraft, Wolfgang Polonik

    Abstract: We study approximation theorems for the Euler characteristic of the Vietoris-Rips and Cech filtration. The filtration is obtained from a Poisson or binomial sampling scheme in the critical regime. We apply our results to the smooth bootstrap of the Euler characteristic and determine its rate of convergence in the Kantorovich-Wasserstein distance and in the Kolmogorov distance.

    Submitted 20 September, 2021; v1 submitted 15 May, 2020; originally announced May 2020.

  9. arXiv:2005.01417  [pdf, other

    math.ST math.AT

    Bootstrap** Persistent Betti Numbers and Other Stabilizing Statistics

    Authors: Benjamin Roycraft, Johannes Krebs, Wolfgang Polonik

    Abstract: The present contribution investigates multivariate bootstrap procedures for general stabilizing statistics, with specific application to topological data analysis. Existing limit theorems for topological statistics prove difficult to use in practice for the construction of confidence intervals, motivating the use of the bootstrap in this capacity. However, the standard nonparametric bootstrap does… ▽ More

    Submitted 25 March, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 59 pages, 3 figures. Restructured paper with alternate problem settings moved to appendix. Rewrote data analysis and simulations study sections to be more comprehensive, moved each to the end of the paper

    MSC Class: 62F40; 62H10; 62G05

  10. arXiv:1903.03280  [pdf, other

    math.PR math.ST

    On the asymptotic normality of persistent Betti numbers

    Authors: Johannes T. N. Krebs, Wolfgang Polonik

    Abstract: Persistent Betti numbers are a major tool in persistent homology, a subfield of topological data analysis. Many tools in persistent homology rely on the properties of persistent Betti numbers considered as a two-dimensional stochastic process $ (r,s) \mapsto n^{-1/2} (β^{r,s}_q ( \mathcal{K}(n^{1/d} S_n))-\mathbb{E}[β^{r,s}_q ( \mathcal{K}( n^{1/d} S_n))])$. So far, pointwise limit theorems have b… ▽ More

    Submitted 2 March, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    MSC Class: Primary: 60G55; 60D05; Secondary: 60F05

  11. arXiv:1903.01430  [pdf, other

    math.ST

    Nonparametric Confidence Regions for Level Sets: Statistical Properties and Geometry

    Authors: Wanli Qiao, Wolfgang Polonik

    Abstract: This paper studies and critically discusses the construction of nonparametric confidence regions for density level sets. Methodologies based on both vertical variation and horizontal variation are considered. The investigations provide theoretical insight into the behavior of these confidence regions via large sample theory. We also discuss the geometric relationships underlying the construction o… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: 46 pages, 2 figures

    MSC Class: Primary 62G20; secondary 62G05

  12. arXiv:1811.10178  [pdf, other

    math.ST

    Multiscale geometric feature extraction for high-dimensional and non-Euclidean data with application

    Authors: Gabriel Chandler, Wolfgang Polonik

    Abstract: A method for extracting multiscale geometric features from a data cloud is proposed and analyzed. The basic idea is to map each pair of data points into a real-valued feature function defined on $[0,1]$. The construction of these feature functions is heavily based on geometric considerations, which has the benefits of enhancing interpretability. Further statistical analysis is then based on the co… ▽ More

    Submitted 12 December, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

  13. On the choice of weight functions for linear representations of persistence diagrams

    Authors: Vincent Divol, Wolfgang Polonik

    Abstract: Persistence diagrams are efficient descriptors of the topology of a point cloud. As they do not naturally belong to a Hilbert space, standard statistical methods cannot be directly applied to them. Instead, feature maps (or representations) are commonly used for the analysis. A large class of feature maps, which we call linear, depends on some weight functions, the choice of which is a critical is… ▽ More

    Submitted 30 November, 2020; v1 submitted 10 July, 2018; originally announced July 2018.

    Journal ref: J Appl. and Comput. Topology 3, 249-283 (2019)

  14. arXiv:1711.06305  [pdf, other

    math.ST

    Neighborhood selection with application to social networks

    Authors: Nana Wang, Wolfgang Polonik

    Abstract: The topic of this paper is modeling and analyzing dependence in stochastic social networks. Using a latent variable block model allows the analysis of dependence between blocks via the analysis of a latent graphical model. Our approach to the analysis of the graphical model then is based on the idea underlying the neighborhood selection scheme put forward by Meinshausen and Bühlmann (2006). Howeve… ▽ More

    Submitted 23 August, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

  15. Nonparametric inference for continuous-time event counting and link-based dynamic network models

    Authors: Alexander Kreiß, Enno Mammen, Wolfgang Polonik

    Abstract: A flexible approach for modeling both dynamic event counting and dynamic link-based networks based on counting processes is proposed, and estimation in these models is studied. We consider nonparametric likelihood based estimation of parameter functions via kernel smoothing. The asymptotic behavior of these estimators is rigorously analyzed by allowing the number of nodes to tend to infinity. The… ▽ More

    Submitted 28 May, 2019; v1 submitted 10 May, 2017; originally announced May 2017.

    Journal ref: Electron. J. Statist. 13 (2) 2764 - 2829, 2019

  16. arXiv:1510.07105  [pdf, other

    math.ST

    Theoretical Analysis of Nonparametric Filament Estimation

    Authors: Wanli Qiao, Wolfgang Polonik

    Abstract: This paper provides a rigorous study of the nonparametric estimation of filaments or ridge lines of a probability density $f$. Points on the filament are considered as local extrema of the density when traversing the support of $f$ along the integral curve driven by the vector field of second eigenvectors of the Hessian of $f$. We `parametrize' points on the filaments by such integral curves, and… ▽ More

    Submitted 24 October, 2015; originally announced October 2015.

    Comments: 55 pages, 1 figure

  17. arXiv:1510.06833  [pdf, other

    math.PR

    Extrema of locally stationary Gaussian fields on growing manifolds

    Authors: Wanli Qiao, Wolfgang Polonik

    Abstract: We consider a class of non-homogeneous, continuous, centered Gaussian random fields $\{X_h(t), t \in {\cal M}_h;\,0 < h \le 1\}$ where ${\cal M}_h$ denotes a rescaled smooth manifold, i.e. ${\cal M}_h = \frac{1}{h} {\cal M},$ and study the limit behavior of the extreme values of these Gaussian random fields when $h$ tends to zero, which means that the manifold is growing. Our main result can be th… ▽ More

    Submitted 23 October, 2015; originally announced October 2015.

    Comments: 28 pages, 1 figure

  18. Asymptotic normality of plug-in level set estimates

    Authors: David M. Mason, Wolfgang Polonik

    Abstract: We establish the asymptotic normality of the $G$-measure of the symmetric difference between the level set and a plug-in-type estimator of it formed by replacing the density in the definition of the level set by a kernel density estimator. Our proof will highlight the efficacy of Poissonization methods in the treatment of large sample theory problems of this kind.

    Submitted 7 August, 2009; originally announced August 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AAP569 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP569 MSC Class: 60F05; 60F15; 62E20; 62G07 (Primary)

    Journal ref: Annals of Applied Probability 2009, Vol. 19, No. 3, 1108-1142

  19. Empirical spectral processes for locally stationary time series

    Authors: Rainer Dahlhaus, Wolfgang Polonik

    Abstract: A time-varying empirical spectral process indexed by classes of functions is defined for locally stationary time series. We derive weak convergence in a function space, and prove a maximal exponential inequality and a Glivenko--Cantelli-type convergence result. The results use conditions based on the metric entropy of the index class. In contrast to related earlier work, no Gaussian assumption i… ▽ More

    Submitted 9 February, 2009; originally announced February 2009.

    Comments: Published in at http://dx.doi.org/10.3150/08-BEJ137 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ137

    Journal ref: Bernoulli 2009, Vol. 15, No. 1, 1-39

  20. Nonparametric quasi-maximum likelihood estimation for Gaussian locally stationary processes

    Authors: Rainer Dahlhaus, Wolfgang Polonik

    Abstract: This paper deals with nonparametric maximum likelihood estimation for Gaussian locally stationary processes. Our nonparametric MLE is constructed by minimizing a frequency domain likelihood over a class of functions. The asymptotic behavior of the resulting estimator is studied. The results depend on the richness of the class of functions. Both sieve estimation and global estimation are consider… ▽ More

    Submitted 1 August, 2007; originally announced August 2007.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000867 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0138 MSC Class: 62M10 (Primary) 62F30 (Secondary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 6, 2790-2824