Skip to main content

Showing 1–39 of 39 results for author: Schmidt-Hieber, J

.
  1. arXiv:2311.03483  [pdf, ps, other

    math.ST cs.LG cs.NE

    Hebbian learning inspired estimation of the linear regression parameters from queries

    Authors: Johannes Schmidt-Hieber, Wouter M Koolen

    Abstract: Local learning rules in biological neural networks (BNNs) are commonly referred to as Hebbian learning. [26] links a biologically motivated Hebbian learning rule to a specific zeroth-order optimization method. In this work, we study a variation of this Hebbian learning rule to recover the regression vector in the linear regression model. Zeroth-order optimization methods are known to converge with… ▽ More

    Submitted 26 September, 2023; originally announced November 2023.

    Comments: 34 pages

    MSC Class: Primary: 62L20; secondary: 62J05

  2. Convergence guarantees for forward gradient descent in the linear regression model

    Authors: Thijs Bos, Johannes Schmidt-Hieber

    Abstract: Renewed interest in the relationship between artificial and biological neural networks motivates the study of gradient-free methods. Considering the linear regression model with random design, we theoretically analyze in this work the biologically motivated (weight-perturbed) forward gradient scheme that is based on random linear combination of the gradient. If d denotes the number of parameters a… ▽ More

    Submitted 20 June, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 17 pages

    MSC Class: Primary: 62L20; secondary: 62J05

    Journal ref: Journal of Statistical Planning and Inference, Volume 233, 106174, 2024

  3. arXiv:2306.10529  [pdf, other

    math.ST stat.ML

    Dropout Regularization Versus $\ell_2$-Penalization in the Linear Model

    Authors: Gabriel Clara, Sophie Langer, Johannes Schmidt-Hieber

    Abstract: We investigate the statistical behavior of gradient descent iterates with dropout in the linear regression model. In particular, non-asymptotic bounds for the convergence of expectations and covariance matrices of the iterates are derived. The results shed more light on the widely cited connection between dropout and l2-regularization in the linear model. We indicate a more subtle relationship, ow… ▽ More

    Submitted 25 April, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 52 pages, 2 figures

  4. arXiv:2306.10471  [pdf, other

    math.ST

    A supervised deep learning method for nonparametric density estimation

    Authors: Thijs Bos, Johannes Schmidt-Hieber

    Abstract: Nonparametric density estimation is an unsupervised learning problem. In this work we propose a two-step procedure that casts the density estimation problem in the first step into a supervised regression problem. The advantage is that we can afterwards apply supervised learning methods. Compared to the standard nonparametric regression setting, the proposed procedure creates, however, dependence a… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: Keywords: Neural networks, nonparametric density estimation, statistical estimation rates, (un)supervised learning

    MSC Class: Primary: 62G07; secondary 68T07

  5. Lower bounds for the trade-off between bias and mean absolute deviation

    Authors: Alexis Derumigny, Johannes Schmidt-Hieber

    Abstract: In nonparametric statistics, rate-optimal estimators typically balance bias and stochastic error. The recent work on overparametrization raises the question whether rate-optimal estimators exist that do not obey this trade-off. In this work we consider pointwise estimation in the Gaussian white noise model with regression function $f$ in a class of $β$-Hölder smooth functions. Let 'worst-case' ref… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: This is an extended version of Section 7 of arXiv:2006.00278v3. The material has been removed from later versions of arXiv:2006.00278

    MSC Class: 62C20; 62G05; 62C05

    Journal ref: Statistics and Probability Letters, Volume 213, 110182, 2024

  6. arXiv:2303.08122  [pdf, ps, other

    math.ST cs.IT math.PR

    Codivergences and information matrices

    Authors: Alexis Derumigny, Johannes Schmidt-Hieber

    Abstract: We propose a new concept of codivergence, which quantifies the similarity between two probability measures $P_1, P_2$ relative to a reference probability measure $P_0$. In the neighborhood of the reference measure $P_0$, a codivergence behaves like an inner product between the measures $P_1 - P_0$ and $P_2 - P_0$. Codivergences of covariance-type and correlation-type are introduced and studied wit… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 30 pages, 1 figure, 1 table. This is an extended version of Section 2.2 of arXiv:2006.00278v3 (most of this content has been removed in the next version (arXiv:2006.00278v4) and link to this separate paper instead)

    MSC Class: 62B11; 46E27; 15A63

  7. arXiv:2301.11777  [pdf, other

    cs.LG cs.NE math.ST

    Interpreting learning in biological neural networks as zero-order optimization method

    Authors: Johannes Schmidt-Hieber

    Abstract: Recently, significant progress has been made regarding the statistical understanding of artificial neural networks (ANNs). ANNs are motivated by the functioning of the brain, but differ in several crucial aspects. In particular, the locality in the updating rule of the connection parameters in biological neural networks (BNNs) makes it biologically implausible that the learning of the brain is bas… ▽ More

    Submitted 23 March, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  8. arXiv:2206.02151  [pdf, other

    math.ST

    A statistical analysis of an image classification problem

    Authors: Sophie Langer, Johannes Schmidt-Hieber

    Abstract: The availability of massive image databases resulted in the development of scalable machine learning methods such as convolutional neural network (CNNs) filtering and processing these data. While the very recent theoretical work on CNNs focuses on standard nonparametric denoising problems, the variability in image classification datasets does, however, not originate from additive noise but from va… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  9. arXiv:2205.07764  [pdf, ps, other

    stat.ML cs.LG math.ST

    On the inability of Gaussian process regression to optimally learn compositional functions

    Authors: Matteo Giordano, Kolyan Ray, Johannes Schmidt-Hieber

    Abstract: We rigorously prove that deep Gaussian process priors can outperform Gaussian process priors if the target function has a compositional structure. To this end, we study information-theoretic lower bounds for posterior contraction rates for Gaussian process regression in a continuous regression model. We show that if the true function is a generalized additive function, then the posterior based on… ▽ More

    Submitted 27 September, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: 20 pages, to appear in Advances in Neural Information Processing Systems 36 (NeurIPS 2022)

  10. arXiv:2204.05003  [pdf, other

    math.ST

    Local convergence rates of the nonparametric least squares estimator with applications to transfer learning

    Authors: Johannes Schmidt-Hieber, Petr Zamolodtchikov

    Abstract: Convergence properties of empirical risk minimizers can be conveniently expressed in terms of the associated population risk. To derive bounds for the performance of the estimator under covariate shift, however, pointwise convergence rates are required. Under weak assumptions on the design distribution, it is shown that least squares estimators (LSE) over 1-Lipschitz functions are also minimax rat… ▽ More

    Submitted 29 December, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

  11. On generalization bounds for deep networks based on loss surface implicit regularization

    Authors: Masaaki Imaizumi, Johannes Schmidt-Hieber

    Abstract: The classical statistical learning theory implies that fitting too many parameters leads to overfitting and poor performance. That modern deep neural networks generalize well despite a large number of parameters contradicts this finding and constitutes a major unsolved problem towards explaining the success of deep learning. While previous work focuses on the implicit regularization induced by sto… ▽ More

    Submitted 16 October, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: To appear in IEEE Transaction on Information Theory

  12. arXiv:2108.00969  [pdf, ps, other

    math.ST cs.LG

    Convergence rates of deep ReLU networks for multiclass classification

    Authors: Thijs Bos, Johannes Schmidt-Hieber

    Abstract: For classification problems, trained deep neural networks return probabilities of class memberships. In this work we study convergence of the learned probabilities to the true conditional class probabilities. More specifically we consider sparse deep ReLU network reconstructions minimizing cross-entropy loss in the multiclass classification setup. Interesting phenomena occur when the class members… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: convergence rates, ReLU networks, multiclass classification, conditional class probabilities, margin condition

    MSC Class: Primary: 62G05; secondary: 63H30; 68T07

  13. arXiv:2105.07410  [pdf, other

    math.ST

    Posterior contraction for deep Gaussian process priors

    Authors: Gianluca Finocchio, Johannes Schmidt-Hieber

    Abstract: We study posterior contraction rates for a class of deep Gaussian process priors applied to the nonparametric regression problem under a general composition assumption on the regression function. It is shown that the contraction rates can achieve the minimax convergence rate (up to $\log n$ factors), while being adaptive to the underlying structure and smoothness of the target function. The propos… ▽ More

    Submitted 13 August, 2022; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: 56 pages, 3 figures

    MSC Class: 62G08; 62G20 (Primary) 62C20; 62R07 (Secondary)

  14. arXiv:2007.15884  [pdf, other

    cs.LG cs.NE stat.ML

    The Kolmogorov-Arnold representation theorem revisited

    Authors: Johannes Schmidt-Hieber

    Abstract: There is a longstanding debate whether the Kolmogorov-Arnold representation theorem can explain the use of more than one hidden layer in neural networks. The Kolmogorov-Arnold representation decomposes a multivariate function into an interior and an outer function and therefore has indeed a similar structure as a neural network with two hidden layers. But there are distinctive differences. One of… ▽ More

    Submitted 2 January, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: 21 pages

    MSC Class: 41A30

  15. arXiv:2006.00278  [pdf, ps, other

    math.ST stat.ML

    On lower bounds for the bias-variance trade-off

    Authors: Alexis Derumigny, Johannes Schmidt-Hieber

    Abstract: It is a common phenomenon that for high-dimensional and nonparametric statistical models, rate-optimal estimators balance squared bias and variance. Although this balancing is widely observed, little is known whether methods exist that could avoid the trade-off between bias and variance. We propose a general strategy to obtain lower bounds on the variance of any estimator with bias smaller than a… ▽ More

    Submitted 20 March, 2023; v1 submitted 30 May, 2020; originally announced June 2020.

    Comments: 52 pages, 2 figures, 1 table

    MSC Class: 62G05; 62C05; 62C20

  16. arXiv:2003.04406  [pdf, other

    math.ST

    On frequentist coverage of Bayesian credible sets for estimation of the mean under constraints

    Authors: Kevin Duisters, Johannes Schmidt-Hieber

    Abstract: Frequentist coverage of $(1-α)$-highest posterior density (HPD) credible sets is studied in a signal plus noise model under a large class of noise distributions. We consider a specific class of spike-and-slab prior distributions. Different regimes are identified and we derive closed form expressions for the $(1-α)$-HPD on each of these regimes. Similar to the earlier work by Marchand and Strawderm… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: 35 pages, 5 figures

    MSC Class: 62C10; 62G15; 62F15

  17. arXiv:1908.00695  [pdf, other

    stat.ML cs.LG

    Deep ReLU network approximation of functions on a manifold

    Authors: Johannes Schmidt-Hieber

    Abstract: Whereas recovery of the manifold from data is a well-studied topic, approximation rates for functions defined on manifolds are less known. In this work, we study a regression problem with inputs on a $d^*$-dimensional manifold that is embedded into a space with potentially much larger ambient dimension. It is shown that sparsely connected deep ReLU networks can approximate a Hölder function with s… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

  18. arXiv:1904.04525  [pdf, ps, other

    math.ST

    Bayesian variance estimation in the Gaussian sequence model with partial information on the means

    Authors: Gianluca Finocchio, Johannes Schmidt-Hieber

    Abstract: Consider the Gaussian sequence model under the additional assumption that a fixed fraction of the means is known. We study the problem of variance estimation from a frequentist Bayesian perspective. The maximum likelihood estimator (MLE) for $σ^2$ is biased and inconsistent. This raises the question whether the posterior is able to correct the MLE in this case. By develo** a new proving strategy… ▽ More

    Submitted 18 December, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: 33 pages, 1 table, corrected typos, improved proofs, expanded sections, references added

  19. arXiv:1809.04140  [pdf, other

    math.ST

    Nonparametric Bayesian analysis of the compound Poisson prior for support boundary recovery

    Authors: Markus Reiss, Johannes Schmidt-Hieber

    Abstract: Given data from a Poisson point process with intensity $(x,y) \mapsto n \mathbf{1}(f(x)\leq y),$ frequentist properties for the Bayesian reconstruction of the support boundary function $f$ are derived. We mainly study compound Poisson process priors with fixed intensity proving that the posterior contracts with nearly optimal rate for monotone and piecewise constant support boundaries and adapts t… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: The first version of arXiv:1703.08358 has been expanded and rewritten. We decided to split it in two separate papers, a new version of arXiv:1703.08358 and this article

    MSC Class: 62C10; 62G05; 60G55

  20. arXiv:1809.02443  [pdf, other

    math.ST

    Posterior analysis of $n$ in the binomial $(n,p)$ problem with both parameters unknown -- with applications to quantitative nanoscopy

    Authors: Johannes Schmidt-Hieber, Laura Fee Schneider, Thomas Staudt, Andrea Kra**a, Timo Aspelmeier, Axel Munk

    Abstract: Estimation of the population size $n$ from $k$ i.i.d.\ binomial observations with unknown success probability $p$ is relevant to a multitude of applications and has a long history. Without additional prior information this is a notoriously difficult task when $p$ becomes small, and the Bayesian approach becomes particularly useful. For a large class of priors, we establish posterior contraction an… ▽ More

    Submitted 16 November, 2020; v1 submitted 7 September, 2018; originally announced September 2018.

    Comments: 66 pages; 37 pages main text and 29 pages supplement; contains link to a supplementary microscopy video

  21. arXiv:1804.02253  [pdf, other

    stat.ML cs.LG stat.ME

    A comparison of deep networks with ReLU activation function and linear spline-type methods

    Authors: Konstantin Eckle, Johannes Schmidt-Hieber

    Abstract: Deep neural networks (DNNs) generate much richer function spaces than shallow networks. Since the function spaces induced by shallow networks have several approximation theoretic drawbacks, this explains, however, not necessarily the success of deep networks. In this article we take another route by comparing the expressive power of DNNs with ReLU activation function to piecewise linear spline met… ▽ More

    Submitted 24 September, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    MSC Class: 62G08 (Primary); 62G20 (Secondary)

  22. Asymptotic nonequivalence of density estimation and Gaussian white noise for small densities

    Authors: Kolyan Ray, Johannes Schmidt-Hieber

    Abstract: It is well-known that density estimation on the unit interval is asymptotically equivalent to a Gaussian white noise experiment, provided the densities are sufficiently smooth and uniformly bounded away from zero. We show that a uniform lower bound, whose size we sharply characterize, is in general necessary for asymptotic equivalence to hold.

    Submitted 6 November, 2018; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: 20 pages, 1 figure. Some results from an early version of arXiv:1608.01824 are now found here

    MSC Class: 62B15 (Primary); 62G07; 62G10; 62G20 (Secondary)

    Journal ref: Ann. Inst. H. Poincare Probab. Statist. 55 (2019), no. 4, 2195-2208

  23. arXiv:1708.06633  [pdf, other

    math.ST cs.LG stat.ML

    Nonparametric regression using deep neural networks with ReLU activation function

    Authors: Johannes Schmidt-Hieber

    Abstract: Consider the multivariate nonparametric regression model. It is shown that estimators based on sparsely connected deep neural networks with ReLU activation function and properly chosen network architecture achieve the minimax rates of convergence (up to $\log n$-factors) under a general composition assumption on the regression function. The framework includes many well-studied structural constrain… ▽ More

    Submitted 13 September, 2020; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: article, rejoinder and supplementary material

    MSC Class: 62G08

    Journal ref: Article: Annals of Statistics, Volume 48, Number 4, 1875-1897, 2020, Rejoinder: Annals of Statistics, Volume 48, Number 4, 1916-1921, 2020

  24. arXiv:1704.01066  [pdf, other

    stat.ME econ.EM

    Tests for qualitative features in the random coefficients model

    Authors: Fabian Dunker, Konstantin Eckle, Katharina Proksch, Johannes Schmidt-Hieber

    Abstract: The random coefficients model is an extension of the linear regression model that allows for unobserved heterogeneity in the population by modeling the regression coefficients as random variables. Given data from this model, the statistical challenge is to recover information about the joint density of the random coefficients which is a multivariate and ill-posed problem. Because of the curse of d… ▽ More

    Submitted 13 March, 2018; v1 submitted 4 April, 2017; originally announced April 2017.

    MSC Class: 62G10; 62G15; 62G20

  25. arXiv:1703.08358  [pdf, other

    math.ST

    Posterior contraction rates for support boundary recovery

    Authors: Markus Reiss, Johannes Schmidt-Hieber

    Abstract: Given a sample of a Poisson point process with intensity $λ_f(x,y) = n \mathbf{1}(f(x) \leq y),$ we study recovery of the boundary function $f$ from a nonparametric Bayes perspective. Because of the irregularity of this model, the analysis is non-standard. We establish a general result for the posterior contraction rate with respect to the $L^1$-norm based on entropy and one-sided small probabilit… ▽ More

    Submitted 12 June, 2020; v1 submitted 24 March, 2017; originally announced March 2017.

    MSC Class: 62C10; 62G05; 60G55

  26. The Le Cam distance between density estimation, Poisson processes and Gaussian white noise

    Authors: Kolyan Ray, Johannes Schmidt-Hieber

    Abstract: It is well-known that density estimation on the unit interval is asymptotically equivalent to a Gaussian white noise experiment, provided the densities have Hölder smoothness larger than $1/2$ and are uniformly bounded away from zero. We derive matching lower and constructive upper bounds for the Le Cam deficiencies between these experiments, with explicit dependence on both the sample size and th… ▽ More

    Submitted 14 April, 2018; v1 submitted 5 August, 2016; originally announced August 2016.

    Comments: Some results from an earlier version of this preprint have been moved to arXiv:1802.03425

    MSC Class: 62G05 (Primary); 62G07; 62G20 (Secondary)

    Journal ref: Math. Stat. Learn. 1 (2018), 101-170

  27. Minimax theory for a class of non-linear statistical inverse problems

    Authors: Kolyan Ray, Johannes Schmidt-Hieber

    Abstract: We study a class of statistical inverse problems with non-linear pointwise operators motivated by concrete statistical applications. A two-step procedure is proposed, where the first step smoothes the data and inverts the non-linearity. This reduces the initial non-linear problem to a linear inverse problem with deterministic noise, which is then solved in a second step. The noise reduction step i… ▽ More

    Submitted 11 May, 2016; v1 submitted 1 December, 2015; originally announced December 2015.

    Comments: 37 pages

    MSC Class: 62G05 (Primary); 62G08; 62G20 (Secondary)

    Journal ref: Inverse Problems 32 (2016) 065003

  28. A regularity class for the roots of non-negative functions

    Authors: Kolyan Ray, Johannes Schmidt-Hieber

    Abstract: We investigate the regularity of the positive roots of a non-negative function of one-variable. A modified Hölder space $\mathcal{F}^β$ is introduced such that if $f\in \mathcal{F}^β$ then $f^α\in C^{αβ}$. This provides sufficient conditions to overcome the usual limitation in the square root case ($α= 1/2$) for Hölder functions that $f^{1/2}$ need be no more than $C^1$ in general. We also derive… ▽ More

    Submitted 16 March, 2017; v1 submitted 30 October, 2015; originally announced October 2015.

    Comments: 12 pages

    MSC Class: 26A16; 26A27

    Journal ref: Ann. Mat. Pura Appl. 196 (2017), 2091-2103

  29. Conditions for Posterior Contraction in the Sparse Normal Means Problem

    Authors: Stéphanie van der Pas, Jean-Bernard Salomond, Johannes Schmidt-Hieber

    Abstract: The first Bayesian results for the sparse normal means problem were proven for spike-and-slab priors. However, these priors are less convenient from a computational point of view. In the meanwhile, a large number of continuous shrinkage priors has been proposed. Many of these shrinkage priors can be written as a scale mixture of normals, which makes them particularly easy to implement. We propose… ▽ More

    Submitted 13 October, 2015; v1 submitted 8 October, 2015; originally announced October 2015.

    Journal ref: Electron. J. Statist. 10 (2016), no. 1, 976--1000. http://projecteuclid.org/euclid.ejs/1460463652

  30. arXiv:1403.0735  [pdf, ps, other

    math.ST stat.ME

    Bayesian linear regression with sparse priors

    Authors: Ismaël Castillo, Johannes Schmidt-Hieber, Aad van der Vaart

    Abstract: We study full Bayesian procedures for high-dimensional linear regression under sparsity constraints. The prior is a mixture of point masses at zero and continuous distributions. Under compatibility conditions on the design matrix, the posterior distribution is shown to contract at the optimal rate for recovery of the unknown sparse vector, and to give optimal prediction of the response vector. It… ▽ More

    Submitted 14 October, 2015; v1 submitted 4 March, 2014; originally announced March 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1334 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1334

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 5, 1986-2018

  31. Asymptotic equivalence for regression under fractional noise

    Authors: Johannes Schmidt-Hieber

    Abstract: Consider estimation of the regression function based on a model with equidistant design and measurement errors generated from a fractional Gaussian noise process. In previous literature, this model has been heuristically linked to an experiment, where the anti-derivative of the regression function is continuously observed under additive perturbation by a fractional Brownian motion. Based on a refo… ▽ More

    Submitted 1 December, 2014; v1 submitted 2 December, 2013; originally announced December 2013.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOS1262 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1262

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 6, 2557-2585

  32. arXiv:1309.6178  [pdf, ps, other

    stat.AP

    Spot volatility estimation for high-frequency data: adaptive estimation in practice

    Authors: Till Sabel, Johannes Schmidt-Hieber, Axel Munk

    Abstract: We develop further the spot volatility estimator introduced in Hoffmann, Munk and Schmidt-Hieber (2012) from a practical point of view and make it useful for the analysis of high-frequency financial data. In a first part, we adjust the estimator substantially in order to achieve good finite sample performance and to overcome difficulties arising from violations of the additive microstructure noise… ▽ More

    Submitted 24 September, 2013; originally announced September 2013.

    MSC Class: 91B84; 62G08; 65T60; 62M99

  33. On adaptive posterior concentration rates

    Authors: Marc Hoffmann, Judith Rousseau, Johannes Schmidt-Hieber

    Abstract: We investigate the problem of deriving posterior concentration rates under different loss functions in nonparametric Bayes. We first provide a lower bound on posterior coverages of shrinking neighbourhoods that relates the metric or loss under which the shrinking neighbourhood is considered, and an intrinsic pre-metric linked to frequentist separation rates. In the Gaussian white noise model, we c… ▽ More

    Submitted 5 November, 2015; v1 submitted 22 May, 2013; originally announced May 2013.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1341 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1341

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 5, 2259-2295

  34. arXiv:1303.3118  [pdf, ps, other

    math.ST stat.ME

    On an estimator achieving the adaptive rate in nonparametric regression under $L^p$-loss for all $1\leq p \leq \infty$

    Authors: Johannes Schmidt-Hieber

    Abstract: Consider nonparametric function estimation under $L^p$-loss. The minimax rate for estimation of the regression function over a Hölder ball with smoothness index $β$ is $n^{-β/(2β+1)}$ if $1\leq p<\infty$ and $(n/\log n)^{-β/(2β+1)}$ if $p=\infty.$ There are many known procedures that either attain this rate for $p=\infty$ but are suboptimal by a $\log n$ factor in the case $p<\infty$ or the other… ▽ More

    Submitted 7 February, 2015; v1 submitted 13 March, 2013; originally announced March 2013.

    Comments: 21 pages

  35. Asymptotically efficient estimation of a scale parameter in Gaussian time series and closed-form expressions for the Fisher information

    Authors: Till Sabel, Johannes Schmidt-Hieber

    Abstract: Mimicking the maximum likelihood estimator, we construct first order Cramer-Rao efficient and explicitly computable estimators for the scale parameter $σ^2$ in the model $Z_{i,n}=σn^{-β}X_i+Y_i,i=1,\ldots,n,β>0$ with independent, stationary Gaussian processes $(X_i)_{i\in\mathbb{N}}$, $(Y_i)_{i\in\mathbb{N}}$, and $(X_i)_{i\in\mathbb{N}}$ exhibits possibly long-range dependence. In a second part,… ▽ More

    Submitted 13 March, 2014; v1 submitted 27 August, 2012; originally announced August 2012.

    Comments: Published in at http://dx.doi.org/10.3150/12-BEJ505 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ505

    Journal ref: Bernoulli 2014, Vol. 20, No. 2, 747-774

  36. arXiv:1107.1404  [pdf, other

    math.ST stat.ME

    Multiscale Methods for Shape Constraints in Deconvolution: Confidence Statements for Qualitative Features

    Authors: Johannes Schmidt-Hieber, Axel Munk, Lutz Duembgen

    Abstract: We derive multiscale statistics for deconvolution in order to detect qualitative features of the unknown density. An important example covered within this framework is to test for local monotonicity on all scales simultaneously. We investigate the moderately ill-posed setting, where the Fourier transform of the error density in the deconvolution model is of polynomial decay. For multiscale testing… ▽ More

    Submitted 17 December, 2012; v1 submitted 7 July, 2011; originally announced July 2011.

    Comments: 55 pages, 5 figures, This is a revised version of a previous paper with the title: "Multiscale Methods for Shape Constraints in Deconvolution"

    MSC Class: 62G10 (Primary) 62G15; 62G20 (Secondary)

  37. arXiv:1007.4622  [pdf, ps, other

    math.ST

    Adaptive wavelet estimation of the diffusion coefficient under additive error measurements

    Authors: Marc Hoffmann, Axel Munk, Johannes Schmidt-Hieber

    Abstract: We study nonparametric estimation of the diffusion coefficient from discrete data, when the observations are blurred by additional noise. Such issues have been developed over the last 10 years in several application fields and in particular in high frequency financial data modelling, however mainly from a parametric and semiparametric point of view. This paper addresses the nonparametric estimatio… ▽ More

    Submitted 29 December, 2011; v1 submitted 27 July, 2010; originally announced July 2010.

    Comments: 46 pages. This is the second version. A first draft of the paper appeared as a working paper in 2010 under the title "Nonparametric estimation of the volatility under microstructure noise: wavelet adaptation"

    MSC Class: 62G99; 62M99; 60G99

  38. arXiv:1002.3045  [pdf, ps, other

    math.ST

    Lower bounds for volatility estimation in microstructure noise models

    Authors: Axel Munk, Johannes Schmidt-Hieber

    Abstract: In this paper we derive lower bounds in minimax sense for estimation of the instantaneous volatility if the diffusion type part cannot be observed directly but under some additional Gaussian noise. Three different models are considered. Our technique is based on a general inequality for Kullback-Leibler divergence of multivariate normal random variables and spectral analysis of the processes. Th… ▽ More

    Submitted 16 February, 2010; originally announced February 2010.

    Comments: 16 pages

  39. arXiv:0908.3163  [pdf, other

    stat.ME math.ST

    Nonparametric estimation of the volatility function in a high-frequency model corrupted by noise

    Authors: Axel Munk, Johannes Schmidt-Hieber

    Abstract: We consider the models Y_{i,n}=\int_0^{i/n} σ(s)dW_s+τ(i/n)ε_{i,n}, and \tilde Y_{i,n}=σ(i/n)W_{i/n}+τ(i/n)ε_{i,n}, i=1,...,n, where W_t denotes a standard Brownian motion and ε_{i,n} are centered i.i.d. random variables with E(ε_{i,n}^2)=1 and finite fourth moment. Furthermore, σand τare unknown deterministic functions and W_t and (ε_{1,n},...,ε_{n,n}) are assumed to be independent processes. Bas… ▽ More

    Submitted 6 April, 2010; v1 submitted 21 August, 2009; originally announced August 2009.

    Comments: 5 figures, corrected references, minor changes