Skip to main content

Showing 1–12 of 12 results for author: Walther, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2308.00950  [pdf, other

    stat.ME

    Beta-trees: Multivariate histograms with confidence statements

    Authors: Guenther Walther, Qian Zhao

    Abstract: Multivariate histograms are difficult to construct due to the curse of dimensionality. Motivated by $k$-d trees in computer science, we show how to construct an efficient data-adaptive partition of Euclidean space that possesses the following two properties: With high confidence the distribution from which the data are generated is close to uniform on each rectangle of the partition; and despite t… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    MSC Class: 62G15

  2. arXiv:2109.06371  [pdf, ps, other

    math.ST stat.ME

    Tail bounds for empirically standardized sums

    Authors: Guenther Walther

    Abstract: Exponential tail bounds for sums play an important role in statistics, but the example of the $t$-statistic shows that the exponential tail decay may be lost when population parameters need to be estimated from the data. However, it turns out that if Studentizing is accompanied by estimating the location parameter in a suitable way, then the $t$-statistic regains the exponential tail behavior. Mot… ▽ More

    Submitted 19 March, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    MSC Class: 62G32; 60F10

  3. Calibrating the scan statistic with size-dependent critical values: heuristics, methodology and computation

    Authors: Guenther Walther

    Abstract: It is known that the scan statistic with variable window size favors the detection of signals with small spatial extent and there is a corresponding loss of power for signals with large spatial extent. Recent results have shown that this loss is not inevitable: Using critical values that depend on the size of the window allows optimal detection for all signal sizes simultaneously, so there is no s… ▽ More

    Submitted 14 February, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Journal ref: In: Glaz, J, Koutras M.V. (eds) Handbook of Scan Statistics. 2022. Springer, New York, NY

  4. arXiv:2011.03668  [pdf, other

    math.ST stat.ME

    Confidence bands for a log-concave density

    Authors: Guenther Walther, Alnur Ali, Xinyue Shen, Stephen Boyd

    Abstract: We present a new approach for inference about a log-concave distribution: Instead of using the method of maximum likelihood, we propose to incorporate the log-concavity constraint in an appropriate nonparametric confidence set for the cdf $F$. This approach has the advantage that it automatically provides a measure of statistical uncertainty and it thus overcomes a marked limitation of the maximum… ▽ More

    Submitted 6 May, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Added a discussion section, minor changes

  5. arXiv:1907.00085  [pdf, other

    math.ST stat.ME

    Large-scale inference with block structure

    Authors: Jiyao Kou, Guenther Walther

    Abstract: The detection of weak and rare effects in large amounts of data arises in a number of modern data analysis problems. Known results show that in this situation the potential of statistical inference is severely limited by the large-scale multiple testing that is inherent in these problems. Here we show that fundamentally more powerful statistical inference is possible when there is some structure i… ▽ More

    Submitted 7 May, 2022; v1 submitted 28 June, 2019; originally announced July 2019.

    MSC Class: 62G10; 62G32

  6. arXiv:1612.07216  [pdf, other

    math.ST stat.ME

    The Essential Histogram

    Authors: Housen Li, Axel Munk, Hannes Sieling, Guenther Walther

    Abstract: The histogram is widely used as a simple, exploratory display of data, but it is usually not clear how to choose the number and size of bins. We construct a confidence set of distribution functions that optimally address the two main tasks of the histogram: estimating probabilities and detecting features such as increases and modes in the distribution. We define the essential histogram as the hist… ▽ More

    Submitted 28 May, 2019; v1 submitted 21 December, 2016; originally announced December 2016.

    Comments: Extension to discrete data is included. A R-package "essHist" is available from https://CRAN.R-project.org/package=essHist

    MSC Class: 62G10; 62H30

    Journal ref: Biometrika, 2020

  7. arXiv:1503.06388  [pdf, other

    math.ST stat.ML

    Adaptive Concentration of Regression Trees, with Application to Random Forests

    Authors: Stefan Wager, Guenther Walther

    Abstract: We study the convergence of the predictive surface of regression trees and forests. To support our analysis we introduce a notion of adaptive concentration for regression trees. This approach breaks tree training into a model selection phase in which we pick the tree splits, followed by a model fitting phase where we find the best regression model consistent with these splits. We then show that th… ▽ More

    Submitted 30 April, 2016; v1 submitted 22 March, 2015; originally announced March 2015.

  8. arXiv:1410.3853  [pdf, other

    stat.AP physics.ed-ph

    Peer assessment enhances student learning

    Authors: Dennis L. Sun, Naftali Harris, Guenther Walther, Michael Baiocchi

    Abstract: Feedback has a powerful influence on learning, but it is also expensive to provide. In large classes, it may even be impossible for instructors to provide individualized feedback. Peer assessment has received attention lately as a way of providing personalized feedback that scales to large classes. Besides these obvious benefits, some researchers have also conjectured that students learn by peer a… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

  9. arXiv:1211.2859  [pdf, ps, other

    stat.ME

    Optimal detection of a jump in the intensity of a Poisson process or in a density with likelihood ratio statistics

    Authors: Camilo Rivera, Guenther Walther

    Abstract: We consider the problem of detecting a `bump' in the intensity of a Poisson process or in a density. We analyze two types of likelihood ratio based statistics which allow for exact finite sample inference and asymptotically optimal detection: The maximum of the penalized square root of log likelihood ratios (`penalized scan') evaluated over a certain sparse set of intervals, and a certain average… ▽ More

    Submitted 25 February, 2014; v1 submitted 12 November, 2012; originally announced November 2012.

    Journal ref: Scandinavian Journal of Statistics 40 (2013), 752-769

  10. arXiv:1111.0328  [pdf, other

    stat.ME

    The Average Likelihood Ratio for Large-scale Multiple Testing and Detecting Sparse Mixtures

    Authors: Guenther Walther

    Abstract: Large-scale multiple testing problems require the simultaneous assessment of many p-values. This paper compares several methods to assess the evidence in multiple binomial counts of p-values: the maximum of the binomial counts after standardization (the `higher-criticism statistic'), the maximum of the binomial counts after a log-likelihood ratio transformation (the `Berk-Jones statistic'), and a… ▽ More

    Submitted 1 November, 2011; originally announced November 2011.

    Journal ref: From Probability to Statistics and Back: High-Dimensional Models and Processes - A Festschrift in Honor of Jon A. Wellner. M. Bannerjee, F. Bunea, J. Huang, V. Koltchinskii, M.H. Maathuis (eds.), Inst. Math. Statistics (2013), 317-326

  11. arXiv:1107.4344  [pdf, other

    stat.ME

    Detection with the scan and the average likelihood ratio

    Authors: Hock Peng Chan, Guenther Walther

    Abstract: We investigate the performance of the scan (maximum likelihood ratio statistic) and of the average likelihood ratio statistic in the problem of detecting a deterministic signal with unknown spatial extent in the prototypical univariate sampled data model with white Gaussian noise. Our results show that the scan statistic, a popular tool for detection problems, is optimal only for the detection of… ▽ More

    Submitted 25 February, 2014; v1 submitted 21 July, 2011; originally announced July 2011.

    Journal ref: Statistica Sinica 23 (2013), 409-428

  12. Inference and Modeling with Log-concave Distributions

    Authors: Guenther Walther

    Abstract: Log-concave distributions are an attractive choice for modeling and inference, for several reasons: The class of log-concave distributions contains most of the commonly used parametric distributions and thus is a rich and flexible nonparametric class of distributions. Further, the MLE exists and can be computed with readily available algorithms. Thus, no tuning parameter, such as a bandwidth, is n… ▽ More

    Submitted 2 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-STS303 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS303

    Journal ref: Statistical Science 2009, Vol. 24, No. 3, 319-327