Skip to main content

Showing 1–12 of 12 results for author: Hammerling, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2202.02616  [pdf, other

    stat.CO cs.CV

    DSSIM: a structural similarity index for floating-point data

    Authors: Allison H. Baker, Alexander Pinard, Dorit M. Hammerling

    Abstract: Data visualization is a critical component in terms of interacting with floating-point output data from large model simulation codes. Indeed, postprocessing analysis workflows on simulation data often generate a large number of images from the raw data, many of which are then compared to each other or to specified reference images. In this image-comparison scenario, image quality assessment (IQA)… ▽ More

    Submitted 19 March, 2023; v1 submitted 5 February, 2022; originally announced February 2022.

  2. arXiv:2111.13428  [pdf, other

    stat.AP

    Nonstationary Spatial Modeling of Massive Global Satellite Data

    Authors: Huang Huang, Lewis R. Blake, Matthias Katzfuss, Dorit M. Hammerling

    Abstract: Earth-observing satellite instruments obtain a massive number of observations every day. For example, tens of millions of sea surface temperature (SST) observations on a global scale are collected daily by the Moderate Resolution Imaging Spectroradiometer (MODIS) instrument. Despite their size, such datasets are incomplete and noisy, necessitating spatial statistical inference to obtain complete,… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  3. arXiv:2101.02404  [pdf, other

    stat.ME cs.LG stat.ML

    Modeling massive highly-multivariate nonstationary spatial data with the basis graphical lasso

    Authors: Mitchell Krock, William Kleiber, Dorit Hammerling, Stephen Becker

    Abstract: We propose a new modeling framework for highly-multivariate spatial processes that synthesizes ideas from recent multiscale and spectral approaches with graphical models. The basis graphical lasso writes a univariate Gaussian process as a linear combination of basis functions weighted with entries of a Gaussian graphical vector whose graph is estimated from optimizing an $\ell_1$ penalized likelih… ▽ More

    Submitted 9 June, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

    Journal ref: Journal of Computational and Graphical Statistics, 32(4), 1472-1487 (2023)

  4. arXiv:2010.04051  [pdf, other

    stat.AP stat.ML

    HECT: High-Dimensional Ensemble Consistency Testing for Climate Models

    Authors: Niccolò Dalmasso, Galen Vincent, Dorit Hammerling, Ann B. Lee

    Abstract: Climate models play a crucial role in understanding the effect of environmental and man-made changes on climate to help mitigate climate risks and inform governmental decisions. Large global climate models such as the Community Earth System Model (CESM), developed by the National Center for Atmospheric Research, are very complex with millions of lines of code describing interactions of the atmosph… ▽ More

    Submitted 30 November, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted at the Tackling Climate Change with Machine Learning workshop at NeurIPS 2020, 6 pages, 1 figure

  5. arXiv:2001.00074  [pdf, other

    stat.AP

    Combining interdependent climate model outputs in CMIP5: A spatial Bayesian approach

    Authors: Huang Huang, Dorit Hammerling, Bo Li, Richard Smith

    Abstract: Projections of future climate change rely heavily on climate models, and combining climate models through a multi-model ensemble is both more accurate than a single climate model and valuable for uncertainty quantification. However, Bayesian approaches to multi-model ensembles have been criticized for making oversimplified assumptions about bias and variability, as well as treating different model… ▽ More

    Submitted 26 February, 2020; v1 submitted 31 December, 2019; originally announced January 2020.

  6. Unlocking GOES: A Statistical Framework for Quantifying the Evolution of Convective Structure in Tropical Cyclones

    Authors: Trey McNeely, Ann B. Lee, Kimberly M. Wood, Dorit Hammerling

    Abstract: Tropical cyclones (TCs) rank among the most costly natural disasters in the United States, and accurate forecasts of track and intensity are critical for emergency response. Intensity guidance has improved steadily but slowly, as processes which drive intensity change are not fully understood. Because most TCs develop far from land-based observing networks, geostationary satellite imagery is criti… ▽ More

    Submitted 3 August, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 19 pages, 14 figures, Submitted to the Journal of Applied Meteorology and Climatology

    Journal ref: Journal of Applied Meteorology and Climatology 59.10 (2020): 1671-1689

  7. Pushing the Limit: A Hybrid Parallel Implementation of the Multi-resolution Approximation for Massive Data

    Authors: Huang Huang, Lewis R. Blake, Dorit M. Hammerling

    Abstract: The multi-resolution approximation (MRA) of Gaussian processes was recently proposed to conduct likelihood-based inference for massive spatial data sets. An advantage of the methodology is that it can be parallelized. We implemented the MRA in C++ for both serial and parallel versions. In the parallel implementation, we use a hybrid parallelism that employs both distributed and shared memory compu… ▽ More

    Submitted 5 May, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

  8. arXiv:1806.11388  [pdf, other

    stat.CO

    Marginally Parametrized Spatio-Temporal Models and Stepwise Maximum Likelihood Estimation

    Authors: Matthew Edwards, Stefano Castruccio, Dorit Hammerling

    Abstract: In order to learn the complex features of large spatio-temporal data, models with large parameter sets are often required. However, estimating a large number of parameters is often infeasible due to the computational and memory costs of maximum likelihood estimation (MLE). We introduce the class of marginally parametrized (MP) models, where inference can be performed efficiently with a sequence of… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  9. arXiv:1711.08077  [pdf, other

    stat.ME

    Modeling and emulation of nonstationary Gaussian fields

    Authors: Douglas Nychka, Dorit Hammerling, Mitchell Krock, Ashton Wiens

    Abstract: Geophysical and other natural processes often exhibit non-stationary covariances and this feature is important to take into account for statistical models that attempt to emulate the physical process. A convolution-based model is used to represent non-stationary Gaussian processes that allows for variation in the correlation range and vari- ance of the process across space. Application of this mod… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 32 pages total, 10 figures

  10. arXiv:1710.05013  [pdf, other

    stat.ME

    A Case Study Competition Among Methods for Analyzing Large Spatial Data

    Authors: Matthew J. Heaton, Abhirup Datta, Andrew Finley, Reinhard Furrer, Rajarshi Guhaniyogi, Florian Gerber, Robert B. Gramacy, Dorit Hammerling, Matthias Katzfuss, Finn Lindgren, Douglas W. Nychka, Furong Sun, Andrew Zammit-Mangion

    Abstract: The Gaussian process is an indispensable tool for spatial data analysts. The onset of the "big data" era, however, has lead to the traditional Gaussian process being computationally infeasible for modern spatial data. As such, various alternatives to the full Gaussian process that are more amenable to handling big spatial data have been proposed. These modern methods often exploit low rank structu… ▽ More

    Submitted 25 April, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

  11. Compression and Conditional Emulation of Climate Model Output

    Authors: Joseph Guinness, Dorit Hammerling

    Abstract: Numerical climate model simulations run at high spatial and temporal resolutions generate massive quantities of data. As our computing capabilities continue to increase, storing all of the data is not sustainable, and thus it is important to develop methods for representing the full datasets by smaller compressed versions. We propose a statistical compression and decompression algorithm based on s… ▽ More

    Submitted 19 February, 2018; v1 submitted 25 May, 2016; originally announced May 2016.

  12. Parallel inference for massive distributed spatial data using low-rank models

    Authors: Matthias Katzfuss, Dorit Hammerling

    Abstract: Due to rapid data growth, statistical analysis of massive datasets often has to be carried out in a distributed fashion, either because several datasets stored in separate physical locations are all relevant to a given problem, or simply to achieve faster (parallel) computation through a divide-and-conquer scheme. In both cases, the challenge is to obtain valid inference that does not require proc… ▽ More

    Submitted 5 February, 2016; v1 submitted 6 February, 2014; originally announced February 2014.

    Comments: 20 pages; published in Statistics and Computing