Skip to main content

Showing 1–31 of 31 results for author: Ginsbourger, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.12909  [pdf, other

    stat.ML cs.LG

    Non-Sequential Ensemble Kalman Filtering using Distributed Arrays

    Authors: Cédric Travelletti, Jörg Franke, David Ginsbourger, Stefan Brönnimann

    Abstract: This work introduces a new, distributed implementation of the Ensemble Kalman Filter (EnKF) that allows for non-sequential assimilation of large datasets in high-dimensional problems. The traditional EnKF algorithm is computationally intensive and exhibits difficulties in applications requiring interaction with the background covariance matrix, prompting the use of methods like sequential assimila… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  2. arXiv:2310.07315  [pdf, ps, other

    math.ST math.PR stat.ML

    Consistency of some sequential experimental design strategies for excursion set estimation based on vector-valued Gaussian processes

    Authors: Philip Stange, David Ginsbourger

    Abstract: We tackle the extension to the vector-valued case of consistency results for Stepwise Uncertainty Reduction sequential experimental design strategies established in [Bect et al., A supermartingale approach to Gaussian process based sequential design of experiments, Bernoulli 25, 2019]. This lead us in the first place to clarify, assuming a compact index set, how the connection between continuous G… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  3. arXiv:2310.04082  [pdf, other

    stat.ME

    An energy-based model approach to rare event probability estimation

    Authors: Lea Friedli, David Ginsbourger, Arnaud Doucet, Niklas Linde

    Abstract: The estimation of rare event probabilities plays a pivotal role in diverse fields. Our aim is to determine the probability of a hazard or system failure occurring when a quantity of interest exceeds a critical value. In our approach, the distribution of the quantity of interest is represented by an energy density, characterized by a free energy function. To efficiently estimate the free energy, a… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  4. arXiv:2307.05846  [pdf, other

    stat.ME stat.AP

    Assessing the calibration of multivariate probabilistic forecasts

    Authors: Sam Allen, Johanna Ziegel, David Ginsbourger

    Abstract: Rank and PIT histograms are established tools to assess the calibration of probabilistic forecasts. They not only check whether an ensemble forecast is calibrated, but they also reveal what systematic biases (if any) are present in the forecasts. Several extensions of rank histograms have been proposed to evaluate the calibration of probabilistic forecasts for multivariate outcomes. These extensio… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  5. arXiv:2206.07588  [pdf, ps, other

    stat.ML cs.LG math.FA math.ST

    Characteristic kernels on Hilbert spaces, Banach spaces, and on sets of measures

    Authors: Johanna Ziegel, David Ginsbourger, Lutz Dümbgen

    Abstract: We present new classes of positive definite kernels on non-standard spaces that are integrally strictly positive definite or characteristic. In particular, we discuss radial kernels on separable Hilbert spaces, and introduce broad classes of kernels on Banach spaces and on metric spaces of strong negative type. The general results are used to give explicit classes of kernels on separable $L^p$ spa… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  6. arXiv:2202.12732  [pdf, other

    stat.ME

    Evaluating forecasts for high-impact events using transformed kernel scores

    Authors: Sam Allen, David Ginsbourger, Johanna Ziegel

    Abstract: It is informative to evaluate a forecaster's ability to predict outcomes that have a large impact on the forecast user. Although weighted scoring rules have become a well-established tool to achieve this, such scores have been studied almost exclusively in the univariate case, with interest typically placed on extreme events. However, a large impact may also result from events not considered to be… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  7. arXiv:2110.05210  [pdf, other

    physics.geo-ph stat.AP

    Lithological Tomography with the Correlated Pseudo-Marginal Method

    Authors: Lea Friedli, Niklas Linde, David Ginsbourger, Arnaud Doucet

    Abstract: We consider lithological tomography in which the posterior distribution of (hydro)geological parameters of interest is inferred from geophysical data by treating the intermediate geophysical properties as latent variables. In such a latent variable model, one needs to estimate the intractable likelihood of the (hydro)geological parameters given the geophysical data. The pseudo-marginal method is a… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Journal ref: Geophysical Journal International, Volume 228, Issue 2, February 2022

  8. arXiv:2109.03457  [pdf, other

    stat.ML cs.LG stat.AP stat.CO stat.ME

    Uncertainty Quantification and Experimental Design for Large-Scale Linear Inverse Problems under Gaussian Process Priors

    Authors: Cédric Travelletti, David Ginsbourger, Niklas Linde

    Abstract: We consider the use of Gaussian process (GP) priors for solving inverse problems in a Bayesian framework. As is well known, the computational complexity of GPs scales cubically in the number of datapoints. We here show that in the context of inverse problems involving integral operators, one faces additional difficulties that hinder inversion on large grids. Furthermore, in that context, covarianc… ▽ More

    Submitted 31 August, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    MSC Class: 86A22; 60G15; 62F15; 62L05

  9. arXiv:2104.08156  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Fast ABC with joint generative modelling and subset simulation

    Authors: Eliane Maalouf, David Ginsbourger, Niklas Linde

    Abstract: We propose a novel approach for solving inverse-problems with high-dimensional inputs and an expensive forward map**. It leverages joint deep generative modelling to transfer the original problem spaces to a lower dimensional latent space. By jointly modelling input and output variables and endowing the latent with a prior distribution, the fitted probabilistic model indirectly gives access to t… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 13 pages, 6 figures

  10. arXiv:2102.07612  [pdf, other

    stat.ME math.OC stat.AP stat.CO stat.ML

    Goal-oriented adaptive sampling under random field modelling of response probability distributions

    Authors: Athénaïs Gautier, David Ginsbourger, Guillaume Pirot

    Abstract: In the study of natural and artificial complex systems, responses that are not completely determined by the considered decision variables are commonly modelled probabilistically, resulting in response distributions varying across decision space. We consider cases where the spatial variation of these response distributions does not only concern their mean and/or variance but also other features inc… ▽ More

    Submitted 17 March, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  11. arXiv:2101.03108  [pdf, other

    stat.ME stat.CO stat.ML

    Fast calculation of Gaussian Process multiple-fold cross-validation residuals and their covariances

    Authors: David Ginsbourger, Cedric Schärer

    Abstract: We generalize fast Gaussian process leave-one-out formulae to multiple-fold cross-validation, highlighting in turn the covariance structure of cross-validation residuals in both Simple and Universal Kriging frameworks. We illustrate how resulting covariances affect model diagnostics. We further establish in the case of noiseless observations that correcting for covariances between residuals in cro… ▽ More

    Submitted 3 June, 2023; v1 submitted 8 January, 2021; originally announced January 2021.

  12. arXiv:2007.03722  [pdf, other

    stat.AP cs.LG stat.CO stat.ML

    Learning excursion sets of vector-valued Gaussian random fields for autonomous ocean sampling

    Authors: Trygve Olav Fossum, Cédric Travelletti, Jo Eidsvik, David Ginsbourger, Kanna Rajan

    Abstract: Improving and optimizing oceanographic sampling is a crucial task for marine science and maritime resource management. Faced with limited resources in understanding processes in the water-column, the combination of statistics and autonomous systems provide new opportunities for experimental design. In this work we develop efficient spatial sampling methods for characterizing regions defined by sim… ▽ More

    Submitted 18 August, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  13. arXiv:1912.11827  [pdf, other

    stat.AP stat.ME

    Area-covering postprocessing of ensemble precipitation forecasts using topographical and seasonal conditions

    Authors: Lea Friedli, David Ginsbourger, Jonas Bhend

    Abstract: Probabilistic weather forecasts from ensemble systems require statistical postprocessing to yield calibrated and sharp predictive distributions. This paper presents an area-covering postprocessing method for ensemble precipitation predictions. We rely on the ensemble model output statistics (EMOS) approach, which generates probabilistic forecasts with a parametric distribution whose parameters dep… ▽ More

    Submitted 12 October, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

  14. arXiv:1910.04086  [pdf, other

    stat.ML cs.LG math.ST stat.AP stat.ME

    Kernels over Sets of Finite Sets using RKHS Embeddings, with Application to Bayesian (Combinatorial) Optimization

    Authors: Poompol Buathong, David Ginsbourger, Tipaluck Krityakierne

    Abstract: We focus on kernel methods for set-valued inputs and their application to Bayesian set optimization, notably combinatorial optimization. We investigate two classes of set kernels that both rely on Reproducing Kernel Hilbert Space embeddings, namely the ``Double Sum'' (DS) kernels recently considered in Bayesian set optimization, and a class introduced here called ``Deep Embedding'' (DE) kernels th… ▽ More

    Submitted 10 March, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

  15. arXiv:1805.00753  [pdf, other

    stat.ME

    Gaussian processes with multidimensional distribution inputs via optimal transport and Hilbertian embedding

    Authors: Francois Bachoc, Alexandra Suvorikova, David Ginsbourger, Jean-Michel Loubes, Vladimir Spokoiny

    Abstract: In this work, we investigate Gaussian Processes indexed by multidimensional distributions. While directly constructing radial positive definite kernels based on the Wasserstein distance has been proven to be possible in the unidimensional case, such constructions do not extend well to the multidimensional case as we illustrate here. To tackle the problem of defining positive definite kernels betwe… ▽ More

    Submitted 11 April, 2019; v1 submitted 2 May, 2018; originally announced May 2018.

  16. arXiv:1711.01878  [pdf, other

    stat.ME

    Modeling non-stationary extreme dependence with stationary max-stable processes and multidimensional scaling

    Authors: Clément Chevalier, David Ginsbourger, Olivia Martius

    Abstract: Modeling the joint distribution of extreme weather events in multiple locations is a challenging task with important applications. In this study, we use max-stable models to study extreme daily precipitation events in Switzerland. The non-stationarity of the spatial process at hand involves important challenges, which are often dealt with by using a stationary model in a so-called climate space, w… ▽ More

    Submitted 28 November, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

  17. Profile extrema for visualizing and quantifying uncertainties on excursion regions. Application to coastal flooding

    Authors: Dario Azzimonti, David Ginsbourger, Jérémy Rohmer, Déborah Idier

    Abstract: We consider the problem of describing excursion sets of a real-valued function $f$, i.e. the set of inputs where $f$ is above a fixed threshold. Such regions are hard to visualize if the input space dimension, $d$, is higher than 2. For a given projection matrix from the input space to a lower dimensional (usually $1,2$) subspace, we introduce profile sup (inf) functions that associate to each poi… ▽ More

    Submitted 3 December, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

    Journal ref: Technometrics, 2019

  18. arXiv:1704.05318  [pdf, other

    math.OC stat.ME stat.ML

    On the choice of the low-dimensional domain for global optimization via random embeddings

    Authors: Mickaël Binois, David Ginsbourger, Olivier Roustant

    Abstract: The challenge of taking many variables into account in optimization problems may be overcome under the hypothesis of low effective dimensionality. Then, the search of solutions can be reduced to the random embedding of a low dimensional space into the original one, resulting in a more manageable optimization problem. Specifically, in the case of time consuming black-box functions and when the budg… ▽ More

    Submitted 22 October, 2018; v1 submitted 18 April, 2017; originally announced April 2017.

  19. arXiv:1611.07256  [pdf, other

    stat.ME math.ST stat.ML

    Adaptive Design of Experiments for Conservative Estimation of Excursion Sets

    Authors: Dario Azzimonti, David Ginsbourger, Clément Chevalier, Julien Bect, Yann Richet

    Abstract: We consider the problem of estimating the set of all inputs that leads a system to some particular behavior. The system is modeled by an expensive-to-evaluate function, such as a computer experiment, and we are interested in its excursion set, i.e. the set of points where the function takes values above or below some prescribed threshold. The objective function is emulated with a Gaussian Process… ▽ More

    Submitted 4 February, 2020; v1 submitted 22 November, 2016; originally announced November 2016.

    Journal ref: Technometrics, 63(1):13-26, 2021

  20. arXiv:1609.02700  [pdf, ps, other

    stat.ML stat.AP stat.ME

    Efficient batch-sequential Bayesian optimization with moments of truncated Gaussian vectors

    Authors: Sébastien Marmin, Clément Chevalier, David Ginsbourger

    Abstract: We deal with the efficient parallelization of Bayesian global optimization algorithms, and more specifically of those based on the expected improvement criterion and its variants. A closed form formula relying on multivariate Gaussian cumulative distribution functions is established for a generalized version of the multipoint expected improvement criterion. In turn, the latter relies on intermedia… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  21. arXiv:1608.01118  [pdf, ps, other

    stat.ML math.PR math.ST

    A supermartingale approach to Gaussian process based sequential design of experiments

    Authors: Julien Bect, François Bachoc, David Ginsbourger

    Abstract: Gaussian process (GP) models have become a well-established frameworkfor the adaptive design of costly experiments, and notably of computerexperiments. GP-based sequential designs have been found practicallyefficient for various objectives, such as global optimization(estimating the global maximum or maximizer(s) of a function),reliability analysis (estimating a probability of failure) or theesti… ▽ More

    Submitted 30 August, 2018; v1 submitted 3 August, 2016; originally announced August 2016.

  22. Estimating orthant probabilities of high dimensional Gaussian vectors with an application to set estimation

    Authors: Dario Azzimonti, David Ginsbourger

    Abstract: The computation of Gaussian orthant probabilities has been extensively studied for low-dimensional vectors. Here, we focus on the high-dimensional case and we present a two-step procedure relying on both deterministic and stochastic techniques. The proposed estimator relies indeed on splitting the probability into a low-dimensional term and a remainder. While the low-dimensional probability can be… ▽ More

    Submitted 30 November, 2018; v1 submitted 16 March, 2016; originally announced March 2016.

    Journal ref: Journal of Computational and Graphical Statistics, Taylor \& Francis, 2018, 27 (2), pp.255-267

  23. arXiv:1503.05509  [pdf, ps, other

    stat.ML math.ST

    Differentiating the multipoint Expected Improvement for optimal batch design

    Authors: Sébastien Marmin, Clément Chevalier, David Ginsbourger

    Abstract: This work deals with parallel optimization of expensive objective functions which are modeled as sample realizations of Gaussian processes. The study is formalized as a Bayesian optimization problem, or continuous multi-armed bandit problem, where a batch of q > 0 arms is pulled in parallel at each iteration. Several algorithms have been developed for choosing batches by trading off exploitation a… ▽ More

    Submitted 2 September, 2019; v1 submitted 18 March, 2015; originally announced March 2015.

  24. arXiv:1501.03659  [pdf, ps, other

    math.ST stat.CO stat.ML

    Quantifying uncertainties on excursion sets under a Gaussian random field prior

    Authors: Dario Azzimonti, Julien Bect, Clément Chevalier, David Ginsbourger

    Abstract: We focus on the problem of estimating and quantifying uncertainties on the excursion set of a function under a limited evaluation budget. We adopt a Bayesian approach where the objective function is assumed to be a realization of a Gaussian random field. In this setting, the posterior distribution on the objective function gives rise to a posterior distribution on excursion sets. Several approache… ▽ More

    Submitted 13 April, 2016; v1 submitted 15 January, 2015; originally announced January 2015.

    Journal ref: SIAM/ASA Journal on Uncertainty Quantification, 4(1):850-874, 2016

  25. arXiv:1411.3685  [pdf, other

    math.OC stat.ML

    A warped kernel improving robustness in Bayesian optimization via random embeddings

    Authors: Mickaël Binois, David Ginsbourger, Olivier Roustant

    Abstract: This works extends the Random Embedding Bayesian Optimization approach by integrating a war** of the high dimensional subspace within the covariance kernel. The proposed war**, that relies on elementary geometric considerations, allows mitigating the drawbacks of the high extrinsic dimensionality while avoiding the algorithm to evaluate points giving redundant information. It also alleviates c… ▽ More

    Submitted 18 March, 2015; v1 submitted 13 November, 2014; originally announced November 2014.

  26. arXiv:1308.1359  [pdf, other

    math.ST math.PR stat.ME stat.ML

    Invariances of random fields paths, with applications in Gaussian Process Regression

    Authors: David Ginsbourger, Olivier Roustant, Nicolas Durrande

    Abstract: We study pathwise invariances of centred random fields that can be controlled through the covariance. A result involving composition operators is obtained in second-order settings, and we show that various path properties including additivity boil down to invariances of the covariance kernel. These results are extended to a broader class of operators in the Gaussian case, via the Loève isometry. S… ▽ More

    Submitted 6 August, 2013; originally announced August 2013.

  27. arXiv:1203.6452  [pdf, ps, other

    stat.ML stat.CO

    Corrected Kriging update formulae for batch-sequential data assimilation

    Authors: Clément Chevalier, David Ginsbourger

    Abstract: Recently, a lot of effort has been paid to the efficient computation of Kriging predictors when observations are assimilated sequentially. In particular, Kriging update formulae enabling significant computational savings were derived in Barnes and Watson (1992), Gao et al. (1996), and Emery (2009). Taking advantage of the previous Kriging mean and variance calculations helps avoiding a costly… ▽ More

    Submitted 29 March, 2012; originally announced March 2012.

  28. arXiv:1111.6233  [pdf, ps, other

    stat.ML

    Additive Covariance Kernels for High-Dimensional Gaussian Process Modeling

    Authors: Nicolas Durrande, David Ginsbourger, Olivier Roustant, Laurent Carraro

    Abstract: Gaussian process models -also called Kriging models- are often used as mathematical approximations of expensive experiments. However, the number of observation required for building an emulator becomes unrealistic when using classical covariance kernels when the dimension of input increases. In oder to get round the curse of dimensionality, a popular approach is to consider simplified models such… ▽ More

    Submitted 27 November, 2011; originally announced November 2011.

    Comments: arXiv admin note: substantial text overlap with arXiv:1103.4023

    Journal ref: Annales de la Faculté de Sciences de Toulouse Tome 21, numéro 3 (2012) p. 481-499

  29. arXiv:1106.3571  [pdf, ps, other

    stat.ML

    ANOVA kernels and RKHS of zero mean functions for model-based sensitivity analysis

    Authors: Nicolas Durrande, David Ginsbourger, Olivier Roustant, Laurent Carraro

    Abstract: Given a reproducing kernel Hilbert space H of real-valued functions and a suitable measure mu over the source space D (subset of R), we decompose H as the sum of a subspace of centered functions for mu and its orthogonal in H. This decomposition leads to a special case of ANOVA kernels, for which the functional ANOVA representation of the best predictor can be elegantly derived, either in an inter… ▽ More

    Submitted 7 December, 2012; v1 submitted 17 June, 2011; originally announced June 2011.

    Journal ref: Journal of Multivariate Analysis 115 (2013) 57-67

  30. arXiv:1103.4023  [pdf, ps, other

    stat.ML

    Additive Kernels for Gaussian Process Modeling

    Authors: Nicolas Durrande, David Ginsbourger, Olivier Roustant

    Abstract: Gaussian Process (GP) models are often used as mathematical approximations of computationally expensive experiments. Provided that its kernel is suitably chosen and that enough data is available to obtain a reasonable fit of the simulator, a GP model can beneficially be used for tasks such as prediction, optimization, or Monte-Carlo-based quantification of uncertainty. However, the former conditio… ▽ More

    Submitted 21 March, 2011; originally announced March 2011.

  31. Sequential design of computer experiments for the estimation of a probability of failure

    Authors: Julien Bect, David Ginsbourger, Ling Li, Victor Picheny, Emmanuel Vazquez

    Abstract: This paper deals with the problem of estimating the volume of the excursion set of a function $f:\mathbb{R}^d \to \mathbb{R}$ above a given threshold, under a probability measure on $\mathbb{R}^d$ that is assumed to be known. In the industrial world, this corresponds to the problem of estimating a probability of failure of a system. When only an expensive-to-simulate model of the system is availab… ▽ More

    Submitted 24 April, 2012; v1 submitted 27 September, 2010; originally announced September 2010.

    Comments: This is an author-generated postprint version. The published version is available at http://www.springerlink.com

    MSC Class: 62L05; 62C10; 62P30

    Journal ref: Statistics and Computing, 22(3):773-793, 2012