Search | arXiv e-print repository

Stochastic full waveform inversion with deep generative prior for uncertainty quantification

Authors: Yuke Xie, Hervé Chauris, Nicolas Desassis

Abstract: To obtain high-resolution images of subsurface structures from seismic data, seismic imaging techniques such as Full Waveform Inversion (FWI) serve as crucial tools. However, FWI involves solving a nonlinear and often non-unique inverse problem, presenting challenges such as local minima trap** and inadequate handling of inherent uncertainties. In addressing these challenges, we propose leveragi… ▽ More To obtain high-resolution images of subsurface structures from seismic data, seismic imaging techniques such as Full Waveform Inversion (FWI) serve as crucial tools. However, FWI involves solving a nonlinear and often non-unique inverse problem, presenting challenges such as local minima trap** and inadequate handling of inherent uncertainties. In addressing these challenges, we propose leveraging deep generative models as the prior distribution of geophysical parameters for stochastic Bayesian inversion. This approach integrates the adjoint state gradient for efficient back-propagation from the numerical solution of partial differential equations. Additionally, we introduce explicit and implicit variational Bayesian inference methods. The explicit method computes variational distribution density using a normalizing flow-based neural network, enabling computation of the Bayesian posterior of parameters. Conversely, the implicit method employs an inference network attached to a pretrained generative model to estimate density, incorporating an entropy estimator. Furthermore, we also experimented with the Stein Variational Gradient Descent (SVGD) method as another variational inference technique, using particles. We compare these variational Bayesian inference methods with conventional Markov chain Monte Carlo (McMC) sampling. Each method is able to quantify uncertainties and to generate seismic data-conditioned realizations of subsurface geophysical parameters. This framework provides insights into subsurface structures while accounting for inherent uncertainties. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2402.01855 [pdf, other]

SPDE priors for uncertainty quantification of end-to-end neural data assimilation schemes

Authors: Maxime Beauchamp, Nicolas Desassis, J. Emmanuel Johnson, Simon Benaichouche, Pierre Tandeo, Ronan Fablet

Abstract: The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-indu… ▽ More The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-induced covariance matrix in Optimal Interpolation. Recent advances in the deep learning community also enables to adress this problem as neural architecture embedding data assimilation variational framework. The reconstruction task is seen as a joint learning problem of the prior involved in the variational inner cost and the gradient-based minimization of the latter: both prior models and solvers are stated as neural networks with automatic differentiation which can be trained by minimizing a loss function, typically stated as the mean squared error between some ground truth and the reconstruction. In this work, we draw from the SPDE-based Gaussian Processes to estimate complex prior models able to handle non-stationary covariances in both space and time and provide a stochastic framework for interpretability and uncertainty quantification. Our neural variational scheme is modified to embed an augmented state formulation with both state and SPDE parametrization to estimate. Instead of a neural prior, we use a stochastic PDE as surrogate model along the data assimilation window. The training involves a loss function for both reconstruction task and SPDE prior model, where the likelihood of the SPDE parameters given the true states is involved in the training. Because the prior is stochastic, we can easily draw samples in the prior distribution before conditioning to provide a flexible way to estimate the posterior distribution based on thousands of members. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2305.13318 [pdf, other]

A stable deep adversarial learning approach for geological facies generation

Authors: Ferdinand Bhavsar, Nicolas Desassis, Fabien Ors, Thomas Romary

Abstract: The simulation of geological facies in an unobservable volume is essential in various geoscience applications. Given the complexity of the problem, deep generative learning is a promising approach to overcome the limitations of traditional geostatistical simulation models, in particular their lack of physical realism. This research aims to investigate the application of generative adversarial netw… ▽ More The simulation of geological facies in an unobservable volume is essential in various geoscience applications. Given the complexity of the problem, deep generative learning is a promising approach to overcome the limitations of traditional geostatistical simulation models, in particular their lack of physical realism. This research aims to investigate the application of generative adversarial networks and deep variational inference for conditionally simulating meandering channels in underground volumes. In this paper, we review the generative deep learning approaches, in particular the adversarial ones and the stabilization techniques that aim to facilitate their training. The proposed approach is tested on 2D and 3D simulations generated by the stochastic process-based model Flumy. Morphological metrics are utilized to compare our proposed method with earlier iterations of generative adversarial networks. The results indicate that by utilizing recent stabilization techniques, generative adversarial networks can efficiently sample from target data distributions. Moreover, we demonstrate the ability to simulate conditioned simulations through the latent variable model property of the proposed approach. △ Less

Submitted 4 March, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

arXiv:2208.14015 [pdf, other]

The SPDE approach for spatio-temporal datasets with advection and diffusion

Authors: Lucia Clarotto, Denis Allard, Thomas Romary, Nicolas Desassis

Abstract: In the task of predicting spatio-temporal fields in environmental science using statistical methods, introducing statistical models inspired by the physics of the underlying phenomena that are numerically efficient is of growing interest. Large space-time datasets call for new numerical methods to efficiently process them. The Stochastic Partial Differential Equation (SPDE) approach has proven to… ▽ More In the task of predicting spatio-temporal fields in environmental science using statistical methods, introducing statistical models inspired by the physics of the underlying phenomena that are numerically efficient is of growing interest. Large space-time datasets call for new numerical methods to efficiently process them. The Stochastic Partial Differential Equation (SPDE) approach has proven to be effective for the estimation and the prediction in a spatial context. We present here the advection-diffusion SPDE with first order derivative in time which defines a large class of nonseparable spatio-temporal models. A Gaussian Markov random field approximation of the solution to the SPDE is built by discretizing the temporal derivative with a finite difference method (implicit Euler) and by solving the spatial SPDE with a finite element method (continuous Galerkin) at each time step. The ''Streamline Diffusion'' stabilization technique is introduced when the advection term dominates the diffusion. Computationally efficient methods are proposed to estimate the parameters of the SPDE and to predict the spatio-temporal field by kriging, as well as to perform conditional simulations. The approach is applied to a solar radiation dataset. Its advantages and limitations are discussed. △ Less

Submitted 24 March, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

arXiv:2208.12501 [pdf, other]

Geostatistics for large datasets on Riemannian manifolds: a matrix-free approach

Authors: Mike Pereira, Nicolas Desassis, Denis Allard

Abstract: Large or very large spatial (and spatio-temporal) datasets have become common place in many environmental and climate studies. These data are often collected in non-Euclidean spaces (such as the planet Earth) and they often present non-stationary anisotropies. This paper proposes a generic approach to model Gaussian Random Fields (GRFs) on compact Riemannian manifolds that bridges the gap between… ▽ More Large or very large spatial (and spatio-temporal) datasets have become common place in many environmental and climate studies. These data are often collected in non-Euclidean spaces (such as the planet Earth) and they often present non-stationary anisotropies. This paper proposes a generic approach to model Gaussian Random Fields (GRFs) on compact Riemannian manifolds that bridges the gap between existing works on non-stationary GRFs and random fields on manifolds. This approach can be applied to any smooth compact manifolds, and in particular to any compact surface. By defining a Riemannian metric that accounts for the preferential directions of correlation, our approach yields an interpretation of the ''local anisotropies'' as resulting from ''local'' deformations of the domain. We provide scalable algorithms for the estimation of the parameters and for optimal prediction by kriging and simulation able to tackle very large grids. Stationary and non-stationary illustrations are provided. △ Less

Submitted 6 January, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

arXiv:2004.02799 [pdf, other]

A matrix-free approach to geostatistical filtering

Authors: Mike Pereira, Nicolas Desassis, Cédric Magneron, Nathan Palmer

Abstract: In this paper, we present a novel approach to geostatistical filtering which tackles two challenges encountered when applying this method to complex spatial datasets: modeling the non-stationarity of the data while still being able to work with large datasets. The approach is based on a finite element approximation of Gaussian random fields expressed as an expansion of the eigenfunctions of a Lapl… ▽ More In this paper, we present a novel approach to geostatistical filtering which tackles two challenges encountered when applying this method to complex spatial datasets: modeling the non-stationarity of the data while still being able to work with large datasets. The approach is based on a finite element approximation of Gaussian random fields expressed as an expansion of the eigenfunctions of a Laplace--Beltrami operator defined to account for local anisotropies. The numerical approximation of the resulting random fields using a finite element approach is then leveraged to solve the scalability issue through a matrix-free approach. Finally, two cases of application of this approach, on simulated and real seismic data are presented. △ Less

Submitted 6 April, 2020; originally announced April 2020.

Comments: 25 pages, 8 figures

arXiv:1811.03004 [pdf, other]

Finite element approximation of non-Markovian random fields

Authors: Mike Pereira, Nicolas Desassis

Abstract: In this paper, we present finite element approximations of a class of Generalized random fields defined over a bounded domain of R d or a smooth d-dimensional Riemannian manifold (d $\ge$ 1). An explicit expression for the covariance matrix of the weights of the finite element representation of these fields is provided and an analysis of the approximation error is carried out. Finally, a method to… ▽ More In this paper, we present finite element approximations of a class of Generalized random fields defined over a bounded domain of R d or a smooth d-dimensional Riemannian manifold (d $\ge$ 1). An explicit expression for the covariance matrix of the weights of the finite element representation of these fields is provided and an analysis of the approximation error is carried out. Finally, a method to generate simulations of these weights while limiting computational and storage costs is presented. △ Less

Submitted 6 November, 2018; originally announced November 2018.

arXiv:1806.04999 [pdf, ps, other]

A general framework for SPDE-based stationary random fields

Authors: Ricardo Carrizo Vergara, Denis Allard, Nicolas Desassis

Abstract: This paper presents theoretical advances in the application of the Stochastic Partial Differential Equation (SPDE) approach in geostatistics. We show a general approach to construct stationary models related to a wide class of linear SPDEs, with applications to spatio-temporal models having non-trivial properties. Within the framework of Generalized Random Fields, a criterion for existence and uni… ▽ More This paper presents theoretical advances in the application of the Stochastic Partial Differential Equation (SPDE) approach in geostatistics. We show a general approach to construct stationary models related to a wide class of linear SPDEs, with applications to spatio-temporal models having non-trivial properties. Within the framework of Generalized Random Fields, a criterion for existence and uniqueness of stationary solutions for this class of SPDEs is proposed and proven. Their covariance are then obtained through their spectral measure. We present a result relating the covariance in the case of a White Noise source term with that of a generic case through convolution. Then, we obtain a variety of SPDE-based stationary random fields. In particular, well-known results regarding the Matérn Model and Markovian models are recovered. A new relationship between the Stein model and a particular SPDE is obtained. New spatio-temporal models obtained from evolution SPDEs of arbitrary temporal derivative order are then obtained, for which properties of separability and symmetry can be controlled. We also obtain results concerning stationary solutions for physically inspired models, such as solutions to the heat equation, the advection-diffusion equation, some Langevin's equations and the wave equation. △ Less

Submitted 27 July, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

Comments: Corrected typos and style. Corrected mistakes in references (verified the cross cite, added new references and erasing non-used ones). Reorganization of the Section 6 in order to obtain a "general to particular" exposition. Appendix E is erased, its content is now present in the corpus. Some clarifications to proofs in Appendix B are added

arXiv:1806.01558 [pdf, other]

Combining covariance tapering and lasso driven low rank decomposition for the kriging of large spatial datasets

Authors: Thomas Romary, Nicolas Desassis

Abstract: Large spatial datasets are becoming ubiquitous in environmental sciences with the explosion in the amount of data produced by sensors that monitor and measure the Earth system. Consequently, the geostatistical analysis of these data requires adequate methods. Richer datasets lead to more complex modeling but may also prevent from using classical techniques. Indeed, the kriging predictor is… ▽ More Large spatial datasets are becoming ubiquitous in environmental sciences with the explosion in the amount of data produced by sensors that monitor and measure the Earth system. Consequently, the geostatistical analysis of these data requires adequate methods. Richer datasets lead to more complex modeling but may also prevent from using classical techniques. Indeed, the kriging predictor is not straightforwarldly available as it requires the inversion of the covariance matrix of the data. The challenge of handling such datasets is therefore to extract the maximum of information they contain while ensuring the numerical tractability of the associated inference and prediction algorithms. The different approaches that have been developed in the literature to address this problem can be classified into two families, both aiming at making the inversion of the covariance matrix computationally feasible. The covariance tapering approach circumvents the problem by enforcing the sparsity of the covariance matrix, making it invertible in a reasonable computation time. The second available approach assumes a low rank representation of the covariance function. While both approaches have their drawbacks, we propose a way to combine them and benefit from their advantages. The covariance model is assumed to have the form low rank plus sparse. The choice of the basis functions sustaining the low rank component is data driven and is achieved through a selection procedure, thus alleviating the computational burden of the low rank part. This model expresses as a spatial random effects model and the estimation of the parameters is conducted through a step by step approach treating each scale separately. The resulting model can account for second order non stationarity and handle large volumes of data. △ Less

Submitted 5 June, 2018; originally announced June 2018.

arXiv:1805.07423 [pdf, other]

doi 10.1016/j.spasta.2019.100359

Efficient simulation of Gaussian Markov random fields by Chebyshev polynomial approximation

Authors: Mike Pereira, Nicolas Desassis

Abstract: This paper presents an algorithm to simulate Gaussian random vectors whose precision matrix can be expressed as a polynomial of a sparse matrix. This situation arises in particular when simulating Gaussian Markov random fields obtained by the discretization by finite elements of the solutions of some stochastic partial derivative equations. The proposed algorithm uses a Chebyshev polynomial approx… ▽ More This paper presents an algorithm to simulate Gaussian random vectors whose precision matrix can be expressed as a polynomial of a sparse matrix. This situation arises in particular when simulating Gaussian Markov random fields obtained by the discretization by finite elements of the solutions of some stochastic partial derivative equations. The proposed algorithm uses a Chebyshev polynomial approximation to compute simulated vectors with a linear complexity. This method is asymptotically exact as the approximation order grows. Criteria based on tests of the statistical properties of the produced vectors are derived to determine minimal orders of approximation. △ Less

Submitted 29 June, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

Comments: 20 pages, 5 figures

arXiv:1510.02668 [pdf, other]

A pairwise likelihood approach for the empirical estimation of the underlyingvariograms in the plurigaussian models

Authors: Nicolas Desassis, Didier Renard, Hélène Beucher, Sylvain Petiteau, Xavier Freulon

Abstract: The plurigaussian model is particularly suited to describe categorical regionalized variables. Starting from a simple principle, the thresh-olding of one or several Gaussian random fields (GRFs) to obtain categories, the plurigaussian model is well adapted for a wide range ofsituations. By acting on the form of the thresholding rule and/or the threshold values (which can vary along space) and the… ▽ More The plurigaussian model is particularly suited to describe categorical regionalized variables. Starting from a simple principle, the thresh-olding of one or several Gaussian random fields (GRFs) to obtain categories, the plurigaussian model is well adapted for a wide range ofsituations. By acting on the form of the thresholding rule and/or the threshold values (which can vary along space) and the variograms ofthe underlying GRFs, one can generate many spatial configurations for the categorical variables. One difficulty is to choose variogrammodel for the underlying GRFs. Indeed, these latter are hidden by the truncation and we only observe the simple and cross-variogramsof the category indicators. In this paper, we propose a semiparametric method based on the pairwise likelihood to estimate the empiricalvariogram of the GRFs. It provides an exploratory tool in order to choose a suitable model for each GRF and later to estimate its param-eters. We illustrate the efficiency of the method with a Monte-Carlo simulation study .The method presented in this paper is implemented in the R packageRGeostats. △ Less

Submitted 16 October, 2015; v1 submitted 9 October, 2015; originally announced October 2015.

Comments: To be submitted to Spatial Statistics

arXiv:1412.1373 [pdf, other]

A Generalized Convolution Model and Estimation for Non-stationary Random Functions

Authors: Francky Fouedjio, Nicolas Desassis, Jacques Rivoirard

Abstract: Standard geostatistical models assume second order stationarity of the underlying Random Function. In some instances, there is little reason to expect the spatial dependence structure to be stationary over the whole region of interest. In this paper, we introduce a new model for second order non-stationary Random Functions as a convolution of an orthogonal random measure with a spatially varying r… ▽ More Standard geostatistical models assume second order stationarity of the underlying Random Function. In some instances, there is little reason to expect the spatial dependence structure to be stationary over the whole region of interest. In this paper, we introduce a new model for second order non-stationary Random Functions as a convolution of an orthogonal random measure with a spatially varying random weighting function. This new model is a generalization of the common convolution model where a non-random weighting function is used. The resulting class of non-stationary covariance functions is very general, flexible and allows to retrieve classes of closed-form non-stationary covariance functions known from the literature, for a suitable choices of the random weighting functions family. Under the framework of a single realization and local stationarity, we develop parameter inference procedure of these explicit classes of non-stationary covariance functions. From a local variogram non-parametric kernel estimator, a weighted local least-squares approach in combination with kernel smoothing method is developed to estimate the parameters. Performances are assessed on two real datasets: soil and rainfall data. It is shown in particular that the proposed approach outperforms the stationary one, according to several criteria. Beyond the spatial predictions, we also show how conditional simulations can be carried out in this non-stationary framework. △ Less

Submitted 3 December, 2014; originally announced December 2014.

Comments: 24 pages, 10 figures, 2 tables

arXiv:1412.1344 [pdf, other]

Estimation of Space Deformation Model for Non-stationary Random Functions

Authors: Francky Fouedjio, Nicolas Desassis, Thomas Romary

Abstract: Stationary Random Functions have been successfully applied in geostatistical applications for decades. In some instances, the assumption of a homogeneous spatial dependence structure across the entire domain of interest is unrealistic. A practical approach for modelling and estimating non-stationary spatial dependence structure is considered. This consists in transforming a non-stationary Random F… ▽ More Stationary Random Functions have been successfully applied in geostatistical applications for decades. In some instances, the assumption of a homogeneous spatial dependence structure across the entire domain of interest is unrealistic. A practical approach for modelling and estimating non-stationary spatial dependence structure is considered. This consists in transforming a non-stationary Random Function into a stationary and isotropic one via a bijective continuous deformation of the index space. So far, this approach has been successfully applied in the context of data from several independent realizations of a Random Function. In this work, we propose an approach for non-stationary geostatistical modelling using space deformation in the context of a single realization with possibly irregularly spaced data. The estimation method is based on a non-stationary variogram kernel estimator which serves as a dissimilarity measure between two locations in the geographical space. The proposed procedure combines aspects of kernel smoothing, weighted non-metric multi-dimensional scaling and thin-plate spline radial basis functions. On a simulated data, the method is able to retrieve the true deformation. Performances are assessed on both synthetic and real datasets. It is shown in particular that our approach outperforms the stationary approach. Beyond the prediction, the proposed method can also serve as a tool for exploratory analysis of the non-stationarity. △ Less

Submitted 3 December, 2014; originally announced December 2014.

Comments: 17 pages, 9 figures, 2 tables

arXiv:0804.4780 [pdf, ps, other]

Incorporating a contrast in the Bayesian formula: What consequences for the MAP estimator and the posterior distribution? Applications in spatial statistics

Authors: S. Soubeyrand, F. Carpentier, N. Desassis, J. Chadœuf

Abstract: In order to estimate model parameters and circumvent possible difficulties encountered with the likelihood function, we propose to replace the likelihood in the formula of the posterior distribution by a function depending on a contrast. The properties of the contrast-based (CB) posterior distribution and MAP estimator are studied to understand what the consequences of incorporating a contrast i… ▽ More In order to estimate model parameters and circumvent possible difficulties encountered with the likelihood function, we propose to replace the likelihood in the formula of the posterior distribution by a function depending on a contrast. The properties of the contrast-based (CB) posterior distribution and MAP estimator are studied to understand what the consequences of incorporating a contrast in the Bayesian formula are. We show that the proposed method can be used to make frequentist inference and allows the reduction of analytical calculations to get the limit variance matrix of the estimator. For specific contrasts, the CB--posterior distribution directly approximates the limit distribution of the estimator; the calculation of the limit variance matrix is then avoided. Moreover, for these contrasts, the CB--posterior distribution can also be used to make inference in the Bayesian way. The method is applied to three spatial data sets. △ Less

Submitted 30 April, 2008; originally announced April 2008.

Comments: Submitted to the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-EJS-EJS_2008_235

Showing 1–14 of 14 results for author: Desassis, N