Search | arXiv e-print repository

SPDE priors for uncertainty quantification of end-to-end neural data assimilation schemes

Authors: Maxime Beauchamp, Nicolas Desassis, J. Emmanuel Johnson, Simon Benaichouche, Pierre Tandeo, Ronan Fablet

Abstract: The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-indu… ▽ More The spatio-temporal interpolation of large geophysical datasets has historically been adressed by Optimal Interpolation (OI) and more sophisticated model-based or data-driven DA techniques. In the last ten years, the link established between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) opened a new way of handling both large datasets and physically-induced covariance matrix in Optimal Interpolation. Recent advances in the deep learning community also enables to adress this problem as neural architecture embedding data assimilation variational framework. The reconstruction task is seen as a joint learning problem of the prior involved in the variational inner cost and the gradient-based minimization of the latter: both prior models and solvers are stated as neural networks with automatic differentiation which can be trained by minimizing a loss function, typically stated as the mean squared error between some ground truth and the reconstruction. In this work, we draw from the SPDE-based Gaussian Processes to estimate complex prior models able to handle non-stationary covariances in both space and time and provide a stochastic framework for interpretability and uncertainty quantification. Our neural variational scheme is modified to embed an augmented state formulation with both state and SPDE parametrization to estimate. Instead of a neural prior, we use a stochastic PDE as surrogate model along the data assimilation window. The training involves a loss function for both reconstruction task and SPDE prior model, where the likelihood of the SPDE parameters given the true states is involved in the training. Because the prior is stochastic, we can easily draw samples in the prior distribution before conditioning to provide a flexible way to estimate the posterior distribution based on thousands of members. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2311.01783 [pdf, other]

Neural SPDE solver for uncertainty quantification in high-dimensional space-time dynamics

Authors: Maxime Beauchamp, Ronan Fablet, Hugo Georgenthum

Abstract: Historically, the interpolation of large geophysical datasets has been tackled using methods like Optimal Interpolation (OI) or model-based data assimilation schemes. However, the recent connection between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) introduced a novel approach to handle large datasets making use of sparse precision matrices in OI. Rece… ▽ More Historically, the interpolation of large geophysical datasets has been tackled using methods like Optimal Interpolation (OI) or model-based data assimilation schemes. However, the recent connection between Stochastic Partial Differential Equations (SPDE) and Gaussian Markov Random Fields (GMRF) introduced a novel approach to handle large datasets making use of sparse precision matrices in OI. Recent advancements in deep learning also addressed this issue by incorporating data assimilation into neural architectures: it treats the reconstruction task as a joint learning problem involving both prior model and solver as neural networks. Though, it requires further developments to quantify the associated uncertainties. In our work, we leverage SPDEbased Gaussian Processes to estimate complex prior models capable of handling nonstationary covariances in space and time. We develop a specific architecture able to learn both state and SPDE parameters as a neural SPDE solver, while providing the precisionbased analytical form of the SPDE sampling. The latter is used as a surrogate model along the data assimilation window. Because the prior is stochastic, we can easily draw samples from it and condition the members by our neural solver, allowing flexible estimation of the posterior distribution based on large ensemble. We demonstrate this framework on realistic Sea Surface Height datasets. Our solution improves the OI baseline, aligns with neural prior while enabling uncertainty quantification and online parameter estimation. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2306.15041 [pdf]

A Comparison of Neuroelectrophysiology Databases

Authors: Priyanka Subash, Alex Gray, Misque Boswell, Samantha L. Cohen, Rachael Garner, Sana Salehi, Calvary Fisher, Samuel Hobel, Satrajit Ghosh, Yaroslav Halchenko, Benjamin Dichter, Russell A. Poldrack, Chris Markiewicz, Dora Hermes, Arnaud Delorme, Scott Makeig, Brendan Behan, Alana Sparks, Stephen R Arnott, Zhengjia Wang, John Magnotti, Michael S. Beauchamp, Nader Pouratian, Arthur W. Toga, Dominique Duncan

Abstract: As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DAN… ▽ More As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DANDI), OpenNeuro, and Brain-CODE. The aim of this review is to describe archives that provide researchers with tools to store, share, and reanalyze both human and non-human neurophysiology data based on criteria that are of interest to the neuroscientific community. The Brain Imaging Data Structure (BIDS) and Neurodata Without Borders (NWB) are utilized by these archives to make data more accessible to researchers by implementing a common standard. As the necessity for integrating large-scale analysis into data repository platforms continues to grow within the neuroscientific community, this article will highlight the various analytical and customizable tools developed within the chosen archives that may advance the field of neuroinformatics. △ Less

Submitted 30 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 22 pages, 6 figures, 5 tables

arXiv:2211.07209 [pdf, other]

Learning Neural Optimal Interpolation Models and Solvers

Authors: Maxime Beauchamp, Joseph Thompson, Hugo Georgenthum, Quentin Febvre, Ronan Fablet

Abstract: The reconstruction of gap-free signals from observation data is a critical challenge for numerous application domains, such as geoscience and space-based earth observation, when the available sensors or the data collection processes lead to irregularly-sampled and noisy observations. Optimal interpolation (OI), also referred to as kriging, provides a theoretical framework to solve interpolation pr… ▽ More The reconstruction of gap-free signals from observation data is a critical challenge for numerous application domains, such as geoscience and space-based earth observation, when the available sensors or the data collection processes lead to irregularly-sampled and noisy observations. Optimal interpolation (OI), also referred to as kriging, provides a theoretical framework to solve interpolation problems for Gaussian processes (GP). The associated computational complexity being rapidly intractable for n-dimensional tensors and increasing numbers of observations, a rich literature has emerged to address this issue using ensemble methods, sparse schemes or iterative approaches. Here, we introduce a neural OI scheme. It exploits a variational formulation with convolutional auto-encoders and a trainable iterative gradient-based solver. Theoretically equivalent to the OI formulation, the trainable solver asymptotically converges to the OI solution when dealing with both stationary and non-stationary linear spatio-temporal GPs. Through a bi-level optimization formulation, we relate the learning step and the selection of the training loss to the theoretical properties of the OI, which is an unbiased estimator with minimal error variance. Numerical experiments for 2D+t synthetic GP datasets demonstrate the relevance of the proposed scheme to learn computationally-efficient and scalable OI models and solvers from data. As illustrated for a real-world interpolation problems for satellite-derived geophysical dynamics, the proposed framework also extends to non-linear and multimodal interpolation problems and significantly outperforms state-of-the-art interpolation methods, when dealing with very high missing data rates. △ Less

Submitted 14 November, 2022; originally announced November 2022.

arXiv:2211.05904 [pdf, other]

4DVarNet-SSH: end-to-end learning of variational interpolation schemes for nadir and wide-swath satellite altimetry

Authors: Maxime Beauchamp, Quentin Febvre, Hugo Georgentum, Ronan Fablet

Abstract: The reconstruction of sea surface currents from satellite altimeter data is a key challenge in spatial oceanography, especially with the upcoming wide-swath SWOT (Surface Ocean and Water Topography) altimeter mission. Operational systems however generally fail to retrieve mesoscale dynamics for horizontal scales below 100km and time-scale below 10 days. Here, we address this challenge through the… ▽ More The reconstruction of sea surface currents from satellite altimeter data is a key challenge in spatial oceanography, especially with the upcoming wide-swath SWOT (Surface Ocean and Water Topography) altimeter mission. Operational systems however generally fail to retrieve mesoscale dynamics for horizontal scales below 100km and time-scale below 10 days. Here, we address this challenge through the 4DVarnet framework, an end-to-end neural scheme backed on a variational data assimilation formulation. We introduce a parametrization of the 4DVarNet scheme dedicated to the space-time interpolation of satellite altimeter data. Within an observing system simulation experiment (NATL60), we demonstrate the relevance of the proposed approach both for nadir and nadir+swot altimeter configurations for two contrasted case-study regions in terms of upper ocean dynamics. We report relative improvement with respect to the operational optimal interpolation between 30% and 60% in terms of reconstruction error. Interestingly, for the nadir+swot altimeter configuration, we reach resolved space-time scales below 70km and 7days. The code is open-source to enable reproductibility and future collaborative developments. Beyond its applicability to large-scale domains, we also address uncertainty quantification issues and generalization properties of the proposed learning setting. We discuss further future research avenues and extensions to other ocean data assimilation and space oceanography challenges. △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2011.09447 [pdf, other]

Interpretable Visualization and Higher-Order Dimension Reduction for ECoG Data

Authors: Kelly Geyer, Frederick Campbell, Andersen Chang, John Magnotti, Michael Beauchamp, Genevera I. Allen

Abstract: ElectroCOrticoGraphy (ECoG) technology measures electrical activity in the human brain via electrodes placed directly on the cortical surface during neurosurgery. Through its capability to record activity at a fast temporal resolution, ECoG experiments have allowed scientists to better understand how the human brain processes speech. By its nature, ECoG data is difficult for neuroscientists to dir… ▽ More ElectroCOrticoGraphy (ECoG) technology measures electrical activity in the human brain via electrodes placed directly on the cortical surface during neurosurgery. Through its capability to record activity at a fast temporal resolution, ECoG experiments have allowed scientists to better understand how the human brain processes speech. By its nature, ECoG data is difficult for neuroscientists to directly interpret for two major reasons. Firstly, ECoG data tends to be large in size, as each individual experiment yields data up to several gigabytes. Secondly, ECoG data has a complex, higher-order nature. After signal processing, this type of data may be organized as a 4-way tensor with dimensions representing trials, electrodes, frequency, and time. In this paper, we develop an interpretable dimension reduction approach called Regularized Higher Order Principal Components Analysis, as well as an extension to Regularized Higher Order Partial Least Squares, that allows neuroscientists to explore and visualize ECoG data. Our approach employs a sparse and functional Candecomp-Parafac (CP) decomposition that incorporates sparsity to select relevant electrodes and frequency bands, as well as smoothness over time and frequency, yielding directly interpretable factors. We demonstrate the performance and interpretability of our method with an ECoG case study on audio and visual processing of human speech. △ Less

Submitted 12 December, 2020; v1 submitted 15 November, 2020; originally announced November 2020.

arXiv:2006.10163 [pdf, other]

doi 10.1111/biom.13684

Functional Group Bridge for Simultaneous Regression and Support Estimation

Authors: Zhengjia Wang, John Magnotti, Michael S. Beauchamp, Meng Li

Abstract: This article is motivated by studying multisensory effects on brain activities in intracranial electroencephalography (iEEG) experiments. Differential brain activities to multisensory stimulus presentations are zero in most regions and non-zero in some local regions, yielding locally sparse functions. Such studies are essentially a function-on-scalar regression problem, with interest being focused… ▽ More This article is motivated by studying multisensory effects on brain activities in intracranial electroencephalography (iEEG) experiments. Differential brain activities to multisensory stimulus presentations are zero in most regions and non-zero in some local regions, yielding locally sparse functions. Such studies are essentially a function-on-scalar regression problem, with interest being focused not only on estimating nonparametric functions but also on recovering the function supports. We propose a weighted group bridge approach for simultaneous function estimation and support recovery in function-on-scalar mixed effect models, while accounting for heterogeneity present in functional data. We use B-splines to transform sparsity of functions to its sparse vector counterpart of increasing dimension, and propose a fast non-convex optimization algorithm using nested alternative direction method of multipliers (ADMM) for estimation. Large sample properties are established. In particular, we show that the estimated coefficient functions are rate optimal in the minimax sense under the $L_2$ norm and resemble a phase transition phenomenon. For support estimation, we derive a convergence rate under the $L_{\infty}$ norm that leads to a sparsistency property under $δ$-sparsity, and provide a simple sufficient regularity condition under which a strict sparsistency property is established. An adjusted extended Bayesian information criterion is proposed for parameter tuning. The developed method is illustrated through simulation and an application to a novel iEEG dataset to study multisensory integration. We integrate the proposed method into RAVE, an R package that gains increasing popularity in the iEEG community. △ Less

Submitted 15 November, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

arXiv:1912.05542 [pdf, other]

doi 10.1093/mnras/stz3515

Realistic Models for Filling and Abundance Discrepancy Factors in Photoionised Nebulae

Authors: Brandon M. Bergerud, Steven R. Spangler, Kara M. Beauchamp

Abstract: When comparing nebular electron densities derived from collisionally excited lines (CELs) to those estimated using the emission measure, significant discrepancies are common. The standard solution is to view nebulae as aggregates of dense regions of constant density in an otherwise empty void. This porosity is parametrized by a filling factor $f<1$. Similarly, abundance and temperature discrepanci… ▽ More When comparing nebular electron densities derived from collisionally excited lines (CELs) to those estimated using the emission measure, significant discrepancies are common. The standard solution is to view nebulae as aggregates of dense regions of constant density in an otherwise empty void. This porosity is parametrized by a filling factor $f<1$. Similarly, abundance and temperature discrepancies between optical recombination lines (ORLs) and CELs are often explained by invoking a dual delta distribution of a dense, cool, metal-rich component immersed in a diffuse, warm, metal-poor plasma. In this paper, we examine the possibility that the observational diagnostics that lead to such discrepancies can be produced by a realistic distribution of density and temperature fluctuations, such as might arise in plasma turbulence. We produce simulated nebulae with density and temperature fluctuations described by various probability distribution functions (pdfs). Standard astronomical diagnostics are applied to these simulated observations to derive estimates of nebular densities, temperatures, and abundances. Our results show that for plausible density pdfs the simulated observations lead to filling factors in the observed range. None of our simulations satisfactorily reproduce the abundance discrepancy factors (ADFs) in planetary nebulae, although there is possible consistency with \ion{H}{ii} regions. Compared to the case of density-only and temperature-only fluctuations, a positive correlation between density and temperature reduces the filling factor and ADF (from optical CELs), whereas a negative correlation increases both, eventually causing the filling factor to exceed unity. This result suggests that real observations can provide constraints on the thermodynamics of small scale fluctuations. △ Less

Submitted 11 December, 2019; originally announced December 2019.

Comments: 13 pages, 12 figures

arXiv:1910.08466 [pdf, ps, other]

Analytic Estimates of the Effect of Plasma Density Fluctuations on HII Region Density Diagnostics

Authors: Steven R. Spangler, Brandon M. Bergerud, Kara M. Beauchamp

Abstract: An analytic calculation is made of the effect of plasma density fluctuations on some spectroscopic diagnostics commonly used in the study of HII regions and planetary nebulae. To permit an analytic treatment, attention is restricted to the case of density fluctuations possessing an exponential probability distribution function (pdf). The present investigation is made in support of a completely num… ▽ More An analytic calculation is made of the effect of plasma density fluctuations on some spectroscopic diagnostics commonly used in the study of HII regions and planetary nebulae. To permit an analytic treatment, attention is restricted to the case of density fluctuations possessing an exponential probability distribution function (pdf). The present investigation is made in support of a completely numerical and more extensive study of nebular diagnostics by Bergerud et al (2019). Results from this paper are presented in terms of graphs of the observed quantity (spectroscopic line ratio) versus mean nebular density. Our results yield a higher density estimate, given the same observed line ratio, for the case of a nebula with density fluctuations than for the case of a nebula with uniform density. This is qualitatively consistent with the typically observed case, in which the observations lead to the inference of a filling factor < 1. Our results are in quantitative agreement with those of Bergerud et al (2019), and thus corroborate those calculations for the case of an exponential pdf. △ Less

Submitted 18 October, 2019; originally announced October 2019.

Comments: Thirteen pages, four figures

Showing 1–9 of 9 results for author: Beauchamp, M