Search | arXiv e-print repository

arXiv:2406.19734 [pdf, ps, other]

Weyl formulae for some singular metrics with application to acoustic modes in gas giants

Authors: Yves Colin de Verdìère, Charlotte Dietze, Maarten V. de Hoop, Emmanuel Trélat

Abstract: This paper is motivated by recent works on inverse problems for acoustic wave propagation in the interior of gas giant planets. In such planets, the speed of sound is isotropic and tends to zero at the surface. Geometrically, this corresponds to a Riemannian manifold with boundary whose metric blows up near the boundary. Here, the spectral analysis of the corresponding Laplace-Beltrami operator is… ▽ More This paper is motivated by recent works on inverse problems for acoustic wave propagation in the interior of gas giant planets. In such planets, the speed of sound is isotropic and tends to zero at the surface. Geometrically, this corresponds to a Riemannian manifold with boundary whose metric blows up near the boundary. Here, the spectral analysis of the corresponding Laplace-Beltrami operator is presented and the Weyl law is derived. The involved exponents depend on the Hausdorff dimension which, in the supercritical case, is larger than the topological dimension. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2406.18978 [pdf, other]

Anisotropic extended Burgers model, its relaxation tensor and properties of the associated Boltzmann viscoelastic system

Authors: Maarten de Hoop, Masato Kimura, Ching-Lung Lin, Gen Nakamura, Kazumi Tanuma

Abstract: We provide a new method for constructing the anisotropic relaxation tensor and proving its exponential decay property for the extended Burgers model (abbreviated by EBM). The EBM is an important viscoelasticity model in rheology, and used in Earth and planetary sciences. Upon having this tensor, the EBM can be converted to a Boltzmann-type viscoelastic system of equations (abbreviated by BVS). His… ▽ More We provide a new method for constructing the anisotropic relaxation tensor and proving its exponential decay property for the extended Burgers model (abbreviated by EBM). The EBM is an important viscoelasticity model in rheology, and used in Earth and planetary sciences. Upon having this tensor, the EBM can be converted to a Boltzmann-type viscoelastic system of equations (abbreviated by BVS). Historically, the relaxation tensor for the EBM is derived by solving the constitutive equation using the Laplace transform. (We refer to this approach by the L-method.) Since inverting the inverse Laplace transform needs a partial fractions expansion, the L-method needs to assume that the EBM elasticity tensors satisfy a commutivity condition. The new method not only avoids this condition but also enables obtaining several important properties of the relaxation tensor, including its positivity, smoothness with respect to the time variable, its exponential decay property together with its derivative, and its causality. Furthermore, we show that the BVS converted from the EBM has the exponential decay property. That is, any solution for its initial boundary value problem with homogeneous boundary data and source decays exponentially as time tends to infinity. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2405.15676 [pdf, other]

Taming Score-Based Diffusion Priors for Infinite-Dimensional Nonlinear Inverse Problems

Authors: Lorenzo Baldassari, Ali Siahkoohi, Josselin Garnier, Knut Solna, Maarten V. de Hoop

Abstract: This work introduces a sampling method capable of solving Bayesian inverse problems in function space. It does not assume the log-concavity of the likelihood, meaning that it is compatible with nonlinear inverse problems. The method leverages the recently defined infinite-dimensional score-based diffusion models as a learning-based prior, while enabling provable posterior sampling through a Langev… ▽ More This work introduces a sampling method capable of solving Bayesian inverse problems in function space. It does not assume the log-concavity of the likelihood, meaning that it is compatible with nonlinear inverse problems. The method leverages the recently defined infinite-dimensional score-based diffusion models as a learning-based prior, while enabling provable posterior sampling through a Langevin-type MCMC algorithm defined on function spaces. A novel convergence analysis is conducted, inspired by the fixed-point methods established for traditional regularization-by-denoising algorithms and compatible with weighted annealing. The obtained convergence bound explicitly depends on the approximation error of the score; a well-approximated score is essential to obtain a well-approximated posterior. Stylized and PDE-based examples are provided, demonstrating the validity of our convergence analysis. We conclude by presenting a discussion of the method's challenges related to learning the score and computational complexity. △ Less

Submitted 24 May, 2024; originally announced May 2024.

MSC Class: 62F15; 65N21; 68Q32; 60Hxx; 60Jxx; 65C05; 82C31

arXiv:2405.15643 [pdf, other]

Reducing the cost of posterior sampling in linear inverse problems via task-dependent score learning

Authors: Fabian Schneider, Duc-Lam Duong, Matti Lassas, Maarten V. de Hoop, Tapio Helin

Abstract: Score-based diffusion models (SDMs) offer a flexible approach to sample from the posterior distribution in a variety of Bayesian inverse problems. In the literature, the prior score is utilized to sample from the posterior by different methods that require multiple evaluations of the forward map** in order to generate a single posterior sample. These methods are often designed with the objective… ▽ More Score-based diffusion models (SDMs) offer a flexible approach to sample from the posterior distribution in a variety of Bayesian inverse problems. In the literature, the prior score is utilized to sample from the posterior by different methods that require multiple evaluations of the forward map** in order to generate a single posterior sample. These methods are often designed with the objective of enabling the direct use of the unconditional prior score and, therefore, task-independent training. In this paper, we focus on linear inverse problems, when evaluation of the forward map** is computationally expensive and frequent posterior sampling is required for new measurement data, such as in medical imaging. We demonstrate that the evaluation of the forward map** can be entirely bypassed during posterior sample generation. Instead, without introducing any error, the computational effort can be shifted to an offline task of training the score of a specific diffusion-like random process. In particular, the training is task-dependent requiring information about the forward map** but not about the measurement data. It is shown that the conditional score corresponding to the posterior can be obtained from the auxiliary score by suitable affine transformations. We prove that this observation generalizes to the framework of infinite-dimensional diffusion models introduced recently and provide numerical analysis of the method. Moreover, we validate our findings with numerical experiments. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 23 pages, 2 figues

MSC Class: 62F15; 65N21; 68Q32; 60Hxx; 60Jxx

arXiv:2404.09101 [pdf, ps, other]

Mixture of Experts Soften the Curse of Dimensionality in Operator Learning

Authors: Anastasis Kratsios, Takashi Furuya, Jose Antonio Lara Benitez, Matti Lassas, Maarten de Hoop

Abstract: In this paper, we construct a mixture of neural operators (MoNOs) between function spaces whose complexity is distributed over a network of expert neural operators (NOs), with each NO satisfying parameter scaling restrictions. Our main result is a \textit{distributed} universal approximation theorem guaranteeing that any Lipschitz non-linear operator between $L^2([0,1]^d)$ spaces can be approximat… ▽ More In this paper, we construct a mixture of neural operators (MoNOs) between function spaces whose complexity is distributed over a network of expert neural operators (NOs), with each NO satisfying parameter scaling restrictions. Our main result is a \textit{distributed} universal approximation theorem guaranteeing that any Lipschitz non-linear operator between $L^2([0,1]^d)$ spaces can be approximated uniformly over the Sobolev unit ball therein, to any given $\varepsilon>0$ accuracy, by an MoNO while satisfying the constraint that: each expert NO has a depth, width, and rank of $\mathcal{O}(\varepsilon^{-1})$. Naturally, our result implies that the required number of experts must be large, however, each NO is guaranteed to be small enough to be loadable into the active memory of most computers for reasonable accuracies $\varepsilon$. During our analysis, we also obtain new quantitative expression rates for classical NOs approximating uniformly continuous non-linear operators uniformly on compact subsets of $L^2([0,1]^d)$. △ Less

Submitted 13 April, 2024; originally announced April 2024.

arXiv:2403.05475 [pdf, ps, other]

Geometric inverse problems on gas giants

Authors: Maarten V. de Hoop, Joonas Ilmavirta, Antti Kykkänen, Rafe Mazzeo

Abstract: On gas giant planets the speed of sound is isotropic and goes to zero at the surface. Geometrically, this corresponds to a Riemannian manifold whose metric tensor has a conformal blow-up near the boundary. The blow-up is tamer than in asymptotically hyperbolic geometry: the boundary is at a finite distance. We study the differential geometry of such manifolds, especially the asymptotic behavior… ▽ More On gas giant planets the speed of sound is isotropic and goes to zero at the surface. Geometrically, this corresponds to a Riemannian manifold whose metric tensor has a conformal blow-up near the boundary. The blow-up is tamer than in asymptotically hyperbolic geometry: the boundary is at a finite distance. We study the differential geometry of such manifolds, especially the asymptotic behavior of geodesics near the boundary. We relate the geometry to the propagation of singularities of a hydrodynamic PDE and we give the basic properties of the Laplace--Beltrami operator. We solve two inverse problems, showing that the interior structure of a gas giant is uniquely determined by different types of boundary data. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 42 pages

MSC Class: 53C22; 37D40; 53C65; 35R30

arXiv:2310.12698 [pdf, other]

Stable recovery of coefficients in an inverse fault friction problem

Authors: Maarten V. de Hoop, Matti Lassas, **peng Lu, Lauri Oksanen

Abstract: We consider the inverse fault friction problem of determining the friction coefficient in the Tresca friction model, which can be formulated as an inverse problem for differential inequalities. We show that the measurements of elastic waves during a rupture uniquely determine the friction coefficient at the rupture surface with explicit stability estimates. We consider the inverse fault friction problem of determining the friction coefficient in the Tresca friction model, which can be formulated as an inverse problem for differential inequalities. We show that the measurements of elastic waves during a rupture uniquely determine the friction coefficient at the rupture surface with explicit stability estimates. △ Less

Submitted 2 May, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: Final version with added details and corrections, to appear in Arch. Ration. Mech. Anal

MSC Class: 35R30; 35Q86

arXiv:2310.00545 [pdf, other]

Implicit Neural Representations and the Algebra of Complex Wavelets

Authors: T. Mitchell Roddenberry, Vishwanath Saragadam, Maarten V. de Hoop, Richard G. Baraniuk

Abstract: Implicit neural representations (INRs) have arisen as useful methods for representing signals on Euclidean domains. By parameterizing an image as a multilayer perceptron (MLP) on Euclidean space, INRs effectively represent signals in a way that couples spatial and spectral features of the signal that is not obvious in the usual discrete representation, paving the way for continuous signal processi… ▽ More Implicit neural representations (INRs) have arisen as useful methods for representing signals on Euclidean domains. By parameterizing an image as a multilayer perceptron (MLP) on Euclidean space, INRs effectively represent signals in a way that couples spatial and spectral features of the signal that is not obvious in the usual discrete representation, paving the way for continuous signal processing and machine learning approaches that were not previously possible. Although INRs using sinusoidal activation functions have been studied in terms of Fourier theory, recent works have shown the advantage of using wavelets instead of sinusoids as activation functions, due to their ability to simultaneously localize in both frequency and space. In this work, we approach such INRs and demonstrate how they resolve high-frequency features of signals from coarse approximations done in the first layer of the MLP. This leads to multiple prescriptions for the design of INR architectures, including the use of complex wavelets, decoupling of low and band-pass approximations, and initialization schemes based on the singularities of the desired signal. △ Less

Submitted 30 September, 2023; originally announced October 2023.

Comments: 10 pages, 6 figures. 2 appendix pages, 1 appendix figure

arXiv:2308.16322 [pdf, ps, other]

Resolvent Estimates for Viscoelastic Systems of Extended Maxwell Type and their Applications

Authors: Maarten V. de Hoop, Masato Kimura, Ching-Lung Lin, Gen Nakamura

Abstract: In the theory of viscoelasticity, an important class of models admits a representation in terms of springs and dashpots. Widely used members of this class are the Maxwell model and its extended version. This paper concerns resolvent estimates for the system of equations for the anisotropic, extended Maxwell model, abbreviated as the EMM, and its marginal realization which includes an inertia term;… ▽ More In the theory of viscoelasticity, an important class of models admits a representation in terms of springs and dashpots. Widely used members of this class are the Maxwell model and its extended version. This paper concerns resolvent estimates for the system of equations for the anisotropic, extended Maxwell model, abbreviated as the EMM, and its marginal realization which includes an inertia term; special attention is paid to the introduction of augmented variables. This leads to the augmented system that will also be referred to as the "original" system. A reduced system is then formed which encodes essentially the EMM; it is a closed system with respect to the particle velocity and the difference between the elastic and viscous strains. Based on resolvent estimates, it is shown that the original and reduced systems generate $C_0$-groups and the reduced system generates a $C_0$-semigroup of contraction. Naturally, the EMM can be written in integrodifferential form leading explicitly to relaxation and a viscoelastic integro-differential system. However, there is a difference between the original and integrodifferential systems, in general, with consequences for whether their solutions generate semigroups or not. Finally, an energy estimate is obtained for the reduced system, and it is proven that its solutions decay exponentially as time tends to infinity. The limiting amplitude principle follows readily from these two results. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 21 pages, 2 figures

MSC Class: 35B37; 35B40; 35L55; 74D05

arXiv:2308.04338 [pdf, ps, other]

Coupling of flow, contact mechanics and friction, generating waves in a fractured porous medium

Authors: Maarten V. de Hoop, Kundan Kumar

Abstract: We present a mixed dimensional model for a fractured poro-elasic medium including contact mechanics. The fracture is a lower dimensional surface embedded in a bulk poro-elastic matrix. The flow equation on the fracture is a Darcy type model that follows the cubic law for permeability. The bulk poro-elasticity is governed by fully dynamic Biot equations. The resulting model is a mixed dimensional t… ▽ More We present a mixed dimensional model for a fractured poro-elasic medium including contact mechanics. The fracture is a lower dimensional surface embedded in a bulk poro-elastic matrix. The flow equation on the fracture is a Darcy type model that follows the cubic law for permeability. The bulk poro-elasticity is governed by fully dynamic Biot equations. The resulting model is a mixed dimensional type where the fracture flow on a surface is coupled to a bulk flow and geomechanics model. The particularity of the work here is in considering fully dynamic Biot equation, that is, including an inertia term, and the contact mechanics including friction for the fracture surface. We prove the well-posedness of the continuous model. △ Less

Submitted 8 August, 2023; originally announced August 2023.

arXiv:2308.03988 [pdf, ps, other]

Uniform Decaying Property of Solutions for Anisotropic Viscoelastic Systems

Authors: Maarten V. de Hoop, Ching-Lung Lin, Gen Nakamura

Abstract: The paper concerns about the uniform decaying property (abbreviated by UDP) of solutions for an anisotropic viscoelastic system in the form of integrodifferential system (abbreviated by VID system) with mixed type boundary condition. The mixed type condition consists of the homogeneous displacement boundary condition and a homogeneous traction boundary condition or with a dissipation. By using a d… ▽ More The paper concerns about the uniform decaying property (abbreviated by UDP) of solutions for an anisotropic viscoelastic system in the form of integrodifferential system (abbreviated by VID system) with mixed type boundary condition. The mixed type condition consists of the homogeneous displacement boundary condition and a homogeneous traction boundary condition or with a dissipation. By using a dissipative structure of this system, we will prove the UDP in a unified way for the two cases, which are, when the time derivative of relaxation tensor decays with polynomial order and it decays with exponential order. △ Less

Submitted 7 June, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: 27 pages, 3 figures

MSC Class: 35B40; 35Q70; 35Q86

arXiv:2307.07572 [pdf, other]

High-Rate Phase Association with Travel Time Neural Fields

Authors: Cheng Shi, Maarten V. de Hoop, Ivan Dokmanić

Abstract: Our understanding of regional seismicity from multi-station seismograms relies on the ability to associate arrival phases with their originating earthquakes. Deep-learning-based phase detection now detects small, high-rate arrivals from seismicity clouds, even at negative magnitudes. This new data could give important insight into earthquake dynamics, but it is presents a challenging association t… ▽ More Our understanding of regional seismicity from multi-station seismograms relies on the ability to associate arrival phases with their originating earthquakes. Deep-learning-based phase detection now detects small, high-rate arrivals from seismicity clouds, even at negative magnitudes. This new data could give important insight into earthquake dynamics, but it is presents a challenging association task. Existing techniques relying on coarsely approximated, fixed wave speed models fail in this unexplored dense regime where the complexity of unknown wave speed cannot be ignored. We introduce Harpa, a high-rate association framework built on deep generative modeling and neural fields. Harpa incorporates wave physics by using optimal transport to compare arrival sequences. It is thus robust to unknown wave speeds and estimates the wave speed model as a by-product of association. Experiments with realistic, complex synthetic models show that Harpa is the first seismic phase association framework which is accurate in the high-rate regime, paving the way for new avenues in exploratory Earth science and improved understanding of seismicity. △ Less

Submitted 26 March, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

arXiv:2307.03312 [pdf, other]

Reconstruction of generic anisotropic stiffness tensors from partial data around one polarization

Authors: Maarten V. de Hoop, Joonas Ilmavirta, Matti Lassas, Anthony Várilly-Alvarado

Abstract: We study inverse problems in anisotropic elasticity using tools from algebraic geometry. The singularities of solutions to the elastic wave equation in dimension $n$ with an anisotropic stiffness tensor have propagation kinematics captured by so-called slowness surfaces, which are hypersurfaces in the cotangent bundle of $\mathbb{R}^n$ that turn out to be algebraic varieties. Leveraging the algebr… ▽ More We study inverse problems in anisotropic elasticity using tools from algebraic geometry. The singularities of solutions to the elastic wave equation in dimension $n$ with an anisotropic stiffness tensor have propagation kinematics captured by so-called slowness surfaces, which are hypersurfaces in the cotangent bundle of $\mathbb{R}^n$ that turn out to be algebraic varieties. Leveraging the algebraic geometry of families of slowness surfaces we show that, for tensors in a dense open subset in the space of all stiffness tensors, a small amount of data around one polarization in an individual slowness surface uniquely determines the entire slowness surface and its stiffness tensor. Such partial data arises naturally from geophysical measurements or geometrized versions of seismic inverse problems. Additionally, we explain how the reconstruction of the stiffness tensor can be carried out effectively, using Gröbner bases. Our uniqueness results fail for very symmetric (e.g., fully isotropic) materials, evidencing the counterintuitive claim that inverse problems in elasticity can become more tractable with increasing asymmetry. △ Less

Submitted 6 July, 2023; originally announced July 2023.

Comments: 39 pages, 4 figures. Computer Code included in the ancillary files folder

MSC Class: Primary 86-10; 86A22; 14D06; Secondary 53Z05; 14P25; 14-04

arXiv:2306.03982 [pdf, ps, other]

Globally injective and bijective neural operators

Authors: Takashi Furuya, Michael Puthawala, Matti Lassas, Maarten V. de Hoop

Abstract: Recently there has been great interest in operator learning, where networks learn operators between function spaces from an essentially infinite-dimensional perspective. In this work we present results for when the operators learned by these networks are injective and surjective. As a warmup, we combine prior work in both the finite-dimensional ReLU and operator learning setting by giving sharp co… ▽ More Recently there has been great interest in operator learning, where networks learn operators between function spaces from an essentially infinite-dimensional perspective. In this work we present results for when the operators learned by these networks are injective and surjective. As a warmup, we combine prior work in both the finite-dimensional ReLU and operator learning setting by giving sharp conditions under which ReLU layers with linear neural operators are injective. We then consider the case the case when the activation function is pointwise bijective and obtain sufficient conditions for the layer to be injective. We remark that this question, while trivial in the finite-rank case, is subtler in the infinite-rank case and is proved using tools from Fredholm theory. Next, we prove that our supplied injective neural operators are universal approximators and that their implementation, with finite-rank neural networks, are still injective. This ensures that injectivity is not `lost' in the transcription from analytical operators to their finite-rank implementation with networks. Finally, we conclude with an increase in abstraction and consider general conditions when subnetworks, which may be many layers deep, are injective and surjective and provide an exact inversion from a `linearization.' This section uses general arguments from Fredholm theory and Leray-Schauder degree theory for non-linear integral equations to analyze the map** properties of neural operators in function spaces. These results apply to subnetworks formed from the layers considered in this work, under natural conditions. We believe that our work has applications in Bayesian UQ where injectivity enables likelihood estimation and in inverse problems where surjectivity and injectivity corresponds to existence and uniqueness, respectively. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: 39 pages

arXiv:2305.19147 [pdf, other]

Conditional score-based diffusion models for Bayesian inference in infinite dimensions

Authors: Lorenzo Baldassari, Ali Siahkoohi, Josselin Garnier, Knut Solna, Maarten V. de Hoop

Abstract: Since their initial introduction, score-based diffusion models (SDMs) have been successfully applied to solve a variety of linear inverse problems in finite-dimensional vector spaces due to their ability to efficiently approximate the posterior distribution. However, using SDMs for inverse problems in infinite-dimensional function spaces has only been addressed recently, primarily through methods… ▽ More Since their initial introduction, score-based diffusion models (SDMs) have been successfully applied to solve a variety of linear inverse problems in finite-dimensional vector spaces due to their ability to efficiently approximate the posterior distribution. However, using SDMs for inverse problems in infinite-dimensional function spaces has only been addressed recently, primarily through methods that learn the unconditional score. While this approach is advantageous for some inverse problems, it is mostly heuristic and involves numerous computationally costly forward operator evaluations during posterior sampling. To address these limitations, we propose a theoretically grounded method for sampling from the posterior of infinite-dimensional Bayesian linear inverse problems based on amortized conditional SDMs. In particular, we prove that one of the most successful approaches for estimating the conditional score in finite dimensions - the conditional denoising estimator - can also be applied in infinite dimensions. A significant part of our analysis is dedicated to demonstrating that extending infinite-dimensional SDMs to the conditional setting requires careful consideration, as the conditional score typically blows up for small times, contrarily to the unconditional score. We conclude by presenting stylized and large-scale numerical examples that validate our approach, offer additional insights, and demonstrate that our method enables large-scale, discretization-invariant Bayesian inference. △ Less

Submitted 27 October, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

Comments: NeurIPS 2023 (Spotlight)

MSC Class: 62F15; 65N21; 68Q32; 60Hxx; 60Jxx

arXiv:2305.16189 [pdf, other]

Martian time-series unraveled: A multi-scale nested approach with factorial variational autoencoders

Authors: Ali Siahkoohi, Rudy Morel, Randall Balestriero, Erwan Allys, Grégory Sainton, Taichi Kawamura, Maarten V. de Hoop

Abstract: Unsupervised source separation involves unraveling an unknown set of source signals recorded through a mixing operator, with limited prior knowledge about the sources, and only access to a dataset of signal mixtures. This problem is inherently ill-posed and is further challenged by the variety of timescales exhibited by sources. Existing methods typically rely on a preselected window size that det… ▽ More Unsupervised source separation involves unraveling an unknown set of source signals recorded through a mixing operator, with limited prior knowledge about the sources, and only access to a dataset of signal mixtures. This problem is inherently ill-posed and is further challenged by the variety of timescales exhibited by sources. Existing methods typically rely on a preselected window size that determines their operating timescale, limiting their capacity to handle multi-scale sources. To address this issue, we propose an unsupervised multi-scale clustering and source separation framework by leveraging wavelet scattering spectra that provide a low-dimensional representation of stochastic processes, capable of distinguishing between different non-Gaussian stochastic processes. Nested within this representation space, we develop a factorial Gaussian-mixture variational autoencoder that is trained to (1) probabilistically cluster sources at different timescales and (2) independently sample scattering spectra representations associated with each cluster. As the final stage, using samples from each cluster as prior information, we formulate source separation as an optimization problem in the wavelet scattering spectra representation space, aiming to separate sources in the time domain. When applied to the entire seismic dataset recorded during the NASA InSight mission on Mars, containing sources varying greatly in timescale, our multi-scale nested approach proves to be a powerful tool for disentangling such different sources, e.g., minute-long transient one-sided pulses (known as ``glitches'') and structured ambient noises resulting from atmospheric activities that typically last for tens of minutes. These results provide an opportunity to conduct further investigations into the isolated sources related to atmospheric-surface interactions, thermal relaxations, and other complex phenomena. △ Less

Submitted 19 February, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2304.12231 [pdf, other]

An Approximation Theory for Metric Space-Valued Functions With A View Towards Deep Learning

Authors: Anastasis Kratsios, Chong Liu, Matti Lassas, Maarten V. de Hoop, Ivan Dokmanić

Abstract: Motivated by the develo** mathematics of deep learning, we build universal functions approximators of continuous maps between arbitrary Polish metric spaces $\mathcal{X}$ and $\mathcal{Y}$ using elementary functions between Euclidean spaces as building blocks. Earlier results assume that the target space $\mathcal{Y}$ is a topological vector space. We overcome this limitation by ``randomization'… ▽ More Motivated by the develo** mathematics of deep learning, we build universal functions approximators of continuous maps between arbitrary Polish metric spaces $\mathcal{X}$ and $\mathcal{Y}$ using elementary functions between Euclidean spaces as building blocks. Earlier results assume that the target space $\mathcal{Y}$ is a topological vector space. We overcome this limitation by ``randomization'': our approximators output discrete probability measures over $\mathcal{Y}$. When $\mathcal{X}$ and $\mathcal{Y}$ are Polish without additional structure, we prove very general qualitative guarantees; when they have suitable combinatorial structure, we prove quantitative guarantees for Hölder-like maps, including maps between finite graphs, solution operators to rough differential equations between certain Carnot groups, and continuous non-linear operators between Banach spaces arising in inverse problems. In particular, we show that the required number of Dirac measures is determined by the combinatorial structure of $\mathcal{X}$ and $\mathcal{Y}$. For barycentric $\mathcal{Y}$, including Banach spaces, $\mathbb{R}$-trees, Hadamard manifolds, or Wasserstein spaces on Polish metric spaces, our approximators reduce to $\mathcal{Y}$-valued functions. When the Euclidean approximators are neural networks, our constructions generalize transformer networks, providing a new probabilistic viewpoint of geometric deep learning. △ Less

Submitted 24 July, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 14 Figures, 3 Tables, 78 Pages (Main 40, Proofs 26, Acknowledgments and References 12)

MSC Class: 41A65; 68T07; 60L50; 65N21; 46T99

arXiv:2304.03118 [pdf, other]

Early-warning inverse source problem for the elasto-gravitational equations

Authors: Lorenzo Baldassari, Maarten V. de Hoop, Elisa Francini, Sergio Vessella

Abstract: Through coupled physics, we study an early-warning inverse source problem for the elasto-gravitational equations. It consists of a mixed hyperbolic-elliptic system of partial differential equations describing elastic wave displacement and gravity perturbations produced by a source in a homogeneous bounded medium. Within the Cowling approximation, we prove uniqueness and Lipschitz stability for the… ▽ More Through coupled physics, we study an early-warning inverse source problem for the elasto-gravitational equations. It consists of a mixed hyperbolic-elliptic system of partial differential equations describing elastic wave displacement and gravity perturbations produced by a source in a homogeneous bounded medium. Within the Cowling approximation, we prove uniqueness and Lipschitz stability for the inverse problem of recovering the moment tensor and the location of the source from early-time measurements of the changes of the gravitational field. The setup studied in this paper is motivated by gravity-based earthquake early warning systems, which are gaining much attention recently. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: 25 pages, 1 figure

MSC Class: 35R30; 35Q86; 35J05; 35L10

arXiv:2302.14158 [pdf, other]

Spherically symmetric terrestrial planets with discontinuities are spectrally rigid

Authors: Joonas Ilmavirta, Maarten V. de Hoop, Vitaly Katsnelson

Abstract: We establish spectral rigidity for spherically symmetric manifolds with boundary and interior interfaces determined by discontinuities in the metric under certain conditions. Rather than a single metric, we allow two distinct metrics in between the interfaces enabling the consideration of two wave types, like P- and S-polarized waves in isotropic elastic solids. Terrestrial planets in our solar sy… ▽ More We establish spectral rigidity for spherically symmetric manifolds with boundary and interior interfaces determined by discontinuities in the metric under certain conditions. Rather than a single metric, we allow two distinct metrics in between the interfaces enabling the consideration of two wave types, like P- and S-polarized waves in isotropic elastic solids. Terrestrial planets in our solar system are approximately spherically symmetric and support toroidal and spheroidal modes. Discontinuities typically correspond with phase transitions in their interiors. Our rigidity result applies to such planets as we ensure that our conditions are satisfied in generally accepted models in the presence of a fluid outer core. The proof is based on a novel trace formula. We also prove that the length spectrum of the Euclidean ball is simple. △ Less

Submitted 7 December, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Comments: 2 figures

MSC Class: 53C24; 58J50; 86A22

arXiv:2302.08173 [pdf, other]

doi 10.1088/1361-6420/ad2781

Inverse problem for Love waves in a layered, elastic half-space

Authors: Maarten V. de Hoop, Josselin Garnier, Alexei Iantchenko, Julien Ricaud

Abstract: In this paper we study Love waves in a layered, elastic half-space. We first address the direct problem and we characterize the existence of Love waves through the dispersion relation. We then address the inverse problem and we show how to recover the parameters of the elastic medium from the empirical knowledge of the frequency--wavenumber couples of the Love waves. In this paper we study Love waves in a layered, elastic half-space. We first address the direct problem and we characterize the existence of Love waves through the dispersion relation. We then address the inverse problem and we show how to recover the parameters of the elastic medium from the empirical knowledge of the frequency--wavenumber couples of the Love waves. △ Less

Submitted 8 September, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

Comments: 37 pages, 7 figures

MSC Class: 74J25; 74J15; 86A15; 86A22; 35R30

Journal ref: Inverse Problems 40 (4), 045013 (2024)

arXiv:2301.11981 [pdf, other]

Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data

Authors: Ali Siahkoohi, Rudy Morel, Maarten V. de Hoop, Erwan Allys, Grégory Sainton, Taichi Kawamura

Abstract: Source separation involves the ill-posed problem of retrieving a set of source signals that have been observed through a mixing operator. Solving this problem requires prior knowledge, which is commonly incorporated by imposing regularity conditions on the source signals, or implicitly learned through supervised or unsupervised methods from existing data. While data-driven methods have shown great… ▽ More Source separation involves the ill-posed problem of retrieving a set of source signals that have been observed through a mixing operator. Solving this problem requires prior knowledge, which is commonly incorporated by imposing regularity conditions on the source signals, or implicitly learned through supervised or unsupervised methods from existing data. While data-driven methods have shown great promise in source separation, they often require large amounts of data, which rarely exists in planetary space missions. To address this challenge, we propose an unsupervised source separation scheme for domains with limited data access that involves solving an optimization problem in the wavelet scattering covariance representation space$\unicode{x2014}$an interpretable, low-dimensional representation of stationary processes. We present a real-data example in which we remove transient, thermally-induced microtilts$\unicode{x2014}$known as glitches$\unicode{x2014}$from data recorded by a seismometer during NASA's InSight mission on Mars. Thanks to the wavelet scattering covariances' ability to capture non-Gaussian properties of stochastic processes, we are able to separate glitches using only a few glitch-free data snippets. △ Less

Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

Comments: ICML 2023

arXiv:2301.11509 [pdf, other]

Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Authors: J. Antonio Lara Benitez, Takashi Furuya, Florian Faucher, Anastasis Kratsios, Xavier Tricoche, Maarten V. de Hoop

Abstract: Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator map** wave speed to so… ▽ More Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator map** wave speed to solution, or boundary values for the Helmholtz equation on a bounded domain. The latter operator is commonly referred to as the ''forward'' operator in the study of inverse problems. Our methodology draws inspiration from transformers and techniques such as stochastic depth. Our experiments reveal certain surprises in the generalization and the relevance of introducing stochastic depth. Our NOs show superior performance as compared with standard NOs, not only for testing within the training distribution but also for out-of-distribution scenarios. To delve into this observation, we offer an in-depth analysis of the Rademacher complexity associated with our modified models and prove an upper bound tied to their stochastic depth that existing NOs do not satisfy. Furthermore, we obtain a novel out-of-distribution risk bound tailored to Gaussian measures on Banach spaces, again relating stochastic depth with the bound. We conclude by proposing a hypernetwork version of the subfamily of NOs as a surrogate model for the mentioned forward operator. △ Less

Submitted 4 July, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

arXiv:2211.02922 [pdf, other]

Beyond Hawkes: Neural Multi-event Forecasting on Spatio-temporal Point Processes

Authors: Negar Erfanian, Santiago Segarra, Maarten de Hoop

Abstract: Predicting discrete events in time and space has many scientific applications, such as predicting hazardous earthquakes and outbreaks of infectious diseases. History-dependent spatio-temporal Hawkes processes are often used to mathematically model these point events. However, previous approaches have faced numerous challenges, particularly when attempting to forecast one or multiple future events.… ▽ More Predicting discrete events in time and space has many scientific applications, such as predicting hazardous earthquakes and outbreaks of infectious diseases. History-dependent spatio-temporal Hawkes processes are often used to mathematically model these point events. However, previous approaches have faced numerous challenges, particularly when attempting to forecast one or multiple future events. In this work, we propose a new neural architecture for simultaneous multi-event forecasting of spatio-temporal point processes, utilizing transformers, augmented with normalizing flows and probabilistic layers. Our network makes batched predictions of complex history-dependent spatio-temporal distributions of future discrete events, achieving state-of-the-art performance on a variety of benchmark datasets including the South California Earthquakes, Citibike, Covid-19, and Hawkes synthetic pinwheel datasets. More generally, we illustrate how our network can be applied to any dataset of discrete events with associated markers, even when no underlying physics is known. △ Less

Submitted 28 January, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

Comments: Submitted to ICML2023

arXiv:2210.00577 [pdf, other]

Deep Invertible Approximation of Topologically Rich Maps between Manifolds

Authors: Michael Puthawala, Matti Lassas, Ivan Dokmanic, Pekka Pankka, Maarten de Hoop

Abstract: How can we design neural networks that allow for stable universal approximation of maps between topologically interesting manifolds? The answer is with a coordinate projection. Neural networks based on topological data analysis (TDA) use tools such as persistent homology to learn topological signatures of data and stabilize training but may not be universal approximators or have stable inverses. O… ▽ More How can we design neural networks that allow for stable universal approximation of maps between topologically interesting manifolds? The answer is with a coordinate projection. Neural networks based on topological data analysis (TDA) use tools such as persistent homology to learn topological signatures of data and stabilize training but may not be universal approximators or have stable inverses. Other architectures universally approximate data distributions on submanifolds but only when the latter are given by a single chart, making them unable to learn maps that change topology. By exploiting the topological parallels between locally bilipschitz maps, covering spaces, and local homeomorphisms, and by using universal approximation arguments from machine learning, we find that a novel network of the form $\mathcal{T} \circ p \circ \mathcal{E}$, where $\mathcal{E}$ is an injective network, $p$ a fixed coordinate projection, and $\mathcal{T}$ a bijective network, is a universal approximator of local diffeomorphisms between compact smooth submanifolds embedded in $\mathbb{R}^n$. We emphasize the case when the target map changes topology. Further, we find that by constraining the projection $p$, multivalued inversions of our networks can be computed without sacrificing universality. As an application, we show that learning a group invariant function with unknown group action naturally reduces to the question of learning local diffeomorphisms for finite groups. Our theory permits us to recover orbits of the group action. We also outline possible extensions of our architecture to address molecular imaging of molecules with symmetries. Finally, our analysis informs the choice of topologically expressive starting spaces in generative problems. △ Less

Submitted 2 October, 2022; originally announced October 2022.

arXiv:2209.15316 [pdf, other]

Uniqueness in an inverse problem of fractional elasticity

Authors: Giovanni Covi, Maarten de Hoop, Mikko Salo

Abstract: We study an inverse problem for fractional elasticity. In analogy to the classical problem of linear elasticity, we consider the unique recovery of the Lamé parameters associated to a linear, isotropic fractional elasticity operator from fractional Dirichlet-to-Neumann data. In our analysis we make use of a fractional matrix Schrödinger equation via a generalization of the so-called Liouville redu… ▽ More We study an inverse problem for fractional elasticity. In analogy to the classical problem of linear elasticity, we consider the unique recovery of the Lamé parameters associated to a linear, isotropic fractional elasticity operator from fractional Dirichlet-to-Neumann data. In our analysis we make use of a fractional matrix Schrödinger equation via a generalization of the so-called Liouville reduction, a technique classically used in the study of the scalar conductivity equation. We conclude that unique recovery is possible if the Lamé parameters agree and are constant in the exterior, and their Poisson ratios agree everywhere. Our study is motivated by the significant recent activity in the field of nonlocal elasticity. △ Less

Submitted 30 September, 2022; originally announced September 2022.

Comments: 31 pages, 1 figure

MSC Class: 35R30; 35R11; 74B99

arXiv:2209.09998 [pdf, ps, other]

doi 10.1098/rspa.2022.0845

Analysis of leaky modes or wavenumber resonances for the Rayleigh system in a half space

Authors: Maarten V. de Hoop, Alexei Iantchenko

Abstract: We present a comprehensive analysis of wavenumber resonances or leaky modes associated with the Rayleigh operator in a half space containing a heterogeneous slab, being motivated by seismology. To this end, we introduce Jost solutions on an appropriate Riemann surface, a boundary matrix and a reflection matrix in analogy to the studies of scattering resonances associated with the Schrödinger opera… ▽ More We present a comprehensive analysis of wavenumber resonances or leaky modes associated with the Rayleigh operator in a half space containing a heterogeneous slab, being motivated by seismology. To this end, we introduce Jost solutions on an appropriate Riemann surface, a boundary matrix and a reflection matrix in analogy to the studies of scattering resonances associated with the Schrödinger operator. We analyze their analytic properties and characterize the distribution of these wavenumber resonances. Furthermore, we show that the resonances appear as poles of the meromorphic continuation of the resolvent to the nonphysical sheets of the mentioned Riemann surface as expected. △ Less

Submitted 1 December, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

Journal ref: Proceedings of the Royal Society A, 479(2277) 2023

arXiv:2204.07664 [pdf, other]

doi 10.1109/TCI.2023.3248949

Conditional Injective Flows for Bayesian Imaging

Authors: AmirEhsan Khorashadizadeh, Konik Kothari, Leonardo Salsi, Ali Aghababaei Harandi, Maarten de Hoop, Ivan Dokmanić

Abstract: Most deep learning models for computational imaging regress a single reconstructed image. In practice, however, ill-posedness, nonlinearity, model mismatch, and noise often conspire to make such point estimates misleading or insufficient. The Bayesian approach models images and (noisy) measurements as jointly distributed random vectors and aims to approximate the posterior distribution of unknowns… ▽ More Most deep learning models for computational imaging regress a single reconstructed image. In practice, however, ill-posedness, nonlinearity, model mismatch, and noise often conspire to make such point estimates misleading or insufficient. The Bayesian approach models images and (noisy) measurements as jointly distributed random vectors and aims to approximate the posterior distribution of unknowns. Recent variational inference methods based on conditional normalizing flows are a promising alternative to traditional MCMC methods, but they come with drawbacks: excessive memory and compute demands for moderate to high resolution images and underwhelming performance on hard nonlinear problems. In this work, we propose C-Trumpets -- conditional injective flows specifically designed for imaging problems, which greatly diminish these challenges. Injectivity reduces memory footprint and training time while low-dimensional latent space together with architectural innovations like fixed-volume-change layers and skip-connection revnet layers, C-Trumpets outperform regular conditional flow models on a variety of imaging and image restoration tasks, including limited-view CT and nonlinear inverse scattering, with a lower compute and memory budget. C-Trumpets enable fast approximation of point estimates like MMSE or MAP as well as physically-meaningful uncertainty quantification. △ Less

Submitted 3 April, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

Comments: 23 pages, 23 figures

Journal ref: IEEE Transactions on Computational Imaging, vol. 9, pp. 224-237, 2023

arXiv:2203.13690 [pdf, other]

Quantitative unique continuation for the elasticity system with application to the kinematic inverse rupture problem

Authors: Maarten V. de Hoop, Matti Lassas, **peng Lu, Lauri Oksanen

Abstract: We obtain explicit estimates on the stability of the unique continuation for a linear system of hyperbolic equations. In particular our result applies to the elasticity system and also the Maxwell system. As an application, we study the kinematic inverse rupture problem of determining the jump in displacement and the friction force at the rupture surface, and we obtain new features on the stable u… ▽ More We obtain explicit estimates on the stability of the unique continuation for a linear system of hyperbolic equations. In particular our result applies to the elasticity system and also the Maxwell system. As an application, we study the kinematic inverse rupture problem of determining the jump in displacement and the friction force at the rupture surface, and we obtain new features on the stable unique continuation up to the rupture surface. △ Less

Submitted 9 February, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

Comments: to appear in Comm. PDE

MSC Class: 35L10; 35R30; 35Q86

arXiv:2203.13181 [pdf, other]

The Cost-Accuracy Trade-Off In Operator Learning With Neural Networks

Authors: Maarten V. de Hoop, Daniel Zhengyu Huang, Elizabeth Qian, Andrew M. Stuart

Abstract: The term `surrogate modeling' in computational science and engineering refers to the development of computationally efficient approximations for expensive simulations, such as those arising from numerical solution of partial differential equations (PDEs). Surrogate modeling is an enabling methodology for many-query computations in science and engineering, which include iterative methods in optimiz… ▽ More The term `surrogate modeling' in computational science and engineering refers to the development of computationally efficient approximations for expensive simulations, such as those arising from numerical solution of partial differential equations (PDEs). Surrogate modeling is an enabling methodology for many-query computations in science and engineering, which include iterative methods in optimization and sampling methods in uncertainty quantification. Over the last few years, several approaches to surrogate modeling for PDEs using neural networks have emerged, motivated by successes in using neural networks to approximate nonlinear maps in other areas. In principle, the relative merits of these different approaches can be evaluated by understanding, for each one, the cost required to achieve a given level of accuracy. However, the absence of a complete theory of approximation error for these approaches makes it difficult to assess this cost-accuracy trade-off. The purpose of the paper is to provide a careful numerical study of this issue, comparing a variety of different neural network architectures for operator approximation across a range of problems arising from PDE models in continuum mechanics. △ Less

Submitted 11 August, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

Comments: 48 pages, 19 figures

arXiv:2203.08735 [pdf, ps, other]

Recovery of piecewise smooth density and Lamé parameters from high-frequency exterior Cauchy data

Authors: Sombuddha Bhattacharyya, Maarten V. de Hoop, Vitaly Katsnelson, Gunther Uhlmann

Abstract: We consider an isotropic elastic medium occupying a bounded domain D whose density and Lamé parameters are piecewise smooth. In the elastic wave initial value inverse problem, we are given the solution operator for the elastic wave equation, but only outside the domain D and only for initial data supported outside D, and we study the recovery of the density and Lamé parameters. For known density,… ▽ More We consider an isotropic elastic medium occupying a bounded domain D whose density and Lamé parameters are piecewise smooth. In the elastic wave initial value inverse problem, we are given the solution operator for the elastic wave equation, but only outside the domain D and only for initial data supported outside D, and we study the recovery of the density and Lamé parameters. For known density, results have recently been obtained using the scattering control method to recover wave speeds. Here, we extend this result to include the recovery of the density in addition to the Lamé parameters under certain geometric conditions using techniques from microlocal analysis and a connection to local tensor tomography. △ Less

Submitted 16 March, 2022; originally announced March 2022.

Comments: 25 pages

MSC Class: 35S30; 35L51

arXiv:2202.06739 [pdf, other]

doi 10.1088/1361-6420/acb008

Local recovery of a piecewise constant anisotropic conductivity in EIT on domains with exposed corners

Authors: Maarten V. de Hoop, Takashi Furuya, Ching-Lung Lin, Gen Nakamura, Manmohan Vashisth

Abstract: We study the local recovery of an unknown piecewise constant anisotropic conductivity in EIT (electric impedance tomography) on certain bounded Lipschitz domains $Ω$ in $\mathbb{R}^2$ with corners. The measurement is conducted on a connected open subset of the boundary $\partialΩ$ of $Ω$ containing corners and is given as a localized Neumann-to-Dirichlet map. The above unknown conductivity is defi… ▽ More We study the local recovery of an unknown piecewise constant anisotropic conductivity in EIT (electric impedance tomography) on certain bounded Lipschitz domains $Ω$ in $\mathbb{R}^2$ with corners. The measurement is conducted on a connected open subset of the boundary $\partialΩ$ of $Ω$ containing corners and is given as a localized Neumann-to-Dirichlet map. The above unknown conductivity is defined via a decomposition of $Ω$ into polygonal cells. Specifically, we consider a parallelogram-based decomposition and a trapezoid-based decomposition. We assume that the decomposition is known, but the conductivity on each cell is unknown. We prove that the local recovery is almost surely true near a known piecewise constant anisotropic conductivity $γ_0$. We do so by proving that the injectivity of the Fréchet derivative $F'(γ_0)$ of the forward map $F$, say, at $γ_0$ is almost surely true. The proof presented, here, involves defining different classes of decompositions for $γ_0$ and a perturbation or contrast $H$ in a proper way so that we can find in the interior of a cell for $γ_0$ exposed single or double corners of a cell of $\mbox{supp}H$ for the former decomposition and latter decomposition, respectively. Then, by adapting the usual proof near such corners, we establish the aforementioned injectivity. △ Less

Submitted 12 August, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

Comments: 29 pages, 13 figures

MSC Class: 35R30; 35J25

arXiv:2201.02607 [pdf, ps, other]

Recovery of wave speeds and density of mass across a heterogeneous smooth interface from acoustic and elastic wave reflection operators

Authors: Sombuddha Bhattacharyya, Maarten V. de Hoop, Vitaly Katsnelson, Gunther Uhlmann

Abstract: We revisit the problem of recovering wave speeds and density across a curved interface from reflected wave amplitudes. Such amplitudes have been exploited for decades in (exploration) seismology in this context. However, the analysis in seismology has been based on linearization and mostly flat interfaces. Here, we present a nonlinear analysis allowing curved interfaces, establish uniqueness and p… ▽ More We revisit the problem of recovering wave speeds and density across a curved interface from reflected wave amplitudes. Such amplitudes have been exploited for decades in (exploration) seismology in this context. However, the analysis in seismology has been based on linearization and mostly flat interfaces. Here, we present a nonlinear analysis allowing curved interfaces, establish uniqueness and provide a reconstruction, while making the notion of amplitude precise through a procedure rooted in microlocal analysis. △ Less

Submitted 7 January, 2022; originally announced January 2022.

Comments: submitted for journal publication 10/21/2021; 42 pages

arXiv:2112.03365 [pdf, ps, other]

doi 10.1063/5.0055827

Inverse problem for the Rayleigh system with spectral data

Authors: Maarten V. de Hoop, Alexei Iantchenko

Abstract: We analyze an inverse problem associated with the time-harmonic Rayleigh system on a flat elastic half-space concerning the recovery of Lamé parameters in a slab beneath a traction-free surface. We employ the Markushevich substitution, while the data are captured in a Jost function, and point out parallels with a corresponding problem for the Schrödinger equation. The Jost function can be identifi… ▽ More We analyze an inverse problem associated with the time-harmonic Rayleigh system on a flat elastic half-space concerning the recovery of Lamé parameters in a slab beneath a traction-free surface. We employ the Markushevich substitution, while the data are captured in a Jost function, and point out parallels with a corresponding problem for the Schrödinger equation. The Jost function can be identified with spectral data. We derive a Gel'fand-Levitan type equation and obtain uniqueness with two distinct frequencies. △ Less

Submitted 6 December, 2021; originally announced December 2021.

Comments: 51 pages

MSC Class: 35P99; 35Q86; 35J10; 34L05; 34L40; 34L25

Journal ref: J. Math. Phys. 63, 031505 (2022)

arXiv:2110.04227 [pdf, other]

Universal Joint Approximation of Manifolds and Densities by Simple Injective Flows

Authors: Michael Puthawala, Matti Lassas, Ivan Dokmanić, Maarten de Hoop

Abstract: We study approximation of probability measures supported on $n$-dimensional manifolds embedded in $\mathbb{R}^m$ by injective flows -- neural networks composed of invertible flows and injective layers. We show that in general, injective flows between $\mathbb{R}^n$ and $\mathbb{R}^m$ universally approximate measures supported on images of extendable embeddings, which are a subset of standard embed… ▽ More We study approximation of probability measures supported on $n$-dimensional manifolds embedded in $\mathbb{R}^m$ by injective flows -- neural networks composed of invertible flows and injective layers. We show that in general, injective flows between $\mathbb{R}^n$ and $\mathbb{R}^m$ universally approximate measures supported on images of extendable embeddings, which are a subset of standard embeddings: when the embedding dimension m is small, topological obstructions may preclude certain manifolds as admissible targets. When the embedding dimension is sufficiently large, $m \ge 3n+1$, we use an argument from algebraic topology known as the clean trick to prove that the topological obstructions vanish and injective flows universally approximate any differentiable embedding. Along the way we show that the studied injective flows admit efficient projections on the range, and that their optimality can be established "in reverse," resolving a conjecture made in Brehmer and Cranmer 2020. △ Less

Submitted 27 June, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

Comments: 26 pages, 5 figures

arXiv:2108.12515 [pdf, other]

doi 10.1137/21M1442942

Convergence Rates for Learning Linear Operators from Noisy Data

Authors: Maarten V. de Hoop, Nikola B. Kovachki, Nicholas H. Nelsen, Andrew M. Stuart

Abstract: This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues give… ▽ More This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues given the data. Adopting a Bayesian approach, the theoretical analysis establishes posterior contraction rates in the infinite data limit with Gaussian priors that are not directly linked to the forward map of the inverse problem. The main results also include learning-theoretic generalization error guarantees for a wide range of distribution shifts. These convergence rates quantify the effects of data smoothness and true eigenvalue decay or growth, for compact or unbounded operators, respectively, on sample complexity. Numerical evidence supports the theory in diagonal and non-diagonal settings. △ Less

Submitted 2 November, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

Comments: To appear in SIAM/ASA Journal on Uncertainty Quantification (JUQ); 34 pages, 5 figures, 2 tables

MSC Class: 62G20; 62C10; 68T05; 47A62

Journal ref: SIAM/ASA J. Uncertainty Quantification Vol. 11 No. 2 (2023) pp. 480-513

arXiv:2102.11799 [pdf, other]

doi 10.1088/1361-6420/ace6c9

Stable reconstruction of simple Riemannian manifolds from unknown interior sources

Authors: Maarten V. de Hoop, Joonas Ilmavirta, Matti Lassas, Teemu Saksala

Abstract: Consider the geometric inverse problem: There is a set of delta-sources in spacetime that emit waves travelling at unit speed. If we know all the arrival times at the boundary cylinder of the spacetime, can we reconstruct the space, a Riemannian manifold with boundary? With a finite set of sources we can only hope to get an approximate reconstruction, and we indeed provide a discrete metric approx… ▽ More Consider the geometric inverse problem: There is a set of delta-sources in spacetime that emit waves travelling at unit speed. If we know all the arrival times at the boundary cylinder of the spacetime, can we reconstruct the space, a Riemannian manifold with boundary? With a finite set of sources we can only hope to get an approximate reconstruction, and we indeed provide a discrete metric approximation to the manifold with explicit data-driven error bounds when the manifold is simple. This is the geometrization of a seismological inverse problem where we measure the arrival times on the surface of waves from an unknown number of unknown interior microseismic events at unknown times. The closeness of two metric spaces with a marked boundary is measured by a labeled Gromov--Hausdorff distance. If measurements are done for infinite time and spatially dense sources, our construction produces the true Riemannian manifold and the finite-time approximations converge to it in the metric sense. △ Less

Submitted 23 February, 2021; originally announced February 2021.

Comments: 39 pages, 1 figure

arXiv:2102.10461 [pdf, other]

Trumpets: Injective Flows for Inference and Inverse Problems

Authors: Konik Kothari, AmirEhsan Khorashadizadeh, Maarten de Hoop, Ivan Dokmanić

Abstract: We propose injective generative models called Trumpets that generalize invertible normalizing flows. The proposed generators progressively increase dimension from a low-dimensional latent space. We demonstrate that Trumpets can be trained orders of magnitudes faster than standard flows while yielding samples of comparable or better quality. They retain many of the advantages of the standard flows… ▽ More We propose injective generative models called Trumpets that generalize invertible normalizing flows. The proposed generators progressively increase dimension from a low-dimensional latent space. We demonstrate that Trumpets can be trained orders of magnitudes faster than standard flows while yielding samples of comparable or better quality. They retain many of the advantages of the standard flows such as training based on maximum likelihood and a fast, exact inverse of the generator. Since Trumpets are injective and have fast inverses, they can be effectively used for downstream Bayesian inference. To wit, we use Trumpet priors for maximum a posteriori estimation in the context of image reconstruction from compressive measurements, outperforming competitive baselines in terms of reconstruction quality and speed. We then propose an efficient method for posterior characterization and uncertainty quantification with Trumpets by taking advantage of the low-dimensional latent space. △ Less

Submitted 20 February, 2021; originally announced February 2021.

Comments: 16 pages

Journal ref: Uncertainty in Artificial Intelligence (UAI 2021)

arXiv:2102.10383 [pdf, other]

Reconstruction along a geodesic from sphere data in Finsler geometry and anisotropic elasticity

Authors: Maarten V. de Hoop, Joonas Ilmavirta, Matti Lassas

Abstract: Dix formulated the inverse problem of recovering an elastic body from the measurements of wave fronts of point scatterers. We geometrize this problem in the framework of linear elasticity, leading to the geometrical inverse problem of recovering a Finsler manifold from certain sphere data in a given open subset of the manifold. We solve this problem locally along any geodesic through the measureme… ▽ More Dix formulated the inverse problem of recovering an elastic body from the measurements of wave fronts of point scatterers. We geometrize this problem in the framework of linear elasticity, leading to the geometrical inverse problem of recovering a Finsler manifold from certain sphere data in a given open subset of the manifold. We solve this problem locally along any geodesic through the measurement set. △ Less

Submitted 20 February, 2021; originally announced February 2021.

Comments: 20 pages

arXiv:2012.00149 [pdf, ps, other]

Unique recovery of electrical conductivity and magnetic permeability from Magneto-Telluric data

Authors: Yernat M. Assylbekov, Maarten V. de Hoop

Abstract: We present a comprehensive mathematical study of the Magneto-Telluric (MT) method, on bounded domain in $\mathbb{R}^3$. We show that electrical conductivity and magnetic permeability, assumed to be $C^2$, can be uniquely recovered from MT data measured on the boundary of the domain. The proof is based on the construction of complex geometric optics solutions. Furthermore, we obtain a unique determ… ▽ More We present a comprehensive mathematical study of the Magneto-Telluric (MT) method, on bounded domain in $\mathbb{R}^3$. We show that electrical conductivity and magnetic permeability, assumed to be $C^2$, can be uniquely recovered from MT data measured on the boundary of the domain. The proof is based on the construction of complex geometric optics solutions. Furthermore, we obtain a unique determination result in the case when the MT data are measured only on an open subset of the boundary. Here, we assume that the part of the boundary inaccessible for measurements is a subset of a sphere. △ Less

Submitted 30 November, 2020; originally announced December 2020.

Comments: 26 pages

MSC Class: 35R30; 35Q86

arXiv:2011.01529 [pdf, other]

A high order discontinuous Galerkin method for the symmetric form of the anisotropic viscoelastic wave equation

Authors: Khemraj Shukla, Jesse Chan, Maarten V. de Hoop

Abstract: Wave propagation in real media is affected by various non-trivial physical phenomena, e.g., anisotropy, an-elasticity and dissipation. Assumptions on the stress-strain relationship are an integral part of seismic modeling and determine the deformation and relaxation of the medium. Stress-strain relationships based on simplified rheologies will incorrectly predict seismic amplitudes, which are used… ▽ More Wave propagation in real media is affected by various non-trivial physical phenomena, e.g., anisotropy, an-elasticity and dissipation. Assumptions on the stress-strain relationship are an integral part of seismic modeling and determine the deformation and relaxation of the medium. Stress-strain relationships based on simplified rheologies will incorrectly predict seismic amplitudes, which are used for quantitative reservoir characterization. Constitutive equations for the rheological model include the generalized Hooke's law and Boltzmann's superposition principal with dissipation models based on standard linear solids or a Zener approximation. In this work, we introduce a high-order discontinuous Galerkin finite element method for wave equation in inhomogeneous and anisotropic dissipative medium. This method is based on a new symmetric treatment of the anisotropic viscoelastic terms, as well as an appropriate memory variable treatment of the stress-strain convolution terms. Together, these result in a symmetric system of first order linear hyperbolic partial differential equations. The accuracy of the proposed numerical scheme is proven and verified using convergence studies against analytical plane wave solutions and analytical solutions of viscoelastic wave equation. Computational experiments are shown for various combinations of homogeneous and heterogeneous viscoelastic media in two and three dimensions. △ Less

Submitted 31 October, 2020; originally announced November 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:1904.02578

arXiv:2006.08464 [pdf, other]

Globally Injective ReLU Networks

Authors: Michael Puthawala, Konik Kothari, Matti Lassas, Ivan Dokmanić, Maarten de Hoop

Abstract: Injectivity plays an important role in generative models where it enables inference; in inverse problems and compressed sensing with generative priors it is a precursor to well posedness. We establish sharp characterizations of injectivity of fully-connected and convolutional ReLU layers and networks. First, through a layerwise analysis, we show that an expansivity factor of two is necessary and s… ▽ More Injectivity plays an important role in generative models where it enables inference; in inverse problems and compressed sensing with generative priors it is a precursor to well posedness. We establish sharp characterizations of injectivity of fully-connected and convolutional ReLU layers and networks. First, through a layerwise analysis, we show that an expansivity factor of two is necessary and sufficient for injectivity by constructing appropriate weight matrices. We show that global injectivity with iid Gaussian matrices, a commonly used tractable model, requires larger expansivity between 3.4 and 10.5. We also characterize the stability of inverting an injective network via worst-case Lipschitz constants of the inverse. We then use arguments from differential topology to study injectivity of deep networks and prove that any Lipschitz map can be approximated by an injective ReLU network. Finally, using an argument based on random projections, we show that an end-to-end -- rather than layerwise -- doubling of the dimension suffices for injectivity. Our results establish a theoretical basis for the study of nonlinear inverse and inference problems using neural networks. △ Less

Submitted 8 October, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

Comments: 48 pages, 18 figures, submitted to JMLR

arXiv:2006.05854 [pdf, other]

Learning the geometry of wave-based imaging

Authors: Konik Kothari, Maarten de Hoop, Ivan Dokmanić

Abstract: We propose a general physics-based deep learning architecture for wave-based imaging problems. A key difficulty in imaging problems with a varying background wave speed is that the medium "bends" the waves differently depending on their position and direction. This space-bending geometry makes the equivariance to translations of convolutional networks an undesired inductive bias. We build an inter… ▽ More We propose a general physics-based deep learning architecture for wave-based imaging problems. A key difficulty in imaging problems with a varying background wave speed is that the medium "bends" the waves differently depending on their position and direction. This space-bending geometry makes the equivariance to translations of convolutional networks an undesired inductive bias. We build an interpretable neural architecture inspired by Fourier integral operators (FIOs) which approximate the wave physics. FIOs model a wide range of imaging modalities, from seismology and radar to Doppler and ultrasound. We focus on learning the geometry of wave propagation captured by FIOs, which is implicit in the data, via a loss based on optimal transport. The proposed FIONet performs significantly better than the usual baselines on a number of imaging inverse problems, especially in out-of-distribution tests. △ Less

Submitted 10 November, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

Comments: Accepted as spotlight presentation to NeurIPS '20

arXiv:2004.04580 [pdf, other]

Reciprocity-gap misfit functional for Distributed Acoustic Sensing, combining data from passive and active sources

Authors: Florian Faucher, Maarten V. de Hoop, Otmar Scherzer

Abstract: Quantitative imaging of sub-surface Earth's properties in elastic media is performed from Distributed Acoustic Sensing data. A new misfit functional based upon the reciprocity-gap is designed, taking cross-correlations of displacement and strain, and these products further associate an observation with a simulation. In comparison with other misfit functionals, this one has the advantage to only re… ▽ More Quantitative imaging of sub-surface Earth's properties in elastic media is performed from Distributed Acoustic Sensing data. A new misfit functional based upon the reciprocity-gap is designed, taking cross-correlations of displacement and strain, and these products further associate an observation with a simulation. In comparison with other misfit functionals, this one has the advantage to only require little a-priori information on the exciting sources. In particular, the misfit criterion enables the use of data from regional earthquakes (teleseismic events can be included as well), followed by exploration data to perform a multi-resolution reconstruction. The data from regional earthquakes contain the low-frequency content which is missing in the exploration ones, allowing for the recovery of the long spatial wavelength, even with very few sources. These data are used to build prior models for the subsequent reconstruction from the higher-frequency exploration data. This gives the elastic Full Reciprocity-gap Waveform Inversion method, and we demonstrate its performance with a pilot experiment for elastic isotropic reconstruction. △ Less

Submitted 4 November, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

Comments: 21 pages, 6 figures

arXiv:2003.12657 [pdf, other]

doi 10.2140/paa.2021.3.789

A foliated and reversible Finsler manifold is determined by its broken scattering relation

Authors: Maarten V. de Hoop, Joonas Ilmavirta, Matti Lassas, Teemu Saksala

Abstract: The broken scattering relation consists of the total lengths of broken geodesics that start from the boundary, change direction once inside the manifold, and propagate to the boundary. We show that if two reversible Finsler manifolds satisfying a convex foliation condition have the same broken scattering relation, then they are isometric. This implies that some anisotropic material parameters of t… ▽ More The broken scattering relation consists of the total lengths of broken geodesics that start from the boundary, change direction once inside the manifold, and propagate to the boundary. We show that if two reversible Finsler manifolds satisfying a convex foliation condition have the same broken scattering relation, then they are isometric. This implies that some anisotropic material parameters of the Earth can be in principle reconstructed from single scattering measurements at the surface. △ Less

Submitted 21 May, 2021; v1 submitted 27 March, 2020; originally announced March 2020.

MSC Class: 86A22; 53Z05; 53C60

Journal ref: Pure Appl. Analysis 3 (2021) 789-811

arXiv:1912.11090 [pdf, other]

Deep learning architectures for nonlinear operator functions and nonlinear inverse problems

Authors: Maarten V. de Hoop, Matti Lassas, Christopher A. Wong

Abstract: We develop a theoretical analysis for special neural network architectures, termed operator recurrent neural networks, for approximating nonlinear functions whose inputs are linear operators. Such functions commonly arise in solution algorithms for inverse boundary value problems. Traditional neural networks treat input data as vectors, and thus they do not effectively capture the multiplicative s… ▽ More We develop a theoretical analysis for special neural network architectures, termed operator recurrent neural networks, for approximating nonlinear functions whose inputs are linear operators. Such functions commonly arise in solution algorithms for inverse boundary value problems. Traditional neural networks treat input data as vectors, and thus they do not effectively capture the multiplicative structure associated with the linear operators that correspond to the data in such inverse problems. We therefore introduce a new family that resembles a standard neural network architecture, but where the input data acts multiplicatively on vectors. Motivated by compact operators appearing in boundary control and the analysis of inverse boundary value problems for the wave equation, we promote structure and sparsity in selected weight matrices in the network. After describing this architecture, we study its representation properties as well as its approximation properties. We furthermore show that an explicit regularization can be introduced that can be derived from the mathematical analysis of the mentioned inverse problems, and which leads to certain guarantees on the generalization properties. We observe that the sparsity of the weight matrices improves the generalization estimates. Lastly, we discuss how operator recurrent networks can be viewed as a deep learning analogue to deterministic algorithms such as boundary control for reconstructing the unknown wavespeed in the acoustic wave equation from boundary measurements. △ Less

Submitted 3 January, 2022; v1 submitted 23 December, 2019; originally announced December 2019.

Comments: To appear in Mathematical Statistics and Learning

MSC Class: 68T05; 35R30; 62M45

arXiv:1912.06087 [pdf, other]

Attention network forecasts time-to-failure in laboratory shear experiments

Authors: Hope Jasperson, David C. Bolton, Paul Johnson, Robert Guyer, Chris Marone, Maarten V. de Hoop

Abstract: Rocks under stress deform by creep mechanisms that include formation and slip on small-scale internal cracks. Intragranular cracks and slip along grain contacts release energy as elastic waves termed acoustic emissions (AE). AEs are thought to contain predictive information that can be used for fault failure forecasting. Here we present a method using unsupervised classification and an attention n… ▽ More Rocks under stress deform by creep mechanisms that include formation and slip on small-scale internal cracks. Intragranular cracks and slip along grain contacts release energy as elastic waves termed acoustic emissions (AE). AEs are thought to contain predictive information that can be used for fault failure forecasting. Here we present a method using unsupervised classification and an attention network to forecast labquakes using AE waveform features. Our data were generated in a laboratory setting using a biaxial shearing device with granular fault gouge intended to mimic the conditions of tectonic faults. Here we analyzed the temporal evolution of AEs generated throughout several hundred laboratory earthquake cycles. We used a Conscience Self-Organizing Map (CSOM) to perform topologically ordered vector quantization based on waveform properties. The resulting map was used to interactively cluster AEs. We examined the clusters over time to identify those with predictive ability. Finally, we used a variety of LSTM and attention-based networks to test the predictive power of the AE clusters. By tracking cumulative waveform features over the seismic cycle, the network is able to forecast the time-to-failure (TTF) of lab earthquakes. Our results show that analyzing the data to isolate predictive signals and using a more sophisticated network architecture are key to robustly forecasting labquakes. In the future, this method could be applied on tectonic faults monitor earthquakes and augment current early warning systems. △ Less

Submitted 20 September, 2021; v1 submitted 12 December, 2019; originally announced December 2019.

arXiv:1912.00114

The computation of seismic normal modes with rotation as a quadratic eigenvalue problem

Authors: Jia Shi, Ruipeng Li, Yuanzhe Xi, Yousef Saad, Maarten V. de Hoop

Abstract: A new approach is presented to compute the seismic normal modes of a fully heterogeneous, rotating planet. Special care is taken to separate out the essential spectrum in the presence of a fluid outer core. The relevant elastic-gravitational system of equations, including the Coriolis force, is subjected to a mixed finite-element method, while self-gravitation is accounted for with the fast multip… ▽ More A new approach is presented to compute the seismic normal modes of a fully heterogeneous, rotating planet. Special care is taken to separate out the essential spectrum in the presence of a fluid outer core. The relevant elastic-gravitational system of equations, including the Coriolis force, is subjected to a mixed finite-element method, while self-gravitation is accounted for with the fast multipole method (FMM). To solve the resulting quadratic eigenvalue problem (QEP), the approach utilizes extended Lanczos vectors forming a subspace computed from a non-rotating planet -- with the shape of boundaries of a rotating planet and accounting for the centrifugal potential -- to reduce the dimension of the original problem significantly. The subspace is guaranteed to be contained in the space of functions to which the seismic normal modes belong. The reduced system can further be solved with a standard eigensolver. The computational accuracy is illustrated using all the modes with relative small meshes and also tested against standard perturbation calculations relative to a standard Earth model. The algorithm and code are used to compute the point spectra of eigenfrequencies in several Mars models studying the effects of heterogeneity on a large range of scales. △ Less

Submitted 25 September, 2021; v1 submitted 29 November, 2019; originally announced December 2019.

Comments: It was merged with another paper, arXiv:1906.11082

arXiv:1909.11172 [pdf, other]

Generic uniqueness and stability for the mixed ray transform

Authors: Maarten V. de Hoop, Teemu Saksala, Gunther Uhlmann, Jian Zhai

Abstract: We consider the mixed ray transform of tensor fields on a three-dimensional compact simple Riemannian manifold with boundary. We prove the injectivity of the transform, up to natural obstructions, and establish stability estimates for the normal operator on generic three dimensional simple manifold in the case of 1+1 and 2+2 tensors fields. We show how the anisotropic perturbations of averaged i… ▽ More We consider the mixed ray transform of tensor fields on a three-dimensional compact simple Riemannian manifold with boundary. We prove the injectivity of the transform, up to natural obstructions, and establish stability estimates for the normal operator on generic three dimensional simple manifold in the case of 1+1 and 2+2 tensors fields. We show how the anisotropic perturbations of averaged isotopic travel-times of qS-polarized elastic waves provide partial information about the mixed ray transform of 2+2 tensors fields. If in addition we include the measurement of the shear wave amplitude, the complete mixed ray transform can be recovered. We also show how one can obtain the mixed ray transform from an anisotropic perturbation of the Dirichlet-to-Neumann map of an isotropic elastic wave equation on a smooth and bounded domain in three dimensional Euclidean space. △ Less

Submitted 18 August, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

MSC Class: 53C22 53C65

arXiv:1908.11698 [pdf, other]

Semiclassical inverse spectral problem for elastic Rayleigh waves in isotropic media

Authors: Maarten V. de Hoop, Alexei Iantchenko, Robert D. van der Hilst, Jian Zhai

Abstract: We analyze the inverse spectral problem on the half line associated with elastic surface waves. Here, we extend the treatment of Love waves [arXiv: 1908.10529] to Rayleigh waves. Under certain conditions, and assuming that the Poisson ratio is constant, we establish uniqueness and present a reconstruction scheme for the S-wave speed with multiple wells from the semiclassical spectrum of these wave… ▽ More We analyze the inverse spectral problem on the half line associated with elastic surface waves. Here, we extend the treatment of Love waves [arXiv: 1908.10529] to Rayleigh waves. Under certain conditions, and assuming that the Poisson ratio is constant, we establish uniqueness and present a reconstruction scheme for the S-wave speed with multiple wells from the semiclassical spectrum of these waves. △ Less

Submitted 29 August, 2019; originally announced August 2019.

Comments: arXiv admin note: text overlap with arXiv:1908.10529

arXiv:1908.10529 [pdf, other]

Semiclassical inverse spectral problem for elastic Love waves in isotropic media

Authors: Maarten V. de Hoop, Alexei Iantchenko, Robert D. van der Hilst, Jian Zhai

Abstract: We analyze the inverse spectral problem on the half line associated with elastic surface waves. Here, we focus on Love waves. Under certain generic conditions, we establish uniqueness and present a reconstruction scheme for the S- wavespeed with multiple wells from the semiclassical spectrum of these waves. We analyze the inverse spectral problem on the half line associated with elastic surface waves. Here, we focus on Love waves. Under certain generic conditions, we establish uniqueness and present a reconstruction scheme for the S- wavespeed with multiple wells from the semiclassical spectrum of these waves. △ Less

Submitted 27 August, 2019; originally announced August 2019.

Showing 1–50 of 113 results for author: de Hoop, M