Search | arXiv e-print repository

Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data

Authors: Bas Peters, Eldad Haber, Keegan Lensink

Abstract: The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory req… ▽ More The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with network depth. This work focuses on a fully invertible network based on the telegraph equation. While reversibility saves the major amount of memory used in deep networks by the data, the convolutional kernels can take up most memory if fully invertible networks contain multiple invertible pooling/coarsening layers. We address the explosion of the number of convolutional kernels by combining fully invertible networks with layers that contain the convolutional kernels in a compressed form directly. A second challenge is that invertible networks output a tensor the same size as its input. This property prevents the straightforward application of invertible networks to applications that map between different input-output dimensions, need to map to outputs with more channels than present in the input data, or desire outputs that decrease/increase the resolution compared to the input data. However, we show that by employing invertible networks in a non-standard fashion, we can still use them for these tasks. Examples in hyperspectral land-use classification, airborne geophysical surveying, and seismic imaging illustrate that we can input large data volumes in one chunk and do not need to work on small patches, use dimensionality reduction, or employ methods that classify a patch to a single central pixel. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: 22 pages, 13 figures

MSC Class: 86A04

arXiv:2407.00257 [pdf, other]

Inverting airborne electromagnetic data with machine learning

Authors: Michael S. McMillan, Bas Peters, Ophir Greif, Paulina Wozniakowska, Eldad Haber

Abstract: This study focuses on inverting time-domain airborne electromagnetic data in 2D by training a neural-network to understand the relationship between data and conductivity, thereby removing the need for expensive forward modeling during the inversion process. Instead the forward modeling is completed in the training stage, where training models are built before calculating 3D forward modeling traini… ▽ More This study focuses on inverting time-domain airborne electromagnetic data in 2D by training a neural-network to understand the relationship between data and conductivity, thereby removing the need for expensive forward modeling during the inversion process. Instead the forward modeling is completed in the training stage, where training models are built before calculating 3D forward modeling training data. The method relies on training data being similar to the field dataset of choice, therefore, the field data was first inverted in 1D to get an idea of the expected conductivity distribution. With this information, $ 10,000 $ training models were built with similar conductivity ranges, and the research shows that this provided enough information for the network to produce realistic 2D inversion models over an aquifer-bearing region in California. Once the training was completed, the actual inversion time took only a matter of seconds on a generic laptop, which means that if future data was collected in this region it could be inverted in near real-time. Better results are expected by increasing the number of training models and eventually the goal is to extend the method to 3D inversion. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: 4 pages, 5 figures, conference submission

MSC Class: 86A22

arXiv:2312.12969 [pdf, ps, other]

Explicit form for the most general Lorentz transformation revisited

Authors: Howard E. Haber

Abstract: Explicit formulae for the $4\times 4$ Lorentz transformation matrices corresponding to a pure boost and a pure three-dimensional rotation are very well-known. Significantly less well-known is the explicit formula for a general Lorentz transformation with arbitrary boost and rotation parameters. We revisit this more general formula by presenting two different derivations. The first derivation (whic… ▽ More Explicit formulae for the $4\times 4$ Lorentz transformation matrices corresponding to a pure boost and a pure three-dimensional rotation are very well-known. Significantly less well-known is the explicit formula for a general Lorentz transformation with arbitrary boost and rotation parameters. We revisit this more general formula by presenting two different derivations. The first derivation (which is somewhat simpler than previous ones appearing in the literature) evaluates the exponential of a $4\times 4$ matrix $A$, where $GA$ is an arbitrary $4\times 4$ real antisymmetric matrix and $G$ is a diagonal matrix corresponding to the Minkowski metric. The formula for $\exp A$ depends only on the eigenvalues of $A$ and makes use of the Lagrange interpolating polynomial. The second derivation exploits the assertion that the spinor product $η^\dagger{\barσ}^{\,μ}χ$ transforms as a Lorentz four-vector, where $χ$ and $η$ are two-component spinors. The advantage of this derivation is that the formula for a general Lorentz transformation $Λ$ reduces to the computation of the trace of a product of $2\times 2$ matrices. Both computations are shown to yield equivalent expressions for $Λ$. △ Less

Submitted 4 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 24 pages; v2: typographical errors fixed and a minor improvement of notation is implemented. In addition, the explicit form for a Lorentz transformation in 2+1 spacetime dimensions is provided

arXiv:2303.11404 [pdf, other]

Semi-Automated Segmentation of Geoscientific Data Using Superpixels

Authors: Conrad P. Koziol, Eldad Haber

Abstract: Geological processes determine the distribution of resources such as critical minerals, water, and geothermal energy. However, direct observation of geology is often prevented by surface cover such as overburden or vegetation. In such cases, remote and in-situ surveys are frequently conducted to collect physical measurements of the earth indicative of the geology. Develo** a geological segmentat… ▽ More Geological processes determine the distribution of resources such as critical minerals, water, and geothermal energy. However, direct observation of geology is often prevented by surface cover such as overburden or vegetation. In such cases, remote and in-situ surveys are frequently conducted to collect physical measurements of the earth indicative of the geology. Develo** a geological segmentation based on these measurements is challenging since individual datasets can differ in properties (e.g. units, dynamic ranges, textures) and because the data does not uniquely constrain the geology. Further, as the number of datasets grows the information to constrain geology increases while simultaneously becoming harder to make sense of. Inspired by the concept of superpixels, we propose a deep-learning based approach to segment rasterized survey data into regions with similar characteristics. We demonstrate its use for semi-automated geoscientific map** with datasets arising from independent sensors and with diverse properties. In addition, we introduce a new loss function for superpixels including a novel regularization parameter penalizing image segmentation with non-connected component superpixels. This improves integration of prior knowledge by allowing better control over the number of superpixels generated. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: 11 pages, 7 figures

arXiv:2211.14302 [pdf, other]

Neural DAEs: Constrained neural networks

Authors: Tue Boesen, Eldad Haber, Uri Michael Ascher

Abstract: This article investigates the effect of explicitly adding auxiliary algebraic trajectory information to neural networks for dynamical systems. We draw inspiration from the field of differential-algebraic equations and differential equations on manifolds and implement related methods in residual neural networks, despite some fundamental scenario differences. Constraint or auxiliary information effe… ▽ More This article investigates the effect of explicitly adding auxiliary algebraic trajectory information to neural networks for dynamical systems. We draw inspiration from the field of differential-algebraic equations and differential equations on manifolds and implement related methods in residual neural networks, despite some fundamental scenario differences. Constraint or auxiliary information effects are incorporated through stabilization as well as projection methods, and we show when to use which method based on experiments involving simulations of multi-body pendulums and molecular dynamics scenarios. Several of our methods are easy to implement in existing code and have limited impact on training performance while giving significant boosts in terms of inference. △ Less

Submitted 12 March, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

Comments: Extended the paper to PDEs, added a third experiment denoising a vector field and updated the introduction to make the distinction between this work and physics informed neural networks more clear

MSC Class: 70H99; 34A09

arXiv:2107.11235 [pdf, other]

doi 10.1063/5.0064458

Robust deep learning for emulating turbulent viscosities

Authors: Aakash Patil, Jonathan Viquerat, George El Haber, Elie Hachem

Abstract: From the simplest models to complex deep neural networks, modeling turbulence with machine learning techniques still offers multiple challenges. In this context, the present contribution proposes a robust strategy using patch-based training to learn turbulent viscosity from flow velocities, and demonstrates its efficient use on the Spallart-Allmaras turbulence model. Training datasets are generate… ▽ More From the simplest models to complex deep neural networks, modeling turbulence with machine learning techniques still offers multiple challenges. In this context, the present contribution proposes a robust strategy using patch-based training to learn turbulent viscosity from flow velocities, and demonstrates its efficient use on the Spallart-Allmaras turbulence model. Training datasets are generated for flow past two-dimensional (2D) obstacles at high Reynolds numbers and used to train an auto-encoder type convolutional neural network with local patch inputs. Compared to a standard training technique, patch-based learning not only yields increased accuracy but also reduces the computational cost required for training. △ Less

Submitted 1 October, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

arXiv:2003.08466 [pdf, other]

Fully reversible neural networks for large-scale 3D seismic horizon tracking

Authors: Bas Peters, Eldad Haber

Abstract: Tracking a horizon in seismic images or 3D volumes is an integral part of seismic interpretation. The last few decades saw progress in using neural networks for this task, starting from shallow networks for 1D traces, to deeper convolutional neural networks for large 2D images. Because geological structures are intrinsically 3D, we hope to see improved horizon tracking by training networks on 3D s… ▽ More Tracking a horizon in seismic images or 3D volumes is an integral part of seismic interpretation. The last few decades saw progress in using neural networks for this task, starting from shallow networks for 1D traces, to deeper convolutional neural networks for large 2D images. Because geological structures are intrinsically 3D, we hope to see improved horizon tracking by training networks on 3D seismic data cubes. While there are some 3D convolutional neural networks for various seismic interpretation tasks, they are restricted to shallow networks or relatively small 3D inputs because of memory limitations. The required memory for the network states and weights increases with network depth. We present a fully reversible network for horizon tracking that has a memory requirement that is independent of network depth. To tackle memory issues regarding the network weights, we use layers that train in a factorized form directly. Therefore, we can maintain a large number of network channels while kee** the number of convolutional kernels low. We use the saved memory to increase the input size of the data by order of magnitude such that the network can better learn from large structures in the data. A field data example verifies the proposed network structure is suitable for seismic horizon tracking. △ Less

Submitted 18 March, 2020; originally announced March 2020.

MSC Class: 68U10 ACM Class: I.4.6

arXiv:2003.07474 [pdf, other]

Fully reversible neural networks for large-scale surface and sub-surface characterization via remote sensing

Authors: Bas Peters, Eldad Haber, Keegan Lensink

Abstract: The large spatial/frequency scale of hyperspectral and airborne magnetic and gravitational data causes memory issues when using convolutional neural networks for (sub-) surface characterization. Recently developed fully reversible networks can mostly avoid memory limitations by virtue of having a low and fixed memory requirement for storing network states, as opposed to the typical linear memory g… ▽ More The large spatial/frequency scale of hyperspectral and airborne magnetic and gravitational data causes memory issues when using convolutional neural networks for (sub-) surface characterization. Recently developed fully reversible networks can mostly avoid memory limitations by virtue of having a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with depth. Fully reversible networks enable the training of deep neural networks that take in entire data volumes, and create semantic segmentations in one go. This approach avoids the need to work in small patches or map a data patch to the class of just the central pixel. The cross-entropy loss function requires small modifications to work in conjunction with a fully reversible network and learn from sparsely sampled labels without ever seeing fully labeled ground truth. We show examples from land-use change detection from hyperspectral time-lapse data, and regional aquifer map** from airborne geophysical and geological data. △ Less

Submitted 16 March, 2020; originally announced March 2020.

MSC Class: 68T45 ACM Class: I.4.6

arXiv:1904.04413 [pdf, other]

doi 10.1190/segam2019-3216640.1

Does shallow geological knowledge help neural-networks to predict deep units?

Authors: Bas Peters, Eldad Haber, Justin Granek

Abstract: Geological interpretation of seismic images is a visual task that can be automated by training neural networks. While neural networks have shown to be effective at various interpretation tasks, a fundamental challenge is the lack of labeled data points in the subsurface. For example, the interpolation and extrapolation of well-based lithology using seismic images relies on a small number of known… ▽ More Geological interpretation of seismic images is a visual task that can be automated by training neural networks. While neural networks have shown to be effective at various interpretation tasks, a fundamental challenge is the lack of labeled data points in the subsurface. For example, the interpolation and extrapolation of well-based lithology using seismic images relies on a small number of known labels. Besides well-known data augmentation techniques, as well as regularization of the network output, we propose and test another approach to deal with the lack of labels. Non learning-based horizon trackers work very well in the shallow subsurface where seismic images are of higher quality and the geological units are roughly layered. We test if these segmented and shallow units can help train neural networks to predict deeper geological units that are not layered and flat. We show that knowledge of shallow geological units helps to predict deeper units when there are only a few labels for training using a dataset from the Sea of Ireland. We employ U-net based multi-resolution networks, and we show that these networks can be described using matrix-vector product notation in a similar fashion as standard geophysical inverse problems. △ Less

Submitted 8 April, 2019; originally announced April 2019.

Comments: 7 pages, 5 figures

MSC Class: 86A99

arXiv:1903.11215 [pdf, other]

Neural-networks for geophysicists and their application to seismic data interpretation

Authors: Bas Peters, Eldad Haber, Justin Granek

Abstract: Neural-networks have seen a surge of interest for the interpretation of seismic images during the last few years. Network-based learning methods can provide fast and accurate automatic interpretation, provided there are sufficiently many training labels. We provide an introduction to the field aimed at geophysicists that are familiar with the framework of forward modeling and inversion. We explain… ▽ More Neural-networks have seen a surge of interest for the interpretation of seismic images during the last few years. Network-based learning methods can provide fast and accurate automatic interpretation, provided there are sufficiently many training labels. We provide an introduction to the field aimed at geophysicists that are familiar with the framework of forward modeling and inversion. We explain the similarities and differences between deep networks to other geophysical inverse problems and show their utility in solving problems such as lithology interpolation between wells, horizon tracking and segmentation of seismic images. The benefits of our approach are demonstrated on field data from the Sea of Ireland and the North Sea. △ Less

Submitted 26 March, 2019; originally announced March 2019.

Comments: 8 pages, 5 figures

MSC Class: 86A04

arXiv:1901.03457 [pdf]

doi 10.1103/PhysRevLett.122.203901

Cooperative energy transfer controls the spontaneous emission rate beyond field enhancement limits

Authors: Mohamed ElKabbash, Ermanno Miele, Ahmad K. Fumani, Michael S. Wolf, Angelo Bozzola, Elisha Haber, Tigran V. Shahbazyan, Jesse Berezovsky, Francesco De Angelis, Giuseppe Strangi

Abstract: Quantum emitters located in proximity to a metal nanostructure individually transfer their energy via near-field excitation of surface plasmons. The energy transfer process increases the spontaneous emission (SE) rate due to plasmon-enhanced local field. Here, we demonstrate significant acceleration of quantum emitter SE rate in a plasmonic nano-cavity due to cooperative energy transfer (CET) from… ▽ More Quantum emitters located in proximity to a metal nanostructure individually transfer their energy via near-field excitation of surface plasmons. The energy transfer process increases the spontaneous emission (SE) rate due to plasmon-enhanced local field. Here, we demonstrate significant acceleration of quantum emitter SE rate in a plasmonic nano-cavity due to cooperative energy transfer (CET) from plasmon-correlated emitters. Using an integrated plasmonic nano-cavity, we realize up to six-fold enhancement in the emission rate of emitters coupled to the same nano-cavity on top of the plasmonic enhancement of the local density of states. The radiated power spectrum retains the plasmon resonance central frequency and lineshape, with the peak amplitude proportional to the number of excited emitters indicating that the observed cooperative SE is distinct from super-radiance. Plasmon-assisted CET offers unprecedented control over the SE rate and allows to dynamically control the spontaneous emission rate at room temperature enabling an SE rate based optical modulator. △ Less

Submitted 10 January, 2019; originally announced January 2019.

Comments: 23 pages. Includes Supplemental Material

Journal ref: Phys. Rev. Lett. 122, 203901 (2019)

arXiv:1812.11092 [pdf, other]

Multi-resolution neural networks for tracking seismic horizons from few training images

Authors: Bas Peters, Justin Granek, Eldad Haber

Abstract: Detecting a specific horizon in seismic images is a valuable tool for geological interpretation. Because hand-picking the locations of the horizon is a time-consuming process, automated computational methods were developed starting three decades ago. Older techniques for such picking include interpolation of control points however, in recent years neural networks have been used for this task. Unti… ▽ More Detecting a specific horizon in seismic images is a valuable tool for geological interpretation. Because hand-picking the locations of the horizon is a time-consuming process, automated computational methods were developed starting three decades ago. Older techniques for such picking include interpolation of control points however, in recent years neural networks have been used for this task. Until now, most networks trained on small patches from larger images. This limits the networks ability to learn from large-scale geologic structures. Moreover, currently available networks and training strategies require label patches that have full and continuous annotations, which are also time-consuming to generate. We propose a projected loss-function for training convolutional networks with a multi-resolution structure, including variants of the U-net. Our networks learn from a small number of large seismic images without creating patches. The projected loss-function enables training on labels with just a few annotated pixels and has no issue with the other unknown label pixels. Training uses all data without reserving some for validation. Only the labels are split into training/testing. Contrary to other work on horizon tracking, we train the network to perform non-linear regression, and not classification. As such, we propose labels as the convolution of a Gaussian kernel and the known horizon locations that indicate uncertainty in the labels. The network output is the probability of the horizon location. We demonstrate the proposed computational ingredients on two different datasets, for horizon extrapolation and interpolation. We show that the predictions of our methodology are accurate even in areas far from known horizon locations because our learning strategy exploits all data in large seismic images. △ Less

Submitted 26 December, 2018; originally announced December 2018.

Comments: 24 pages, 13 figures

MSC Class: 68T45 (Primary)

arXiv:1804.08697 [pdf, other]

Simultaneous shot inversion for nonuniform geometries using fast data interpolation

Authors: Michelle Liu, Rajiv Kumar, Eldad Haber, Aleksandr Aravkin

Abstract: Stochastic optimization is key to efficient inversion in PDE-constrained optimization. Using 'simultaneous shots', or random superposition of source terms, works very well in simple acquisition geometries where all sources see all receivers, but this rarely occurs in practice. We develop an approach that interpolates data to an ideal acquisition geometry while solving the inverse problem using sim… ▽ More Stochastic optimization is key to efficient inversion in PDE-constrained optimization. Using 'simultaneous shots', or random superposition of source terms, works very well in simple acquisition geometries where all sources see all receivers, but this rarely occurs in practice. We develop an approach that interpolates data to an ideal acquisition geometry while solving the inverse problem using simultaneous shots. The approach is formulated as a joint inverse problem, combining ideas from low-rank interpolation with full-waveform inversion. Results using synthetic experiments illustrate the flexibility and efficiency of the approach. △ Less

Submitted 23 April, 2018; originally announced April 2018.

Comments: 16 pages, 10 figures

MSC Class: 65K05; 65K10; 86-08

arXiv:1712.06091 [pdf, other]

A multigrid solver to the Helmholtz equation with a point source based on travel time and amplitude

Authors: Eran Treister, Eldad Haber

Abstract: The Helmholtz equation arises when modeling wave propagation in the frequency domain. The equation is discretized as an indefinite linear system, which is difficult to solve at high wave numbers. In many applications, the solution of the Helmholtz equation is required for a point source. In this case, it is possible to reformulate the equation as two separate equations: one for the travel time of… ▽ More The Helmholtz equation arises when modeling wave propagation in the frequency domain. The equation is discretized as an indefinite linear system, which is difficult to solve at high wave numbers. In many applications, the solution of the Helmholtz equation is required for a point source. In this case, it is possible to reformulate the equation as two separate equations: one for the travel time of the wave and one for its amplitude. The travel time is obtained by a solution of the factored eikonal equation, and the amplitude is obtained by solving a complex-valued advection-diffusion-reaction (ADR) equation. The reformulated equation is equivalent to the original Helmholtz equation, and the differences between the numerical solutions of these equations arise only from discretization errors. We develop an efficient multigrid solver for obtaining the amplitude given the travel time, which can be efficiently computed. This approach is advantageous because the amplitude is typically smooth in this case, and hence, more suitable for multigrid solvers than the standard Helmholtz discretization. We demonstrate that our second order ADR discretization is more accurate than the standard second order discretization at high wave numbers, as long as there are no reflections or caustics. Moreover, we show that using our approach, the problem can be solved more efficiently than using the common shifted Laplacian multigrid approach. △ Less

Submitted 17 December, 2017; originally announced December 2017.

arXiv:1706.03381 [pdf, other]

doi 10.1016/j.cageo.2018.04.006

A numerical method for efficient 3D inversions using Richards equation

Authors: Rowan Cockett, Lindsey J. Heagy, Eldad Haber

Abstract: Fluid flow in the vadose zone is governed by Richards equation; it is parameterized by hydraulic conductivity, which is a nonlinear function of pressure head. Investigations in the vadose zone typically require characterizing distributed hydraulic properties. Saturation or pressure head data may include direct measurements made from boreholes. Increasingly, proxy measurements from hydrogeophysics… ▽ More Fluid flow in the vadose zone is governed by Richards equation; it is parameterized by hydraulic conductivity, which is a nonlinear function of pressure head. Investigations in the vadose zone typically require characterizing distributed hydraulic properties. Saturation or pressure head data may include direct measurements made from boreholes. Increasingly, proxy measurements from hydrogeophysics are being used to supply more spatially and temporally dense data sets. Inferring hydraulic parameters from such datasets requires the ability to efficiently solve and deterministically optimize the nonlinear time domain Richards equation. This is particularly important as the number of parameters to be estimated in a vadose zone inversion continues to grow. In this paper, we describe an efficient technique to invert for distributed hydraulic properties in 1D, 2D, and 3D. Our algorithm does not store the Jacobian, but rather computes the product with a vector, which allows the size of the inversion problem to become much larger than methods such as finite difference or automatic differentiation; which are constrained by computation and memory, respectively. We show our algorithm in practice for a 3D inversion of saturated hydraulic conductivity using saturation data through time. The code to run our examples is open source and the algorithm presented allows this inversion process to run on modest computational resources. △ Less

Submitted 11 June, 2017; originally announced June 2017.

Comments: Computers and Geosciences (2018)

arXiv:1610.02948 [pdf, other]

doi 10.1016/j.cam.2016.11.051

A Framework for the Upscaling of the Electrical Conductivity in the Quasi-static Maxwell's Equations

Authors: Luz Angelica Caudillo-Mata, Eldad Haber, Lindsey J. Heagy, Christoph Schwarzbach

Abstract: Electromagnetic simulations of complex geologic settings are computationally expensive. One reason for this is the fact that a fine mesh is required to accurately discretize the electrical conductivity model of a given setting. This conductivity model may vary over several orders of magnitude and these variations can occur over a large range of length scales. Using a very fine mesh for the discret… ▽ More Electromagnetic simulations of complex geologic settings are computationally expensive. One reason for this is the fact that a fine mesh is required to accurately discretize the electrical conductivity model of a given setting. This conductivity model may vary over several orders of magnitude and these variations can occur over a large range of length scales. Using a very fine mesh for the discretization of this setting leads to the necessity to solve a large system of equations that is often difficult to deal with. To keep the simulations computationally tractable, coarse meshes are often employed for the discretization of the model. Such coarse meshes typically fail to capture the fine-scale variations in the conductivity model resulting in inaccuracies in the predicted data. In this work, we introduce a framework for constructing a coarse-mesh or upscaled conductivity model based on a prescribed fine-mesh model. Rather than using analytical expressions, we opt to pose upscaling as a parameter estimation problem. By solving an optimization problem, we obtain a coarse-mesh conductivity model. The optimization criterion can be tailored to the survey setting in order to produce coarse models that accurately reproduce the predicted data generated on the fine mesh. This allows us to upscale arbitrary conductivity structures, as well as to better understand the meaning of the upscaled quantity. We use 1D and 3D examples to demonstrate that the proposed framework is able to emulate the behavior of the heterogeneity in the fine-mesh conductivity model, and to produce an accurate description of the desired predicted data obtained by using a coarse mesh in the simulation process. △ Less

Submitted 6 October, 2016; originally announced October 2016.

Comments: 27 pages, 13 figures

MSC Class: 78M40; 35K55; 65N08; 78A25; 86-08

Journal ref: Journal of Computational and Applied Mathematics, 317, 388-402 (2017)

arXiv:1610.02105 [pdf, other]

An oversampling technique for the multiscale finite volume method to simulate electromagnetic responses in the frequency domain

Authors: Luz Angelica Caudillo Mata, Eldad Haber, Christoph Schwarzbach

Abstract: In order to reduce the computational cost of the simulation of electromagnetic responses in geophysical settings that involve highly heterogeneous media, we develop a multiscale finite volume method with oversampling for the quasi-static Maxwell's equations in the frequency domain. We assume a coarse mesh nested within a fine mesh that accurately discretizes the problem. For each coarse cell, we i… ▽ More In order to reduce the computational cost of the simulation of electromagnetic responses in geophysical settings that involve highly heterogeneous media, we develop a multiscale finite volume method with oversampling for the quasi-static Maxwell's equations in the frequency domain. We assume a coarse mesh nested within a fine mesh that accurately discretizes the problem. For each coarse cell, we independently solve a local version of the original Maxwell's system subject to linear boundary conditions on an extended domain, which includes the coarse cell and a neighborhood of fine cells around it. The local Maxwell's system is solved using the fine mesh contained in the extended domain and the mimetic finite volume method. Next, these local solutions (basis functions) together with a weak-continuity condition are used to construct a coarse-mesh version of the global problem. The basis functions can be used to obtain the fine-mesh details from the solution of the coarse-mesh problem. Our approach leads to a significant reduction in the size of the final system of equations and the computational time, while accurately approximating the behavior of the fine-mesh solutions. We demonstrate the performance of our method using a synthetic 3D example of a mineral deposit. △ Less

Submitted 6 October, 2016; originally announced October 2016.

Comments: 15 pages, 18 figures

MSC Class: 35K55; 35B99; 65N08; 78A25; 86-08

arXiv:1608.01352 [pdf, other]

Joint Hydrogeophysical Inversion: State Estimation for Seawater Intrusion Models in 3D

Authors: K. Steklova, E. Haber

Abstract: Seawater intrusion (SWI) is a complex process, where 3D modeling is often necessary in order to monitor and manage the affected aquifers. Here, we present a synthetic study to test a joint hydrogeophysical inversion approach aimed at solving the inverse problem of estimating initial and current saltwater distribution. First, we use a 3D groundwater model for variable density flow based on discreti… ▽ More Seawater intrusion (SWI) is a complex process, where 3D modeling is often necessary in order to monitor and manage the affected aquifers. Here, we present a synthetic study to test a joint hydrogeophysical inversion approach aimed at solving the inverse problem of estimating initial and current saltwater distribution. First, we use a 3D groundwater model for variable density flow based on discretized flow and solute mass balance equations. In addition to the groundwater model, a 3D geophysical model was developed for direct current resistivity imaging and inversion. The objective function of the coupled problem consists of data misfit and regularization terms as well as a coupling term that relates groundwater and geophysical states. We present a novel approach to solve the inverse problem using an Alternating Direction Method of Multipliers (ADMM) to minimize this coupled objective function. The sensitivities are derived analytically for the discretized system of equations, which allows us to efficiently compute the gradients in the minimization procedure and reduce the computational complexity of the problem. The method was tested on different synthetic scenarios with groundwater and geophysical data represented by solute mass fraction data and direct current resistivity data. With the ADMM approach, we were able to obtain better estimates for the solute distribution, compared to just considering each data set separately or solving with a simple coupled approach. △ Less

Submitted 3 August, 2016; originally announced August 2016.

Showing 1–18 of 18 results for author: Haber, E