-
Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data
Authors:
Bas Peters,
Eldad Haber,
Keegan Lensink
Abstract:
The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory req…
▽ More
The large spatial/temporal/frequency scale of geoscience and remote-sensing datasets causes memory issues when using convolutional neural networks for (sub-) surface data segmentation. Recently developed fully reversible or fully invertible networks can mostly avoid memory limitations by recomputing the states during the backward pass through the network. This results in a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with network depth. This work focuses on a fully invertible network based on the telegraph equation. While reversibility saves the major amount of memory used in deep networks by the data, the convolutional kernels can take up most memory if fully invertible networks contain multiple invertible pooling/coarsening layers. We address the explosion of the number of convolutional kernels by combining fully invertible networks with layers that contain the convolutional kernels in a compressed form directly. A second challenge is that invertible networks output a tensor the same size as its input. This property prevents the straightforward application of invertible networks to applications that map between different input-output dimensions, need to map to outputs with more channels than present in the input data, or desire outputs that decrease/increase the resolution compared to the input data. However, we show that by employing invertible networks in a non-standard fashion, we can still use them for these tasks. Examples in hyperspectral land-use classification, airborne geophysical surveying, and seismic imaging illustrate that we can input large data volumes in one chunk and do not need to work on small patches, use dimensionality reduction, or employ methods that classify a patch to a single central pixel.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Inverting airborne electromagnetic data with machine learning
Authors:
Michael S. McMillan,
Bas Peters,
Ophir Greif,
Paulina Wozniakowska,
Eldad Haber
Abstract:
This study focuses on inverting time-domain airborne electromagnetic data in 2D by training a neural-network to understand the relationship between data and conductivity, thereby removing the need for expensive forward modeling during the inversion process. Instead the forward modeling is completed in the training stage, where training models are built before calculating 3D forward modeling traini…
▽ More
This study focuses on inverting time-domain airborne electromagnetic data in 2D by training a neural-network to understand the relationship between data and conductivity, thereby removing the need for expensive forward modeling during the inversion process. Instead the forward modeling is completed in the training stage, where training models are built before calculating 3D forward modeling training data. The method relies on training data being similar to the field dataset of choice, therefore, the field data was first inverted in 1D to get an idea of the expected conductivity distribution. With this information, $ 10,000 $ training models were built with similar conductivity ranges, and the research shows that this provided enough information for the network to produce realistic 2D inversion models over an aquifer-bearing region in California. Once the training was completed, the actual inversion time took only a matter of seconds on a generic laptop, which means that if future data was collected in this region it could be inverted in near real-time. Better results are expected by increasing the number of training models and eventually the goal is to extend the method to 3D inversion.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Explicit form for the most general Lorentz transformation revisited
Authors:
Howard E. Haber
Abstract:
Explicit formulae for the $4\times 4$ Lorentz transformation matrices corresponding to a pure boost and a pure three-dimensional rotation are very well-known. Significantly less well-known is the explicit formula for a general Lorentz transformation with arbitrary boost and rotation parameters. We revisit this more general formula by presenting two different derivations. The first derivation (whic…
▽ More
Explicit formulae for the $4\times 4$ Lorentz transformation matrices corresponding to a pure boost and a pure three-dimensional rotation are very well-known. Significantly less well-known is the explicit formula for a general Lorentz transformation with arbitrary boost and rotation parameters. We revisit this more general formula by presenting two different derivations. The first derivation (which is somewhat simpler than previous ones appearing in the literature) evaluates the exponential of a $4\times 4$ matrix $A$, where $GA$ is an arbitrary $4\times 4$ real antisymmetric matrix and $G$ is a diagonal matrix corresponding to the Minkowski metric. The formula for $\exp A$ depends only on the eigenvalues of $A$ and makes use of the Lagrange interpolating polynomial. The second derivation exploits the assertion that the spinor product $η^\dagger{\barσ}^{\,μ}χ$ transforms as a Lorentz four-vector, where $χ$ and $η$ are two-component spinors. The advantage of this derivation is that the formula for a general Lorentz transformation $Λ$ reduces to the computation of the trace of a product of $2\times 2$ matrices. Both computations are shown to yield equivalent expressions for $Λ$.
△ Less
Submitted 4 February, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Semi-Automated Segmentation of Geoscientific Data Using Superpixels
Authors:
Conrad P. Koziol,
Eldad Haber
Abstract:
Geological processes determine the distribution of resources such as critical minerals, water, and geothermal energy. However, direct observation of geology is often prevented by surface cover such as overburden or vegetation. In such cases, remote and in-situ surveys are frequently conducted to collect physical measurements of the earth indicative of the geology. Develo** a geological segmentat…
▽ More
Geological processes determine the distribution of resources such as critical minerals, water, and geothermal energy. However, direct observation of geology is often prevented by surface cover such as overburden or vegetation. In such cases, remote and in-situ surveys are frequently conducted to collect physical measurements of the earth indicative of the geology. Develo** a geological segmentation based on these measurements is challenging since individual datasets can differ in properties (e.g. units, dynamic ranges, textures) and because the data does not uniquely constrain the geology. Further, as the number of datasets grows the information to constrain geology increases while simultaneously becoming harder to make sense of. Inspired by the concept of superpixels, we propose a deep-learning based approach to segment rasterized survey data into regions with similar characteristics. We demonstrate its use for semi-automated geoscientific map** with datasets arising from independent sensors and with diverse properties. In addition, we introduce a new loss function for superpixels including a novel regularization parameter penalizing image segmentation with non-connected component superpixels. This improves integration of prior knowledge by allowing better control over the number of superpixels generated.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Neural DAEs: Constrained neural networks
Authors:
Tue Boesen,
Eldad Haber,
Uri Michael Ascher
Abstract:
This article investigates the effect of explicitly adding auxiliary algebraic trajectory information to neural networks for dynamical systems. We draw inspiration from the field of differential-algebraic equations and differential equations on manifolds and implement related methods in residual neural networks, despite some fundamental scenario differences. Constraint or auxiliary information effe…
▽ More
This article investigates the effect of explicitly adding auxiliary algebraic trajectory information to neural networks for dynamical systems. We draw inspiration from the field of differential-algebraic equations and differential equations on manifolds and implement related methods in residual neural networks, despite some fundamental scenario differences. Constraint or auxiliary information effects are incorporated through stabilization as well as projection methods, and we show when to use which method based on experiments involving simulations of multi-body pendulums and molecular dynamics scenarios. Several of our methods are easy to implement in existing code and have limited impact on training performance while giving significant boosts in terms of inference.
△ Less
Submitted 12 March, 2024; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Robust deep learning for emulating turbulent viscosities
Authors:
Aakash Patil,
Jonathan Viquerat,
George El Haber,
Elie Hachem
Abstract:
From the simplest models to complex deep neural networks, modeling turbulence with machine learning techniques still offers multiple challenges. In this context, the present contribution proposes a robust strategy using patch-based training to learn turbulent viscosity from flow velocities, and demonstrates its efficient use on the Spallart-Allmaras turbulence model. Training datasets are generate…
▽ More
From the simplest models to complex deep neural networks, modeling turbulence with machine learning techniques still offers multiple challenges. In this context, the present contribution proposes a robust strategy using patch-based training to learn turbulent viscosity from flow velocities, and demonstrates its efficient use on the Spallart-Allmaras turbulence model. Training datasets are generated for flow past two-dimensional (2D) obstacles at high Reynolds numbers and used to train an auto-encoder type convolutional neural network with local patch inputs. Compared to a standard training technique, patch-based learning not only yields increased accuracy but also reduces the computational cost required for training.
△ Less
Submitted 1 October, 2021; v1 submitted 23 July, 2021;
originally announced July 2021.
-
Fully reversible neural networks for large-scale 3D seismic horizon tracking
Authors:
Bas Peters,
Eldad Haber
Abstract:
Tracking a horizon in seismic images or 3D volumes is an integral part of seismic interpretation. The last few decades saw progress in using neural networks for this task, starting from shallow networks for 1D traces, to deeper convolutional neural networks for large 2D images. Because geological structures are intrinsically 3D, we hope to see improved horizon tracking by training networks on 3D s…
▽ More
Tracking a horizon in seismic images or 3D volumes is an integral part of seismic interpretation. The last few decades saw progress in using neural networks for this task, starting from shallow networks for 1D traces, to deeper convolutional neural networks for large 2D images. Because geological structures are intrinsically 3D, we hope to see improved horizon tracking by training networks on 3D seismic data cubes. While there are some 3D convolutional neural networks for various seismic interpretation tasks, they are restricted to shallow networks or relatively small 3D inputs because of memory limitations. The required memory for the network states and weights increases with network depth. We present a fully reversible network for horizon tracking that has a memory requirement that is independent of network depth. To tackle memory issues regarding the network weights, we use layers that train in a factorized form directly. Therefore, we can maintain a large number of network channels while kee** the number of convolutional kernels low. We use the saved memory to increase the input size of the data by order of magnitude such that the network can better learn from large structures in the data. A field data example verifies the proposed network structure is suitable for seismic horizon tracking.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
Fully reversible neural networks for large-scale surface and sub-surface characterization via remote sensing
Authors:
Bas Peters,
Eldad Haber,
Keegan Lensink
Abstract:
The large spatial/frequency scale of hyperspectral and airborne magnetic and gravitational data causes memory issues when using convolutional neural networks for (sub-) surface characterization. Recently developed fully reversible networks can mostly avoid memory limitations by virtue of having a low and fixed memory requirement for storing network states, as opposed to the typical linear memory g…
▽ More
The large spatial/frequency scale of hyperspectral and airborne magnetic and gravitational data causes memory issues when using convolutional neural networks for (sub-) surface characterization. Recently developed fully reversible networks can mostly avoid memory limitations by virtue of having a low and fixed memory requirement for storing network states, as opposed to the typical linear memory growth with depth. Fully reversible networks enable the training of deep neural networks that take in entire data volumes, and create semantic segmentations in one go. This approach avoids the need to work in small patches or map a data patch to the class of just the central pixel. The cross-entropy loss function requires small modifications to work in conjunction with a fully reversible network and learn from sparsely sampled labels without ever seeing fully labeled ground truth. We show examples from land-use change detection from hyperspectral time-lapse data, and regional aquifer map** from airborne geophysical and geological data.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
Does shallow geological knowledge help neural-networks to predict deep units?
Authors:
Bas Peters,
Eldad Haber,
Justin Granek
Abstract:
Geological interpretation of seismic images is a visual task that can be automated by training neural networks. While neural networks have shown to be effective at various interpretation tasks, a fundamental challenge is the lack of labeled data points in the subsurface. For example, the interpolation and extrapolation of well-based lithology using seismic images relies on a small number of known…
▽ More
Geological interpretation of seismic images is a visual task that can be automated by training neural networks. While neural networks have shown to be effective at various interpretation tasks, a fundamental challenge is the lack of labeled data points in the subsurface. For example, the interpolation and extrapolation of well-based lithology using seismic images relies on a small number of known labels. Besides well-known data augmentation techniques, as well as regularization of the network output, we propose and test another approach to deal with the lack of labels. Non learning-based horizon trackers work very well in the shallow subsurface where seismic images are of higher quality and the geological units are roughly layered. We test if these segmented and shallow units can help train neural networks to predict deeper geological units that are not layered and flat. We show that knowledge of shallow geological units helps to predict deeper units when there are only a few labels for training using a dataset from the Sea of Ireland. We employ U-net based multi-resolution networks, and we show that these networks can be described using matrix-vector product notation in a similar fashion as standard geophysical inverse problems.
△ Less
Submitted 8 April, 2019;
originally announced April 2019.
-
Neural-networks for geophysicists and their application to seismic data interpretation
Authors:
Bas Peters,
Eldad Haber,
Justin Granek
Abstract:
Neural-networks have seen a surge of interest for the interpretation of seismic images during the last few years. Network-based learning methods can provide fast and accurate automatic interpretation, provided there are sufficiently many training labels. We provide an introduction to the field aimed at geophysicists that are familiar with the framework of forward modeling and inversion. We explain…
▽ More
Neural-networks have seen a surge of interest for the interpretation of seismic images during the last few years. Network-based learning methods can provide fast and accurate automatic interpretation, provided there are sufficiently many training labels. We provide an introduction to the field aimed at geophysicists that are familiar with the framework of forward modeling and inversion. We explain the similarities and differences between deep networks to other geophysical inverse problems and show their utility in solving problems such as lithology interpolation between wells, horizon tracking and segmentation of seismic images. The benefits of our approach are demonstrated on field data from the Sea of Ireland and the North Sea.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
Cooperative energy transfer controls the spontaneous emission rate beyond field enhancement limits
Authors:
Mohamed ElKabbash,
Ermanno Miele,
Ahmad K. Fumani,
Michael S. Wolf,
Angelo Bozzola,
Elisha Haber,
Tigran V. Shahbazyan,
Jesse Berezovsky,
Francesco De Angelis,
Giuseppe Strangi
Abstract:
Quantum emitters located in proximity to a metal nanostructure individually transfer their energy via near-field excitation of surface plasmons. The energy transfer process increases the spontaneous emission (SE) rate due to plasmon-enhanced local field. Here, we demonstrate significant acceleration of quantum emitter SE rate in a plasmonic nano-cavity due to cooperative energy transfer (CET) from…
▽ More
Quantum emitters located in proximity to a metal nanostructure individually transfer their energy via near-field excitation of surface plasmons. The energy transfer process increases the spontaneous emission (SE) rate due to plasmon-enhanced local field. Here, we demonstrate significant acceleration of quantum emitter SE rate in a plasmonic nano-cavity due to cooperative energy transfer (CET) from plasmon-correlated emitters. Using an integrated plasmonic nano-cavity, we realize up to six-fold enhancement in the emission rate of emitters coupled to the same nano-cavity on top of the plasmonic enhancement of the local density of states. The radiated power spectrum retains the plasmon resonance central frequency and lineshape, with the peak amplitude proportional to the number of excited emitters indicating that the observed cooperative SE is distinct from super-radiance. Plasmon-assisted CET offers unprecedented control over the SE rate and allows to dynamically control the spontaneous emission rate at room temperature enabling an SE rate based optical modulator.
△ Less
Submitted 10 January, 2019;
originally announced January 2019.
-
Multi-resolution neural networks for tracking seismic horizons from few training images
Authors:
Bas Peters,
Justin Granek,
Eldad Haber
Abstract:
Detecting a specific horizon in seismic images is a valuable tool for geological interpretation. Because hand-picking the locations of the horizon is a time-consuming process, automated computational methods were developed starting three decades ago. Older techniques for such picking include interpolation of control points however, in recent years neural networks have been used for this task. Unti…
▽ More
Detecting a specific horizon in seismic images is a valuable tool for geological interpretation. Because hand-picking the locations of the horizon is a time-consuming process, automated computational methods were developed starting three decades ago. Older techniques for such picking include interpolation of control points however, in recent years neural networks have been used for this task. Until now, most networks trained on small patches from larger images. This limits the networks ability to learn from large-scale geologic structures. Moreover, currently available networks and training strategies require label patches that have full and continuous annotations, which are also time-consuming to generate.
We propose a projected loss-function for training convolutional networks with a multi-resolution structure, including variants of the U-net. Our networks learn from a small number of large seismic images without creating patches. The projected loss-function enables training on labels with just a few annotated pixels and has no issue with the other unknown label pixels. Training uses all data without reserving some for validation. Only the labels are split into training/testing. Contrary to other work on horizon tracking, we train the network to perform non-linear regression, and not classification. As such, we propose labels as the convolution of a Gaussian kernel and the known horizon locations that indicate uncertainty in the labels. The network output is the probability of the horizon location. We demonstrate the proposed computational ingredients on two different datasets, for horizon extrapolation and interpolation. We show that the predictions of our methodology are accurate even in areas far from known horizon locations because our learning strategy exploits all data in large seismic images.
△ Less
Submitted 26 December, 2018;
originally announced December 2018.
-
Simultaneous shot inversion for nonuniform geometries using fast data interpolation
Authors:
Michelle Liu,
Rajiv Kumar,
Eldad Haber,
Aleksandr Aravkin
Abstract:
Stochastic optimization is key to efficient inversion in PDE-constrained optimization. Using 'simultaneous shots', or random superposition of source terms, works very well in simple acquisition geometries where all sources see all receivers, but this rarely occurs in practice. We develop an approach that interpolates data to an ideal acquisition geometry while solving the inverse problem using sim…
▽ More
Stochastic optimization is key to efficient inversion in PDE-constrained optimization. Using 'simultaneous shots', or random superposition of source terms, works very well in simple acquisition geometries where all sources see all receivers, but this rarely occurs in practice. We develop an approach that interpolates data to an ideal acquisition geometry while solving the inverse problem using simultaneous shots. The approach is formulated as a joint inverse problem, combining ideas from low-rank interpolation with full-waveform inversion. Results using synthetic experiments illustrate the flexibility and efficiency of the approach.
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
A multigrid solver to the Helmholtz equation with a point source based on travel time and amplitude
Authors:
Eran Treister,
Eldad Haber
Abstract:
The Helmholtz equation arises when modeling wave propagation in the frequency domain. The equation is discretized as an indefinite linear system, which is difficult to solve at high wave numbers. In many applications, the solution of the Helmholtz equation is required for a point source. In this case, it is possible to reformulate the equation as two separate equations: one for the travel time of…
▽ More
The Helmholtz equation arises when modeling wave propagation in the frequency domain. The equation is discretized as an indefinite linear system, which is difficult to solve at high wave numbers. In many applications, the solution of the Helmholtz equation is required for a point source. In this case, it is possible to reformulate the equation as two separate equations: one for the travel time of the wave and one for its amplitude. The travel time is obtained by a solution of the factored eikonal equation, and the amplitude is obtained by solving a complex-valued advection-diffusion-reaction (ADR) equation. The reformulated equation is equivalent to the original Helmholtz equation, and the differences between the numerical solutions of these equations arise only from discretization errors. We develop an efficient multigrid solver for obtaining the amplitude given the travel time, which can be efficiently computed. This approach is advantageous because the amplitude is typically smooth in this case, and hence, more suitable for multigrid solvers than the standard Helmholtz discretization. We demonstrate that our second order ADR discretization is more accurate than the standard second order discretization at high wave numbers, as long as there are no reflections or caustics. Moreover, we show that using our approach, the problem can be solved more efficiently than using the common shifted Laplacian multigrid approach.
△ Less
Submitted 17 December, 2017;
originally announced December 2017.
-
A numerical method for efficient 3D inversions using Richards equation
Authors:
Rowan Cockett,
Lindsey J. Heagy,
Eldad Haber
Abstract:
Fluid flow in the vadose zone is governed by Richards equation; it is parameterized by hydraulic conductivity, which is a nonlinear function of pressure head. Investigations in the vadose zone typically require characterizing distributed hydraulic properties. Saturation or pressure head data may include direct measurements made from boreholes. Increasingly, proxy measurements from hydrogeophysics…
▽ More
Fluid flow in the vadose zone is governed by Richards equation; it is parameterized by hydraulic conductivity, which is a nonlinear function of pressure head. Investigations in the vadose zone typically require characterizing distributed hydraulic properties. Saturation or pressure head data may include direct measurements made from boreholes. Increasingly, proxy measurements from hydrogeophysics are being used to supply more spatially and temporally dense data sets. Inferring hydraulic parameters from such datasets requires the ability to efficiently solve and deterministically optimize the nonlinear time domain Richards equation. This is particularly important as the number of parameters to be estimated in a vadose zone inversion continues to grow. In this paper, we describe an efficient technique to invert for distributed hydraulic properties in 1D, 2D, and 3D. Our algorithm does not store the Jacobian, but rather computes the product with a vector, which allows the size of the inversion problem to become much larger than methods such as finite difference or automatic differentiation; which are constrained by computation and memory, respectively. We show our algorithm in practice for a 3D inversion of saturated hydraulic conductivity using saturation data through time. The code to run our examples is open source and the algorithm presented allows this inversion process to run on modest computational resources.
△ Less
Submitted 11 June, 2017;
originally announced June 2017.
-
A Framework for the Upscaling of the Electrical Conductivity in the Quasi-static Maxwell's Equations
Authors:
Luz Angelica Caudillo-Mata,
Eldad Haber,
Lindsey J. Heagy,
Christoph Schwarzbach
Abstract:
Electromagnetic simulations of complex geologic settings are computationally expensive. One reason for this is the fact that a fine mesh is required to accurately discretize the electrical conductivity model of a given setting. This conductivity model may vary over several orders of magnitude and these variations can occur over a large range of length scales. Using a very fine mesh for the discret…
▽ More
Electromagnetic simulations of complex geologic settings are computationally expensive. One reason for this is the fact that a fine mesh is required to accurately discretize the electrical conductivity model of a given setting. This conductivity model may vary over several orders of magnitude and these variations can occur over a large range of length scales. Using a very fine mesh for the discretization of this setting leads to the necessity to solve a large system of equations that is often difficult to deal with. To keep the simulations computationally tractable, coarse meshes are often employed for the discretization of the model. Such coarse meshes typically fail to capture the fine-scale variations in the conductivity model resulting in inaccuracies in the predicted data. In this work, we introduce a framework for constructing a coarse-mesh or upscaled conductivity model based on a prescribed fine-mesh model. Rather than using analytical expressions, we opt to pose upscaling as a parameter estimation problem. By solving an optimization problem, we obtain a coarse-mesh conductivity model. The optimization criterion can be tailored to the survey setting in order to produce coarse models that accurately reproduce the predicted data generated on the fine mesh. This allows us to upscale arbitrary conductivity structures, as well as to better understand the meaning of the upscaled quantity. We use 1D and 3D examples to demonstrate that the proposed framework is able to emulate the behavior of the heterogeneity in the fine-mesh conductivity model, and to produce an accurate description of the desired predicted data obtained by using a coarse mesh in the simulation process.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
An oversampling technique for the multiscale finite volume method to simulate electromagnetic responses in the frequency domain
Authors:
Luz Angelica Caudillo Mata,
Eldad Haber,
Christoph Schwarzbach
Abstract:
In order to reduce the computational cost of the simulation of electromagnetic responses in geophysical settings that involve highly heterogeneous media, we develop a multiscale finite volume method with oversampling for the quasi-static Maxwell's equations in the frequency domain. We assume a coarse mesh nested within a fine mesh that accurately discretizes the problem. For each coarse cell, we i…
▽ More
In order to reduce the computational cost of the simulation of electromagnetic responses in geophysical settings that involve highly heterogeneous media, we develop a multiscale finite volume method with oversampling for the quasi-static Maxwell's equations in the frequency domain. We assume a coarse mesh nested within a fine mesh that accurately discretizes the problem. For each coarse cell, we independently solve a local version of the original Maxwell's system subject to linear boundary conditions on an extended domain, which includes the coarse cell and a neighborhood of fine cells around it. The local Maxwell's system is solved using the fine mesh contained in the extended domain and the mimetic finite volume method. Next, these local solutions (basis functions) together with a weak-continuity condition are used to construct a coarse-mesh version of the global problem. The basis functions can be used to obtain the fine-mesh details from the solution of the coarse-mesh problem. Our approach leads to a significant reduction in the size of the final system of equations and the computational time, while accurately approximating the behavior of the fine-mesh solutions. We demonstrate the performance of our method using a synthetic 3D example of a mineral deposit.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
Joint Hydrogeophysical Inversion: State Estimation for Seawater Intrusion Models in 3D
Authors:
K. Steklova,
E. Haber
Abstract:
Seawater intrusion (SWI) is a complex process, where 3D modeling is often necessary in order to monitor and manage the affected aquifers. Here, we present a synthetic study to test a joint hydrogeophysical inversion approach aimed at solving the inverse problem of estimating initial and current saltwater distribution. First, we use a 3D groundwater model for variable density flow based on discreti…
▽ More
Seawater intrusion (SWI) is a complex process, where 3D modeling is often necessary in order to monitor and manage the affected aquifers. Here, we present a synthetic study to test a joint hydrogeophysical inversion approach aimed at solving the inverse problem of estimating initial and current saltwater distribution. First, we use a 3D groundwater model for variable density flow based on discretized flow and solute mass balance equations. In addition to the groundwater model, a 3D geophysical model was developed for direct current resistivity imaging and inversion. The objective function of the coupled problem consists of data misfit and regularization terms as well as a coupling term that relates groundwater and geophysical states. We present a novel approach to solve the inverse problem using an Alternating Direction Method of Multipliers (ADMM) to minimize this coupled objective function. The sensitivities are derived analytically for the discretized system of equations, which allows us to efficiently compute the gradients in the minimization procedure and reduce the computational complexity of the problem. The method was tested on different synthetic scenarios with groundwater and geophysical data represented by solute mass fraction data and direct current resistivity data. With the ADMM approach, we were able to obtain better estimates for the solute distribution, compared to just considering each data set separately or solving with a simple coupled approach.
△ Less
Submitted 3 August, 2016;
originally announced August 2016.