Search | arXiv e-print repository

Direct mineral content prediction from drill core images via transfer learning

Authors: Romana Boiger, Sergey V. Churakov, Ignacio Ballester Llagaria, Georg Kosakowski, Raphael Wüst, Nikolaos I. Prasianakis

Abstract: Deep subsurface exploration is important for mining, oil and gas industries, as well as in the assessment of geological units for the disposal of chemical or nuclear waste, or the viability of geothermal energy systems. Typically, detailed examinations of subsurface formations or units are performed on cuttings or core materials extracted during drilling campaigns, as well as on geophysical boreho… ▽ More Deep subsurface exploration is important for mining, oil and gas industries, as well as in the assessment of geological units for the disposal of chemical or nuclear waste, or the viability of geothermal energy systems. Typically, detailed examinations of subsurface formations or units are performed on cuttings or core materials extracted during drilling campaigns, as well as on geophysical borehole data, which provide detailed information about the petrophysical properties of the rocks. Depending on the volume of rock samples and the analytical program, the laboratory analysis and diagnostics can be very time-consuming. This study investigates the potential of utilizing machine learning, specifically convolutional neural networks (CNN), to assess the lithology and mineral content solely from analysis of drill core images, aiming to support and expedite the subsurface geological exploration. The paper outlines a comprehensive methodology, encompassing data preprocessing, machine learning methods, and transfer learning techniques. The outcome reveals a remarkable 96.7% accuracy in the classification of drill core segments into distinct formation classes. Furthermore, a CNN model was trained for the evaluation of mineral content using a learning data set from multidimensional log analysis data (silicate, total clay, carbonate). When benchmarked against laboratory XRD measurements on samples from the cores, both the advanced multidimensional log analysis model and the neural network approach developed here provide equally good performance. This work demonstrates that deep learning and particularly transfer learning can support extracting petrophysical properties, including mineral content and formation classification, from drill core images, thus offering a road map for enhancing model performance and data set quality in image-based analysis of drill cores. △ Less

Submitted 27 March, 2024; originally announced March 2024.

arXiv:2308.08391 [pdf, other]

Fast Uncertainty Quantification of Spent Nuclear Fuel with Neural Networks

Authors: Arnau Albà, Andreas Adelmann, Lucas Münster, Dimitri Rochman, Romana Boiger

Abstract: The accurate calculation and uncertainty quantification of the characteristics of spent nuclear fuel (SNF) play a crucial role in ensuring the safety, efficiency, and sustainability of nuclear energy production, waste management, and nuclear safeguards. State of the art physics-based models, while reliable, are computationally intensive and time-consuming. This paper presents a surrogate modeling… ▽ More The accurate calculation and uncertainty quantification of the characteristics of spent nuclear fuel (SNF) play a crucial role in ensuring the safety, efficiency, and sustainability of nuclear energy production, waste management, and nuclear safeguards. State of the art physics-based models, while reliable, are computationally intensive and time-consuming. This paper presents a surrogate modeling approach using neural networks (NN) to predict a number of SNF characteristics with reduced computational costs compared to physics-based models. An NN is trained using data generated from CASMO5 lattice calculations. The trained NN accurately predicts decay heat and nuclide concentrations of SNF, as a function of key input parameters, such as enrichment, burnup, cooling time between cycles, mean boron concentration and fuel temperature. The model is validated against physics-based decay heat simulations and measurements of different uranium oxide fuel assemblies from two different pressurized water reactors. In addition, the NN is used to perform sensitivity analysis and uncertainty quantification. The results are in very good alignment to CASMO5, while the computational costs (taking into account the costs of generating training samples) are reduced by a factor of 10 or more. Our findings demonstrate the feasibility of using NNs as surrogate models for fast characterization of SNF, providing a promising avenue for improving computational efficiency in assessing nuclear fuel behavior and associated risks. △ Less

Submitted 16 August, 2023; originally announced August 2023.

arXiv:2210.03634 [pdf, other]

Lasso Monte Carlo, a Variation on Multi Fidelity Methods for High Dimensional Uncertainty Quantification

Authors: Arnau Albà, Romana Boiger, Dimitri Rochman, Andreas Adelmann

Abstract: Uncertainty quantification (UQ) is an active area of research, and an essential technique used in all fields of science and engineering. The most common methods for UQ are Monte Carlo and surrogate-modelling. The former method is dimensionality independent but has slow convergence, while the latter method has been shown to yield large computational speedups with respect to Monte Carlo. However, su… ▽ More Uncertainty quantification (UQ) is an active area of research, and an essential technique used in all fields of science and engineering. The most common methods for UQ are Monte Carlo and surrogate-modelling. The former method is dimensionality independent but has slow convergence, while the latter method has been shown to yield large computational speedups with respect to Monte Carlo. However, surrogate models suffer from the so-called curse of dimensionality, and become costly to train for high-dimensional problems, where UQ might become computationally prohibitive. In this paper we present a new technique, Lasso Monte Carlo (LMC), which combines a Lasso surrogate model with the multifidelity Monte Carlo technique, in order to perform UQ in high-dimensional settings, at a reduced computational cost. We provide mathematical guarantees for the unbiasedness of the method, and show that LMC can be more accurate than simple Monte Carlo. The theory is numerically tested with benchmarks on toy problems, as well as on a real example of UQ from the field of nuclear engineering. In all presented examples LMC is more accurate than simple Monte Carlo and other multifidelity methods. Thanks to LMC, computational costs are reduced by more than a factor of 5 with respect to simple MC, in relevant cases. △ Less

Submitted 31 August, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

arXiv:2111.07960 [pdf, other]

Retrieval of aerosol properties from in situ, multi-angle light scattering measurements using invertible neural networks

Authors: Romana Boiger, Rob L. Modini, Alireza Moallemi, David Degen, Martin Gysel-Beer, Andreas Adelmann

Abstract: Atmospheric aerosols have a major influence on the earths climate and public health. Hence, studying their properties and recovering them from light scattering measurements is of great importance. State of the art retrieval methods such as pre-computed look-up tables and iterative, physics-based algorithms can suffer from either accuracy or speed limitations. These limitations are becoming increas… ▽ More Atmospheric aerosols have a major influence on the earths climate and public health. Hence, studying their properties and recovering them from light scattering measurements is of great importance. State of the art retrieval methods such as pre-computed look-up tables and iterative, physics-based algorithms can suffer from either accuracy or speed limitations. These limitations are becoming increasingly restrictive as instrumentation technology advances and measurement complexity increases. Machine learning algorithms offer new opportunities to overcome these problems, by being quick and precise. In this work we present a method, using invertible neural networks to retrieve aerosol properties from in situ light scattering measurements. In addition, the algorithm is capable of simulating the forward direction, from aerosol properties to measurement data. The applicability and performance of the algorithm are demonstrated with simulated measurement data, mimicking in situ laboratory and field measurements. With a retrieval time in the millisecond range and a weighted mean absolute percentage error of less than 1.5%, the algorithm turned out to be fast and accurate. By introducing Gaussian noise to the data, we further demonstrate that the method is robust with respect to measurement errors. In addition, realistic case studies are performed to demonstrate that the algorithm performs well even with missing measurement data. △ Less

Submitted 15 November, 2021; originally announced November 2021.

arXiv:2107.00060 [pdf, other]

Fast, efficient and flexible particle accelerator optimisation using densely connected and invertible neural networks

Authors: Renato Bellotti, Romana Boiger, Andreas Adelmann

Abstract: Particle accelerators are enabling tools for scientific exploration and discovery in various disciplines. Finding optimized operation points for these complex machines is a challenging task, however, due to the large number of parameters involved and the underlying non-linear dynamics. Here, we introduce two families of data-driven surrogate models, based on deep and invertible neural networks, th… ▽ More Particle accelerators are enabling tools for scientific exploration and discovery in various disciplines. Finding optimized operation points for these complex machines is a challenging task, however, due to the large number of parameters involved and the underlying non-linear dynamics. Here, we introduce two families of data-driven surrogate models, based on deep and invertible neural networks, that can replace the expensive physics computer models. These models are employed in multi-objective optimisations to find Pareto optimal operation points for two fundamentally different types of particle accelerators. Our approach reduces the time-to-solution for a multi-objective accelerator optimisation up to a factor of 640 and the computational cost up to 98%. The framework established here should pave the way for future on-line and real-time multi-objective optimisation of particle accelerators. △ Less

Submitted 30 June, 2021; originally announced July 2021.

arXiv:2011.05372 [pdf, ps, other]

doi 10.1093/imanum/dry066

Range-relaxed criteria for choosing the Lagrange multipliers in nonstationary iterated Tikhonov method

Authors: R. Boiger, A. Leitao, B. F. Svaiter

Abstract: In this article we propose a novel nonstationary iterated Tikhonov (NIT) type method for obtaining stable approximate solutions to ill-posed operator equations modeled by linear operators acting between Hilbert spaces. Geometrical properties of the problem are used to derive a new strategy for choosing the sequence of regularization parameters (Lagrange multipliers) for the NIT iteration. Converge… ▽ More In this article we propose a novel nonstationary iterated Tikhonov (NIT) type method for obtaining stable approximate solutions to ill-posed operator equations modeled by linear operators acting between Hilbert spaces. Geometrical properties of the problem are used to derive a new strategy for choosing the sequence of regularization parameters (Lagrange multipliers) for the NIT iteration. Convergence analysis for this new method is provided. Numerical experiments are presented for two distinct applications: I) A 2D elliptic parameter identification problem (Inverse Potential Problem); II) An image deblurring problem. The results obtained validate the efficiency of our method compared with standard implementations of the NIT method (where a geometrical choice is typically used for the sequence of Lagrange multipliers). △ Less

Submitted 10 November, 2020; originally announced November 2020.

Comments: 22 pages, 8 figures

MSC Class: 65J20; 47J06

Journal ref: IMA Journal of Numerical Analysis 40 (2020), no. 1, 606-627

arXiv:1604.02894 [pdf, other]

doi 10.1088/0266-5611/32/12/125009

Integration based profile likelihood calculation for PDE constrained parameter estimation problems

Authors: Romana Boiger, Jan Hasenauer, Sabrina Hross, Barbara Kaltenbacher

Abstract: Partial differential equation (PDE) models are widely used in engineering and natural sciences to describe spatio-temporal processes. The parameters of the considered processes are often unknown and have to be estimated from experimental data. Due to partial observations and measurement noise, these parameter estimates are subject to uncertainty. This uncertainty can be assessed using profile like… ▽ More Partial differential equation (PDE) models are widely used in engineering and natural sciences to describe spatio-temporal processes. The parameters of the considered processes are often unknown and have to be estimated from experimental data. Due to partial observations and measurement noise, these parameter estimates are subject to uncertainty. This uncertainty can be assessed using profile likelihoods, a reliable but computationally intensive approach. In this paper, we introduce an integration based approach for the profile likelihood calculation for inverse problems with PDE constraints. While existing approaches rely on repeated optimization, the proposed approach exploits a dynamical system evolving along the likelihood profile. We derive the dynamical system for the reduced and the full estimation problem and study its properties. To evaluate the proposed method, we compare it with state-of-the-art algorithms for a simple reaction-diffusion model for a cellular patterning process. We observe a good accuracy of the method as well as a significant speed up as compared to established methods. Integration based profile calculation facilitates rigorous uncertainty analysis for computationally demanding parameter estimation problems with PDE constraints. △ Less

Submitted 11 April, 2016; originally announced April 2016.

MSC Class: 35R30

arXiv:1506.00658 [pdf, ps, other]

doi 10.1088/0266-5611/32/4/045006

An online parameter identification method for time dependent partial differential equations

Authors: Romana Boiger, Barbara Kaltenbacher

Abstract: Online parameter identification is of importance, e.g., for model predictive control. Since the parameters have to be identified simultaneously to the process of the modeled system, dynamical update laws are used for state and parameter estimates. Most of the existing methods for infinite dimensional systems either impose strong assumptions on the model or cannot handle partial observations. There… ▽ More Online parameter identification is of importance, e.g., for model predictive control. Since the parameters have to be identified simultaneously to the process of the modeled system, dynamical update laws are used for state and parameter estimates. Most of the existing methods for infinite dimensional systems either impose strong assumptions on the model or cannot handle partial observations. Therefore we propose and analyze an online parameter identification method that is less restrictive concerning the underlying model and allows for partial observations and noisy data. The performance of our approach is illustrated by some numerical experiments. △ Less

Submitted 1 June, 2015; originally announced June 2015.

Showing 1–8 of 8 results for author: Boiger, R