-
GraLMatch: Matching Groups of Entities with Graphs and Language Models
Authors:
Fernando De Meer Pardo,
Claude Lehmann,
Dennis Gehrig,
Andrea Nagy,
Stefano Nicoli,
Branka Hadji Misheva,
Martin Braschler,
Kurt Stockinger
Abstract:
In this paper, we present an end-to-end multi-source Entity Matching problem, which we call entity group matching, where the goal is to assign to the same group, records originating from multiple data sources but representing the same real-world entity. We focus on the effects of transitively matched records, i.e. the records connected by paths in the graph G = (V,E) whose nodes and edges represen…
▽ More
In this paper, we present an end-to-end multi-source Entity Matching problem, which we call entity group matching, where the goal is to assign to the same group, records originating from multiple data sources but representing the same real-world entity. We focus on the effects of transitively matched records, i.e. the records connected by paths in the graph G = (V,E) whose nodes and edges represent the records and whether they are a match or not. We present a real-world instance of this problem, where the challenge is to match records of companies and financial securities originating from different data providers. We also introduce two new multi-source benchmark datasets that present similar matching challenges as real-world records. A distinctive characteristic of these records is that they are regularly updated following real-world events, but updates are not applied uniformly across data sources. This phenomenon makes the matching of certain groups of records only possible through the use of transitive information.
In our experiments, we illustrate how considering transitively matched records is challenging since a limited amount of false positive pairwise match predictions can throw off the group assignment of large quantities of records. Thus, we propose GraLMatch, a method that can partially detect and remove false positive pairwise predictions through graph-based properties. Finally, we showcase how fine-tuning a Transformer-based model (DistilBERT) on a reduced number of labeled samples yields a better final entity group matching than training on more samples and/or incorporating fine-tuning optimizations, illustrating how precision becomes the deciding factor in the entity group matching of large volumes of records.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Regularity-Conforming Neural Networks (ReCoNNs) for solving Partial Differential Equations
Authors:
Jamie M. Taylor,
David Pardo,
Judit Muñoz-Matute
Abstract:
Whilst the Universal Approximation Theorem guarantees the existence of approximations to Sobolev functions -- the natural function spaces for PDEs -- by Neural Networks (NNs) of sufficient size, low-regularity solutions may lead to poor approximations in practice. For example, classical fully-connected feed-forward NNs fail to approximate continuous functions whose gradient is discontinuous when e…
▽ More
Whilst the Universal Approximation Theorem guarantees the existence of approximations to Sobolev functions -- the natural function spaces for PDEs -- by Neural Networks (NNs) of sufficient size, low-regularity solutions may lead to poor approximations in practice. For example, classical fully-connected feed-forward NNs fail to approximate continuous functions whose gradient is discontinuous when employing strong formulations like in Physics Informed Neural Networks (PINNs). In this article, we propose the use of regularity-conforming neural networks, where a priori information on the regularity of solutions to PDEs can be employed to construct proper architectures. We illustrate the potential of such architectures via a two-dimensional (2D) transmission problem, where the solution may admit discontinuities in the gradient across interfaces, as well as power-like singularities at certain points. In particular, we formulate the weak transmission problem in a PINNs-like strong formulation with interface and continuity conditions. Such architectures are partially explainable; discontinuities are explicitly described, allowing the introduction of novel terms into the loss function. We demonstrate via several model problems in one and two dimensions the advantages of using regularity-conforming architectures in contrast to classical architectures. The ideas presented in this article easily extend to problems in higher dimensions.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
1D Photonic Band Gap Atlas, Formula Extension and Design Applications
Authors:
Oscar D. H. Pardo,
R. R. Rey-González
Abstract:
The design and development of new photonic devices for technological applications requires a deep understanding of the effect of structural properties on the resulting band gap size and its position. Here, we perform a theoretical study of behavior of the photonic band gap sizes, positions and percentages under variations of the parameters characterizing binary (two materials), ternary (three mate…
▽ More
The design and development of new photonic devices for technological applications requires a deep understanding of the effect of structural properties on the resulting band gap size and its position. Here, we perform a theoretical study of behavior of the photonic band gap sizes, positions and percentages under variations of the parameters characterizing binary (two materials), ternary (three materials) and linear dielectric grating multilayer structures. The resulting band gap atlas show that binary systems may suffice for most applications but ternary systems may add additional flexibility in design if needed. Linear gratings show a regular pattern for all gaps studied, this regularity was able to be reproduced with only few materials involved. The position of the gaps showed a very monotonous behavior for all calculations performed. Finally, additional extensions of formulas commonly used in the design of Bragg mirrors/reflectors using binary materials were proposed with their corresponding limitations discussed. These results can be seen as a technological horizon for photonic device development.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Reducing Spatial Discretization Error on Coarse CFD Simulations Using an OpenFOAM-Embedded Deep Learning Framework
Authors:
Jesus Gonzalez-Sieiro,
David Pardo,
Vincenzo Nava,
Victor M. Calo,
Markus Towara
Abstract:
We propose a method for reducing the spatial discretization error of coarse computational fluid dynamics (CFD) problems by enhancing the quality of low-resolution simulations using a deep learning model fed with high-quality data. We substitute the default differencing scheme for the convection term by a feed-forward neural network that interpolates velocities from cell centers to face values to p…
▽ More
We propose a method for reducing the spatial discretization error of coarse computational fluid dynamics (CFD) problems by enhancing the quality of low-resolution simulations using a deep learning model fed with high-quality data. We substitute the default differencing scheme for the convection term by a feed-forward neural network that interpolates velocities from cell centers to face values to produce velocities that approximate the fine-mesh data well. The deep learning framework incorporates the open-source CFD code OpenFOAM, resulting in an end-to-end differentiable model. We automatically differentiate the CFD physics using a discrete adjoint code version. We present a fast communication method between TensorFlow (Python) and OpenFOAM (c++) that accelerates the training process. We applied the model to the flow past a square cylinder problem, reducing the error to about 50% for simulations outside the training distribution compared to the traditional solver in the x- and y-velocity components using an 8x coarser mesh. The training is affordable in terms of time and data samples since the architecture exploits the local features of the physics while generating stable predictions for mid-term simulations.
△ Less
Submitted 22 May, 2024; v1 submitted 12 May, 2024;
originally announced May 2024.
-
Residual-based Attention Physics-informed Neural Networks for Efficient Spatio-Temporal Lifetime Assessment of Transformers Operated in Renewable Power Plants
Authors:
Ibai Ramirez,
Joel Pino,
David Pardo,
Mikel Sanz,
Luis del Rio,
Alvaro Ortiz,
Kateryna Morozovska,
Jose I. Aizpurua
Abstract:
Transformers are vital assets for the reliable and efficient operation of power and energy systems. They support the integration of renewables to the grid through improved grid stability and operation efficiency. Monitoring the health of transformers is essential to ensure grid reliability and efficiency. Thermal insulation ageing is a key transformer failure mode, which is generally tracked by mo…
▽ More
Transformers are vital assets for the reliable and efficient operation of power and energy systems. They support the integration of renewables to the grid through improved grid stability and operation efficiency. Monitoring the health of transformers is essential to ensure grid reliability and efficiency. Thermal insulation ageing is a key transformer failure mode, which is generally tracked by monitoring the hotspot temperature (HST). However, HST measurement is complex and expensive and often estimated from indirect measurements. Existing computationally-efficient HST models focus on space-agnostic thermal models, providing worst-case HST estimates. This article introduces an efficient spatio-temporal model for transformer winding temperature and ageing estimation, which leverages physics-based partial differential equations (PDEs) with data-driven Neural Networks (NN) in a Physics Informed Neural Networks (PINNs) configuration to improve prediction accuracy and acquire spatio-temporal resolution. The computational efficiency of the PINN model is improved through the implementation of the Residual-Based Attention scheme that accelerates the PINN model convergence. PINN based oil temperature predictions are used to estimate spatio-temporal transformer winding temperature values, which are validated through PDE resolution models and fiber optic sensor measurements, respectively. Furthermore, the spatio-temporal transformer ageing model is inferred, aiding transformer health management decision-making and providing insights into localized thermal ageing phenomena in the transformer insulation. Results are validated with a distribution transformer operated on a floating photovoltaic power plant.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Ensemble Deep Learning for enhanced seismic data reconstruction
Authors:
Mohammad Mahdi Abedi,
David Pardo,
Tariq Alkhalifah
Abstract:
Seismic data often contain gaps due to various obstacles in the investigated area and recording instrument failures. Deep learning techniques offer promising solutions for reconstructing missing data parts by leveraging existing information. However, self-supervised methods frequently struggle with capturing under-represented features such as weaker events, crossing dips, and higher frequencies. T…
▽ More
Seismic data often contain gaps due to various obstacles in the investigated area and recording instrument failures. Deep learning techniques offer promising solutions for reconstructing missing data parts by leveraging existing information. However, self-supervised methods frequently struggle with capturing under-represented features such as weaker events, crossing dips, and higher frequencies. To address these challenges, we propose a novel ensemble deep model along with a tailored self-supervised training approach for reconstructing seismic data with consecutive missing traces. Our model comprises two branches of U-nets, each fed from distinct data transformation modules aimed at amplifying under-represented features and promoting diversity among learners. Our loss function minimizes relative errors at the outputs of individual branches and the entire model, ensuring accurate reconstruction of various features while maintaining overall data integrity. Additionally, we employ masking while training to enhance sample diversity and memory efficiency. Application on two benchmark synthetic datasets and two real datasets demonstrates improved accuracy compared to a conventional U-net, successfully reconstructing weak events, diffractions, higher frequencies, and reflections obscured by groundroll. However, our method requires a threefold of training time compared to a simple U-net. An implementation of our method with TensorFlow is also made available.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Adaptive Deep Fourier Residual method via overlap** domain decomposition
Authors:
Jamie M. Taylor,
Manuela Bastidas,
Victor M. Calo,
David Pardo
Abstract:
The Deep Fourier Residual (DFR) method is a specific type of variational physics-informed neural networks (VPINNs). It provides a robust neural network-based solution to partial differential equations (PDEs). The DFR strategy is based on approximating the dual norm of the weak residual of a PDE. This is equivalent to minimizing the energy norm of the error. To compute the dual of the weak residual…
▽ More
The Deep Fourier Residual (DFR) method is a specific type of variational physics-informed neural networks (VPINNs). It provides a robust neural network-based solution to partial differential equations (PDEs). The DFR strategy is based on approximating the dual norm of the weak residual of a PDE. This is equivalent to minimizing the energy norm of the error. To compute the dual of the weak residual norm, the DFR method employs an orthonormal spectral basis of the test space, which is known for rectangles or cuboids for multiple function spaces.
In this work, we extend the DFR method with ideas of traditional domain decomposition (DD). This enables two improvements: (a) to solve problems in more general polygonal domains, and (b) to develop an adaptive refinement technique in the test space using a Dofler marking algorithm. In the former case, we show that under non-restrictive assumptions we retain the desirable equivalence between the employed loss function and the H1-error, numerically demonstrating adherence to explicit bounds in the case of the L-shaped domain problem. In the latter, we show how refinement strategies lead to potentially significant improvements against a reference, classical DFR implementation with a test function space of significantly lower dimensionality, allowing us to better approximate singular solutions at a more reasonable computational cost.
△ Less
Submitted 10 January, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
Domain-wall Magnetic-texture dependent Creep Motion driven by Spin-transfer Torques
Authors:
Lucas Javier Albornoz,
Rebeca Díaz Pardo,
Aristide Lemaître,
Sebastian Bustingorry,
Javier Curiale,
Vincent Jeudy
Abstract:
We explore the contributions of adiabatic and non-adiabatic spin-transfer torques (STT) of a spin-polarized current to the thermally activated creep motion of domain-walls in a thin (Ga,Mn)(As,P) film with perpendicular anisotropy. For a domain-wall transverse to current, the non-adiabatic STT is found to act as an external magnetic field. Close to the compensation between these two terms, the adi…
▽ More
We explore the contributions of adiabatic and non-adiabatic spin-transfer torques (STT) of a spin-polarized current to the thermally activated creep motion of domain-walls in a thin (Ga,Mn)(As,P) film with perpendicular anisotropy. For a domain-wall transverse to current, the non-adiabatic STT is found to act as an external magnetic field. Close to the compensation between these two terms, the adiabatic contribution is strongly enhanced. The domain-wall velocity may be both increased or reduced by the adiabatic STT, which we associate to variations of creep pinning energy barrier with domain-wall magnetic texture. Far from compensation, the contribution of adiabatic STT is negligible. Field and current driven domain-wall motion present common universal behaviors described by the quenched Edwards Wilkinson universality class.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Robust Variational Physics-Informed Neural Networks
Authors:
Sergio Rojas,
Paweł Maczuga,
Judit Muñoz-Matute,
David Pardo,
Maciej Paszynski
Abstract:
We introduce a Robust version of the Variational Physics-Informed Neural Networks method (RVPINNs). As in VPINNs, we define the quadratic loss functional in terms of a Petrov-Galerkin-type variational formulation of the PDE problem: the trial space is a (Deep) Neural Network (DNN) manifold, while the test space is a finite-dimensional vector space. Whereas the VPINN's loss depends upon the selecte…
▽ More
We introduce a Robust version of the Variational Physics-Informed Neural Networks method (RVPINNs). As in VPINNs, we define the quadratic loss functional in terms of a Petrov-Galerkin-type variational formulation of the PDE problem: the trial space is a (Deep) Neural Network (DNN) manifold, while the test space is a finite-dimensional vector space. Whereas the VPINN's loss depends upon the selected basis functions of a given test space, herein, we minimize a loss based on the discrete dual norm of the residual. The main advantage of such a loss definition is that it provides a reliable and efficient estimator of the true error in the energy norm under the assumption of the existence of a local Fortin operator. We test the performance and robustness of our algorithm in several advection-diffusion problems. These numerical results perfectly align with our theoretical findings, showing that our estimates are sharp.
△ Less
Submitted 5 March, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Semi-blind-trace algorithm for self-supervised attenuation of trace-wise coherent noise
Authors:
Mohammad Mahdi Abedi,
David Pardo,
Tariq Alkhalifah
Abstract:
Trace-wise noise is a type of noise often seen in seismic data, which is characterized by vertical coherency and horizontal incoherency. Using self-supervised deep learning to attenuate this type of noise, the conventional blind-trace deep learning trains a network to blindly reconstruct each trace in the data from its surrounding traces; it attenuates isolated trace-wise noise but causes signal l…
▽ More
Trace-wise noise is a type of noise often seen in seismic data, which is characterized by vertical coherency and horizontal incoherency. Using self-supervised deep learning to attenuate this type of noise, the conventional blind-trace deep learning trains a network to blindly reconstruct each trace in the data from its surrounding traces; it attenuates isolated trace-wise noise but causes signal leakage in clean and noisy traces and reconstruction errors next to each noisy trace. To reduce signal leakage and improve denoising, we propose a new loss function and masking procedure in semi-blind-trace deep learning. Our hybrid loss function has weighted active zones that cover masked and non-masked traces. Therefore, the network is not blinded to clean traces during their reconstruction. During training, we dynamically change the masks' characteristics. The goal is to train the network to learn the characteristics of the signal instead of noise. The proposed algorithm enables the designed U-net to detect and attenuate trace-wise noise without having prior information about the noise. A new hyperparameter of our method is the relative weight between the masked and non-masked traces' contribution to the loss function. Numerical experiments show that selecting a small value for this parameter is enough to significantly decrease signal leakage. The proposed algorithm is tested on synthetic and real off-shore and land datasets with different noises. The results show the superb ability of the method to attenuate trace-wise noise while preserving other events. An implementation of the proposed algorithm as a Python code is also made available.
△ Less
Submitted 23 August, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
Liquid-Crystal-Based Controllable Attenuators Operating in the 1-4 Terahertz Band
Authors:
Aniela Dunn,
Zhaopeng Zhang,
Michael D. Horbury,
Eleanor V. Nuttall,
Yingjun Han,
Mohammed Salih,
Lianhe Li,
Abigail Bond,
Ehab Saleh,
Russell Harris,
Diego Pardo,
Brian N. Ellison,
Andrew D. Burnett,
Helen F. Gleeson,
Alexander Valavanis
Abstract:
Liquid-crystal devices (LCDs) offer a potential route toward adaptive optical components for use in the < 2 THz band of the electromagnetic spectrum. We demonstrate LCDs using a commercially available material (E7), with unbiased birefringence values of 0.14-0.18 in the 0.3-4 THz band. We exploit the linear dichroism of the material to modulate the emission from a 3.4-THz quantum cascade laser by…
▽ More
Liquid-crystal devices (LCDs) offer a potential route toward adaptive optical components for use in the < 2 THz band of the electromagnetic spectrum. We demonstrate LCDs using a commercially available material (E7), with unbiased birefringence values of 0.14-0.18 in the 0.3-4 THz band. We exploit the linear dichroism of the material to modulate the emission from a 3.4-THz quantum cascade laser by up to 40%, dependent upon both the liquid-crystal layer thickness and the bias voltage applied.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Deep Fourier Residual method for solving time-harmonic Maxwell's equations
Authors:
Jamie M. Taylor,
Manuela Bastidas,
David Pardo,
Ignacio Muga
Abstract:
Solving PDEs with machine learning techniques has become a popular alternative to conventional methods. In this context, Neural networks (NNs) are among the most commonly used machine learning tools, and in those models, the choice of an appropriate loss function is critical. In general, the main goal is to guarantee that minimizing the loss during training translates to minimizing the error in th…
▽ More
Solving PDEs with machine learning techniques has become a popular alternative to conventional methods. In this context, Neural networks (NNs) are among the most commonly used machine learning tools, and in those models, the choice of an appropriate loss function is critical. In general, the main goal is to guarantee that minimizing the loss during training translates to minimizing the error in the solution at the same rate. In this work, we focus on the time-harmonic Maxwell's equations, whose weak formulation takes H(curl) as the space of test functions. We propose a NN in which the loss function is a computable approximation of the dual norm of the weak-form PDE residual. To that end, we employ the Helmholtz decomposition of the space H(curl) and construct an orthonormal basis for this space in two and three spatial dimensions. Here, we use the Discrete Sine/Cosine Transform to accurately and efficiently compute the discrete version of our proposed loss function. Moreover, in the numerical examples we show a high correlation between the proposed loss function and the H(curl)-norm of the error, even in problems with low-regularity solutions.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Machine Learning Discovery of Optimal Quadrature Rules for Isogeometric Analysis
Authors:
Tomas Teijeiro,
Jamie M. Taylor,
Ali Hashemian,
David Pardo
Abstract:
We propose the use of machine learning techniques to find optimal quadrature rules for the construction of stiffness and mass matrices in isogeometric analysis (IGA). We initially consider 1D spline spaces of arbitrary degree spanned over uniform and non-uniform knot sequences, and then the generated optimal rules are used for integration over higher-dimensional spaces using tensor product sense.…
▽ More
We propose the use of machine learning techniques to find optimal quadrature rules for the construction of stiffness and mass matrices in isogeometric analysis (IGA). We initially consider 1D spline spaces of arbitrary degree spanned over uniform and non-uniform knot sequences, and then the generated optimal rules are used for integration over higher-dimensional spaces using tensor product sense. The quadrature rule search is posed as an optimization problem and solved by a machine learning strategy based on gradient-descent. However, since the optimization space is highly non-convex, the success of the search strongly depends on the number of quadrature points and the parameter initialization. Thus, we use a dynamic programming strategy that initializes the parameters from the optimal solution over the spline space with a lower number of knots. With this method, we found optimal quadrature rules for spline spaces when using IGA discretizations with up to 50 uniform elements and polynomial degrees up to 8, showing the generality of the approach in this scenario. For non-uniform partitions, the method also finds an optimal rule in a reasonable number of test cases. We also assess the generated optimal rules in two practical case studies, namely, the eigenvalue problem of the Laplace operator and the eigenfrequency analysis of freeform curved beams, where the latter problem shows the applicability of the method to curved geometries. In particular, the proposed method results in savings with respect to traditional Gaussian integration of up to 44% in 1D, 68% in 2D, and 82% in 3D spaces.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Learning quantities of interest from parametric PDEs: An efficient neural-weighted Minimal Residual approach
Authors:
Ignacio Brevis,
Ignacio Muga,
David Pardo,
Oscar Rodriguez,
Kristoffer G. van der Zee
Abstract:
The efficient approximation of parametric PDEs is of tremendous importance in science and engineering. In this paper, we show how one can train Galerkin discretizations to efficiently learn quantities of interest of solutions to a parametric PDE. The central component in our approach is an efficient neural-network-weighted Minimal-Residual formulation, which, after training, provides Galerkin-base…
▽ More
The efficient approximation of parametric PDEs is of tremendous importance in science and engineering. In this paper, we show how one can train Galerkin discretizations to efficiently learn quantities of interest of solutions to a parametric PDE. The central component in our approach is an efficient neural-network-weighted Minimal-Residual formulation, which, after training, provides Galerkin-based approximations in standard discrete spaces that have accurate quantities of interest, regardless of the coarseness of the discrete space.
△ Less
Submitted 15 February, 2024; v1 submitted 4 April, 2023;
originally announced April 2023.
-
QUICK$^3$ -- Design of a satellite-based quantum light source for quantum communication and extended physical theory tests in space
Authors:
Najme Ahmadi,
Sven Schwertfeger,
Philipp Werner,
Lukas Wiese,
Joseph Lester,
Elisa Da Ros,
Josefine Krause,
Sebastian Ritter,
Mostafa Abasifard,
Chanaprom Cholsuk,
Ria G. Krämer,
Simone Atzeni,
Mustafa Gündoğan,
Subash Sachidananda,
Daniel Pardo,
Stefan Nolte,
Alexander Lohrmann,
Alexander Ling,
Julian Bartholomäus,
Giacomo Corrielli,
Markus Krutzik,
Tobias Vogl
Abstract:
Modern quantum technologies have matured such that they can now be used in space applications, e.g., long-distance quantum communication. Here, we present the design of a compact true single photon source that can enhance the secure data rates in satellite-based quantum key distribution scenarios compared to conventional laser-based light sources. Our quantum light source is a fluorescent color ce…
▽ More
Modern quantum technologies have matured such that they can now be used in space applications, e.g., long-distance quantum communication. Here, we present the design of a compact true single photon source that can enhance the secure data rates in satellite-based quantum key distribution scenarios compared to conventional laser-based light sources. Our quantum light source is a fluorescent color center in hexagonal boron nitride. The emitter is off-resonantly excited by a diode laser and directly coupled to an integrated photonic processor that routes the photons to different experiments performed directly on-chip: (i) the characterization of the single photon source and (ii) testing a fundamental postulate of quantum mechanics, namely the relation of the probability density and the wave function (known as Born's rule). The described payload is currently being integrated into a 3U CubeSat and scheduled for launch in 2024 into low Earth orbit. We can therefore evaluate the feasibility of true single photon sources and reconfigurable photonic circuits in space. This provides a promising route toward a high-speed quantum network.
△ Less
Submitted 28 January, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
A Deep Double Ritz Method (D$^2$RM) for solving Partial Differential Equations using Neural Networks
Authors:
Carlos Uriarte,
David Pardo,
Ignacio Muga,
Judit Muñoz-Matute
Abstract:
Residual minimization is a widely used technique for solving Partial Differential Equations in variational form. It minimizes the dual norm of the residual, which naturally yields a saddle-point (min-max) problem over the so-called trial and test spaces. In the context of neural networks, we can address this min-max approach by employing one network to seek the trial minimum, while another network…
▽ More
Residual minimization is a widely used technique for solving Partial Differential Equations in variational form. It minimizes the dual norm of the residual, which naturally yields a saddle-point (min-max) problem over the so-called trial and test spaces. In the context of neural networks, we can address this min-max approach by employing one network to seek the trial minimum, while another network seeks the test maximizers. However, the resulting method is numerically unstable as we approach the trial solution. To overcome this, we reformulate the residual minimization as an equivalent minimization of a Ritz functional fed by optimal test functions computed from another Ritz functional minimization. We call the resulting scheme the Deep Double Ritz Method (D$^2$RM), which combines two neural networks for approximating trial functions and optimal test functions along a nested double Ritz minimization strategy. Numerical results on different diffusion and convection problems support the robustness of our method, up to the approximation properties of the networks and the training capacity of the optimizers.
△ Less
Submitted 18 January, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
A Deep Fourier Residual Method for solving PDEs using Neural Networks
Authors:
Jamie M. Taylor,
David Pardo,
Ignacio Muga
Abstract:
When using Neural Networks as trial functions to numerically solve PDEs, a key choice to be made is the loss function to be minimised, which should ideally correspond to a norm of the error. In multiple problems, this error norm coincides with--or is equivalent to--the $H^{-1}$-norm of the residual; however, it is often difficult to accurately compute it. This work assumes rectangular domains and…
▽ More
When using Neural Networks as trial functions to numerically solve PDEs, a key choice to be made is the loss function to be minimised, which should ideally correspond to a norm of the error. In multiple problems, this error norm coincides with--or is equivalent to--the $H^{-1}$-norm of the residual; however, it is often difficult to accurately compute it. This work assumes rectangular domains and proposes the use of a Discrete Sine/Cosine Transform to accurately and efficiently compute the $H^{-1}$ norm. The resulting Deep Fourier-based Residual (DFR) method efficiently and accurately approximate solutions to PDEs. This is particularly useful when solutions lack $H^{2}$ regularity and methods involving strong formulations of the PDE fail. We observe that the $H^1$-error is highly correlated with the discretised loss during training, which permits accurate error estimation via the loss.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
$r-$Adaptive Deep Learning Method for Solving Partial Differential Equations
Authors:
Ángel J. Omella,
David Pardo
Abstract:
We introduce an $r-$adaptive algorithm to solve Partial Differential Equations using a Deep Neural Network. The proposed method restricts to tensor product meshes and optimizes the boundary node locations in one dimension, from which we build two- or three-dimensional meshes. The method allows the definition of fixed interfaces to design conforming meshes, and enables changes in the topology, i.e.…
▽ More
We introduce an $r-$adaptive algorithm to solve Partial Differential Equations using a Deep Neural Network. The proposed method restricts to tensor product meshes and optimizes the boundary node locations in one dimension, from which we build two- or three-dimensional meshes. The method allows the definition of fixed interfaces to design conforming meshes, and enables changes in the topology, i.e., some nodes can jump across fixed interfaces. The method simultaneously optimizes the node locations and the PDE solution values over the resulting mesh. To numerically illustrate the performance of our proposed $r-$adaptive method, we apply it in combination with a collocation method, a Least Squares Method, and a Deep Ritz Method. We focus on the latter to solve one- and two-dimensional problems whose solutions are smooth, singular, and/or exhibit strong gradients.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
A Modular Framework for Reinforcement Learning Optimal Execution
Authors:
Fernando de Meer Pardo,
Christoph Auth,
Florin Dascalu
Abstract:
In this article, we develop a modular framework for the application of Reinforcement Learning to the problem of Optimal Trade Execution. The framework is designed with flexibility in mind, in order to ease the implementation of different simulation setups. Rather than focusing on agents and optimization methods, we focus on the environment and break down the necessary requirements to simulate an O…
▽ More
In this article, we develop a modular framework for the application of Reinforcement Learning to the problem of Optimal Trade Execution. The framework is designed with flexibility in mind, in order to ease the implementation of different simulation setups. Rather than focusing on agents and optimization methods, we focus on the environment and break down the necessary requirements to simulate an Optimal Trade Execution under a Reinforcement Learning framework such as data pre-processing, construction of observations, action processing, child order execution, simulation of benchmarks, reward calculations etc. We give examples of each component, explore the difficulties their individual implementations \& the interactions between them entail, and discuss the different phenomena that each component induces in the simulation, highlighting the divergences between the simulation and the behavior of a real market. We showcase our modular implementation through a setup that, following a Time-Weighted Average Price (TWAP) order submission schedule, allows the agent to exclusively place limit orders, simulates their execution via iterating over snapshots of the Limit Order Book (LOB), and calculates rewards as the \$ improvement over the price achieved by a TWAP benchmark algorithm following the same schedule. We also develop evaluation procedures that incorporate iterative re-training and evaluation of a given agent over intervals of a training horizon, mimicking how an agent may behave when being continuously retrained as new market data becomes available and emulating the monitoring practices that algorithm providers are bound to perform under current regulatory frameworks.
△ Less
Submitted 11 August, 2022;
originally announced August 2022.
-
Automated machine learning for borehole resistivity measurements
Authors:
M. Shahriari,
D. Pardo,
S. Kargaran,
T. Teijeiro
Abstract:
Deep neural networks (DNNs) offer a real-time solution for the inversion of borehole resistivity measurements to approximate forward and inverse operators. It is possible to use extremely large DNNs to approximate the operators, but it demands a considerable training time. Moreover, evaluating the network after training also requires a significant amount of memory and processing power. In addition…
▽ More
Deep neural networks (DNNs) offer a real-time solution for the inversion of borehole resistivity measurements to approximate forward and inverse operators. It is possible to use extremely large DNNs to approximate the operators, but it demands a considerable training time. Moreover, evaluating the network after training also requires a significant amount of memory and processing power. In addition, we may overfit the model. In this work, we propose a scoring function that accounts for the accuracy and size of the DNNs compared to a reference DNN that provides a good approximation for the operators. Using this scoring function, we use DNN architecture search algorithms to obtain a quasi-optimal DNN smaller than the reference network; hence, it requires less computational effort during training and evaluation. The quasi-optimal DNN delivers comparable accuracy to the original large DNN.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Performance of Refined Isogeometric Analysis in Solving Quadratic Eigenvalue Problems
Authors:
Ali Hashemian,
Daniel Garcia,
David Pardo,
Victor M. Calo
Abstract:
Certain applications that analyze dam** effects require the solution of quadratic eigenvalue problems (QEPs). We use refined isogeometric analysis (rIGA) to solve quadratic eigenproblems. rIGA discretization, while conserving desirable properties of maximum-continuity isogeometric analysis (IGA), reduces the interconnection between degrees of freedom by adding low-continuity basis functions. Thi…
▽ More
Certain applications that analyze dam** effects require the solution of quadratic eigenvalue problems (QEPs). We use refined isogeometric analysis (rIGA) to solve quadratic eigenproblems. rIGA discretization, while conserving desirable properties of maximum-continuity isogeometric analysis (IGA), reduces the interconnection between degrees of freedom by adding low-continuity basis functions. This connectivity reduction in rIGA's algebraic system results in faster matrix LU factorizations when using multifrontal direct solvers. We compare computational costs of rIGA versus those of IGA when employing Krylov eigensolvers to solve quadratic eigenproblems arising in 2D vector-valued multifield problems. For large problem sizes, the eigencomputation cost is governed by the cost of LU factorization, followed by costs of several matrix-vector and vector-vector multiplications, which correspond to Krylov projections. We minimize the computational cost by introducing C^0 and C^1 separators at specific element interfaces for our rIGA generalizations of the curl-conforming Nedelec and divergence-conforming Raviart-Thomas finite elements. Let p be the polynomial degree of basis functions; the LU factorization is up to O((p-1)^2) times faster when using rIGA compared to IGA in the asymptotic regime. Thus, rIGA theoretically improves the total eigencomputation cost by O((p-1)^2) for sufficiently large problem sizes. Yet, in practical cases of moderate-size eigenproblems, the improvement rate deteriorates as the number of computed eigenvalues increases because of multiple matrix-vector and vector-vector operations. Our numerical tests show that rIGA accelerates the solution of quadratic eigensystems by O(p-1) for moderately sized problems when we seek to compute a reasonable number of eigenvalues.
△ Less
Submitted 28 December, 2021;
originally announced December 2021.
-
Deep-Learning Inversion Method for the Interpretation of Noisy Logging-While-Drilling Resistivity Measurements
Authors:
Kyubo Noh,
David Pardo,
Carlos Torres-Verdin
Abstract:
Deep Learning (DL) inversion is a promising method for real time interpretation of logging while drilling (LWD) resistivity measurements for well navigation applications. In this context, measurement noise may significantly affect inversion results. Existing publications examining the effects of measurement noise on DL inversion results are scarce. We develop a method to generate training data set…
▽ More
Deep Learning (DL) inversion is a promising method for real time interpretation of logging while drilling (LWD) resistivity measurements for well navigation applications. In this context, measurement noise may significantly affect inversion results. Existing publications examining the effects of measurement noise on DL inversion results are scarce. We develop a method to generate training data sets and construct DL architectures that enhance the robustness of DL inversion methods in the presence of noisy LWD resistivity measurements. We use two synthetic resistivity models to test three approaches that explicitly consider the presence of noise: (1) adding noise to the measurements in the training set, (2) augmenting the training set by replicating it and adding varying noise realizations, and (3) adding a noise layer in the DL architecture. Numerical results confirm that the three approaches produce a denoising effect, yielding better inversion results in both predicted earth model and measurements compared not only to the basic DL inversion but also to traditional gradient based inversion results. A combination of the second and third approaches delivers the best results. The proposed methods can be readily generalized to multi dimensional DL inversion.
△ Less
Submitted 14 November, 2021;
originally announced November 2021.
-
On quadrature rules for solving Partial Differential Equations using Neural Networks
Authors:
Jon A. Rivera,
Jamie M. Taylor,
Ángel J. Omella,
David Pardo
Abstract:
Neural Networks have been widely used to solve Partial Differential Equations. These methods require to approximate definite integrals using quadrature rules. Here, we illustrate via 1D numerical examples the quadrature problems that may arise in these applications and propose different alternatives to overcome them, namely: Monte Carlo methods, adaptive integration, polynomial approximations of t…
▽ More
Neural Networks have been widely used to solve Partial Differential Equations. These methods require to approximate definite integrals using quadrature rules. Here, we illustrate via 1D numerical examples the quadrature problems that may arise in these applications and propose different alternatives to overcome them, namely: Monte Carlo methods, adaptive integration, polynomial approximations of the Neural Network output, and the inclusion of regularization terms in the loss. We also discuss the advantages and limitations of each proposed alternative. We advocate the use of Monte Carlo methods for high dimensions (above 3 or 4), and adaptive integration or polynomial approximations for low dimensions (3 or below). The use of regularization terms is a mathematically elegant alternative that is valid for any spacial dimension, however, it requires certain regularity assumptions on the solution and complex mathematical analysis when dealing with sophisticated Neural Networks.
△ Less
Submitted 30 October, 2021;
originally announced November 2021.
-
Design of borehole resistivity measurement acquisition systems using deep learning
Authors:
M. Shahriari,
A. Hazra,
D. Pardo
Abstract:
Borehole resistivity measurements recorded with logging-while-drilling (LWD) instruments are widely used for characterizing the earth's subsurface properties. They facilitate the extraction of natural resources such as oil and gas. LWD instruments require real-time inversions of electromagnetic measurements to estimate the electrical properties of the earth's subsurface near the well and possibly…
▽ More
Borehole resistivity measurements recorded with logging-while-drilling (LWD) instruments are widely used for characterizing the earth's subsurface properties. They facilitate the extraction of natural resources such as oil and gas. LWD instruments require real-time inversions of electromagnetic measurements to estimate the electrical properties of the earth's subsurface near the well and possibly correct the well trajectory. Deep Neural Network (DNN)-based methods are suitable for the rapid inversion of borehole resistivity measurements as they approximate the forward and inverse problem offline during the training phase and they only require a fraction of a second for the evaluation (aka prediction). However, the inverse problem generally admits multiple solutions. DNNs with traditional loss functions based on data misfit are ill-equipped for solving an inverse problem. This can be partially overcome by adding regularization terms to a loss function specifically designed for encoder-decoder architectures. But adding regularization seriously limits the number of possible solutions to a set of a priori desirable physical solutions. To avoid this, we use a two-step loss function without any regularization. In addition, to guarantee an inverse solution, we need a carefully selected measurement acquisition system with a sufficient number of measurements. In this work, we propose a DNN-based iterative algorithm for designing such a measurement acquisition system. We illustrate our DNN-based iterative algorithm via several synthetic examples. Numerical results show that the obtained measurement acquisition system is sufficient to identify and characterize both resistive and conductive layers above and below the logging instrument. Numerical results are promising, although further improvements are required to make our method amenable for industrial purposes.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
Refined isogeometric analysis for generalized Hermitian eigenproblems
Authors:
Ali Hashemian,
David Pardo,
Victor M. Calo
Abstract:
We use the refined isogeometric analysis (rIGA) to solve generalized Hermitian eigenproblems $({Ku=λMu})$. The rIGA framework conserves the desirable properties of maximum-continuity isogeometric analysis (IGA) discretizations while reducing the computation cost of the solution through partitioning the computational domain by adding zero-continuity basis functions. As a result, rIGA enriches the a…
▽ More
We use the refined isogeometric analysis (rIGA) to solve generalized Hermitian eigenproblems $({Ku=λMu})$. The rIGA framework conserves the desirable properties of maximum-continuity isogeometric analysis (IGA) discretizations while reducing the computation cost of the solution through partitioning the computational domain by adding zero-continuity basis functions. As a result, rIGA enriches the approximation space and decreases the interconnection between degrees of freedom. We compare computational costs of rIGA versus those of IGA when employing a Lanczos eigensolver with a shift-and-invert spectral transformation. When all eigenpairs within a given interval ${[λ_s,λ_e]}$ are of interest, we select several shifts ${σ_k\in[λ_s,λ_e]}$ using a spectrum slicing technique. For each shift $σ_k$, the cost of factorization of the spectral transformation matrix ${K-σ_k M}$ drives the total computational cost of the eigensolution. Several multiplications of the operator matrices ${(K-σ_k M)^{-1} M}$ by vectors follow this factorization. Let $p$ be the polynomial degree of basis functions and assume that IGA has maximum continuity of ${p-1}$, while rIGA introduces $C^0$ separators to minimize the factorization cost. For this setup, our theoretical estimates predict computational savings to compute a fixed number of eigenpairs of up to ${O(p^2)}$ in the asymptotic regime, that is, large problem sizes. Yet, our numerical tests show that for moderately-sized eigenproblems, the total computational cost reduction is $O(p)$. Nevertheless, rIGA improves the accuracy of every eigenpair of the first $N_0$ eigenvalues and eigenfunctions. Here, we allow $N_0$ to be as large as the total number of eigenmodes of the original maximum-continuity IGA discretization.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Database Generation for Deep Learning Inversion of 2.5D Borehole Electromagnetic Measurements using Refined Isogeometric Analysis
Authors:
Ali Hashemian,
Daniel Garcia,
Jon Ander Rivera,
David Pardo
Abstract:
Borehole resistivity measurements are routinely inverted in real-time during geosteering operations. The inversion process can be efficiently performed with the help of advanced artificial intelligence algorithms such as deep learning. These methods require a large dataset that relates multiple earth models with the corresponding borehole resistivity measurements. In here, we propose to use an adv…
▽ More
Borehole resistivity measurements are routinely inverted in real-time during geosteering operations. The inversion process can be efficiently performed with the help of advanced artificial intelligence algorithms such as deep learning. These methods require a large dataset that relates multiple earth models with the corresponding borehole resistivity measurements. In here, we propose to use an advanced numerical method --refined isogeometric analysis (rIGA)-- to perform rapid and accurate 2.5D simulations and generate databases when considering arbitrary 2D earth models. Numerical results show that we can generate a meaningful synthetic database composed of 100,000 earth models with the corresponding measurements in 56 hours using a workstation equipped with two CPUs.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Goal-oriented adaptivity for a conforming residual minimization method in a dual discontinuous Galerkin norm
Authors:
Sergio Rojas,
David Pardo,
Pouria Behnoudfar,
Victor M. Calo
Abstract:
We propose a goal-oriented mesh-adaptive algorithm for a finite element method stabilized via residual minimization on dual discontinuous-Galerkin norms. By solving a saddle-point problem, this residual minimization delivers a stable continuous approximation to the solution on each mesh instance and a residual projection onto a broken polynomial space, which is a robust error estimator to minimize…
▽ More
We propose a goal-oriented mesh-adaptive algorithm for a finite element method stabilized via residual minimization on dual discontinuous-Galerkin norms. By solving a saddle-point problem, this residual minimization delivers a stable continuous approximation to the solution on each mesh instance and a residual projection onto a broken polynomial space, which is a robust error estimator to minimize the discrete energy norm via automatic mesh refinement. In this work, we propose and analyze a goal-oriented adaptive algorithm for this stable residual minimization. We solve the primal and adjoint problems considering the same saddle-point formulation and different right-hand sides. By solving a third stable problem, we obtain two efficient error estimates to guide goal-oriented adaptivity. We illustrate the performance of this goal-oriented adaptive strategy on advection-diffusion-reaction problems.
△ Less
Submitted 17 December, 2020; v1 submitted 17 July, 2020;
originally announced July 2020.
-
Modeling extra-deep electromagnetic logs using a deep neural network
Authors:
Sergey Alyaev,
Mostafa Shahriari,
David Pardo,
Angel Javier Omella,
David Larsen,
Nazanin Jahani,
Erich Suter
Abstract:
Modern geosteering is heavily dependent on real-time interpretation of deep electromagnetic (EM) measurements. We present a methodology to construct a deep neural network (DNN) model trained to reproduce a full set of extra-deep EM logs consisting of 22 measurements per logging position. The model is trained in a 1D layered environment consisting of up to seven layers with different resistivity va…
▽ More
Modern geosteering is heavily dependent on real-time interpretation of deep electromagnetic (EM) measurements. We present a methodology to construct a deep neural network (DNN) model trained to reproduce a full set of extra-deep EM logs consisting of 22 measurements per logging position. The model is trained in a 1D layered environment consisting of up to seven layers with different resistivity values. A commercial simulator provided by a tool vendor is used to generate a training dataset. The dataset size is limited because the simulator provided by the vendor is optimized for sequential execution. Therefore, we design a training dataset that embraces the geological rules and geosteering specifics supported by the forward model. We use this dataset to produce an EM simulator based on a DNN without access to the proprietary information about the EM tool configuration or the original simulator source code. Despite employing a relatively small training set size, the resulting DNN forward model is quite accurate for the considered examples: a multi-layer synthetic case and a section of a published historical operation from the Goliat Field. The observed average evaluation time of 0.15 ms per logging position makes it also suitable for future use as part of evaluation-hungry statistical and/or Monte-Carlo inversion algorithms within geosteering workflows.
△ Less
Submitted 13 August, 2021; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Error Control and Loss Functions for the Deep Learning Inversion of Borehole Resistivity Measurements
Authors:
M. Shahriari,
D. Pardo,
J. A. Rivera,
C. Torres-Verdín,
A. Picon,
J. Del Ser,
S. Ossandón,
V. M. Calo
Abstract:
Deep learning (DL) is a numerical method that approximates functions. Recently, its use has become attractive for the simulation and inversion of multiple problems in computational mechanics, including the inversion of borehole logging measurements for oil and gas applications. In this context, DL methods exhibit two key attractive features: a) once trained, they enable to solve an inverse problem…
▽ More
Deep learning (DL) is a numerical method that approximates functions. Recently, its use has become attractive for the simulation and inversion of multiple problems in computational mechanics, including the inversion of borehole logging measurements for oil and gas applications. In this context, DL methods exhibit two key attractive features: a) once trained, they enable to solve an inverse problem in a fraction of a second, which is convenient for borehole geosteering operations as well as in other real-time inversion applications. b) DL methods exhibit a superior capability for approximating highly-complex functions across different areas of knowledge. Nevertheless, as it occurs with most numerical methods, DL also relies on expert design decisions that are problem specific to achieve reliable and robust results. Herein, we investigate two key aspects of deep neural networks (DNNs) when applied to the inversion of borehole resistivity measurements: error control and adequate selection of the loss function. As we illustrate via theoretical considerations and extensive numerical experiments, these interrelated aspects are critical to recover accurate inversion results.
△ Less
Submitted 28 May, 2020; v1 submitted 7 May, 2020;
originally announced May 2020.
-
Common universal behaviors of magnetic domain walls driven by spin-polarized electrical current and magnetic field
Authors:
R. Diaz Pardo,
N. Moisan,
L. Albornoz,
A. Lemaitre,
J. Curiale,
V. Jeudy
Abstract:
We explore universal behaviors of magnetic domain wall driven by the spin-transfer of an electrical current, in a ferromagnetic (Ga,Mn)(As,P) thin film with perpendicular magnetic anisotropy. For a current direction transverse to domain wall, the dynamics of the thermally activated creep regime and the depinning transition are found to be compatible with a self-consistent universal description of…
▽ More
We explore universal behaviors of magnetic domain wall driven by the spin-transfer of an electrical current, in a ferromagnetic (Ga,Mn)(As,P) thin film with perpendicular magnetic anisotropy. For a current direction transverse to domain wall, the dynamics of the thermally activated creep regime and the depinning transition are found to be compatible with a self-consistent universal description of magnetic field induced domain wall dynamics. This common universal behavior, characteristic of the so-called quenched Edwards-Wilkinson universality class, is confirmed by a complementary and independent analysis of domain wall roughness. However, the tilting of domain walls and the formation of facets is produced by the directionality of interaction with the current, which acts as a magnetic field only in the direction transverse to domain wall.
△ Less
Submitted 5 September, 2019; v1 submitted 5 April, 2019;
originally announced April 2019.
-
Universal dimensional crossover of domain wall dynamics in ferromagnetic films
Authors:
W. Savero Torres,
R. Diaz Pardo,
S. Bustingorry,
A. B. Kolton,
A. Lemaître,
V. Jeudy
Abstract:
The magnetic domain wall motion driven by a magnetic field is studied in (Ga,Mn)As and (Ga,Mn)(As,P) films of different thicknesses. In the thermally activated creep regime, a kink in the velocity curves and a jump of the roughness exponent evidence a dimensional crossover in the domain wall dynamics. The measured values of the roughness exponent zeta_{1d} = 0.62 +/- 0.02 and zeta_{2d} = 0.45 +/-…
▽ More
The magnetic domain wall motion driven by a magnetic field is studied in (Ga,Mn)As and (Ga,Mn)(As,P) films of different thicknesses. In the thermally activated creep regime, a kink in the velocity curves and a jump of the roughness exponent evidence a dimensional crossover in the domain wall dynamics. The measured values of the roughness exponent zeta_{1d} = 0.62 +/- 0.02 and zeta_{2d} = 0.45 +/- 0.04 are compatible with theoretical predictions for the motion of elastic line (d = 1) and surface (d = 2) in two and three dimensional media, respectively.
△ Less
Submitted 1 December, 2018; v1 submitted 26 November, 2018;
originally announced November 2018.
-
A Deep Learning Approach to the Inversion of Borehole Resistivity Measurements
Authors:
M. Shahriari,
D. Pardo,
A. Picón,
A. Galdrán,
J. Del Ser,
C. Torres-Verdín
Abstract:
We use borehole resistivity measurements to map the electrical properties of the subsurface and to increase the productivity of a reservoir. When used for geosteering purposes, it becomes essential to invert them in real time. In this work, we explore the possibility of using Deep Neural Network (DNN) to perform a rapid inversion of borehole resistivity measurements. Herein, we build a DNN that ap…
▽ More
We use borehole resistivity measurements to map the electrical properties of the subsurface and to increase the productivity of a reservoir. When used for geosteering purposes, it becomes essential to invert them in real time. In this work, we explore the possibility of using Deep Neural Network (DNN) to perform a rapid inversion of borehole resistivity measurements. Herein, we build a DNN that approximates the following inverse problem: given a set of borehole resistivity measurements, the DNN is designed to deliver a physically meaningful and data-consistent piecewise one-dimensional layered model of the surrounding subsurface. Once the DNN is built, we can perform the actual inversion of the field measurements in real time. We illustrate the performance of DNN of logging-while-drilling measurements acquired on high-angle wells via synthetic data.
△ Less
Submitted 9 January, 2019; v1 submitted 5 October, 2018;
originally announced October 2018.
-
Variational Formulations for Explicit Runge-Kutta Methods
Authors:
Judit Muñoz-Matute,
David Pardo,
Victor M. Calo,
Elisabete Alberdi
Abstract:
Variational space-time formulations for Partial Differential Equations have been of great interest in the last decades. While it is known that implicit time marching schemes have variational structure, the Galerkin formulation of explicit methods in time remains elusive. In this work, we prove that the explicit Runge-Kutta methods can be expressed as discontinuous Petrov-Galerkin methods both in s…
▽ More
Variational space-time formulations for Partial Differential Equations have been of great interest in the last decades. While it is known that implicit time marching schemes have variational structure, the Galerkin formulation of explicit methods in time remains elusive. In this work, we prove that the explicit Runge-Kutta methods can be expressed as discontinuous Petrov-Galerkin methods both in space and time. We build trial and test spaces for the linear diffusion equation that lead to one, two, and general stage explicit Runge-Kutta methods. This approach enables us to design explicit time-domain (goal-oriented) adaptive algorithms
△ Less
Submitted 20 June, 2018;
originally announced June 2018.
-
Pinning of Domain Walls in thin Ferromagnetic Films
Authors:
Vincent Jeudy,
Rebeca Diaz Pardo,
Williams Savero Torres,
Sebastian Bustingorry,
Alejandro Kolton
Abstract:
We present a quantitative investigation of magnetic domain wall pinning in thin magnets with perpendicular anisotropy. A self-consistent description exploiting the universal features of the depinning and thermally activated sub-threshold creep regimes observed in the field driven domain wall velocity, is used to determine the effective pinning parameters controlling the domain wall dynamics: the e…
▽ More
We present a quantitative investigation of magnetic domain wall pinning in thin magnets with perpendicular anisotropy. A self-consistent description exploiting the universal features of the depinning and thermally activated sub-threshold creep regimes observed in the field driven domain wall velocity, is used to determine the effective pinning parameters controlling the domain wall dynamics: the effective height of pinning barriers, the depinning threshold, and the velocity at depinning. Within this framework, the analysis of results published in the literature allows for a quantitative comparison of pinning properties for a set of magnetic materials in a wide temperature range. On the basis of scaling arguments, the microscopic parameters controlling the pinning: the correlation length of pinning, the collectively pinned domain wall length (Larkin length) and the strength of pinning disorder, are estimated from the effective pinning and the micromagnetic parameters. The analysis of thermal effects reveals a crossover between different pinning length scales and strengths at low reduced temperature.
△ Less
Submitted 17 May, 2018; v1 submitted 23 September, 2017;
originally announced September 2017.
-
Excess velocity of magnetic domain walls close to the depinning field
Authors:
Nirvana B. Caballero,
Iván Fernández Aguirre,
Lucas J. Albornoz,
Alejandro B. Kolton,
Juan Carlos Rojas-Sánchez,
Sophie Collin,
Jean Marie George,
Rebeca Diaz Pardo,
Vincent Jeudy,
Sebastian Bustingorry,
Javier Curiale
Abstract:
Magnetic field driven domain wall velocities in [Co/Ni] based multilayers thin films have been measured using polar magneto-optic Kerr effect microscopy. The low field results are shown to be consistent with the universal creep regime of domain wall motion, characterized by a stretched exponential growth of the velocity with the inverse of the applied field. Approaching the depinning field from be…
▽ More
Magnetic field driven domain wall velocities in [Co/Ni] based multilayers thin films have been measured using polar magneto-optic Kerr effect microscopy. The low field results are shown to be consistent with the universal creep regime of domain wall motion, characterized by a stretched exponential growth of the velocity with the inverse of the applied field. Approaching the depinning field from below results in an unexpected excess velocity with respect to the creep law. We analyze these results using scaling theory to show that this speeding up of domain wall motion can be interpreted as due to the increase of the size of the deterministic relaxation close to the depinning transition. We propose a phenomenological model which allows to accurately fit the observed excess velocity and to obtain characteristic values for the depinning field $H_d$, the depinning temperature $T_d$, and the characteristic velocity scale $v_0$ for each sample.
△ Less
Submitted 23 January, 2018; v1 submitted 11 August, 2017;
originally announced August 2017.
-
Fast Trajectory Optimization for Legged Robots using Vertex-based ZMP Constraints
Authors:
Alexander W Winkler,
Farbod Farshidian,
Diego Pardo,
Michael Neunert,
Jonas Buchli
Abstract:
This paper combines the fast Zero-Moment-Point (ZMP) approaches that work well in practice with the broader range of capabilities of a Trajectory Optimization formulation, by optimizing over body motion, footholds and Center of Pressure simultaneously. We introduce a vertex-based representation of the support-area constraint, which can treat arbitrarily oriented point-, line-, and area-contacts un…
▽ More
This paper combines the fast Zero-Moment-Point (ZMP) approaches that work well in practice with the broader range of capabilities of a Trajectory Optimization formulation, by optimizing over body motion, footholds and Center of Pressure simultaneously. We introduce a vertex-based representation of the support-area constraint, which can treat arbitrarily oriented point-, line-, and area-contacts uniformly. This generalization allows us to create motions such quadrupedal walking, trotting, bounding, pacing, combinations and transitions between these, lim**, bipedal walking and push-recovery all with the same approach. This formulation constitutes a minimal representation of the physical laws (unilateral contact forces) and kinematic restrictions (range of motion) in legged locomotion, which allows us to generate various motion in less than a second. We demonstrate the feasibility of the generated motions on a real quadruped robot.
△ Less
Submitted 27 May, 2017;
originally announced May 2017.
-
Universal Depinning Transition of Domain Walls in Ultrathin Ferromagnets
Authors:
Rebeca Diaz Pardo,
Williams Savero Torres,
Alejandro Kolton,
Sebastian Bustingorry,
Vincent Jeudy
Abstract:
We present a quantitative and comparative study of magnetic field driven domain wall depinning transition in different ferromagnetic ultrathin films over a wide range of temperature. We reveal a universal scaling function accounting for both drive and thermal effects on the depinning transition, including critical exponents. The consistent description we obtain for both the depinning and subthresh…
▽ More
We present a quantitative and comparative study of magnetic field driven domain wall depinning transition in different ferromagnetic ultrathin films over a wide range of temperature. We reveal a universal scaling function accounting for both drive and thermal effects on the depinning transition, including critical exponents. The consistent description we obtain for both the depinning and subthreshold thermally activated creep motion should shed light on the universal glassy dynamics of thermally fluctuating elastic objects pinned by disordered energy landscapes.
△ Less
Submitted 3 February, 2017; v1 submitted 26 November, 2016;
originally announced November 2016.
-
Sequential Linear Quadratic Optimal Control for Nonlinear Switched Systems
Authors:
Farbod Farshidian,
Maryam Kamgarpour,
Diego Pardo,
Jonas Buchli
Abstract:
In this contribution, we introduce an efficient method for solving the optimal control problem for an unconstrained nonlinear switched system with an arbitrary cost function. We assume that the sequence of the switching modes are given but the switching time in between consecutive modes remains to be optimized. The proposed method uses a two-stage approach as introduced by Xu and Antsaklis (2004)…
▽ More
In this contribution, we introduce an efficient method for solving the optimal control problem for an unconstrained nonlinear switched system with an arbitrary cost function. We assume that the sequence of the switching modes are given but the switching time in between consecutive modes remains to be optimized. The proposed method uses a two-stage approach as introduced by Xu and Antsaklis (2004) where the original optimal control problem is transcribed into an equivalent problem parametrized by the switching times and the optimal control policy is obtained based on the solution of a two-point boundary value differential equation. The main contribution of this paper is to use a Sequential Linear Quadratic approach to synthesize the optimal controller instead of solving a boundary value problem. The proposed method is numerically more efficient and scales very well to the high dimensional problems. In order to evaluate its performance, we use two numerical examples as benchmarks to compare against the baseline algorithm. In the third numerical example, we apply the proposed algorithm to the Center of Mass control problem in a quadruped robot locomotion task.
△ Less
Submitted 1 May, 2017; v1 submitted 7 September, 2016;
originally announced September 2016.
-
Symbolic Abstract Contract Synthesis in a Rewriting Framework
Authors:
María Alpuente,
Daniel Pardo,
Alicia Villanueva
Abstract:
We propose an automated technique for inferring software contracts from programs that are written in a non-trivial fragment of C, called KernelC, that supports pointer-based structures and heap manipulation. Starting from the semantic definition of KernelC in the K framework, we enrich the symbolic execution facilities recently provided by K with novel capabilities for assertion synthesis that are…
▽ More
We propose an automated technique for inferring software contracts from programs that are written in a non-trivial fragment of C, called KernelC, that supports pointer-based structures and heap manipulation. Starting from the semantic definition of KernelC in the K framework, we enrich the symbolic execution facilities recently provided by K with novel capabilities for assertion synthesis that are based on abstract subsumption. Roughly speaking, we define an abstract symbolic technique that explains the execution of a (modifier) C function by using other (observer) routines in the same program. We implemented our technique in the automated tool KindSpec 2.0, which generates logical axioms that express pre- and post-condition assertions by defining the precise input/output behaviour of the C routines.
△ Less
Submitted 19 August, 2016;
originally announced August 2016.
-
Automatic Inference of Specifications in the K Framework
Authors:
María Alpuente,
Daniel Pardo,
Alicia Villanueva
Abstract:
Despite its many unquestionable benefits, formal specifications are not widely used in industrial software development. In order to reduce the time and effort required to write formal specifications, in this paper we propose a technique for automatically discovering specifications from real code. The proposed methodology relies on the symbolic execution capabilities recently provided by the K fra…
▽ More
Despite its many unquestionable benefits, formal specifications are not widely used in industrial software development. In order to reduce the time and effort required to write formal specifications, in this paper we propose a technique for automatically discovering specifications from real code. The proposed methodology relies on the symbolic execution capabilities recently provided by the K framework that we exploit to automatically infer formal specifications from programs that are written in a non-trivial fragment of C, called KernelC. Roughly speaking, our symbolic analysis of KernelC programs explains the execution of a (modifier) function by using other (observer) routines in the program. We implemented our technique in the automated tool Kindspec 2.0, which generates axioms that describe the precise input/output behavior of C routines that handle pointer-based structures (i.e., result values and state change). We describe the implementation of our system and discuss the differences w.r.t. our previous work on inferring specifications from C code.
△ Less
Submitted 21 December, 2015;
originally announced December 2015.
-
Fourier finite element modeling of light emission in waveguides: 2.5-dimensional FEM approach
Authors:
Yangxin Ou,
David Pardo,
Yuntian Chen
Abstract:
We present a Fourier finite element modeling of light emission of dipolar emitters coupled to infinitely long waveguides. Due to the translational symmetry, the three-dimensional (3D) coupled waveguide-emitter system can be decomposed into a series of independent 2D problems (2.5D), which reduces the computational cost. Moreover, the reduced 2D problems can be extremely accurate, compared to its 3…
▽ More
We present a Fourier finite element modeling of light emission of dipolar emitters coupled to infinitely long waveguides. Due to the translational symmetry, the three-dimensional (3D) coupled waveguide-emitter system can be decomposed into a series of independent 2D problems (2.5D), which reduces the computational cost. Moreover, the reduced 2D problems can be extremely accurate, compared to its 3D counterpart. Our method can precisely quantify the total emission rates, as well as the fraction of emission rates into different modal channels for waveguides with arbitrary cross-sections. We compare our method with dyadic Green's function for the light emission in single mode metallic nanowire, which yields an excellent agreement. This method is applied in multi-mode waveguides, as well as multi-core waveguides. We further show that our method has the full capability of including dipole orientations, as illustrated via a rotating dipole, which leads to unidirectional excitation of guide modes. The 2.5D Finite Element Method (FEM) approach proposed here can be applied for various waveguides, thus it is useful to interface single-photon single-emitter in nano-structures, as well as for other scenarios involving coupled waveguide-emitters.
△ Less
Submitted 29 November, 2015;
originally announced November 2015.
-
Projection based whole body motion planning for legged robots
Authors:
Diego Pardo,
Michael Neunert,
Alexander W. Winkler,
Jonas Buchli
Abstract:
In this paper we present a new approach for dynamic motion planning for legged robots. We formulate a trajectory optimization problem based on a compact form of the robot dynamics. Such a form is obtained by projecting the rigid body dynamics onto the null space of the Constraint Jacobian. As consequence of the projection, contact forces are removed from the model but their effects are still taken…
▽ More
In this paper we present a new approach for dynamic motion planning for legged robots. We formulate a trajectory optimization problem based on a compact form of the robot dynamics. Such a form is obtained by projecting the rigid body dynamics onto the null space of the Constraint Jacobian. As consequence of the projection, contact forces are removed from the model but their effects are still taken into account. This approach permits to solve the optimal control problem of a floating base constrained multibody system while avoiding the use of an explicit contact model. We use direct transcription to numerically solve the optimization. As the contact forces are not part of the decision variables the size of the resultant discrete mathematical program is reduced and therefore solutions can be obtained in a tractable time. Using a predefined sequence of contact configurations (phases), our approach solves motions where contact switches occur. Transitions between phases are automatically resolved without using a model for switching dynamics. We present results on a hydraulic quadruped robot (HyQ), including single phase (standing, crouching) as well as multiple phase (rearing, diagonal leg balancing and step**) dynamic motions.
△ Less
Submitted 6 October, 2015;
originally announced October 2015.
-
Evaluating direct transcription and nonlinear optimization methods for robot motion planning
Authors:
Diego Pardo,
Lukas Möller,
Michael Neunert,
Alexander W. Winkler,
Jonas Buchli
Abstract:
This paper studies existing direct transcription methods for trajectory optimization applied to robot motion planning. There are diverse alternatives for the implementation of direct transcription. In this study we analyze the effects of such alternatives when solving a robotics problem. Different parameters such as integration scheme, number of discretization nodes, initialization strategies and…
▽ More
This paper studies existing direct transcription methods for trajectory optimization applied to robot motion planning. There are diverse alternatives for the implementation of direct transcription. In this study we analyze the effects of such alternatives when solving a robotics problem. Different parameters such as integration scheme, number of discretization nodes, initialization strategies and complexity of the problem are evaluated. We measure the performance of the methods in terms of computational time, accuracy and quality of the solution. Additionally, we compare two optimization methodologies frequently used to solve the transcribed problem, namely Sequential Quadratic Programming (SQP) and Interior Point Method (IPM). As a benchmark, we solve different motion tasks on an underactuated and non-minimal-phase ball-balancing robot with a 10 dimensional state space and 3 dimensional input space. Additionally, we validate the results on a simulated 3D quadrotor. Finally, as a verification of using direct transcription methods for trajectory optimization on real robots, we present hardware experiments on a motion task including path constraints and actuation limits.
△ Less
Submitted 29 January, 2016; v1 submitted 22 April, 2015;
originally announced April 2015.
-
A direct solver with reutilization of previously-computed LU factorizations for h-adaptive finite element grids with point singularities
Authors:
Maciej Paszynski,
Victor Calo,
David Pardo
Abstract:
This paper describes a direct solver algorithm for a sequence of finite element meshes that are h-refined towards one or several point singularities. For such a sequence of grids, the solver delivers linear computational cost O(N) in terms of CPU time and memory with respect to the number of unknowns N. The linear computational cost is achieved by utilizing the recursive structure provided by the…
▽ More
This paper describes a direct solver algorithm for a sequence of finite element meshes that are h-refined towards one or several point singularities. For such a sequence of grids, the solver delivers linear computational cost O(N) in terms of CPU time and memory with respect to the number of unknowns N. The linear computational cost is achieved by utilizing the recursive structure provided by the sequence of h-adaptive grids with a special construction of the elimination tree that allows for reutilization of previously computed partial LU factorizations over the entire unrefined part of the computational mesh. The reutilization technique reduces the computational cost of the entire sequence of h-refined grids from O(N^2) down to O(N). Theoretical estimates are illustrated with numerical results on two- and three-dimensional model problems exhibiting one or several point singularities.
△ Less
Submitted 10 December, 2012;
originally announced December 2012.
-
The cost of continuity: performance of iterative solvers on isogeometric finite elements
Authors:
Nathan Collier,
Lisandro Dalcin,
David Pardo,
V. M. Calo
Abstract:
In this paper we study how the use of a more continuous set of basis functions affects the cost of solving systems of linear equations resulting from a discretized Galerkin weak form. Specifically, we compare performance of linear solvers when discretizing using $C^0$ B-splines, which span traditional finite element spaces, and $C^{p-1}$ B-splines, which represent maximum continuity. We provide th…
▽ More
In this paper we study how the use of a more continuous set of basis functions affects the cost of solving systems of linear equations resulting from a discretized Galerkin weak form. Specifically, we compare performance of linear solvers when discretizing using $C^0$ B-splines, which span traditional finite element spaces, and $C^{p-1}$ B-splines, which represent maximum continuity. We provide theoretical estimates for the increase in cost of the matrix-vector product as well as for the construction and application of black-box preconditioners. We accompany these estimates with numerical results and study their sensitivity to various grid parameters such as element size $h$ and polynomial order of approximation $p$. Finally, we present timing results for a range of preconditioning options for the Laplace problem. We conclude that the matrix-vector product operation is at most $\slfrac{33p^2}{8}$ times more expensive for the more continuous space, although for moderately low $p$, this number is significantly reduced. Moreover, if static condensation is not employed, this number further reduces to at most a value of 8, even for high $p$. Preconditioning options can be up to $p^3$ times more expensive to setup, although this difference significantly decreases for some popular preconditioners such as Incomplete LU factorization.
△ Less
Submitted 13 June, 2012;
originally announced June 2012.
-
Computational complexity and memory usage for multi-frontal direct solvers in structured mesh finite elements
Authors:
Nathan Collier,
David Pardo,
Maciej Paszynski,
Victor M. Calo
Abstract:
The multi-frontal direct solver is the state-of-the-art algorithm for the direct solution of sparse linear systems. This paper provides computational complexity and memory usage estimates for the application of the multi-frontal direct solver algorithm on linear systems resulting from B-spline-based isogeometric finite elements, where the mesh is a structured grid. Specifically we provide the esti…
▽ More
The multi-frontal direct solver is the state-of-the-art algorithm for the direct solution of sparse linear systems. This paper provides computational complexity and memory usage estimates for the application of the multi-frontal direct solver algorithm on linear systems resulting from B-spline-based isogeometric finite elements, where the mesh is a structured grid. Specifically we provide the estimates for systems resulting from $C^{p-1}$ polynomial B-spline spaces and compare them to those obtained using $C^0$ spaces.
△ Less
Submitted 8 April, 2012;
originally announced April 2012.
-
Injection statistics simulator for dynamic analysis of noise in mesoscopic devices
Authors:
T. Gonzalez,
J. Mateos,
D. Pardo,
L. Varani,
L. Reggiani
Abstract:
We present a model for electron injection from thermal reservoirs which is applied to particle simulations of one-dimensional mesoscopic conductors. The statistics of injected carriers is correctly described from nondegenerate to completely degenerate conditions. The model is validated by comparing Monte Carlo simulations with existing analytical results for the case of ballistic conductors. An…
▽ More
We present a model for electron injection from thermal reservoirs which is applied to particle simulations of one-dimensional mesoscopic conductors. The statistics of injected carriers is correctly described from nondegenerate to completely degenerate conditions. The model is validated by comparing Monte Carlo simulations with existing analytical results for the case of ballistic conductors. An excellent agreement is found for average and noise characteristics, in particular, the fundamental unities of electrical and thermal conductances are exactly reproduced.
△ Less
Submitted 8 October, 1999;
originally announced October 1999.
-
Microscopic analysis of shot-noise suppression in nondegenerate diffusive conductors
Authors:
T. Gonzalez,
J. Mateos,
D. Pardo,
O. M. Bulashenko,
L. Reggiani
Abstract:
We present a theoretical investigation of shot-noise suppression due to long-range Coulomb interaction in nondegenerate diffusive conductors. Calculations make use of an ensemble Monte Carlo simulator self-consistently coupled with a one-dimensional Poisson solver. We analyze the noise in a lightly doped active region surrounded by two contacts acting as thermal reservoirs. By taking the do**…
▽ More
We present a theoretical investigation of shot-noise suppression due to long-range Coulomb interaction in nondegenerate diffusive conductors. Calculations make use of an ensemble Monte Carlo simulator self-consistently coupled with a one-dimensional Poisson solver. We analyze the noise in a lightly doped active region surrounded by two contacts acting as thermal reservoirs. By taking the do** of the injecting contacts and the applied voltage as variable parameters, the influence of elastic and inelastic scattering in the active region is investigated. The transition from ballistic to diffusive transport regimes under different contact injecting statistics is analyzed and discussed. Provided significant space-charge effects take place inside the active region, long-range Coulomb interaction is found to play an essential role in suppressing the shot noise at $qU \gg k_BT$. In the elastic diffusive regime, momentum space dimensionality is found to modify the suppression factor $γ$, which within numerical uncertainty takes values respectively of about 1/3, 1/2 and 0.7 in the 3D, 2D and 1D cases. In the inelastic diffusive regime, shot noise is suppressed to the thermal value.
△ Less
Submitted 5 November, 1998;
originally announced November 1998.
-
Universality of the 1/3 shot-noise suppression factor in nondegenerate diffusive conductors
Authors:
T. Gonzalez,
C. Gonzalez,
J. Mateos,
D. Pardo,
L. Reggiani,
O. M. Bulashenko,
J. M. Rubi
Abstract:
Shot-noise suppression is investigated in nondegenerate diffusive conductors by means of an ensemble Monte Carlo simulator. The universal 1/3 suppression value is obtained when transport occurs under elastic collision regime provided the following conditions are satisfied: (i) The applied voltage is much larger than the thermal value; (ii) the length of the device is much longer than both the el…
▽ More
Shot-noise suppression is investigated in nondegenerate diffusive conductors by means of an ensemble Monte Carlo simulator. The universal 1/3 suppression value is obtained when transport occurs under elastic collision regime provided the following conditions are satisfied: (i) The applied voltage is much larger than the thermal value; (ii) the length of the device is much longer than both the elastic mean free path and the Debye length. By fully suppressing carrier-number fluctuations, long range Coulomb interaction is essential to obtain the 1/3 value in the low-frequency limit.
△ Less
Submitted 30 March, 1998;
originally announced March 1998.
-
Electron-number statistics and shot-noise suppression by Coulomb correlation in nondegenerate ballistic transport
Authors:
O. M. Bulashenko,
J. Mateos,
D. Pardo,
T. Gonzalez,
L. Reggiani,
J. M. Rubi
Abstract:
Within a Monte Carlo simulation we investigate the statistical properties of an electron flow injected with a Poissonian distribution and transmitted under ballistic regime in the presence of long-range Coulomb interaction. Electrons are shown to exhibit a motional squeezing which tends to space them more regularly rather than strictly at random, and evidence a sub-Poissonian statistics with a s…
▽ More
Within a Monte Carlo simulation we investigate the statistical properties of an electron flow injected with a Poissonian distribution and transmitted under ballistic regime in the presence of long-range Coulomb interaction. Electrons are shown to exhibit a motional squeezing which tends to space them more regularly rather than strictly at random, and evidence a sub-Poissonian statistics with a substantially reduced Fano factor $F_n\ll 1$. The temporal (anti)correlation among carriers is demonstrated to be a collective effect which persists over the transit of several successive electrons and results in a considerable (more than one order of magnitude) shot noise suppression.
△ Less
Submitted 19 October, 1997;
originally announced October 1997.