-
Conformalized-DeepONet: A Distribution-Free Framework for Uncertainty Quantification in Deep Operator Networks
Authors:
Christian Moya,
Amirhossein Mollaali,
Zecheng Zhang,
Lu Lu,
Guang Lin
Abstract:
In this paper, we adopt conformal prediction, a distribution-free uncertainty quantification (UQ) framework, to obtain confidence prediction intervals with coverage guarantees for Deep Operator Network (DeepONet) regression. Initially, we enhance the uncertainty quantification frameworks (B-DeepONet and Prob-DeepONet) previously proposed by the authors by using split conformal prediction. By combi…
▽ More
In this paper, we adopt conformal prediction, a distribution-free uncertainty quantification (UQ) framework, to obtain confidence prediction intervals with coverage guarantees for Deep Operator Network (DeepONet) regression. Initially, we enhance the uncertainty quantification frameworks (B-DeepONet and Prob-DeepONet) previously proposed by the authors by using split conformal prediction. By combining conformal prediction with our Prob- and B-DeepONets, we effectively quantify uncertainty by generating rigorous confidence intervals for DeepONet prediction. Additionally, we design a novel Quantile-DeepONet that allows for a more natural use of split conformal prediction. We refer to this distribution-free effective uncertainty quantification framework as split conformal Quantile-DeepONet regression. Finally, we demonstrate the effectiveness of the proposed methods using various ordinary, partial differential equation numerical examples, and multi-fidelity learning.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Magnetic nanoparticles: from the nanostructure to the physical properties
Authors:
Xavier Batlle,
Carlos Moya,
Mariona Escoda Torroellla,
Oscar Iglesias,
Arantxa Fraile Rodriguez,
Amilcar Labarta
Abstract:
Some of the synthesis methods and physical properties of iron-oxide based magnetic nanoparticles such as Fe3-xO4 and CoxFe3-xO4 are reviewed because of their interest in health, environmental applications, and ultra-high-density magnetic recording. Unlike high crystalline quality nanoparticles larger than a few nanometers that show bulk-like magnetic and electronic properties, nanostructures with…
▽ More
Some of the synthesis methods and physical properties of iron-oxide based magnetic nanoparticles such as Fe3-xO4 and CoxFe3-xO4 are reviewed because of their interest in health, environmental applications, and ultra-high-density magnetic recording. Unlike high crystalline quality nanoparticles larger than a few nanometers that show bulk-like magnetic and electronic properties, nanostructures with increasing structural defects yield a progressive worsening of their general performance due to frozen magnetic disorder and local breaking of their crystalline symmetry. Thus, it is shown that single-crystal, monophasic nanoparticles do not exhibit significant surface or finite-size effects, such as spin canting, reduced saturation magnetization, high closure magnetic fields, hysteresis-loop shift or dead magnetic layer features which are mostly associated with crystallographic defective systems. Besides, the key role of the nanoparticle coating, surface anisotropy, and inter-particle interactions are discussed. Finally, the results of some single particle techniques -- magnetic force microscopy, X-ray photoemission electron microscopy, and electron magnetic chiral dichroism -- that allow studying individual nanoparticles down to sub-nanometer resolution with element, valence and magnetic selectivity, are presented. All in all, the intimate, fundamental correlation of the nanostructure (crystalline, chemical, magnetic) to the physical properties of the nanoparticles is ascertained.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo
Authors:
Haoyang Zheng,
Wei Deng,
Christian Moya,
Guang Lin
Abstract:
Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the g…
▽ More
Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from $\mathcal{\tilde O}(d)$ to $\mathcal{\tilde O}(\sqrt{d})$. The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems.
△ Less
Submitted 20 June, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
B-LSTM-MIONet: Bayesian LSTM-based Neural Operators for Learning the Response of Complex Dynamical Systems to Length-Variant Multiple Input Functions
Authors:
Zhihao Kong,
Amirhossein Mollaali,
Christian Moya,
Na Lu,
Guang Lin
Abstract:
Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output lo…
▽ More
Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output location. However, it requires offline inputs and cannot handle varying sequence lengths in testing datasets, limiting its real-time application in dynamic complex systems. This work redesigns MIONet, integrating Long Short Term Memory (LSTM) to learn neural operators from time-dependent data. This approach overcomes data discretization constraints and harnesses LSTM's capability with variable-length, real-time data. Factors affecting learning performance, like algorithm extrapolation ability are presented. The framework is enhanced with uncertainty quantification through a novel Bayesian method, sampling from MIONet parameter distributions. Consequently, we develop the B-LSTM-MIONet, incorporating LSTM's temporal strengths with Bayesian robustness, resulting in a more precise and reliable model for noisy datasets.
△ Less
Submitted 29 November, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
A Physics-Guided Bi-Fidelity Fourier-Featured Operator Learning Framework for Predicting Time Evolution of Drag and Lift Coefficients
Authors:
Amirhossein Mollaali,
Izzet Sahin,
Iqrar Raza,
Christian Moya,
Guillermo Paniagua,
Guang Lin
Abstract:
In the pursuit of accurate experimental and computational data while minimizing effort, there is a constant need for high-fidelity results. However, achieving such results often requires significant computational resources. To address this challenge, this paper proposes a deep operator learning-based framework that requires a limited high-fidelity dataset for training. We introduce a novel physics…
▽ More
In the pursuit of accurate experimental and computational data while minimizing effort, there is a constant need for high-fidelity results. However, achieving such results often requires significant computational resources. To address this challenge, this paper proposes a deep operator learning-based framework that requires a limited high-fidelity dataset for training. We introduce a novel physics-guided, bi-fidelity, Fourier-featured Deep Operator Network (DeepONet) framework that effectively combines low and high-fidelity datasets, leveraging the strengths of each. In our methodology, we began by designing a physics-guided Fourier-featured DeepONet, drawing inspiration from the intrinsic physical behavior of the target solution. Subsequently, we train this network to primarily learn the low-fidelity solution, utilizing an extensive dataset. This process ensures a comprehensive grasp of the foundational solution patterns. Following this foundational learning, the low-fidelity deep operator network's output is enhanced using a physics-guided Fourier-featured residual deep operator network. This network refines the initial low-fidelity output, achieving the high-fidelity solution by employing a small high-fidelity dataset for training. Notably, in our framework, we employ the Fourier feature network as the Trunk network for the DeepONets, given its proficiency in capturing and learning the oscillatory nature of the target solution with high precision. We validate our approach using a well-known 2D benchmark cylinder problem, which aims to predict the time trajectories of lift and drag coefficients. The results highlight that the physics-guided Fourier-featured deep operator network, serving as a foundational building block of our framework, possesses superior predictive capability for the lift and drag coefficients compared to its data-driven counterparts.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
D2NO: Efficient Handling of Heterogeneous Input Function Spaces with Distributed Deep Neural Operators
Authors:
Zecheng Zhang,
Christian Moya,
Lu Lu,
Guang Lin,
Hayden Schaeffer
Abstract:
Neural operators have been applied in various scientific fields, such as solving parametric partial differential equations, dynamical systems with control, and inverse problems. However, challenges arise when dealing with input functions that exhibit heterogeneous properties, requiring multiple sensors to handle functions with minimal regularity. To address this issue, discretization-invariant neu…
▽ More
Neural operators have been applied in various scientific fields, such as solving parametric partial differential equations, dynamical systems with control, and inverse problems. However, challenges arise when dealing with input functions that exhibit heterogeneous properties, requiring multiple sensors to handle functions with minimal regularity. To address this issue, discretization-invariant neural operators have been used, allowing the sampling of diverse input functions with different sensor locations. However, existing frameworks still require an equal number of sensors for all functions. In our study, we propose a novel distributed approach to further relax the discretization requirements and solve the heterogeneous dataset challenges. Our method involves partitioning the input function space and processing individual input functions using independent and separate neural networks. A centralized neural network is used to handle shared information across all output functions. This distributed methodology reduces the number of gradient descent back-propagation steps, improving efficiency while maintaining accuracy. We demonstrate that the corresponding neural network is a universal approximator of continuous nonlinear operators and present four numerical examples to validate its performance.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
Bayesian deep operator learning for homogenized to fine-scale maps for multiscale PDE
Authors:
Zecheng Zhang,
Christian Moya,
Wing Tat Leung,
Guang Lin,
Hayden Schaeffer
Abstract:
We present a new framework for computing fine-scale solutions of multiscale Partial Differential Equations (PDEs) using operator learning tools. Obtaining fine-scale solutions of multiscale PDEs can be challenging, but there are many inexpensive computational methods for obtaining coarse-scale solutions. Additionally, in many real-world applications, fine-scale solutions can only be observed at a…
▽ More
We present a new framework for computing fine-scale solutions of multiscale Partial Differential Equations (PDEs) using operator learning tools. Obtaining fine-scale solutions of multiscale PDEs can be challenging, but there are many inexpensive computational methods for obtaining coarse-scale solutions. Additionally, in many real-world applications, fine-scale solutions can only be observed at a limited number of locations. In order to obtain approximations or predictions of fine-scale solutions over general regions of interest, we propose to learn the operator map** from coarse-scale solutions to fine-scale solutions using a limited number (and possibly noisy) observations of the fine-scale solutions. The approach is to train multi-fidelity homogenization maps using mathematically motivated neural operators. The operator learning framework can efficiently obtain the solution of multiscale PDEs at any arbitrary point, making our proposed framework a mesh-free solver. We verify our results on multiple numerical examples showing that our approach is an efficient mesh-free solver for multiscale PDEs.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Deep Operator Learning-based Surrogate Models with Uncertainty Quantification for Optimizing Internal Cooling Channel Rib Profiles
Authors:
Izzet Sahin,
Christian Moya,
Amirhossein Mollaali,
Guang Lin,
Guillermo Paniagua
Abstract:
This paper designs surrogate models with uncertainty quantification capabilities to improve the thermal performance of rib-turbulated internal cooling channels effectively. To construct the surrogate, we use the deep operator network (DeepONet) framework, a novel class of neural networks designed to approximate map**s between infinite-dimensional spaces using relatively small datasets. The propo…
▽ More
This paper designs surrogate models with uncertainty quantification capabilities to improve the thermal performance of rib-turbulated internal cooling channels effectively. To construct the surrogate, we use the deep operator network (DeepONet) framework, a novel class of neural networks designed to approximate map**s between infinite-dimensional spaces using relatively small datasets. The proposed DeepONet takes an arbitrary continuous rib geometry with control points as input and outputs continuous detailed information about the distribution of pressure and heat transfer around the profiled ribs. The datasets needed to train and test the proposed DeepONet framework were obtained by simulating a 2D rib-roughened internal cooling channel. To accomplish this, we continuously modified the input rib geometry by adjusting the control points according to a simple random distribution with constraints, rather than following a predefined path or sampling method. The studied channel has a hydraulic diameter, Dh, of 66.7 mm, and a length-to-hydraulic diameter ratio, L/Dh, of 10. The ratio of rib center height to hydraulic diameter (e/Dh), which was not changed during the rib profile update, was maintained at a constant value of 0.048. The ribs were placed in the channel with a pitch-to-height ratio (P/e) of 10. In addition, we provide the proposed surrogates with effective uncertainty quantification capabilities. This is achieved by converting the DeepONet framework into a Bayesian DeepONet (B-DeepONet). B-DeepONet samples from the posterior distribution of DeepONet parameters using the novel framework of stochastic gradient replica-exchange MCMC.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
NSGA-PINN: A Multi-Objective Optimization Method for Physics-Informed Neural Network Training
Authors:
Binghang Lu,
Christian B. Moya,
Guang Lin
Abstract:
This paper presents NSGA-PINN, a multi-objective optimization framework for effective training of Physics-Informed Neural Networks (PINNs). The proposed framework uses the Non-dominated Sorting Genetic Algorithm (NSGA-II) to enable traditional stochastic gradient optimization algorithms (e.g., ADAM) to escape local minima effectively. Additionally, the NSGA-II algorithm enables satisfying the init…
▽ More
This paper presents NSGA-PINN, a multi-objective optimization framework for effective training of Physics-Informed Neural Networks (PINNs). The proposed framework uses the Non-dominated Sorting Genetic Algorithm (NSGA-II) to enable traditional stochastic gradient optimization algorithms (e.g., ADAM) to escape local minima effectively. Additionally, the NSGA-II algorithm enables satisfying the initial and boundary conditions encoded into the loss function during physics-informed training precisely. We demonstrate the effectiveness of our framework by applying NSGA-PINN to several ordinary and partial differential equation problems. In particular, we show that the proposed framework can handle challenging inverse problems with noisy data.
△ Less
Submitted 6 March, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
On Approximating the Dynamic Response of Synchronous Generators via Operator Learning: A Step Towards Building Deep Operator-based Power Grid Simulators
Authors:
Christian Moya,
Guang Lin,
Tianqiao Zhao,
Meng Yue
Abstract:
This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates…
▽ More
This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates the generators' infinite-dimensional solution operator. Then, we develop a DeepONet-based numerical scheme to simulate a given generator's dynamic response over a short/medium-term horizon. The proposed numerical scheme recursively employs the trained DeepONet to simulate the response for a given multi-dimensional input, which describes the interaction between the generator and the rest of the system. Furthermore, we develop a residual DeepONet numerical scheme that incorporates information from mathematical models of synchronous generators. We accompany this residual DeepONet scheme with an estimate for the prediction's cumulative error. We also design a data aggregation (DAgger) strategy that allows (i) employing supervised learning to train the proposed DeepONets and (ii) fine-tuning the DeepONet using aggregated training data that the DeepONet is likely to encounter during interactive simulations with other grid components. Finally, as a proof of concept, we demonstrate that the proposed DeepONet frameworks can effectively approximate the transient model of a synchronous generator.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
DeepGraphONet: A Deep Graph Operator Network to Learn and Zero-shot Transfer the Dynamic Response of Networked Systems
Authors:
Yixuan Sun,
Christian Moya,
Guang Lin,
Meng Yue
Abstract:
This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate…
▽ More
This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate the solution operator of dynamical systems. The resulting DeepGraphONet can then predict the dynamics within a given short/medium-term time horizon by observing a finite history of the graph state information. Furthermore, we design our DeepGraphONet to be resolution-independent. That is, we do not require the finite history to be collected at the exact/same resolution. In addition, to disseminate the results from a trained DeepGraphONet, we design a zero-shot learning strategy that enables using it on a different sub-graph. Finally, empirical results on the (i) transient stability prediction problem of power grids and (ii) traffic flow forecasting problem of a vehicular system illustrate the effectiveness of the proposed DeepGraphONet.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Performance characterization and near-realtime monitoring of MUSE adaptive optics modes at Paranal
Authors:
T. Wevers,
F. J. Selman,
A. Reyes,
M. Vega,
J. Hartke,
F. Bian,
O. Beltramo-Martin,
R. Fétick,
S. Kamann,
J. Kolb,
T. Kravtsov,
C. Moya,
B. Neichel,
S. Oberti,
C. Reyes,
E. Valenti
Abstract:
The Multi Unit Spectroscopic Explorer (MUSE) is an integral field spectrograph on the Very Large Telescope Unit Telescope 4, capable of laser guide star assisted and tomographic adaptive optics using the GALACSI module. Its observing capabilities include a wide field (1 square arcmin), ground layer AO mode (WFM-AO) and a narrow field (7.5"x7.5"), laser tomography AO mode (NFM-AO). The latter has h…
▽ More
The Multi Unit Spectroscopic Explorer (MUSE) is an integral field spectrograph on the Very Large Telescope Unit Telescope 4, capable of laser guide star assisted and tomographic adaptive optics using the GALACSI module. Its observing capabilities include a wide field (1 square arcmin), ground layer AO mode (WFM-AO) and a narrow field (7.5"x7.5"), laser tomography AO mode (NFM-AO). The latter has had several upgrades in the 4 years since commissioning, including an optimisation of the control matrices for the AO system and a new sub-electron noise detector for its infra-red low order wavefront sensor. We set out to quantify the NFM-AO system performance by analysing $\sim$230 spectrophotometric standard star observations taken over the last 3 years. To this end we expand upon previous work, designed to facilitate analysis of the WFM-AO system performance. We briefly describe the framework that will provide a user friendly, semi-automated way for system performance monitoring during science operations. We provide the results of our performance analysis, chiefly through the measured Strehl ratio and full width at half maximum (FWHM) of the core of the point spread function (PSF) using two PSF models, and correlations with atmospheric conditions. These results will feed into a range of applications, including providing a more accurate prediction of the system performance as implemented in the exposure time calculator, and the associated optimization of the scientific output for a given set of limiting atmospheric conditions.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
On Learning the Dynamical Response of Nonlinear Control Systems with Deep Operator Networks
Authors:
Guang Lin,
Christian Moya,
Zecheng Zhang
Abstract:
We propose a Deep Operator Network~(DeepONet) framework to learn the dynamic response of continuous-time nonlinear control systems from data. To this end, we first construct and train a DeepONet that approximates the control system's local solution operator. Then, we design a numerical scheme that recursively uses the trained DeepONet to simulate the control system's long/medium-term dynamic respo…
▽ More
We propose a Deep Operator Network~(DeepONet) framework to learn the dynamic response of continuous-time nonlinear control systems from data. To this end, we first construct and train a DeepONet that approximates the control system's local solution operator. Then, we design a numerical scheme that recursively uses the trained DeepONet to simulate the control system's long/medium-term dynamic response for given control inputs and initial conditions. We accompany the proposed scheme with an estimate for the error bound of the associated cumulative error. Furthermore, we design a data-driven Runge-Kutta~(RK) explicit scheme that uses the DeepONet forward pass and automatic differentiation to better approximate the system's response when the numerical scheme's step size is sufficiently small. Numerical experiments on the predator-prey, pendulum, and cart pole systems confirm that our DeepONet framework learns to approximate the dynamic response of nonlinear control systems effectively.
△ Less
Submitted 26 September, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting the Power Grid's Post-Fault Trajectories
Authors:
Christian Moya,
Shiqi Zhang,
Meng Yue,
Guang Lin
Abstract:
This paper proposes a new data-driven method for the reliable prediction of power system post-fault trajectories. The proposed method is based on the fundamentally new concept of Deep Operator Networks (DeepONets). Compared to traditional neural networks that learn to approximate functions, DeepONets are designed to approximate nonlinear operators. Under this operator framework, we design a DeepON…
▽ More
This paper proposes a new data-driven method for the reliable prediction of power system post-fault trajectories. The proposed method is based on the fundamentally new concept of Deep Operator Networks (DeepONets). Compared to traditional neural networks that learn to approximate functions, DeepONets are designed to approximate nonlinear operators. Under this operator framework, we design a DeepONet to (1) take as inputs the fault-on trajectories collected, for example, via simulation or phasor measurement units, and (2) provide as outputs the predicted post-fault trajectories. In addition, we endow our method with a much-needed ability to balance efficiency with reliable/trustworthy predictions via uncertainty quantification. To this end, we propose and compare two methods that enable quantifying the predictive uncertainty. First, we propose a \textit{Bayesian DeepONet} (B-DeepONet) that uses stochastic gradient Hamiltonian Monte-Carlo to sample from the posterior distribution of the DeepONet parameters. Then, we propose a \textit{Probabilistic DeepONet} (Prob-DeepONet) that uses a probabilistic training strategy to equip DeepONets with a form of automated uncertainty quantification, at virtually no extra computational cost. Finally, we validate the predictive power and uncertainty quantification capability of the proposed B-DeepONet and Prob-DeepONet using the IEEE 16-machine 68-bus system.
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
Accelerated replica exchange stochastic gradient Langevin diffusion enhanced Bayesian DeepONet for solving noisy parametric PDEs
Authors:
Guang Lin,
Christian Moya,
Zecheng Zhang
Abstract:
The Deep Operator Networks~(DeepONet) is a fundamentally different class of neural networks that we train to approximate nonlinear operators, including the solution operator of parametric partial differential equations (PDE). DeepONets have shown remarkable approximation and generalization capabilities even when trained with relatively small datasets. However, the performance of DeepONets deterior…
▽ More
The Deep Operator Networks~(DeepONet) is a fundamentally different class of neural networks that we train to approximate nonlinear operators, including the solution operator of parametric partial differential equations (PDE). DeepONets have shown remarkable approximation and generalization capabilities even when trained with relatively small datasets. However, the performance of DeepONets deteriorates when the training data is polluted with noise, a scenario that occurs very often in practice. To enable DeepONets training with noisy data, we propose using the Bayesian framework of replica-exchange Langevin diffusion. Such a framework uses two particles, one for exploring and another for exploiting the loss function landscape of DeepONets. We show that the proposed framework's exploration and exploitation capabilities enable (1) improved training convergence for DeepONets in noisy scenarios and (2) attaching an uncertainty estimate for the predicted solutions of parametric PDEs. In addition, we show that replica-exchange Langeving Diffusion (remarkably) also improves the DeepONet's mean prediction accuracy in noisy scenarios compared with vanilla DeepONets trained with state-of-the-art gradient-based optimization algorithms (e.g. Adam). To reduce the potentially high computational cost of replica, in this work, we propose an accelerated training framework for replica-exchange Langevin diffusion that exploits the neural network architecture of DeepONets to reduce its computational cost up to 25% without compromising the proposed framework's performance. Finally, we illustrate the effectiveness of the proposed Bayesian framework using a series of experiments on four parametric PDE problems.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
DAE-PINN: A Physics-Informed Neural Network Model for Simulating Differential-Algebraic Equations with Application to Power Networks
Authors:
Christian Moya,
Guang Lin
Abstract:
Deep learning-based surrogate modeling is becoming a promising approach for learning and simulating dynamical systems. Deep-learning methods, however, find very challenging learning stiff dynamics. In this paper, we develop DAE-PINN, the first effective deep-learning framework for learning and simulating the solution trajectories of nonlinear differential-algebraic equations (DAE), which present a…
▽ More
Deep learning-based surrogate modeling is becoming a promising approach for learning and simulating dynamical systems. Deep-learning methods, however, find very challenging learning stiff dynamics. In this paper, we develop DAE-PINN, the first effective deep-learning framework for learning and simulating the solution trajectories of nonlinear differential-algebraic equations (DAE), which present a form of infinite stiffness and describe, for example, the dynamics of power networks. Our DAE-PINN bases its effectiveness on the synergy between implicit Runge-Kutta time-step** schemes (designed specifically for solving DAEs) and physics-informed neural networks (PINN) (deep neural networks that we train to satisfy the dynamics of the underlying problem). Furthermore, our framework (i) enforces the neural network to satisfy the DAEs as (approximate) hard constraints using a penalty-based method and (ii) enables simulating DAEs for long-time horizons. We showcase the effectiveness and accuracy of DAE-PINN by learning and simulating the solution trajectories of a three-bus power network.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Discovery of two Einstein crosses from massive post--blue nugget galaxies at z>1 in KiDS
Authors:
N. R. Napolitano,
R. Li,
C. Spiniello,
C. Tortora,
A. Sergeyev,
G. D'Ago,
X. Guo,
L. Xie,
M. Radovich,
N. Roy,
L. V. E. Koopmans,
K. Kuijken,
M. Bilicki,
T. Erben,
F. Getman,
C. Heymans,
H. Hildebrandt,
C. Moya,
H. Y. Shan,
G. Vernardos,
A. H. Wright
Abstract:
We report the discovery of two Einstein Crosses (ECs) in the footprint of the Kilo-Degree Survey (KiDS): KIDS J232940-340922 and KIDS J122456+005048. Using integral field spectroscopy from MUSE@VLT, we confirm their gravitational-lens nature. In both cases, the four spectra of the source clearly show a prominence of absorption features, hence revealing an evolved stellar population with little sta…
▽ More
We report the discovery of two Einstein Crosses (ECs) in the footprint of the Kilo-Degree Survey (KiDS): KIDS J232940-340922 and KIDS J122456+005048. Using integral field spectroscopy from MUSE@VLT, we confirm their gravitational-lens nature. In both cases, the four spectra of the source clearly show a prominence of absorption features, hence revealing an evolved stellar population with little star formation. The lensing model of the two systems, assuming a singular isothermal ellipsoid (SIE) with external shear, shows that: 1) the two crosses, located at redshift $z=0.38$ and 0.24, have Einstein radius $R_{\rm E}=5.2$ kpc and 5.4 kpc, respectively; 2) their projected dark matter fractions inside the half effective radius are 0.60 and 0.56 (Chabrier IMF); 3) the sources are ultra-compact galaxies, $R_{\rm e}\sim0.9$ kpc (at redshift $z_{\rm s}=1.59$) and $R_{\rm e}\sim0.5$ kpc ($z_{\rm s}=1.10$), respectively. These results are unaffected by the underlying mass density assumption. Due to size, blue color and absorption-dominated spectra, corroborated by low specific star-formation rates derived from optical-NIR spectral energy distribution fitting, we argue that the two lensed sources in these ECs are blue nuggets migrating toward their quenching phase.
△ Less
Submitted 19 November, 2020; v1 submitted 18 November, 2020;
originally announced November 2020.
-
Magnetization process of atacamite: a case of weakly coupled $S = 1/2$ sawtooth chains
Authors:
L. Heinze,
H. O. Jeschke,
I. I. Mazin,
A. Metavitsiadis,
M. Reehuis,
R. Feyerherm,
J. -U. Hoffmann,
M. Bartkowiak,
O. Prokhnenko,
A. U. B. Wolter,
X. Ding,
V. S. Zapf,
C. Corvalán Moya,
F. Weickert,
M. Jaime,
K. C. Rule,
D. Menzel,
R. Valentí,
W. Brenig,
S. Süllow
Abstract:
We present a combined experimental and theoretical study of the mineral atacamite Cu$_2$Cl(OH)$_3$. Density functional theory yields a Hamiltonian describing anisotropic sawtooth chains with weak 3D connections. Experimentally, we fully characterize the antiferromagnetically ordered state. Magnetic order shows a complex evolution with the magnetic field, while, starting at 31.5 T, we observe a pla…
▽ More
We present a combined experimental and theoretical study of the mineral atacamite Cu$_2$Cl(OH)$_3$. Density functional theory yields a Hamiltonian describing anisotropic sawtooth chains with weak 3D connections. Experimentally, we fully characterize the antiferromagnetically ordered state. Magnetic order shows a complex evolution with the magnetic field, while, starting at 31.5 T, we observe a plateau-like magnetization at about $M_{\rm sat}/2$. Based on complementary theoretical approaches, we show that the latter is unrelated to the known magnetization plateau of a sawtooth chain. Instead, we provide evidence that the magnetization process in atacamite is a field-driven canting of a 3D network of weakly coupled sawtooth chains that form giant moments.
△ Less
Submitted 1 April, 2021; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Magnetoelastic coupling in URu2Si2: Probing multipolar correlations in the hidden order state
Authors:
Mark Wartenbe,
Ryan E. Baumbach,
Arkady Shekhter,
Gregory S. Boebinger,
Eric D. Bauer,
Carolina Corvalan Moya,
Neil Harrison,
Ross D. McDonald,
Myron B. Salamon,
Marcelo Jaime
Abstract:
Time reversal symmetry and magnetoelastic correlations are probed by means of high-resolution volume dilatometry in URu2Si2 at cryogenic temperatures and magnetic fields more than enough to suppress the hidden order state at H_HO(T = 0.66 K) approximately 35 T. We report a significant crystal lattice volume expansion at and above H_HO(T), and even above T_HO, possibly a consequence of field-induce…
▽ More
Time reversal symmetry and magnetoelastic correlations are probed by means of high-resolution volume dilatometry in URu2Si2 at cryogenic temperatures and magnetic fields more than enough to suppress the hidden order state at H_HO(T = 0.66 K) approximately 35 T. We report a significant crystal lattice volume expansion at and above H_HO(T), and even above T_HO, possibly a consequence of field-induced f-electron localization, and hysteresis at some high field phase boundaries that confirm volume involvement. We investigate in detail the magnetostriction and magnetization as the temperature is reduced over two decades from 50 K where the system is paramagnetic, to 0.5 K in the realms of the hidden order state. We find a dominant quadratic-in-field dependence delta L/L proportional to H^2, a result consistent with a state that is symmetric under time reversal. The data shows, however, an incipient yet unmistakable asymptotic approach to linear (delta L/L proportional to 1-H/H_0) for 15 T < H < H_HO(0.66 K) approximately 35 T at the lowest temperatures. We discuss these results in the framework of a Ginzburg-Landau formalism that proposes a complex order parameter for the HO to model the (H,T,p) phase diagram.
△ Less
Submitted 29 April, 2019; v1 submitted 6 December, 2018;
originally announced December 2018.
-
Application of Correlation Indices on Intrusion Detection Systems: Protecting the Power Grid Against Coordinated Attacks
Authors:
Christian Moya,
Junho Hong,
Jiankang Wang
Abstract:
The future power grid will be characterized by the pervasive use of heterogeneous and non-proprietary information and communication technology, which exposes the power grid to a broad scope of cyber-attacks. In particular, Monitoring-Control Attacks (MCA) --i.e., attacks in which adversaries manipulate control decisions by fabricating measurement signals in the feedback loop-- are highly threateni…
▽ More
The future power grid will be characterized by the pervasive use of heterogeneous and non-proprietary information and communication technology, which exposes the power grid to a broad scope of cyber-attacks. In particular, Monitoring-Control Attacks (MCA) --i.e., attacks in which adversaries manipulate control decisions by fabricating measurement signals in the feedback loop-- are highly threatening. This is because, MCAs are (i) more likely to happen with greater attack surface and lower cost, (ii) difficult to detect by hiding in measurement signals, and (iii) capable of inflicting severe consequences by coordinating attack resources. To defend against MCAs, we have developed a semantic analysis framework for Intrusion Detection Systems (IDS) in power grids. The framework consists of two parts running in parallel: a Correlation Index Generator (CIG), which indexes correlated MCAs, and a Correlation Knowledge-Base~(CKB), which is updated aperiodically with attacks' Correlation Indices (CI). The framework has the advantage of detecting MCAs and estimating attack consequences with promising runtime and detection accuracy. To evaluate the performance of the framework, we computed its false alarm rates under different attack scenarios.
△ Less
Submitted 9 June, 2018;
originally announced June 2018.
-
PyMUSE: a Python package for VLT/MUSE data
Authors:
Ismael Pessa,
Nicolas Tejos,
Cristobal Moya
Abstract:
This is a companion Focus Demonstration article to the PyMUSE python package, demonstrating its usage and utilities for VLT/MUSE data analysis, that include a wide range of options for spectra extractions, the creation of different types of images, compatibilities with some commonly used software for astronomical data analysis, among others. PyMUSE is an open-source software and can be found on Gi…
▽ More
This is a companion Focus Demonstration article to the PyMUSE python package, demonstrating its usage and utilities for VLT/MUSE data analysis, that include a wide range of options for spectra extractions, the creation of different types of images, compatibilities with some commonly used software for astronomical data analysis, among others. PyMUSE is an open-source software and can be found on Github for free use and distribution.
△ Less
Submitted 16 March, 2018; v1 submitted 13 March, 2018;
originally announced March 2018.
-
A Protection Method in Active Distribution Grids with High Penetration of Renewable Energy Sources
Authors:
J. K. Wang,
Christian Moya
Abstract:
A protection method in active distribution networks is proposed in this paper. In active distribution systems, fault currents flow in multiple directions and presents a varying range of value, which poses a great challenge of maintaining coordination among protective devices on feeders. The proposed protection method addresses this challenge by simultaneously adjusting DG's output power and protec…
▽ More
A protection method in active distribution networks is proposed in this paper. In active distribution systems, fault currents flow in multiple directions and presents a varying range of value, which poses a great challenge of maintaining coordination among protective devices on feeders. The proposed protection method addresses this challenge by simultaneously adjusting DG's output power and protection devices' settings in pre-fault networks. Comparing to previous protection solutions, the proposed method considers the influences from renewable DG's intermittency, and explores the economic and protection benefits of DG's active participation. The formulation of proposed method is decomposed into two optimization sub-problems, coupling through the constraint on fuse-recloser coordination. This decomposed mathematical structure effectively extinguishes the non-linearity arising from reclosers' time-current inverse characteristics, and greatly reduces computation efforts.
△ Less
Submitted 2 February, 2018;
originally announced February 2018.
-
Develo** a Correlation Indices to Identify Coordinated Cyber-Attacks on Power Grids
Authors:
Christian Moya,
Jiankang Wang
Abstract:
Increasing reliance on Information and Communication Technology~(ICT) exposes the power grid to cyber-attacks. In particular, Coordinated Cyber-Attacks (CCAs) are considered highly threatening and difficult to defend against, because they (i) possess higher disruptiveness by integrating greater resources from multiple attack entities, and (ii) present heterogeneous traits in cyber-space and the ph…
▽ More
Increasing reliance on Information and Communication Technology~(ICT) exposes the power grid to cyber-attacks. In particular, Coordinated Cyber-Attacks (CCAs) are considered highly threatening and difficult to defend against, because they (i) possess higher disruptiveness by integrating greater resources from multiple attack entities, and (ii) present heterogeneous traits in cyber-space and the physical grid by hitting multiple targets to achieve the attack goal. Thus, and as opposed to independent attacks, whose severity is limited by the power grid's redundancy, CCAs could inflict disastrous consequences, such as blackouts. In this paper, we propose a method to develop Correlation Indices to defend against CCAs on static control applications. These proposed indices relate the targets of CCAs with attack goals on the power grid. Compared to related works, the proposed indices present the benefits of deployment simplicity and are capable of detecting more sophisticated attacks, such as measurement attacks. We demonstrate our method using measurement attacks against Security Constrained Economic Dispatch.
△ Less
Submitted 29 May, 2018; v1 submitted 3 July, 2017;
originally announced July 2017.
-
Quantification of dipolar interactions in Fe$_{3-x}$O$_4$ nanoparticles
Authors:
Carlos Moya,
Òscar Iglesias,
Xavier Batlle,
Amílcar Labarta
Abstract:
A general method for the quantification of dipolar interactions in assemblies of nanoparticles has been developed from a model sample constituted by magnetite nanoparticles of 5 nm in diameter, in powder form with oleic acid as a surfactant so that the particles were solely separated from each other through an organic layer of about 1 nm in thickness. This quantification is based on the comparison…
▽ More
A general method for the quantification of dipolar interactions in assemblies of nanoparticles has been developed from a model sample constituted by magnetite nanoparticles of 5 nm in diameter, in powder form with oleic acid as a surfactant so that the particles were solely separated from each other through an organic layer of about 1 nm in thickness. This quantification is based on the comparison of the distribution of energy barriers for magnetization reversal obtained from time-dependent relaxation measurements starting from either (i) an almost random orientation of the particles magnetizations or (ii) a collinear arrangement of them prepared by previously field cooling the sample. Experimental results and numerical simulations show that the mean dipolar field acting on each single particle is significantly reduced when particles magnetizations are collinearly aligned. Besides, the intrinsic distribution of the energy barriers of anisotropy for the non-interacting case was evaluated from a reference sample where the same magnetic particles were individually coated with a thick silica shell in order to make dipolar interactions negligible. Interestingly, the results of the numerical simulations account for the relative energy shift of the experimental energy barrier distributions corresponding to the interacting and non-interacting cases, thus supporting the validity of the proposed method for the quantification of dipolar interactions.
△ Less
Submitted 3 August, 2015;
originally announced August 2015.