Search | arXiv e-print repository

Conformalized-DeepONet: A Distribution-Free Framework for Uncertainty Quantification in Deep Operator Networks

Authors: Christian Moya, Amirhossein Mollaali, Zecheng Zhang, Lu Lu, Guang Lin

Abstract: In this paper, we adopt conformal prediction, a distribution-free uncertainty quantification (UQ) framework, to obtain confidence prediction intervals with coverage guarantees for Deep Operator Network (DeepONet) regression. Initially, we enhance the uncertainty quantification frameworks (B-DeepONet and Prob-DeepONet) previously proposed by the authors by using split conformal prediction. By combi… ▽ More In this paper, we adopt conformal prediction, a distribution-free uncertainty quantification (UQ) framework, to obtain confidence prediction intervals with coverage guarantees for Deep Operator Network (DeepONet) regression. Initially, we enhance the uncertainty quantification frameworks (B-DeepONet and Prob-DeepONet) previously proposed by the authors by using split conformal prediction. By combining conformal prediction with our Prob- and B-DeepONets, we effectively quantify uncertainty by generating rigorous confidence intervals for DeepONet prediction. Additionally, we design a novel Quantile-DeepONet that allows for a more natural use of split conformal prediction. We refer to this distribution-free effective uncertainty quantification framework as split conformal Quantile-DeepONet regression. Finally, we demonstrate the effectiveness of the proposed methods using various ordinary, partial differential equation numerical examples, and multi-fidelity learning. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2401.13422 [pdf]

doi 10.1016/j.jmmm.2021.168594

Magnetic nanoparticles: from the nanostructure to the physical properties

Authors: Xavier Batlle, Carlos Moya, Mariona Escoda Torroellla, Oscar Iglesias, Arantxa Fraile Rodriguez, Amilcar Labarta

Abstract: Some of the synthesis methods and physical properties of iron-oxide based magnetic nanoparticles such as Fe3-xO4 and CoxFe3-xO4 are reviewed because of their interest in health, environmental applications, and ultra-high-density magnetic recording. Unlike high crystalline quality nanoparticles larger than a few nanometers that show bulk-like magnetic and electronic properties, nanostructures with… ▽ More Some of the synthesis methods and physical properties of iron-oxide based magnetic nanoparticles such as Fe3-xO4 and CoxFe3-xO4 are reviewed because of their interest in health, environmental applications, and ultra-high-density magnetic recording. Unlike high crystalline quality nanoparticles larger than a few nanometers that show bulk-like magnetic and electronic properties, nanostructures with increasing structural defects yield a progressive worsening of their general performance due to frozen magnetic disorder and local breaking of their crystalline symmetry. Thus, it is shown that single-crystal, monophasic nanoparticles do not exhibit significant surface or finite-size effects, such as spin canting, reduced saturation magnetization, high closure magnetic fields, hysteresis-loop shift or dead magnetic layer features which are mostly associated with crystallographic defective systems. Besides, the key role of the nanoparticle coating, surface anisotropy, and inter-particle interactions are discussed. Finally, the results of some single particle techniques -- magnetic force microscopy, X-ray photoemission electron microscopy, and electron magnetic chiral dichroism -- that allow studying individual nanoparticles down to sub-nanometer resolution with element, valence and magnetic selectivity, are presented. All in all, the intimate, fundamental correlation of the nanostructure (crystalline, chemical, magnetic) to the physical properties of the nanoparticles is ascertained. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Journal ref: Journal of Magnetism and Magnetic Materials 543 (2022) 168594

arXiv:2401.11665 [pdf, other]

Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo

Authors: Haoyang Zheng, Wei Deng, Christian Moya, Guang Lin

Abstract: Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the g… ▽ More Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from $\mathcal{\tilde O}(d)$ to $\mathcal{\tilde O}(\sqrt{d})$. The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems. △ Less

Submitted 20 June, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

Comments: 52 pages, 2 figures

arXiv:2311.16519 [pdf, other]

B-LSTM-MIONet: Bayesian LSTM-based Neural Operators for Learning the Response of Complex Dynamical Systems to Length-Variant Multiple Input Functions

Authors: Zhihao Kong, Amirhossein Mollaali, Christian Moya, Na Lu, Guang Lin

Abstract: Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output lo… ▽ More Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output location. However, it requires offline inputs and cannot handle varying sequence lengths in testing datasets, limiting its real-time application in dynamic complex systems. This work redesigns MIONet, integrating Long Short Term Memory (LSTM) to learn neural operators from time-dependent data. This approach overcomes data discretization constraints and harnesses LSTM's capability with variable-length, real-time data. Factors affecting learning performance, like algorithm extrapolation ability are presented. The framework is enhanced with uncertainty quantification through a novel Bayesian method, sampling from MIONet parameter distributions. Consequently, we develop the B-LSTM-MIONet, incorporating LSTM's temporal strengths with Bayesian robustness, resulting in a more precise and reliable model for noisy datasets. △ Less

Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

arXiv:2311.03639 [pdf, other]

A Physics-Guided Bi-Fidelity Fourier-Featured Operator Learning Framework for Predicting Time Evolution of Drag and Lift Coefficients

Authors: Amirhossein Mollaali, Izzet Sahin, Iqrar Raza, Christian Moya, Guillermo Paniagua, Guang Lin

Abstract: In the pursuit of accurate experimental and computational data while minimizing effort, there is a constant need for high-fidelity results. However, achieving such results often requires significant computational resources. To address this challenge, this paper proposes a deep operator learning-based framework that requires a limited high-fidelity dataset for training. We introduce a novel physics… ▽ More In the pursuit of accurate experimental and computational data while minimizing effort, there is a constant need for high-fidelity results. However, achieving such results often requires significant computational resources. To address this challenge, this paper proposes a deep operator learning-based framework that requires a limited high-fidelity dataset for training. We introduce a novel physics-guided, bi-fidelity, Fourier-featured Deep Operator Network (DeepONet) framework that effectively combines low and high-fidelity datasets, leveraging the strengths of each. In our methodology, we began by designing a physics-guided Fourier-featured DeepONet, drawing inspiration from the intrinsic physical behavior of the target solution. Subsequently, we train this network to primarily learn the low-fidelity solution, utilizing an extensive dataset. This process ensures a comprehensive grasp of the foundational solution patterns. Following this foundational learning, the low-fidelity deep operator network's output is enhanced using a physics-guided Fourier-featured residual deep operator network. This network refines the initial low-fidelity output, achieving the high-fidelity solution by employing a small high-fidelity dataset for training. Notably, in our framework, we employ the Fourier feature network as the Trunk network for the DeepONets, given its proficiency in capturing and learning the oscillatory nature of the target solution with high precision. We validate our approach using a well-known 2D benchmark cylinder problem, which aims to predict the time trajectories of lift and drag coefficients. The results highlight that the physics-guided Fourier-featured deep operator network, serving as a foundational building block of our framework, possesses superior predictive capability for the lift and drag coefficients compared to its data-driven counterparts. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 24 pages, 10 figures, 5 tables- submitted to Fluid

arXiv:2310.18888 [pdf, other]

D2NO: Efficient Handling of Heterogeneous Input Function Spaces with Distributed Deep Neural Operators

Authors: Zecheng Zhang, Christian Moya, Lu Lu, Guang Lin, Hayden Schaeffer

Abstract: Neural operators have been applied in various scientific fields, such as solving parametric partial differential equations, dynamical systems with control, and inverse problems. However, challenges arise when dealing with input functions that exhibit heterogeneous properties, requiring multiple sensors to handle functions with minimal regularity. To address this issue, discretization-invariant neu… ▽ More Neural operators have been applied in various scientific fields, such as solving parametric partial differential equations, dynamical systems with control, and inverse problems. However, challenges arise when dealing with input functions that exhibit heterogeneous properties, requiring multiple sensors to handle functions with minimal regularity. To address this issue, discretization-invariant neural operators have been used, allowing the sampling of diverse input functions with different sensor locations. However, existing frameworks still require an equal number of sensors for all functions. In our study, we propose a novel distributed approach to further relax the discretization requirements and solve the heterogeneous dataset challenges. Our method involves partitioning the input function space and processing individual input functions using independent and separate neural networks. A centralized neural network is used to handle shared information across all output functions. This distributed methodology reduces the number of gradient descent back-propagation steps, improving efficiency while maintaining accuracy. We demonstrate that the corresponding neural network is a universal approximator of continuous nonlinear operators and present four numerical examples to validate its performance. △ Less

Submitted 28 October, 2023; originally announced October 2023.

arXiv:2308.14188 [pdf, other]

Bayesian deep operator learning for homogenized to fine-scale maps for multiscale PDE

Authors: Zecheng Zhang, Christian Moya, Wing Tat Leung, Guang Lin, Hayden Schaeffer

Abstract: We present a new framework for computing fine-scale solutions of multiscale Partial Differential Equations (PDEs) using operator learning tools. Obtaining fine-scale solutions of multiscale PDEs can be challenging, but there are many inexpensive computational methods for obtaining coarse-scale solutions. Additionally, in many real-world applications, fine-scale solutions can only be observed at a… ▽ More We present a new framework for computing fine-scale solutions of multiscale Partial Differential Equations (PDEs) using operator learning tools. Obtaining fine-scale solutions of multiscale PDEs can be challenging, but there are many inexpensive computational methods for obtaining coarse-scale solutions. Additionally, in many real-world applications, fine-scale solutions can only be observed at a limited number of locations. In order to obtain approximations or predictions of fine-scale solutions over general regions of interest, we propose to learn the operator map** from coarse-scale solutions to fine-scale solutions using a limited number (and possibly noisy) observations of the fine-scale solutions. The approach is to train multi-fidelity homogenization maps using mathematically motivated neural operators. The operator learning framework can efficiently obtain the solution of multiscale PDEs at any arbitrary point, making our proposed framework a mesh-free solver. We verify our results on multiple numerical examples showing that our approach is an efficient mesh-free solver for multiscale PDEs. △ Less

Submitted 27 August, 2023; originally announced August 2023.

arXiv:2306.00810 [pdf, other]

Deep Operator Learning-based Surrogate Models with Uncertainty Quantification for Optimizing Internal Cooling Channel Rib Profiles

Authors: Izzet Sahin, Christian Moya, Amirhossein Mollaali, Guang Lin, Guillermo Paniagua

Abstract: This paper designs surrogate models with uncertainty quantification capabilities to improve the thermal performance of rib-turbulated internal cooling channels effectively. To construct the surrogate, we use the deep operator network (DeepONet) framework, a novel class of neural networks designed to approximate map**s between infinite-dimensional spaces using relatively small datasets. The propo… ▽ More This paper designs surrogate models with uncertainty quantification capabilities to improve the thermal performance of rib-turbulated internal cooling channels effectively. To construct the surrogate, we use the deep operator network (DeepONet) framework, a novel class of neural networks designed to approximate map**s between infinite-dimensional spaces using relatively small datasets. The proposed DeepONet takes an arbitrary continuous rib geometry with control points as input and outputs continuous detailed information about the distribution of pressure and heat transfer around the profiled ribs. The datasets needed to train and test the proposed DeepONet framework were obtained by simulating a 2D rib-roughened internal cooling channel. To accomplish this, we continuously modified the input rib geometry by adjusting the control points according to a simple random distribution with constraints, rather than following a predefined path or sampling method. The studied channel has a hydraulic diameter, Dh, of 66.7 mm, and a length-to-hydraulic diameter ratio, L/Dh, of 10. The ratio of rib center height to hydraulic diameter (e/Dh), which was not changed during the rib profile update, was maintained at a constant value of 0.048. The ribs were placed in the channel with a pitch-to-height ratio (P/e) of 10. In addition, we provide the proposed surrogates with effective uncertainty quantification capabilities. This is achieved by converting the DeepONet framework into a Bayesian DeepONet (B-DeepONet). B-DeepONet samples from the posterior distribution of DeepONet parameters using the novel framework of stochastic gradient replica-exchange MCMC. △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: 25 pages, 12 figures, 4 tables- submitted to the International Journal of Heat and Mass Transfer

arXiv:2303.02219 [pdf, other]

NSGA-PINN: A Multi-Objective Optimization Method for Physics-Informed Neural Network Training

Authors: Binghang Lu, Christian B. Moya, Guang Lin

Abstract: This paper presents NSGA-PINN, a multi-objective optimization framework for effective training of Physics-Informed Neural Networks (PINNs). The proposed framework uses the Non-dominated Sorting Genetic Algorithm (NSGA-II) to enable traditional stochastic gradient optimization algorithms (e.g., ADAM) to escape local minima effectively. Additionally, the NSGA-II algorithm enables satisfying the init… ▽ More This paper presents NSGA-PINN, a multi-objective optimization framework for effective training of Physics-Informed Neural Networks (PINNs). The proposed framework uses the Non-dominated Sorting Genetic Algorithm (NSGA-II) to enable traditional stochastic gradient optimization algorithms (e.g., ADAM) to escape local minima effectively. Additionally, the NSGA-II algorithm enables satisfying the initial and boundary conditions encoded into the loss function during physics-informed training precisely. We demonstrate the effectiveness of our framework by applying NSGA-PINN to several ordinary and partial differential equation problems. In particular, we show that the proposed framework can handle challenging inverse problems with noisy data. △ Less

Submitted 6 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

Comments: 13 pages, 35 figures

arXiv:2301.12538 [pdf, other]

On Approximating the Dynamic Response of Synchronous Generators via Operator Learning: A Step Towards Building Deep Operator-based Power Grid Simulators

Authors: Christian Moya, Guang Lin, Tianqiao Zhao, Meng Yue

Abstract: This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates… ▽ More This paper designs an Operator Learning framework to approximate the dynamic response of synchronous generators. One can use such a framework to (i) design a neural-based generator model that can interact with a numerical simulator of the rest of the power grid or (ii) shadow the generator's transient response. To this end, we design a data-driven Deep Operator Network~(DeepONet) that approximates the generators' infinite-dimensional solution operator. Then, we develop a DeepONet-based numerical scheme to simulate a given generator's dynamic response over a short/medium-term horizon. The proposed numerical scheme recursively employs the trained DeepONet to simulate the response for a given multi-dimensional input, which describes the interaction between the generator and the rest of the system. Furthermore, we develop a residual DeepONet numerical scheme that incorporates information from mathematical models of synchronous generators. We accompany this residual DeepONet scheme with an estimate for the prediction's cumulative error. We also design a data aggregation (DAgger) strategy that allows (i) employing supervised learning to train the proposed DeepONets and (ii) fine-tuning the DeepONet using aggregated training data that the DeepONet is likely to encounter during interactive simulations with other grid components. Finally, as a proof of concept, we demonstrate that the proposed DeepONet frameworks can effectively approximate the transient model of a synchronous generator. △ Less

Submitted 29 January, 2023; originally announced January 2023.

arXiv:2209.10622 [pdf, other]

DeepGraphONet: A Deep Graph Operator Network to Learn and Zero-shot Transfer the Dynamic Response of Networked Systems

Authors: Yixuan Sun, Christian Moya, Guang Lin, Meng Yue

Abstract: This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate… ▽ More This paper develops a Deep Graph Operator Network (DeepGraphONet) framework that learns to approximate the dynamics of a complex system (e.g. the power grid or traffic) with an underlying sub-graph structure. We build our DeepGraphONet by fusing the ability of (i) Graph Neural Networks (GNN) to exploit spatially correlated graph information and (ii) Deep Operator Networks~(DeepONet) to approximate the solution operator of dynamical systems. The resulting DeepGraphONet can then predict the dynamics within a given short/medium-term time horizon by observing a finite history of the graph state information. Furthermore, we design our DeepGraphONet to be resolution-independent. That is, we do not require the finite history to be collected at the exact/same resolution. In addition, to disseminate the results from a trained DeepGraphONet, we design a zero-shot learning strategy that enables using it on a different sub-graph. Finally, empirical results on the (i) transient stability prediction problem of power grids and (ii) traffic flow forecasting problem of a vehicular system illustrate the effectiveness of the proposed DeepGraphONet. △ Less

Submitted 21 September, 2022; originally announced September 2022.

arXiv:2209.07540 [pdf, other]

doi 10.1117/12.2630835

Performance characterization and near-realtime monitoring of MUSE adaptive optics modes at Paranal

Authors: T. Wevers, F. J. Selman, A. Reyes, M. Vega, J. Hartke, F. Bian, O. Beltramo-Martin, R. Fétick, S. Kamann, J. Kolb, T. Kravtsov, C. Moya, B. Neichel, S. Oberti, C. Reyes, E. Valenti

Abstract: The Multi Unit Spectroscopic Explorer (MUSE) is an integral field spectrograph on the Very Large Telescope Unit Telescope 4, capable of laser guide star assisted and tomographic adaptive optics using the GALACSI module. Its observing capabilities include a wide field (1 square arcmin), ground layer AO mode (WFM-AO) and a narrow field (7.5"x7.5"), laser tomography AO mode (NFM-AO). The latter has h… ▽ More The Multi Unit Spectroscopic Explorer (MUSE) is an integral field spectrograph on the Very Large Telescope Unit Telescope 4, capable of laser guide star assisted and tomographic adaptive optics using the GALACSI module. Its observing capabilities include a wide field (1 square arcmin), ground layer AO mode (WFM-AO) and a narrow field (7.5"x7.5"), laser tomography AO mode (NFM-AO). The latter has had several upgrades in the 4 years since commissioning, including an optimisation of the control matrices for the AO system and a new sub-electron noise detector for its infra-red low order wavefront sensor. We set out to quantify the NFM-AO system performance by analysing $\sim$230 spectrophotometric standard star observations taken over the last 3 years. To this end we expand upon previous work, designed to facilitate analysis of the WFM-AO system performance. We briefly describe the framework that will provide a user friendly, semi-automated way for system performance monitoring during science operations. We provide the results of our performance analysis, chiefly through the measured Strehl ratio and full width at half maximum (FWHM) of the core of the point spread function (PSF) using two PSF models, and correlations with atmospheric conditions. These results will feed into a range of applications, including providing a more accurate prediction of the system performance as implemented in the exposure time calculator, and the associated optimization of the scientific output for a given set of limiting atmospheric conditions. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: SPIE proceedings (2022), Observatory Operations: Strategies, Processes, and Systems IX

Journal ref: Proc. of SPIE 2022 Vol. 12186, 121860T

arXiv:2206.06536 [pdf, other]

On Learning the Dynamical Response of Nonlinear Control Systems with Deep Operator Networks

Authors: Guang Lin, Christian Moya, Zecheng Zhang

Abstract: We propose a Deep Operator Network~(DeepONet) framework to learn the dynamic response of continuous-time nonlinear control systems from data. To this end, we first construct and train a DeepONet that approximates the control system's local solution operator. Then, we design a numerical scheme that recursively uses the trained DeepONet to simulate the control system's long/medium-term dynamic respo… ▽ More We propose a Deep Operator Network~(DeepONet) framework to learn the dynamic response of continuous-time nonlinear control systems from data. To this end, we first construct and train a DeepONet that approximates the control system's local solution operator. Then, we design a numerical scheme that recursively uses the trained DeepONet to simulate the control system's long/medium-term dynamic response for given control inputs and initial conditions. We accompany the proposed scheme with an estimate for the error bound of the associated cumulative error. Furthermore, we design a data-driven Runge-Kutta~(RK) explicit scheme that uses the DeepONet forward pass and automatic differentiation to better approximate the system's response when the numerical scheme's step size is sufficiently small. Numerical experiments on the predator-prey, pendulum, and cart pole systems confirm that our DeepONet framework learns to approximate the dynamic response of nonlinear control systems effectively. △ Less

Submitted 26 September, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

arXiv:2202.07176 [pdf, other]

DeepONet-Grid-UQ: A Trustworthy Deep Operator Framework for Predicting the Power Grid's Post-Fault Trajectories

Authors: Christian Moya, Shiqi Zhang, Meng Yue, Guang Lin

Abstract: This paper proposes a new data-driven method for the reliable prediction of power system post-fault trajectories. The proposed method is based on the fundamentally new concept of Deep Operator Networks (DeepONets). Compared to traditional neural networks that learn to approximate functions, DeepONets are designed to approximate nonlinear operators. Under this operator framework, we design a DeepON… ▽ More This paper proposes a new data-driven method for the reliable prediction of power system post-fault trajectories. The proposed method is based on the fundamentally new concept of Deep Operator Networks (DeepONets). Compared to traditional neural networks that learn to approximate functions, DeepONets are designed to approximate nonlinear operators. Under this operator framework, we design a DeepONet to (1) take as inputs the fault-on trajectories collected, for example, via simulation or phasor measurement units, and (2) provide as outputs the predicted post-fault trajectories. In addition, we endow our method with a much-needed ability to balance efficiency with reliable/trustworthy predictions via uncertainty quantification. To this end, we propose and compare two methods that enable quantifying the predictive uncertainty. First, we propose a \textit{Bayesian DeepONet} (B-DeepONet) that uses stochastic gradient Hamiltonian Monte-Carlo to sample from the posterior distribution of the DeepONet parameters. Then, we propose a \textit{Probabilistic DeepONet} (Prob-DeepONet) that uses a probabilistic training strategy to equip DeepONets with a form of automated uncertainty quantification, at virtually no extra computational cost. Finally, we validate the predictive power and uncertainty quantification capability of the proposed B-DeepONet and Prob-DeepONet using the IEEE 16-machine 68-bus system. △ Less

Submitted 14 February, 2022; originally announced February 2022.

arXiv:2111.02484 [pdf, other]

Accelerated replica exchange stochastic gradient Langevin diffusion enhanced Bayesian DeepONet for solving noisy parametric PDEs

Authors: Guang Lin, Christian Moya, Zecheng Zhang

Abstract: The Deep Operator Networks~(DeepONet) is a fundamentally different class of neural networks that we train to approximate nonlinear operators, including the solution operator of parametric partial differential equations (PDE). DeepONets have shown remarkable approximation and generalization capabilities even when trained with relatively small datasets. However, the performance of DeepONets deterior… ▽ More The Deep Operator Networks~(DeepONet) is a fundamentally different class of neural networks that we train to approximate nonlinear operators, including the solution operator of parametric partial differential equations (PDE). DeepONets have shown remarkable approximation and generalization capabilities even when trained with relatively small datasets. However, the performance of DeepONets deteriorates when the training data is polluted with noise, a scenario that occurs very often in practice. To enable DeepONets training with noisy data, we propose using the Bayesian framework of replica-exchange Langevin diffusion. Such a framework uses two particles, one for exploring and another for exploiting the loss function landscape of DeepONets. We show that the proposed framework's exploration and exploitation capabilities enable (1) improved training convergence for DeepONets in noisy scenarios and (2) attaching an uncertainty estimate for the predicted solutions of parametric PDEs. In addition, we show that replica-exchange Langeving Diffusion (remarkably) also improves the DeepONet's mean prediction accuracy in noisy scenarios compared with vanilla DeepONets trained with state-of-the-art gradient-based optimization algorithms (e.g. Adam). To reduce the potentially high computational cost of replica, in this work, we propose an accelerated training framework for replica-exchange Langevin diffusion that exploits the neural network architecture of DeepONets to reduce its computational cost up to 25% without compromising the proposed framework's performance. Finally, we illustrate the effectiveness of the proposed Bayesian framework using a series of experiments on four parametric PDE problems. △ Less

Submitted 3 November, 2021; originally announced November 2021.

arXiv:2109.04304 [pdf, ps, other]

DAE-PINN: A Physics-Informed Neural Network Model for Simulating Differential-Algebraic Equations with Application to Power Networks

Authors: Christian Moya, Guang Lin

Abstract: Deep learning-based surrogate modeling is becoming a promising approach for learning and simulating dynamical systems. Deep-learning methods, however, find very challenging learning stiff dynamics. In this paper, we develop DAE-PINN, the first effective deep-learning framework for learning and simulating the solution trajectories of nonlinear differential-algebraic equations (DAE), which present a… ▽ More Deep learning-based surrogate modeling is becoming a promising approach for learning and simulating dynamical systems. Deep-learning methods, however, find very challenging learning stiff dynamics. In this paper, we develop DAE-PINN, the first effective deep-learning framework for learning and simulating the solution trajectories of nonlinear differential-algebraic equations (DAE), which present a form of infinite stiffness and describe, for example, the dynamics of power networks. Our DAE-PINN bases its effectiveness on the synergy between implicit Runge-Kutta time-step** schemes (designed specifically for solving DAEs) and physics-informed neural networks (PINN) (deep neural networks that we train to satisfy the dynamics of the underlying problem). Furthermore, our framework (i) enforces the neural network to satisfy the DAEs as (approximate) hard constraints using a penalty-based method and (ii) enables simulating DAEs for long-time horizons. We showcase the effectiveness and accuracy of DAE-PINN by learning and simulating the solution trajectories of a three-bus power network. △ Less

Submitted 9 September, 2021; originally announced September 2021.

arXiv:2011.09150 [pdf, other]

doi 10.3847/2041-8213/abc95b

Discovery of two Einstein crosses from massive post--blue nugget galaxies at z>1 in KiDS

Authors: N. R. Napolitano, R. Li, C. Spiniello, C. Tortora, A. Sergeyev, G. D'Ago, X. Guo, L. Xie, M. Radovich, N. Roy, L. V. E. Koopmans, K. Kuijken, M. Bilicki, T. Erben, F. Getman, C. Heymans, H. Hildebrandt, C. Moya, H. Y. Shan, G. Vernardos, A. H. Wright

Abstract: We report the discovery of two Einstein Crosses (ECs) in the footprint of the Kilo-Degree Survey (KiDS): KIDS J232940-340922 and KIDS J122456+005048. Using integral field spectroscopy from MUSE@VLT, we confirm their gravitational-lens nature. In both cases, the four spectra of the source clearly show a prominence of absorption features, hence revealing an evolved stellar population with little sta… ▽ More We report the discovery of two Einstein Crosses (ECs) in the footprint of the Kilo-Degree Survey (KiDS): KIDS J232940-340922 and KIDS J122456+005048. Using integral field spectroscopy from MUSE@VLT, we confirm their gravitational-lens nature. In both cases, the four spectra of the source clearly show a prominence of absorption features, hence revealing an evolved stellar population with little star formation. The lensing model of the two systems, assuming a singular isothermal ellipsoid (SIE) with external shear, shows that: 1) the two crosses, located at redshift $z=0.38$ and 0.24, have Einstein radius $R_{\rm E}=5.2$ kpc and 5.4 kpc, respectively; 2) their projected dark matter fractions inside the half effective radius are 0.60 and 0.56 (Chabrier IMF); 3) the sources are ultra-compact galaxies, $R_{\rm e}\sim0.9$ kpc (at redshift $z_{\rm s}=1.59$) and $R_{\rm e}\sim0.5$ kpc ($z_{\rm s}=1.10$), respectively. These results are unaffected by the underlying mass density assumption. Due to size, blue color and absorption-dominated spectra, corroborated by low specific star-formation rates derived from optical-NIR spectral energy distribution fitting, we argue that the two lensed sources in these ECs are blue nuggets migrating toward their quenching phase. △ Less

Submitted 19 November, 2020; v1 submitted 18 November, 2020; originally announced November 2020.

Comments: Accepted for publication on APJL

arXiv:1904.07820 [pdf, other]

doi 10.1103/PhysRevLett.126.207201

Magnetization process of atacamite: a case of weakly coupled $S = 1/2$ sawtooth chains

Authors: L. Heinze, H. O. Jeschke, I. I. Mazin, A. Metavitsiadis, M. Reehuis, R. Feyerherm, J. -U. Hoffmann, M. Bartkowiak, O. Prokhnenko, A. U. B. Wolter, X. Ding, V. S. Zapf, C. Corvalán Moya, F. Weickert, M. Jaime, K. C. Rule, D. Menzel, R. Valentí, W. Brenig, S. Süllow

Abstract: We present a combined experimental and theoretical study of the mineral atacamite Cu$_2$Cl(OH)$_3$. Density functional theory yields a Hamiltonian describing anisotropic sawtooth chains with weak 3D connections. Experimentally, we fully characterize the antiferromagnetically ordered state. Magnetic order shows a complex evolution with the magnetic field, while, starting at 31.5 T, we observe a pla… ▽ More We present a combined experimental and theoretical study of the mineral atacamite Cu$_2$Cl(OH)$_3$. Density functional theory yields a Hamiltonian describing anisotropic sawtooth chains with weak 3D connections. Experimentally, we fully characterize the antiferromagnetically ordered state. Magnetic order shows a complex evolution with the magnetic field, while, starting at 31.5 T, we observe a plateau-like magnetization at about $M_{\rm sat}/2$. Based on complementary theoretical approaches, we show that the latter is unrelated to the known magnetization plateau of a sawtooth chain. Instead, we provide evidence that the magnetization process in atacamite is a field-driven canting of a 3D network of weakly coupled sawtooth chains that form giant moments. △ Less

Submitted 1 April, 2021; v1 submitted 16 April, 2019; originally announced April 2019.

Journal ref: Phys. Rev. Lett. 126, 207201 (2021)

arXiv:1812.02798 [pdf, other]

doi 10.1103/PhysRevB.99.235101

Magnetoelastic coupling in URu2Si2: Probing multipolar correlations in the hidden order state

Authors: Mark Wartenbe, Ryan E. Baumbach, Arkady Shekhter, Gregory S. Boebinger, Eric D. Bauer, Carolina Corvalan Moya, Neil Harrison, Ross D. McDonald, Myron B. Salamon, Marcelo Jaime

Abstract: Time reversal symmetry and magnetoelastic correlations are probed by means of high-resolution volume dilatometry in URu2Si2 at cryogenic temperatures and magnetic fields more than enough to suppress the hidden order state at H_HO(T = 0.66 K) approximately 35 T. We report a significant crystal lattice volume expansion at and above H_HO(T), and even above T_HO, possibly a consequence of field-induce… ▽ More Time reversal symmetry and magnetoelastic correlations are probed by means of high-resolution volume dilatometry in URu2Si2 at cryogenic temperatures and magnetic fields more than enough to suppress the hidden order state at H_HO(T = 0.66 K) approximately 35 T. We report a significant crystal lattice volume expansion at and above H_HO(T), and even above T_HO, possibly a consequence of field-induced f-electron localization, and hysteresis at some high field phase boundaries that confirm volume involvement. We investigate in detail the magnetostriction and magnetization as the temperature is reduced over two decades from 50 K where the system is paramagnetic, to 0.5 K in the realms of the hidden order state. We find a dominant quadratic-in-field dependence delta L/L proportional to H^2, a result consistent with a state that is symmetric under time reversal. The data shows, however, an incipient yet unmistakable asymptotic approach to linear (delta L/L proportional to 1-H/H_0) for 15 T < H < H_HO(0.66 K) approximately 35 T at the lowest temperatures. We discuss these results in the framework of a Ginzburg-Landau formalism that proposes a complex order parameter for the HO to model the (H,T,p) phase diagram. △ Less

Submitted 29 April, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

Journal ref: Phys. Rev. B 99, 235101 (2019)

arXiv:1806.03544 [pdf, ps, other]

Application of Correlation Indices on Intrusion Detection Systems: Protecting the Power Grid Against Coordinated Attacks

Authors: Christian Moya, Junho Hong, Jiankang Wang

Abstract: The future power grid will be characterized by the pervasive use of heterogeneous and non-proprietary information and communication technology, which exposes the power grid to a broad scope of cyber-attacks. In particular, Monitoring-Control Attacks (MCA) --i.e., attacks in which adversaries manipulate control decisions by fabricating measurement signals in the feedback loop-- are highly threateni… ▽ More The future power grid will be characterized by the pervasive use of heterogeneous and non-proprietary information and communication technology, which exposes the power grid to a broad scope of cyber-attacks. In particular, Monitoring-Control Attacks (MCA) --i.e., attacks in which adversaries manipulate control decisions by fabricating measurement signals in the feedback loop-- are highly threatening. This is because, MCAs are (i) more likely to happen with greater attack surface and lower cost, (ii) difficult to detect by hiding in measurement signals, and (iii) capable of inflicting severe consequences by coordinating attack resources. To defend against MCAs, we have developed a semantic analysis framework for Intrusion Detection Systems (IDS) in power grids. The framework consists of two parts running in parallel: a Correlation Index Generator (CIG), which indexes correlated MCAs, and a Correlation Knowledge-Base~(CKB), which is updated aperiodically with attacks' Correlation Indices (CI). The framework has the advantage of detecting MCAs and estimating attack consequences with promising runtime and detection accuracy. To evaluate the performance of the framework, we computed its false alarm rates under different attack scenarios. △ Less

Submitted 9 June, 2018; originally announced June 2018.

Comments: 10 pages, 8 figures

arXiv:1803.05005 [pdf, ps, other]

PyMUSE: a Python package for VLT/MUSE data

Authors: Ismael Pessa, Nicolas Tejos, Cristobal Moya

Abstract: This is a companion Focus Demonstration article to the PyMUSE python package, demonstrating its usage and utilities for VLT/MUSE data analysis, that include a wide range of options for spectra extractions, the creation of different types of images, compatibilities with some commonly used software for astronomical data analysis, among others. PyMUSE is an open-source software and can be found on Gi… ▽ More This is a companion Focus Demonstration article to the PyMUSE python package, demonstrating its usage and utilities for VLT/MUSE data analysis, that include a wide range of options for spectra extractions, the creation of different types of images, compatibilities with some commonly used software for astronomical data analysis, among others. PyMUSE is an open-source software and can be found on Github for free use and distribution. △ Less

Submitted 16 March, 2018; v1 submitted 13 March, 2018; originally announced March 2018.

Comments: Proceedings of Astronomical Data Analysis Software and Systems XXVII conference

arXiv:1802.00881 [pdf, other]

A Protection Method in Active Distribution Grids with High Penetration of Renewable Energy Sources

Authors: J. K. Wang, Christian Moya

Abstract: A protection method in active distribution networks is proposed in this paper. In active distribution systems, fault currents flow in multiple directions and presents a varying range of value, which poses a great challenge of maintaining coordination among protective devices on feeders. The proposed protection method addresses this challenge by simultaneously adjusting DG's output power and protec… ▽ More A protection method in active distribution networks is proposed in this paper. In active distribution systems, fault currents flow in multiple directions and presents a varying range of value, which poses a great challenge of maintaining coordination among protective devices on feeders. The proposed protection method addresses this challenge by simultaneously adjusting DG's output power and protection devices' settings in pre-fault networks. Comparing to previous protection solutions, the proposed method considers the influences from renewable DG's intermittency, and explores the economic and protection benefits of DG's active participation. The formulation of proposed method is decomposed into two optimization sub-problems, coupling through the constraint on fuse-recloser coordination. This decomposed mathematical structure effectively extinguishes the non-linearity arising from reclosers' time-current inverse characteristics, and greatly reduces computation efforts. △ Less

Submitted 2 February, 2018; originally announced February 2018.

arXiv:1707.00672 [pdf, ps, other]

Develo** a Correlation Indices to Identify Coordinated Cyber-Attacks on Power Grids

Authors: Christian Moya, Jiankang Wang

Abstract: Increasing reliance on Information and Communication Technology~(ICT) exposes the power grid to cyber-attacks. In particular, Coordinated Cyber-Attacks (CCAs) are considered highly threatening and difficult to defend against, because they (i) possess higher disruptiveness by integrating greater resources from multiple attack entities, and (ii) present heterogeneous traits in cyber-space and the ph… ▽ More Increasing reliance on Information and Communication Technology~(ICT) exposes the power grid to cyber-attacks. In particular, Coordinated Cyber-Attacks (CCAs) are considered highly threatening and difficult to defend against, because they (i) possess higher disruptiveness by integrating greater resources from multiple attack entities, and (ii) present heterogeneous traits in cyber-space and the physical grid by hitting multiple targets to achieve the attack goal. Thus, and as opposed to independent attacks, whose severity is limited by the power grid's redundancy, CCAs could inflict disastrous consequences, such as blackouts. In this paper, we propose a method to develop Correlation Indices to defend against CCAs on static control applications. These proposed indices relate the targets of CCAs with attack goals on the power grid. Compared to related works, the proposed indices present the benefits of deployment simplicity and are capable of detecting more sophisticated attacks, such as measurement attacks. We demonstrate our method using measurement attacks against Security Constrained Economic Dispatch. △ Less

Submitted 29 May, 2018; v1 submitted 3 July, 2017; originally announced July 2017.

Comments: 9 pages, 6 figures

arXiv:1508.00337 [pdf]

doi 10.1021/acs.jpcc.5b07516

Quantification of dipolar interactions in Fe$_{3-x}$O$_4$ nanoparticles

Authors: Carlos Moya, Òscar Iglesias, Xavier Batlle, Amílcar Labarta

Abstract: A general method for the quantification of dipolar interactions in assemblies of nanoparticles has been developed from a model sample constituted by magnetite nanoparticles of 5 nm in diameter, in powder form with oleic acid as a surfactant so that the particles were solely separated from each other through an organic layer of about 1 nm in thickness. This quantification is based on the comparison… ▽ More A general method for the quantification of dipolar interactions in assemblies of nanoparticles has been developed from a model sample constituted by magnetite nanoparticles of 5 nm in diameter, in powder form with oleic acid as a surfactant so that the particles were solely separated from each other through an organic layer of about 1 nm in thickness. This quantification is based on the comparison of the distribution of energy barriers for magnetization reversal obtained from time-dependent relaxation measurements starting from either (i) an almost random orientation of the particles magnetizations or (ii) a collinear arrangement of them prepared by previously field cooling the sample. Experimental results and numerical simulations show that the mean dipolar field acting on each single particle is significantly reduced when particles magnetizations are collinearly aligned. Besides, the intrinsic distribution of the energy barriers of anisotropy for the non-interacting case was evaluated from a reference sample where the same magnetic particles were individually coated with a thick silica shell in order to make dipolar interactions negligible. Interestingly, the results of the numerical simulations account for the relative energy shift of the experimental energy barrier distributions corresponding to the interacting and non-interacting cases, thus supporting the validity of the proposed method for the quantification of dipolar interactions. △ Less

Submitted 3 August, 2015; originally announced August 2015.

Comments: 7 pages, 7 figures, submitted

Journal ref: J. Phys. Chem. C 119, 24142 (2015)

Showing 1–24 of 24 results for author: Moya, C