-
Vortex confinement through an unquantized magnetic flux
Authors:
Geunyong Kim,
**young Yun,
**ho Yang,
Ilkyu Yang,
Dirk Wulferding,
Roman Movshovich,
Gil Young Cho,
Ki-Seok Kim,
Garam Hahn,
Jeehoon Kim
Abstract:
Geometrically confined superconductors often experience a breakdown in the quantization of magnetic flux owing to the incomplete screening of the supercurrent against the field penetration. In this study, we report that the confinement of a magnetic field occurs regardless of the dimensionality of the system, extending even to 1D linear potential systems. By utilizing a vector-field magnetic force…
▽ More
Geometrically confined superconductors often experience a breakdown in the quantization of magnetic flux owing to the incomplete screening of the supercurrent against the field penetration. In this study, we report that the confinement of a magnetic field occurs regardless of the dimensionality of the system, extending even to 1D linear potential systems. By utilizing a vector-field magnetic force microscope, we successfully create a vortex-antivortex pair connected by a 1D unquantized magnetic flux in ultra-thin superconducting films. Through an investigation of the manipulation and thermal behavior of the vortex pair, we uncover a long-range interaction mediated by the unquantized magnetic flux. These findings suggest a universal phenomenon of unquantized magnetic flux formation, independent of the geometry of the system. Our results present an experimental route for probing the impact of confinement on superconducting properties and order parameters in unconventional superconductors characterized by extremely low dimensionality.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Hot Schrödinger Cat States
Authors:
Ian Yang,
Thomas Agrenius,
Vasilisa Usova,
Oriol Romero-Isart,
Gerhard Kirchmair
Abstract:
The observation of quantum phenomena often necessitates sufficiently pure states, a requirement that can be challenging to achieve. In this study, our goal is to prepare a non-classical state originating from a mixed state, utilizing dynamics that preserve the initial low purity of the state. We generate a quantum superposition of displaced thermal states within a microwave cavity using only unita…
▽ More
The observation of quantum phenomena often necessitates sufficiently pure states, a requirement that can be challenging to achieve. In this study, our goal is to prepare a non-classical state originating from a mixed state, utilizing dynamics that preserve the initial low purity of the state. We generate a quantum superposition of displaced thermal states within a microwave cavity using only unitary interactions with a transmon qubit. We measure the Wigner functions of these ``hot'' Schrödinger cat states for an initial purity as low as 0.06. This corresponds to a cavity mode temperature of up to 1.8 Kelvin, sixty times hotter than the cavity's physical environment. Our realization of highly mixed quantum superposition states could be implemented with other continuous-variable systems e.g. nanomechanical oscillators, for which ground-state cooling remains challenging.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Wasserstein Distributionally Robust Control and State Estimation for Partially Observable Linear Systems
Authors:
Minhyuk Jang,
Astghik Hakobyan,
Insoon Yang
Abstract:
This paper presents a novel Wasserstein distributionally robust control and state estimation algorithm for partially observable linear stochastic systems, where the probability distributions of disturbances and measurement noises are unknown. Our method consists of the control and state estimation phases to handle distributional ambiguities of system disturbances and measurement noises, respective…
▽ More
This paper presents a novel Wasserstein distributionally robust control and state estimation algorithm for partially observable linear stochastic systems, where the probability distributions of disturbances and measurement noises are unknown. Our method consists of the control and state estimation phases to handle distributional ambiguities of system disturbances and measurement noises, respectively. Leveraging tools from modern distributionally robust optimization, we consider an approximation of the control problem with an arbitrary nominal distribution and derive its closed-form optimal solution. We show that the separation principle holds, thereby allowing the state estimator to be designed separately. A novel distributionally robust Kalman filter is then proposed as an optimal solution to the state estimation problem with Gaussian nominal distributions. Our key contribution is the combination of distributionally robust control and state estimation into a unified algorithm. This is achieved by formulating a tractable semidefinite programming problem that iteratively determines the worst-case covariance matrices of all uncertainties, leading to a scalable and efficient algorithm. Our method is also shown to enjoy a guaranteed cost property as well as a probabilistic out-of-sample performance guarantee. The results of our numerical experiments demonstrate the performance and computational efficiency of the proposed method.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Large Language Models: A New Approach for Privacy Policy Analysis at Scale
Authors:
David Rodriguez,
Ian Yang,
Jose M. Del Alamo,
Norman Sadeh
Abstract:
The number and dynamic nature of web and mobile applications presents significant challenges for assessing their compliance with data protection laws. In this context, symbolic and statistical Natural Language Processing (NLP) techniques have been employed for the automated analysis of these systems' privacy policies. However, these techniques typically require labor-intensive and potentially erro…
▽ More
The number and dynamic nature of web and mobile applications presents significant challenges for assessing their compliance with data protection laws. In this context, symbolic and statistical Natural Language Processing (NLP) techniques have been employed for the automated analysis of these systems' privacy policies. However, these techniques typically require labor-intensive and potentially error-prone manually annotated datasets for training and validation. This research proposes the application of Large Language Models (LLMs) as an alternative for effectively and efficiently extracting privacy practices from privacy policies at scale. Particularly, we leverage well-known LLMs such as ChatGPT and Llama 2, and offer guidance on the optimal design of prompts, parameters, and models, incorporating advanced strategies such as few-shot learning. We further illustrate its capability to detect detailed and varied privacy practices accurately. Using several renowned datasets in the domain as a benchmark, our evaluation validates its exceptional performance, achieving an F1 score exceeding 93%. Besides, it does so with reduced costs, faster processing times, and fewer technical knowledge requirements. Consequently, we advocate for LLM-based solutions as a sound alternative to traditional NLP techniques for the automated analysis of privacy policies at scale.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Approximate Thompson Sampling for Learning Linear Quadratic Regulators with $O(\sqrt{T})$ Regret
Authors:
Yeoneung Kim,
Gihun Kim,
Insoon Yang
Abstract:
We propose an approximate Thompson sampling algorithm that learns linear quadratic regulators (LQR) with an improved Bayesian regret bound of $O(\sqrt{T})$. Our method leverages Langevin dynamics with a meticulously designed preconditioner as well as a simple excitation mechanism. We show that the excitation signal induces the minimum eigenvalue of the preconditioner to grow over time, thereby acc…
▽ More
We propose an approximate Thompson sampling algorithm that learns linear quadratic regulators (LQR) with an improved Bayesian regret bound of $O(\sqrt{T})$. Our method leverages Langevin dynamics with a meticulously designed preconditioner as well as a simple excitation mechanism. We show that the excitation signal induces the minimum eigenvalue of the preconditioner to grow over time, thereby accelerating the approximate posterior sampling process. Moreover, we identify nontrivial concentration properties of the approximate posteriors generated by our algorithm. These properties enable us to bound the moments of the system state and attain an $O(\sqrt{T})$ regret bound without the unrealistic restrictive assumptions on parameter sets that are often used in the literature.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations
Authors:
Yuxin Wang,
Ivory Yang,
Saeed Hassanpour,
Soroush Vosoughi
Abstract:
Mental manipulation, a significant form of abuse in interpersonal conversations, presents a challenge to identify due to its context-dependent and often subtle nature. The detection of manipulative language is essential for protecting potential victims, yet the field of Natural Language Processing (NLP) currently faces a scarcity of resources and research on this topic. Our study addresses this ga…
▽ More
Mental manipulation, a significant form of abuse in interpersonal conversations, presents a challenge to identify due to its context-dependent and often subtle nature. The detection of manipulative language is essential for protecting potential victims, yet the field of Natural Language Processing (NLP) currently faces a scarcity of resources and research on this topic. Our study addresses this gap by introducing a new dataset, named ${\rm M{\small ental}M{\small anip}}$, which consists of $4,000$ annotated movie dialogues. This dataset enables a comprehensive analysis of mental manipulation, pinpointing both the techniques utilized for manipulation and the vulnerabilities targeted in victims. Our research further explores the effectiveness of leading-edge models in recognizing manipulative dialogue and its components through a series of experiments with various configurations. The results demonstrate that these models inadequately identify and categorize manipulative content. Attempts to improve their performance by fine-tuning with existing datasets on mental health and toxicity have not overcome these limitations. We anticipate that ${\rm M{\small ental}M{\small anip}}$ will stimulate further research, leading to progress in both understanding and mitigating the impact of mental manipulation in conversations.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Empowering Federated Learning for Massive Models with NVIDIA FLARE
Authors:
Holger R. Roth,
Ziyue Xu,
Yuan-Ting Hsieh,
Adithya Renduchintala,
Isaac Yang,
Zhihong Zhang,
Yuhong Wen,
Sean Yang,
Kevin Lu,
Kristopher Kersten,
Camir Ricketts,
Daguang Xu,
Chester Chen,
Yan Cheng,
Andrew Feng
Abstract:
In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy…
▽ More
In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copyright issues, and the sheer effort required to move vast datasets. In this paper, we explore how federated learning enabled by NVIDIA FLARE can address these challenges with easy and scalable integration capabilities, enabling parameter-efficient and full supervised fine-tuning of LLMs for natural language processing and biopharmaceutical applications to enhance their accuracy and robustness.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Maximizing Consistent Force Output for Shape Memory Alloy Artificial Muscles in Soft Robots
Authors:
Meredith L. Anderson,
Ran **g,
Juan C. Pacheco Garcia,
Ilyoung Yang,
Sarah Alizadeh-Shabdiz,
Charles DeLorey,
Andrew P. Sabelhaus
Abstract:
Soft robots have immense potential given their inherent safety and adaptability, but challenges in soft actuator forces and design constraints have limited scaling up soft robots to larger sizes. Electrothermal shape memory alloy (SMA) artificial muscles have the potential to create these large forces and high displacements, but consistently using these muscles under a well-defined model, in-situ…
▽ More
Soft robots have immense potential given their inherent safety and adaptability, but challenges in soft actuator forces and design constraints have limited scaling up soft robots to larger sizes. Electrothermal shape memory alloy (SMA) artificial muscles have the potential to create these large forces and high displacements, but consistently using these muscles under a well-defined model, in-situ in a soft robot, remains an open challenge. This article provides a system for maintaining the highest-possible consistent SMA forces, over long lifetimes, by combining a fatigue testing protocol with a supervisory control system for the muscles' internal temperature state. We propose a design of a soft limb with swap-able SMA muscles, and deploy the limb in a blocked-force test to quantify the relationship between the measured maximum force at different temperatures over different lifetimes. Then, by applying an invariance-based control system to maintain temperatures under our long-life limit, we demonstrate consistent high forces in a practical task over hundreds of cycles. The method we developed allows for practical implementation of SMAs in soft robots through characterizing and controlling their behavior in-situ, and provides a method to impose limits that maximize their consistent, repeatable behavior.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Computational Fluid Dynamics: its Carbon Footprint and Role in Carbon Emission Reduction
Authors:
Xiang I A Yang,
Wen Zhang,
Mahdi Abkar,
William Anderson
Abstract:
Turbulent flow physics regulates the aerodynamic properties of lifting surfaces, the thermodynamic efficiency of vapor power systems, and exchanges of natural and anthropogenic quantities between the atmosphere and ocean, to name just a few applications. The dynamics of turbulent flows are described via numerical integration of the non-linear Navier-Stokes equation -- a procedure known as computat…
▽ More
Turbulent flow physics regulates the aerodynamic properties of lifting surfaces, the thermodynamic efficiency of vapor power systems, and exchanges of natural and anthropogenic quantities between the atmosphere and ocean, to name just a few applications. The dynamics of turbulent flows are described via numerical integration of the non-linear Navier-Stokes equation -- a procedure known as computational fluid dynamics (CFD). At the dawn of scientific computing in the late 1950s, it would be many decades before terms such as ``carbon footprint'' or ``sustainability'' entered the lexicon, and longer still before these themes attained national priority throughout advanced economies. This paper introduces a framework designed to calculate the carbon footprint of CFD and its contribution to carbon emission reduction strategies. We will distinguish between "hero" and "routine" calculations, noting that the carbon footprint of hero calculations is largely determined by the energy source mix utilized. We will also review CFD of flows where turbulence effects are modeled, thus reducing the degrees of freedom. Estimates of the carbon footprint are presented for such fully- and partially-resolved simulations as functions of turbulence activity and calculation year, demonstrating a reduction in carbon emissions by two to five orders of magnitude at practical conditions. Beyond analyzing CO2 emissions, we quantify the benefits of applying CFD towards overall carbon emission reduction. The community's effort to avoid redundant calculations via turbulence databases merits particular attention, with estimates indicating that a single database could potentially reduce CO2 emissions by approximately O(1) million metric tons. Additionally, implementing CFD in the fluids industry has markedly decreased dependence on wind tunnel testing, which is anticipated to lead to CO2 emission reduction.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Generating High-Precision Force Fields for Molecular Dynamics Simulations to Study Chemical Reaction Mechanisms using Molecular Configuration Transformer
Authors:
Sihao Yuan,
Xu Han,
Jun Zhang,
Zhaoxin Xie,
Cheng Fan,
Yunlong Xiao,
Yi Qin Gao,
Yi Isaac Yang
Abstract:
Theoretical studies on chemical reaction mechanisms have been crucial in organic chemistry. Traditionally, calculating the manually constructed molecular conformations of transition states for chemical reactions using quantum chemical calculations is the most commonly used method. However, this way is heavily dependent on individual experience and chemical intuition. In our previous study, we prop…
▽ More
Theoretical studies on chemical reaction mechanisms have been crucial in organic chemistry. Traditionally, calculating the manually constructed molecular conformations of transition states for chemical reactions using quantum chemical calculations is the most commonly used method. However, this way is heavily dependent on individual experience and chemical intuition. In our previous study, we proposed a research paradigm that uses enhanced sampling in molecular dynamics simulations to study chemical reactions. This approach can directly simulate the entire process of a chemical reaction. However, the computational speed limits the use of high-precision potential energy functions for simulations. To address this issue, we present a scheme for training high-precision force fields for molecular modeling using a previously developed graph-neural-network-based molecular model, molecular configuration transformer. This potential energy function allows for highly accurate simulations at a low computational cost, leading to more precise calculations of the mechanism of chemical reactions. We applied this approach to study a Claisen rearrangement reaction and a Carbonyl insertion reaction catalyzed by Manganese.
△ Less
Submitted 11 April, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Authors:
Jaeuk Shin,
Giho Kim,
Howon Lee,
Joonho Han,
Insoon Yang
Abstract:
Designing a competent meta-reinforcement learning (meta-RL) algorithm in terms of data usage remains a central challenge to be tackled for its successful real-world applications. In this paper, we propose a sample-efficient meta-RL algorithm that learns a model of the system or environment at hand in a task-directed manner. As opposed to the standard model-based approaches to meta-RL, our method e…
▽ More
Designing a competent meta-reinforcement learning (meta-RL) algorithm in terms of data usage remains a central challenge to be tackled for its successful real-world applications. In this paper, we propose a sample-efficient meta-RL algorithm that learns a model of the system or environment at hand in a task-directed manner. As opposed to the standard model-based approaches to meta-RL, our method exploits the value information in order to rapidly capture the decision-critical part of the environment. The key component of our method is the loss function for learning the task inference module and the system model that systematically couples the model discrepancy and the value estimate, thereby facilitating the learning of the policy and the task inference module with a significantly smaller amount of data compared to the existing meta-RL algorithms. The idea is also extended to a non-meta-RL setting, namely an online linear quadratic regulator (LQR) problem, where our method can be simplified to reveal the essence of the strategy. The proposed method is evaluated in high-dimensional robotic control and online LQR problems, empirically verifying its effectiveness in extracting information indispensable for solving the tasks from observations in a sample efficient manner.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Incorporating basic calibrations in existing machine-learned turbulence modeling
Authors:
Jiaqi J. L. Li,
Yuanwei Bin,
George P. Huang,
Xiang I. A. Yang
Abstract:
This work aims to incorporate basic calibrations of Reynolds-averaged Navier-Stokes (RANS) models as part of machine learning (ML) frameworks. The ML frameworks considered are tensor-basis neural network (TBNN), physics-informed machine learning (PIML), and field inversion & machine learning (FIML) in J. Fluid Mech., 2016, 807, 155-166, Phys. Rev. Fluids, 2017, 2(3), 034603 and J. Comp. Phys., 201…
▽ More
This work aims to incorporate basic calibrations of Reynolds-averaged Navier-Stokes (RANS) models as part of machine learning (ML) frameworks. The ML frameworks considered are tensor-basis neural network (TBNN), physics-informed machine learning (PIML), and field inversion & machine learning (FIML) in J. Fluid Mech., 2016, 807, 155-166, Phys. Rev. Fluids, 2017, 2(3), 034603 and J. Comp. Phys., 2016, 305, 758-774, and the baseline RANS models are the one-equation Spalart-Allmaras model, the two-equation $k$-$ω$ model, and the seven-equation Reynolds stress transport models. ML frameworks are trained against plane channel flow and shear-layer flow data. We compare the ML frameworks and study whether the machine-learned augmentations are detrimental outside the training set. The findings are summarized as follows. The augmentations due to TBNN are detrimental. PIML leads to augmentations that are beneficial inside the training dataset but detrimental outside it. These results are not affected by the baseline RANS model. FIML's augmentations to the two eddy viscosity models, where an inner-layer treatment already exists, are largely neutral. Its augmentation to the seven-equation model, where an inner-layer treatment does not exist, improves the mean flow prediction in a channel. Furthermore, these FIML augmentations are mostly non-detrimental outside the training dataset. In addition to reporting these results, the paper offers physical explanations of the results. Last, we note that the conclusions drawn here are confined to the ML frameworks and the flows considered in this study. More detailed comparative studies and validation & verification studies are needed to account for developments in recent years.
△ Less
Submitted 14 November, 2023; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Risk-Aware Wasserstein Distributionally Robust Control of Vessels in Natural Waterways
Authors:
Juan Moreno Nadales,
Astghik Hakobyan,
David Muñoz de la Peña,
Daniel Limon,
Insoon Yang
Abstract:
In the realm of maritime transportation, autonomous vessel navigation in natural inland waterways faces persistent challenges due to unpredictable natural factors. Existing scheduling algorithms fall short in handling these uncertainties, compromising both safety and efficiency. Moreover, these algorithms are primarily designed for non-autonomous vessels, leading to labor-intensive operations vuln…
▽ More
In the realm of maritime transportation, autonomous vessel navigation in natural inland waterways faces persistent challenges due to unpredictable natural factors. Existing scheduling algorithms fall short in handling these uncertainties, compromising both safety and efficiency. Moreover, these algorithms are primarily designed for non-autonomous vessels, leading to labor-intensive operations vulnerable to human error. To address these issues, this study proposes a risk-aware motion control approach for vessels that accounts for the dynamic and uncertain nature of tide islands in a distributionally robust manner. Specifically, a model predictive control method is employed to follow the reference trajectory in the time-space map while incorporating a risk constraint to prevent grounding accidents. To address uncertainties in tide islands, a novel modeling technique represents them as stochastic polytopes. Additionally, potential inaccuracies in waterway depth are addressed through a risk constraint that considers the worst-case uncertainty distribution within a Wasserstein ambiguity set around the empirical distribution. Using sensor data collected in the Guadalquivir River, we empirically demonstrate the performance of the proposed method through simulations on a vessel. As a result, the vessel successfully navigates the waterway while avoiding grounding accidents, even with a limited dataset of observations. This stands in contrast to existing non-robust controllers, highlighting the robustness and practical applicability of the proposed approach.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
Constrained re-calibration of Reynolds-averaged Navier-Stokes models
Authors:
Yuanwei Bin,
George Huang,
Robert Kunz,
Xiang I A Yang
Abstract:
The constants and functions in Reynolds-averaged Navier Stokes (RANS) turbulence models are coupled. Consequently, modifications of a RANS model often negatively impact its basic calibrations, which is why machine-learned augmentations are often detrimental outside the training dataset. A solution to this is to identify the degrees of freedom that do not affect the basic calibrations and only modi…
▽ More
The constants and functions in Reynolds-averaged Navier Stokes (RANS) turbulence models are coupled. Consequently, modifications of a RANS model often negatively impact its basic calibrations, which is why machine-learned augmentations are often detrimental outside the training dataset. A solution to this is to identify the degrees of freedom that do not affect the basic calibrations and only modify these identified degrees of freedom when re-calibrating the baseline model to accommodate a specific application. This approach is colloquially known as the "rubber-band" approach, which we formally call "constrained model re-calibration" in this article. To illustrate the efficacy of the approach, we identify the degrees of freedom in the Spalart-Allmaras (SA) model that do not affect the log law calibration. By subsequently interfacing data-based methods with these degrees of freedom, we train models to solve historically challenging flow scenarios, including the round-jet/plane-jet anomaly, airfoil stall, secondary flow separation, and recovery after separation. In addition to good performance inside the training dataset, the trained models yield similar performance as the baseline model outside the training dataset.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Large-eddy simulation of separated flows on unconventionally coarse grids
Authors:
Yuanwei Bin,
George I. Park,
Yu Lv,
Xiang I. A. Yang
Abstract:
We examine and benchmark the emerging idea of applying the large-eddy simulation (LES) formalism to unconventionally coarse grids where RANS would be considered more appropriate at first glance. We distinguish this idea from very-large-eddy-simulation (VLES) and detached-eddy-simulation (DES), which require switching between RANS and LES formalism. LES on RANS grid is appealing because first, it r…
▽ More
We examine and benchmark the emerging idea of applying the large-eddy simulation (LES) formalism to unconventionally coarse grids where RANS would be considered more appropriate at first glance. We distinguish this idea from very-large-eddy-simulation (VLES) and detached-eddy-simulation (DES), which require switching between RANS and LES formalism. LES on RANS grid is appealing because first, it requires minimal changes to a production code; second, it is more cost-effective than LES; third, it converges to LES; and most importantly, it accurately predicts flows with separation. This work quantifies the benefit of LES on RANS-like grids as compared to RANS on the same grids. Three canonical cases are considered: periodic hill, backward-facing step, and jet in cross flow. We conduct direct numerical simulation (DNS), proper LES on LES grids, LES on RANS-quality grids, and RANS. We show that while the LES solutions on the RANS-quality grids are not grid converged, they are twice as accurate as the RANS on the same grids.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
A priori screening of data-enabled turbulence models
Authors:
Peng E S Chen,
Yuanwei Bin,
Xiang I A Yang,
Yipeng Shi,
Mahdi Abkar,
George I. Park
Abstract:
Assessing the compliance of a white-box turbulence model with known turbulent knowledge is straightforward. It enables users to screen conventional turbulence models and identify apparent inadequacies, thereby allowing for a more focused and fruitful validation and verification. However, comparing a black-box machine-learning model to known empirical scalings is not straightforward. Unless one imp…
▽ More
Assessing the compliance of a white-box turbulence model with known turbulent knowledge is straightforward. It enables users to screen conventional turbulence models and identify apparent inadequacies, thereby allowing for a more focused and fruitful validation and verification. However, comparing a black-box machine-learning model to known empirical scalings is not straightforward. Unless one implements and tests the model, it would not be clear if a machine-learning model, trained at finite Reynolds numbers preserves the known high Reynolds number limit. This is inconvenient, particularly because model implementation involves retraining and re-interfacing. This work attempts to address this issue, allowing fast a priori screening of machine-learning models that are based on feed-forward neural networks (FNN). The method leverages the mathematical theorems we present in the paper. These theorems offer estimates of a network's limits even when the exact weights and biases are unknown. For demonstration purposes, we screen existing machine-learning wall models and RANS models for their compliance with the log layer physics and the viscous layer physics in a priori manner. In addition, the theorems serve as essential guidelines for future machine-learning models.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Progressive augmentation of Reynolds stress tensor models for secondary flow prediction by computational fluid dynamics driven surrogate optimisation
Authors:
M. J. Rincón,
A. Amarloo,
M. Reclari,
X. I. A. Yang,
M. Abkar
Abstract:
Generalisability and the consistency of the a posteriori results are the most critical points of view regarding data-driven turbulence models. This study presents a progressive improvement of turbulence models using simulation-driven surrogate optimisation based on Kriging. We aim for the augmentation of secondary-flow reconstruction capability in a linear eddy-viscosity model without violating it…
▽ More
Generalisability and the consistency of the a posteriori results are the most critical points of view regarding data-driven turbulence models. This study presents a progressive improvement of turbulence models using simulation-driven surrogate optimisation based on Kriging. We aim for the augmentation of secondary-flow reconstruction capability in a linear eddy-viscosity model without violating its original performance on canonical cases e.g. channel flow. Explicit algebraic Reynolds stress correction models (EARSCMs) for $k-ω$ SST turbulence model are obtained to predict the secondary flow which the standard model fails to capture. The optimisation of the models is achieved by a multi-objective approach based on duct flow quantities, and numerical verification of the developed models is performed for various test cases. The results of testing new models on channel flow cases guarantee that new models preserve the performance of the original $k-ω$ SST model. Regarding the generalisability of the new models, results of unseen test cases demonstrate a significant improvement in the prediction of secondary flows and streamwise velocity. These results highlight the potential of the progressive approach to enhance the performance of data-driven turbulence models for fluid flow simulation while preserving the robustness and stability of the solver.
△ Less
Submitted 3 November, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Elastic Modulus of Polycrystalline Halide Perovskite Thin Films on Substrates
Authors:
Madhuja Layek,
In Seok Yang,
Zhenghong Dai,
Anush Ranka,
Truong Cai,
Brian W. Sheldon,
Eric Chason,
Nitin P. Padture
Abstract:
Using an innovative combination of multi-beam-optical stress-sensor (MOSS) curvature and X-ray diffraction (XRD) techniques, the Young's modulus (E) of polycrystalline MAPbI3 metal-halide perovskite (MHP) thin films attached to Si substrates is estimated to be 10.2 +/- 3.4 GPa. This is comparable to the E of corresponding MAPbI3 single-crystals. This generic method could be applied to other system…
▽ More
Using an innovative combination of multi-beam-optical stress-sensor (MOSS) curvature and X-ray diffraction (XRD) techniques, the Young's modulus (E) of polycrystalline MAPbI3 metal-halide perovskite (MHP) thin films attached to Si substrates is estimated to be 10.2 +/- 3.4 GPa. This is comparable to the E of corresponding MAPbI3 single-crystals. This generic method could be applied to other systems to estimate hard-to-measure E of thin films.
△ Less
Submitted 23 October, 2023; v1 submitted 13 July, 2023;
originally announced July 2023.
-
Extension of the law of the wall exploiting weak similarity of velocity fluctuations in turbulent channels
Authors:
Christoffer Hansen,
Jens N. Sørensen,
Xiang I. A. Yang,
Mahdi Abkar
Abstract:
This paper explores the similarity of the streamwise velocity fluctuations in a channel. In the analysis, we employ a one-dimensional scalar variant of the proper orthogonal decomposition (POD). This approach naturally motivates the introduction of two different levels of similarity which we will refer to as strong and weak similarity. Strong similarity requires that the two-point correlation, and…
▽ More
This paper explores the similarity of the streamwise velocity fluctuations in a channel. In the analysis, we employ a one-dimensional scalar variant of the proper orthogonal decomposition (POD). This approach naturally motivates the introduction of two different levels of similarity which we will refer to as strong and weak similarity. Strong similarity requires that the two-point correlation, and thus, all POD modes, show Reynolds number similarity, while weak similarity only requires that the first few POD modes show similarity. As POD concerns information at more than one location, these similarities are more general than various similarities found in the literature concerning single-point flow statistics. We examine flows at $Re_τ=$180, 540, 1000, and 5200. Strong similarity is observed in the viscous layer and the wake region, and weak similarity is found in both the viscous wall region and the outer part of the logarithmic layer. The presence of weak similarity suggests the existence of an extension to the law of the wall (LoW). We propose such an extension based on the results from the one-dimensional POD analysis. The usefulness of the LoW extension is then assessed by comparing flow reconstructions according to the conventional equilibrium LoW and the extended LoW. We show that the extended LoW provides accurate flow reconstructions in the wall layer, capturing fine-scale motions that are entirely missed by the equilibrium LoW.
△ Less
Submitted 29 December, 2023; v1 submitted 29 June, 2023;
originally announced June 2023.
-
A Generalized Nucleation Theory for Ice Crystalline
Authors:
Maodong Li,
Yupeng Huang,
Yijie Xia,
Dechin Chen,
Cheng Fan,
Lijiang Yang,
Yi Qin Gao,
Yi Isaac Yang
Abstract:
Despite the simplicity of the water molecule, the kinetics of ice nucleation under natural conditions can be complex. We investigated spontaneously grown ice nuclei using all-atom molecular dynamics simulations and found significant differences between the kinetics of ice formation through spontaneously formed and ideal nuclei. Since classical nucleation theory can only provide a good description…
▽ More
Despite the simplicity of the water molecule, the kinetics of ice nucleation under natural conditions can be complex. We investigated spontaneously grown ice nuclei using all-atom molecular dynamics simulations and found significant differences between the kinetics of ice formation through spontaneously formed and ideal nuclei. Since classical nucleation theory can only provide a good description of ice nucleation in ideal conditions, we propose a generalized nucleation theory that can better characterize the kinetics of ice crystal nucleation in general conditions. This study provides an explanation on why previous experimental and computational studies have yielded widely varying critical nucleation sizes.
△ Less
Submitted 9 April, 2024; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Distributionally Robust Differential Dynamic Programming with Wasserstein Distance
Authors:
Astghik Hakobyan,
Insoon Yang
Abstract:
Differential dynamic programming (DDP) is a popular technique for solving nonlinear optimal control problems with locally quadratic approximations. However, existing DDP methods are not designed for stochastic systems with unknown disturbance distributions. To address this limitation, we propose a novel DDP method that approximately solves the Wasserstein distributionally robust control (WDRC) pro…
▽ More
Differential dynamic programming (DDP) is a popular technique for solving nonlinear optimal control problems with locally quadratic approximations. However, existing DDP methods are not designed for stochastic systems with unknown disturbance distributions. To address this limitation, we propose a novel DDP method that approximately solves the Wasserstein distributionally robust control (WDRC) problem, where the true disturbance distribution is unknown but a disturbance sample dataset is given. Our approach aims to develop a practical and computationally efficient DDP solution. To achieve this, we use the Kantrovich duality principle to decompose the value function in a novel way and derive closed-form expressions of the distributionally robust control and worst-case distribution policies to be used in each iteration of our DDP algorithm. This characterization makes our method tractable and scalable without the need for numerically solving any minimax optimization problems. The superior out-of-sample performance and scalability of our algorithm are demonstrated through kinematic car navigation and coupled oscillator problems.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Searching for anomalous quartic gauge couplings at muon colliders using principle component analysis
Authors:
Yi-Fei Dong,
Ying-Chen Mao,
i-Chong Yang
Abstract:
Searching for new physics~(NP) is one of the areas of high-energy physics that requires the most processing of large amounts of data. At the same time, quantum computing has huge potential advantages when dealing with large amounts of data. The principal component analysis~(PCA) algorithm may be one of the bridges connecting these two aspects. On the one hand, it can be used for anomaly detection,…
▽ More
Searching for new physics~(NP) is one of the areas of high-energy physics that requires the most processing of large amounts of data. At the same time, quantum computing has huge potential advantages when dealing with large amounts of data. The principal component analysis~(PCA) algorithm may be one of the bridges connecting these two aspects. On the one hand, it can be used for anomaly detection, and on the other hand, there are corresponding quantum algorithms for PCA. In this paper, we investigate how to use PCA to search for NP. Taking the example of anomalous quartic gauge couplings in the tri-photon process at muon colliders, we find that PCA can be used to search for NP. Compared with the traditional event selection strategy, the expected constraints on the operator coefficients obtained by PCA based event selection strategy are even better.
△ Less
Submitted 20 October, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Log-law recovery through reinforcement-learning wall model for large-eddy simulation
Authors:
Aurélien Vadrot,
Xiang I. A. Yang,
H. Jane Bae,
Mahdi Abkar
Abstract:
This paper focuses on the use of reinforcement learning (RL) as a machine-learning (ML) modeling tool for near-wall turbulence. RL has demonstrated its effectiveness in solving high-dimensional problems, especially in domains such as games. Despite its potential, RL is still not widely used for turbulence modeling and is primarily used for flow control and optimization purposes. A new RL wall mode…
▽ More
This paper focuses on the use of reinforcement learning (RL) as a machine-learning (ML) modeling tool for near-wall turbulence. RL has demonstrated its effectiveness in solving high-dimensional problems, especially in domains such as games. Despite its potential, RL is still not widely used for turbulence modeling and is primarily used for flow control and optimization purposes. A new RL wall model (WM) called VYBA23 is developed in this work, which uses agents dispersed in the flow near the wall. The model is trained on a single Reynolds number ($Re_τ= 10^4$) and does not rely on high-fidelity data, as the back-propagation process is based on a reward rather than output error. The states of the RLWM, which are the representation of the environment by the agents, are normalized to remove dependence on the Reynolds number. The model is tested and compared to another RLWM (BK22) and to an equilibrium wall model, in a half-channel flow at eleven different Reynolds numbers ($Re_τ\in [180;10^{10}]$). The effects of varying agents' parameters such as actions range, time-step, and spacing are also studied. The results are promising, showing little effect on the average flow field but some effect on wall-shear stress fluctuations and velocity fluctuations. This work offers positive prospects for develo** RLWMs that can recover physical laws, and for extending this type of ML models to more complex flows in the future.
△ Less
Submitted 2 May, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
POD-mode-augmented wall model and its applications to flows at non-equilibrium conditions
Authors:
Christoffer Hansen,
Xiang IA Yang,
Mahdi Abkar
Abstract:
Insights gained from modal analysis are invoked for predictive large-eddy simulation (LES) wall modeling. Specifically, we augment the law of the wall (LoW) by an additional mode based on a one-dimensional proper orthogonal decomposition (POD) applied to a 2D turbulent channel. The constructed wall model contains two modes, i.e., the LoW mode and the POD-based mode, and the model matches with the…
▽ More
Insights gained from modal analysis are invoked for predictive large-eddy simulation (LES) wall modeling. Specifically, we augment the law of the wall (LoW) by an additional mode based on a one-dimensional proper orthogonal decomposition (POD) applied to a 2D turbulent channel. The constructed wall model contains two modes, i.e., the LoW mode and the POD-based mode, and the model matches with the LES at two, instead of one, off-wall locations. To show that the proposed model captures non-equilibrium effects, we perform a-priori and a-posteriori tests in the context of both equilibrium and non-equilibrium flows. The a-priori tests show that the proposed wall model captures extreme wall-shear stress events better than the equilibrium wall model. The model also captures non-equilibrium effects due to adverse pressure gradients. The a-posteriori tests show that the wall model captures the rapid decrease and the initial decrease of the streamwise wall-shear stress in channels subjected to suddenly imposed adverse and transverse pressure gradients, respectively, both of which are missed by currently available wall models. These results show promise in applying modal analysis for turbulence wall modeling. In particular, the results show that employing multiple modes helps in the modeling of non-equilibrium flows.
△ Less
Submitted 16 October, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Unifying Nesterov's Accelerated Gradient Methods for Convex and Strongly Convex Objective Functions: From Continuous-Time Dynamics to Discrete-Time Algorithms
Authors:
Jungbin Kim,
Insoon Yang
Abstract:
Although Nesterov's accelerated gradient (NAG) methods have been studied from various perspectives, it remains unclear why the most popular forms of NAG must handle convex and strongly convex objective functions separately. Motivated by this inconsistency, we propose an NAG method that unifies the existing ones for the convex and strongly convex cases. We first design a Lagrangian function that co…
▽ More
Although Nesterov's accelerated gradient (NAG) methods have been studied from various perspectives, it remains unclear why the most popular forms of NAG must handle convex and strongly convex objective functions separately. Motivated by this inconsistency, we propose an NAG method that unifies the existing ones for the convex and strongly convex cases. We first design a Lagrangian function that continuously extends the first Bregman Lagrangian to the strongly convex setting. As a specific case of the Euler--Lagrange equation for this Lagrangian, we derive an ordinary differential equation (ODE) model, which we call the unified NAG ODE, that bridges the gap between the ODEs that model NAG for convex and strongly convex objective functions. We then design the unified NAG, a novel momentum method whereby the continuous-time limit corresponds to the unified ODE. The coefficients and the convergence rates of the unified NAG and unified ODE are continuous in the strong convexity parameter $μ$ on $[0, +\infty)$. Unlike the existing popular algorithm and ODE for strongly convex objective functions, the unified NAG and the unified NAG ODE always have superior convergence guarantees compared to the known algorithms and ODEs for non-strongly convex objective functions. This property is beneficial in practical perspective when considering strongly convex objective functions with small $μ$. Furthermore, we extend our unified dynamics and algorithms to the higher-order setting. Last but not least, we propose the unified NAG-G ODE, a novel ODE model for minimizing the gradient norm of strongly convex objective functions. Our unified Lagrangian framework is crucial in the process of constructing this ODE. Fascinatingly, using our novel tool, called the differential kernel, we observe that the unified NAG ODE and the unified NAG-G ODE have an anti-transpose relationship.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
Using affine policies to reformulate two-stage Wasserstein distributionally robust linear programs to be independent of sample size
Authors:
Youngchae Cho,
Insoon Yang
Abstract:
Intensively studied in theory as a promising data-driven tool for decision-making under ambiguity, two-stage distributionally robust optimization (DRO) problems over Wasserstein balls are not necessarily easy to solve in practice. This is partly due to large sample size. In this article, we study a generic two-stage distributionally robust linear program (2-DRLP) over a 1-Wasserstein ball using an…
▽ More
Intensively studied in theory as a promising data-driven tool for decision-making under ambiguity, two-stage distributionally robust optimization (DRO) problems over Wasserstein balls are not necessarily easy to solve in practice. This is partly due to large sample size. In this article, we study a generic two-stage distributionally robust linear program (2-DRLP) over a 1-Wasserstein ball using an affine policy. The 2-DRLP has right-hand-side uncertainty with a rectangular support. Our main contribution is to show that the 2-DRLP problem has a tractable reformulation with a scale independent of sample size. The reformulated problem can be solved within a pre-defined optimality tolerance using robust optimization techniques. To reduce the inevitable conservativeness of the affine policy while preserving independence of sample size, we further develop a method for constructing an uncertainty set with a probabilistic guarantee over which the Wasserstein ball is re-defined. As an application, we present a novel unit commitment model for power systems under uncertainty of renewable energy generation to examine the effectiveness of the proposed 2-DRLP technique. Extensive numerical experiments demonstrate that our model leads to better out-of-sample performance on average than other state-of-the-art distributionally robust unit commitment models while staying computationally competent.
△ Less
Submitted 31 December, 2022;
originally announced January 2023.
-
Wasserstein Distributionally Robust Control of Partially Observable Linear Stochastic Systems
Authors:
Astghik Hakobyan,
Insoon Yang
Abstract:
Distributionally robust control (DRC) aims to effectively manage distributional ambiguity in stochastic systems. While most existing works address inaccurate distributional information in fully observable settings, we consider a partially observable DRC problem for discrete-time linear systems using the Wasserstein metric. For a tractable solution, we propose a novel approximation method exploitin…
▽ More
Distributionally robust control (DRC) aims to effectively manage distributional ambiguity in stochastic systems. While most existing works address inaccurate distributional information in fully observable settings, we consider a partially observable DRC problem for discrete-time linear systems using the Wasserstein metric. For a tractable solution, we propose a novel approximation method exploiting the Gelbrich bound of Wasserstein distance. Using techniques from modern distributionally robust optimization, we derive a closed-form expression for the optimal control policy and a tractable semidefinite programming problem for the worst-case distribution policy in both finite-horizon and infinite-horizon average-cost settings. The proposed method features several salient theoretical properties, such as a guaranteed cost property and a probabilistic out-of-sample performance guarantee, demonstrating the distributional robustness of our controller. Furthermore, the resulting controller is shown to ensure the closed-loop stability of the mean-state system. The empirical performance of our method is tested through numerical experiments on a power system frequency control problem.
△ Less
Submitted 21 December, 2022; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach
Authors:
Mingyu Park,
Jaeuk Shin,
Insoon Yang
Abstract:
Partially observable Markov decision processes (POMDPs) is a rich mathematical framework that embraces a large class of complex sequential decision-making problems under uncertainty with limited observations. However, the complexity of POMDPs poses various computational challenges, motivating the need for an efficient algorithm that rapidly finds a good enough suboptimal solution. In this paper, w…
▽ More
Partially observable Markov decision processes (POMDPs) is a rich mathematical framework that embraces a large class of complex sequential decision-making problems under uncertainty with limited observations. However, the complexity of POMDPs poses various computational challenges, motivating the need for an efficient algorithm that rapidly finds a good enough suboptimal solution. In this paper, we propose a novel accelerated offline POMDP algorithm exploiting Anderson acceleration (AA) that is capable of efficiently solving fixed-point problems using previous solution estimates. Our algorithm is based on the Q-function approximation (QMDP) method to alleviate the scalability issue inherent in POMDPs. Inspired by the quasi-Newton interpretation of AA, we propose a maximum entropy variant of QMDP, which we call soft QMDP, to fully benefit from AA. We prove that the overall algorithm converges to the suboptimal solution obtained by soft QMDP. Our algorithm can also be implemented in a model-free manner using simulation data. Provable error bounds on the residual and the solution are provided to examine how the simulation errors are propagated through the proposed algorithm. Finally, the performance of our algorithm is tested on several benchmark problems. According to the results of our experiments, the proposed algorithm converges significantly faster without degrading the solution quality compared to its standard counterparts.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Survey of machine learning wall models for large eddy simulation
Authors:
Aurélien Vadrot,
Xiang I. A. Yang,
Mahdi Abkar
Abstract:
This survey investigates wall modeling in large eddy simulations (LES) using data-driven machine learning (ML) techniques. To this end, we implement three ML wall models in an open-source code and compare their performances with the equilibrium wall model in LES of half-channel flow at eleven friction Reynolds numbers between $180$ and $10^{10}$. The three models have ''seen'' flows at only a few…
▽ More
This survey investigates wall modeling in large eddy simulations (LES) using data-driven machine learning (ML) techniques. To this end, we implement three ML wall models in an open-source code and compare their performances with the equilibrium wall model in LES of half-channel flow at eleven friction Reynolds numbers between $180$ and $10^{10}$. The three models have ''seen'' flows at only a few Reynolds numbers. We test if these ML wall models can extrapolate to unseen Reynolds numbers. Among the three models, two are supervised ML models, and one is a reinforcement learning ML model. The two supervised ML models are trained against direct numerical simulation (DNS) data, whereas the reinforcement learning ML model is trained in the context of a wall-modeled LES with no access to high-fidelity data. The two supervised ML models capture the law of the wall at both seen and unseen Reynolds numbers--although one model requires re-training and predicts a smaller von Kármán constant. The reinforcement learning model captures the law of the wall reasonably well but has errors at both low ($Re_τ<10^3$) and high Reynolds numbers ($Re_τ>10^6$). In addition to documenting the results, we try to ''understand'' why the ML models behave the way they behave. Analysis shows that the errors of the supervised ML model is a result of the network design and the errors in the reinforcement learning model arise due to the present choice of the ''states'' and the mismatch between the neutral line and the line separating the action map. In all, we see promises in data-driven machine learning models.
△ Less
Submitted 22 May, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
MONAI: An open-source framework for deep learning in healthcare
Authors:
M. Jorge Cardoso,
Wenqi Li,
Richard Brown,
Nic Ma,
Eric Kerfoot,
Yiheng Wang,
Benjamin Murrey,
Andriy Myronenko,
Can Zhao,
Dong Yang,
Vishwesh Nath,
Yufan He,
Ziyue Xu,
Ali Hatamizadeh,
Andriy Myronenko,
Wentao Zhu,
Yun Liu,
Mingxin Zheng,
Yucheng Tang,
Isaac Yang,
Michael Zephyr,
Behrooz Hashemian,
Sachidanand Alle,
Mohammad Zalbagi Darestani,
Charlie Budd
, et al. (32 additional authors not shown)
Abstract:
Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geo…
▽ More
Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geometry, physiology, physics) of medical data being processed. This work introduces MONAI, a freely available, community-supported, and consortium-led PyTorch-based framework for deep learning in healthcare. MONAI extends PyTorch to support medical data, with a particular focus on imaging, and provide purpose-specific AI model architectures, transformations and utilities that streamline the development and deployment of medical AI models. MONAI follows best practices for software-development, providing an easy-to-use, robust, well-documented, and well-tested software framework. MONAI preserves the simple, additive, and compositional approach of its underlying PyTorch libraries. MONAI is being used by and receiving contributions from research, clinical and industrial teams from around the world, who are pursuing applications spanning nearly every aspect of healthcare.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
NVIDIA FLARE: Federated Learning from Simulation to Real-World
Authors:
Holger R. Roth,
Yan Cheng,
Yuhong Wen,
Isaac Yang,
Ziyue Xu,
Yuan-Ting Hsieh,
Kristopher Kersten,
Ahmed Harouni,
Can Zhao,
Kevin Lu,
Zhihong Zhang,
Wenqi Li,
Andriy Myronenko,
Dong Yang,
Sean Yang,
Nicola Rieke,
Abood Quraini,
Chester Chen,
Daguang Xu,
Nic Ma,
Prerna Dogra,
Mona Flores,
Andrew Feng
Abstract:
Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data. We created NVIDIA FLARE as an open-source software development kit (SDK) to make it easier for data scientists to use FL in their research and real-world applications. The SDK includes solutions for state-of-the-art FL algorithms and…
▽ More
Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data. We created NVIDIA FLARE as an open-source software development kit (SDK) to make it easier for data scientists to use FL in their research and real-world applications. The SDK includes solutions for state-of-the-art FL algorithms and federated machine learning approaches, which facilitate building workflows for distributed learning across enterprises and enable platform developers to create a secure, privacy-preserving offering for multiparty collaboration utilizing homomorphic encryption or differential privacy. The SDK is a lightweight, flexible, and scalable Python package. It allows researchers to apply their data science workflows in any training libraries (PyTorch, TensorFlow, XGBoost, or even NumPy) in real-world FL settings. This paper introduces the key design principles of NVFlare and illustrates some use cases (e.g., COVID analysis) with customizable FL workflows that implement different privacy-preserving algorithms.
Code is available at https://github.com/NVIDIA/NVFlare.
△ Less
Submitted 28 April, 2023; v1 submitted 24 October, 2022;
originally announced October 2022.
-
Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning
Authors:
Louis Castricato,
Alexander Havrilla,
Shahbuland Matiana,
Michael Pieler,
Anbang Ye,
Ian Yang,
Spencer Frazier,
Mark Riedl
Abstract:
Controlled automated story generation seeks to generate natural language stories satisfying constraints from natural language critiques or preferences. Existing methods to control for story preference utilize prompt engineering which is labor intensive and often inconsistent. They may also use logit-manipulation methods which require annotated datasets to exist for the desired attributes. To addre…
▽ More
Controlled automated story generation seeks to generate natural language stories satisfying constraints from natural language critiques or preferences. Existing methods to control for story preference utilize prompt engineering which is labor intensive and often inconsistent. They may also use logit-manipulation methods which require annotated datasets to exist for the desired attributes. To address these issues, we first train a contrastive bi-encoder model to align stories with corresponding human critiques, named CARP, building a general purpose preference model. This is subsequently used as a reward function to fine-tune a generative language model via reinforcement learning. However, simply fine-tuning a generative language model with a contrastive reward model does not always reliably result in a story generation system capable of generating stories that meet user preferences. To increase story generation robustness we further fine-tune the contrastive reward model using a prompt-learning technique. A human participant study is then conducted comparing generations from our full system, ablations, and two baselines. We show that the full fine-tuning pipeline results in a story generator preferred over a LLM 20x as large as well as logit-based methods. This motivates the use of contrastive learning for general purpose human preference modeling.
△ Less
Submitted 15 December, 2022; v1 submitted 14 October, 2022;
originally announced October 2022.
-
Unsupervisedly Prompting AlphaFold2 for Few-Shot Learning of Accurate Folding Landscape and Protein Structure Prediction
Authors:
Jun Zhang,
Sirui Liu,
Mengyun Chen,
Haotian Chu,
Min Wang,
Zidong Wang,
Jialiang Yu,
Ningxi Ni,
Fan Yu,
Diqing Chen,
Yi Isaac Yang,
Boxin Xue,
Lijiang Yang,
Yuan Liu,
Yi Qin Gao
Abstract:
Data-driven predictive methods which can efficiently and accurately transform protein sequences into biologically active structures are highly valuable for scientific research and medical development. Determining accurate folding landscape using co-evolutionary information is fundamental to the success of modern protein structure prediction methods. As the state of the art, AlphaFold2 has dramatic…
▽ More
Data-driven predictive methods which can efficiently and accurately transform protein sequences into biologically active structures are highly valuable for scientific research and medical development. Determining accurate folding landscape using co-evolutionary information is fundamental to the success of modern protein structure prediction methods. As the state of the art, AlphaFold2 has dramatically raised the accuracy without performing explicit co-evolutionary analysis. Nevertheless, its performance still shows strong dependence on available sequence homologs. Based on the interrogation on the cause of such dependence, we presented EvoGen, a meta generative model, to remedy the underperformance of AlphaFold2 for poor MSA targets. By prompting the model with calibrated or virtually generated homologue sequences, EvoGen helps AlphaFold2 fold accurately in low-data regime and even achieve encouraging performance with single-sequence predictions. Being able to make accurate predictions with few-shot MSA not only generalizes AlphaFold2 better for orphan sequences, but also democratizes its use for high-throughput applications. Besides, EvoGen combined with AlphaFold2 yields a probabilistic structure generation method which could explore alternative conformations of protein sequences, and the task-aware differentiable algorithm for sequence generation will benefit other related tasks including protein design.
△ Less
Submitted 8 October, 2023; v1 submitted 20 August, 2022;
originally announced August 2022.
-
Unique Ferroelectric Fatigue Behavior and Exceptional High Temperature Retention in Al0.93B0.07N Films
Authors:
Wanlin Zhu,
Fan He,
John Hayden,
Quyen Tran,
Jung In Yang,
Pannawit Tipsawat,
Brian Foley,
Thomas N. Jackson,
Jon-Paul Maria,
Susan Trolier-McKinstry
Abstract:
This paper reports the fatigue and retention behavior for Al1-xBxN thin films, a member of the novel family of wurtzite ferroelectrics, with an emphasis on the role of capacitor architecture. By modifying the capacitor architecture, and thus thermal and electrical boundary conditions, we create insight regarding the relative importance of intrinsic and extrinsic contributors to the degradation ten…
▽ More
This paper reports the fatigue and retention behavior for Al1-xBxN thin films, a member of the novel family of wurtzite ferroelectrics, with an emphasis on the role of capacitor architecture. By modifying the capacitor architecture, and thus thermal and electrical boundary conditions, we create insight regarding the relative importance of intrinsic and extrinsic contributors to the degradation tendencies. Our experiments suggest that bipolar cycling of metal (Pt/W)/Al0.93B0.07N/W/Al2O3 film stacks first induced wake-up, then a region of constant switchable polarization. On additional cycling, the film leakage current increased, and then films underwent dielectric breakdown. For unpatterned first generation Al0.93B0.07N films with 100 nm thick Pt top electrodes survive ~104 bipolar cycles, whereas films with 1000 nm W top electrodes survive ~10^5 cycles before thermal dielectric breakdown. Sentaurus modeling was used to design an SU8 field plate which improved the performance to ~10^6 fatigue cycles. It was found that the thermal failures during fatigue were not due to surface flashover events but were associated with hard breakdown events in the dielectric. The films showed excellent retention of the stored polarization state. As expected, data retention was slightly inferior in the opposite state (OS) measurements. However, it is noted that even after 3.6x10^6 sec (1000 hr). at 200°C, the OS signal margin still exceeded 200 uC/cm2. The predicted OS retention is 82% after 10 years baking at 200oC.
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
On representation formulas for optimal control: A Lagrangian perspective
Authors:
Yeoneung Kim,
Insoon Yang
Abstract:
In this paper, we study representation formulas for finite-horizon optimal control problems with or without state constraints, unifying two different viewpoints: the Lagrangian and dynamic programming (DP) frameworks. In a recent work [1], the generalized Lax formula is obtained via DP for optimal control problems with state constraints and nonlinear systems. We revisit the formula from the Lagran…
▽ More
In this paper, we study representation formulas for finite-horizon optimal control problems with or without state constraints, unifying two different viewpoints: the Lagrangian and dynamic programming (DP) frameworks. In a recent work [1], the generalized Lax formula is obtained via DP for optimal control problems with state constraints and nonlinear systems. We revisit the formula from the Lagrangian perspective to provide a unified framework for understanding and implementing the nontrivial representation of the value function. Our simple derivation makes direct use of the Lagrangian formula from the theory of Hamilton-Jacobi (HJ) equations. We also discuss a rigorous way to construct an optimal control using a $δ$-net, as well as a numerical scheme for controller synthesis via convex optimization.
△ Less
Submitted 5 April, 2022;
originally announced April 2022.
-
Wasserstein Distributionally Robust Control of Partially Observable Linear Systems: Tractable Approximation and Performance Guarantee
Authors:
Astghik Hakobyan,
Insoon Yang
Abstract:
Wasserstein distributionally robust control (WDRC) is an effective method for addressing inaccurate distribution information about disturbances in stochastic systems. It provides various salient features, such as an out-of-sample performance guarantee, while most of the existing methods use full-state observations. In this paper, we develop a computationally tractable WDRC method for discrete-time…
▽ More
Wasserstein distributionally robust control (WDRC) is an effective method for addressing inaccurate distribution information about disturbances in stochastic systems. It provides various salient features, such as an out-of-sample performance guarantee, while most of the existing methods use full-state observations. In this paper, we develop a computationally tractable WDRC method for discrete-time partially observable linear-quadratic (LQ) control problems. The key idea is to reformulate the WDRC problem as a novel minimax control problem with an approximate Wasserstein penalty. We derive a closed-form expression of the optimal control policy of the approximate problem using a nontrivial Riccati equation. We further show the guaranteed cost property of the resulting controller and identify a provable bound for the optimality gap. Finally, we evaluate the performance of our method through numerical experiments using both Gaussian and non-Gaussian disturbances.
△ Less
Submitted 7 September, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
On Affine Policies for Wasserstein Distributionally Robust Unit Commitment
Authors:
Youngchae Cho,
Insoon Yang
Abstract:
This paper proposes a unit commitment (UC) model based on data-driven Wasserstein distributionally robust optimization (WDRO) for power systems under uncertainty of renewable generation as well as its tractable exact reformulation. The proposed model is formulated as a WDRO problem relying on an affine policy, which nests an infinite-dimensional worst-case expectation problem and satisfies the non…
▽ More
This paper proposes a unit commitment (UC) model based on data-driven Wasserstein distributionally robust optimization (WDRO) for power systems under uncertainty of renewable generation as well as its tractable exact reformulation. The proposed model is formulated as a WDRO problem relying on an affine policy, which nests an infinite-dimensional worst-case expectation problem and satisfies the non-anticipativity constraint. To reduce conservativeness, we develop a novel technique that defines a subset of the uncertainty set with a probabilistic guarantee. Subsequently, the proposed model is recast as a semi-infinite programming problem that can be efficiently solved using existing algorithms. Notably, the scale of this reformulation is invariant with the sample size. As a result, a number of samples are easily incorporated without using sophisticated decomposition algorithms. Numerical simulations on 6- and 24-bus test systems demonstrate the economic and computational efficiency of the proposed model.
△ Less
Submitted 16 August, 2022; v1 submitted 29 March, 2022;
originally announced March 2022.
-
On the energy of Schwarzschild spacetime with the post-Newtonian approximation
Authors:
I-Ching Yang
Abstract:
With the post-Newtonian approxination, the energy of Schwarzschild spacetime in the Weinberg prescription is obtained. The energy for the first post-Newtonian approximation $E^{(1)} = m$ gives the Newtonian treatment of Schwarzschild spacetime. However, for the second post-Newtonian approximation, the erergy is shown that $E^{(1)}$ adds extra terms $E^{(2)}$ which consist of the energy stored in t…
▽ More
With the post-Newtonian approxination, the energy of Schwarzschild spacetime in the Weinberg prescription is obtained. The energy for the first post-Newtonian approximation $E^{(1)} = m$ gives the Newtonian treatment of Schwarzschild spacetime. However, for the second post-Newtonian approximation, the erergy is shown that $E^{(1)}$ adds extra terms $E^{(2)}$ which consist of the energy stored in the configuration $E_{\rm config}$, in the gravitational field $E_{\rm field}$. and a term of surface integral. These extra terms gives post-Newtonian corrections to the Newtonian treatment.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
Nesterov Acceleration for Riemannian Optimization
Authors:
Jungbin Kim,
Insoon Yang
Abstract:
In this paper, we generalize the Nesterov accelerated gradient (NAG) method to solve Riemannian optimization problems in a computationally tractable manner. The iteration complexity of our algorithm matches that of the NAG method on the Euclidean space when the objective functions are geodesically convex or geodesically strongly convex. To the best of our knowledge, the proposed algorithm is the f…
▽ More
In this paper, we generalize the Nesterov accelerated gradient (NAG) method to solve Riemannian optimization problems in a computationally tractable manner. The iteration complexity of our algorithm matches that of the NAG method on the Euclidean space when the objective functions are geodesically convex or geodesically strongly convex. To the best of our knowledge, the proposed algorithm is the first fully accelerated method for geodesically convex optimization problems without requiring strong convexity. Our convergence rate analysis exploits novel metric distortion lemmas as well as carefully designed potential functions. We also identify a connection with the continuous-time dynamics for modeling Riemannian acceleration in Alimisis et al. [1] to understand the accelerated convergence of our scheme through the lens of continuous-time flows.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Atomistic View of Homogeneous Nucleation of Water into Polymorphic Ices
Authors:
Maodong Li,
Jun Zhang,
Niu Haiyang,
Yao Kun Lei,
Xu Han,
Lijiang Yang,
Zhiqiang Ye,
Yi Isaac Yang,
Yi Qin Gao
Abstract:
Water is one of the most abundant substances on Earth, and ice, i.e., solid water, has more than 18 known phases. Normally ice in nature exists only as Ice Ih, Ice Ic, or a stacking disordered mixture of both. Although many theoretical efforts have been devoted to understanding the thermodynamics of different ice phases at ambient temperature and pressure, there still remains many puzzles. We simu…
▽ More
Water is one of the most abundant substances on Earth, and ice, i.e., solid water, has more than 18 known phases. Normally ice in nature exists only as Ice Ih, Ice Ic, or a stacking disordered mixture of both. Although many theoretical efforts have been devoted to understanding the thermodynamics of different ice phases at ambient temperature and pressure, there still remains many puzzles. We simulated the reversible transitions between water and different ice phases by performing full atom molecular dynamics simulations. Using the enhanced sampling method MetaITS with the two selected X-ray diffraction peak intensities as collective variables, the ternary phase diagrams of liquid water, ice Ih, ice Ic at multiple were obtained. We also present a simple physical model which successfully explains the thermodynamic stability of ice. Our results agree with experiments and leads to a deeper understanding of the ice nucleation mechanism.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
The Einstein and Møller energy-momentum complexes in post-Newtonian approximation
Authors:
I-Ching Yang
Abstract:
In the first and second post-Newtonian approximation of the Schwarzschild metric, I obtain the energy component of the Einstein and Møller energy-momentum complex. Both energies involve the rest-mass energy $m$, the energy stored in the configuration and that in the gravitational field, but the energies of Schwarzschild spacetime in the Einstein and Møller prescriptions are the total mass-energy…
▽ More
In the first and second post-Newtonian approximation of the Schwarzschild metric, I obtain the energy component of the Einstein and Møller energy-momentum complex. Both energies involve the rest-mass energy $m$, the energy stored in the configuration and that in the gravitational field, but the energies of Schwarzschild spacetime in the Einstein and Møller prescriptions are the total mass-energy $M$. First, for general relativity, the rest-mass energy $m$ in the flat spacetime behaves like the bare mass, and the total mass-energy $M$ in the curved spacetime behaves like the experimentally observed mass. Second, the zero-potential surface is important condition for defining the energy of gravitational field, and plays an important role in the energy-momentum localization of general relativity.
△ Less
Submitted 22 November, 2021;
originally announced November 2021.
-
Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs
Authors:
Yeoneung Kim,
Insoon Yang,
Kwang-Sung Jun
Abstract:
In online learning problems, exploiting low variance plays an important role in obtaining tight performance guarantees yet is challenging because variances are often not known a priori. Recently, considerable progress has been made by Zhang et al. (2021) where they obtain a variance-adaptive regret bound for linear bandits without knowledge of the variances and a horizon-free regret bound for line…
▽ More
In online learning problems, exploiting low variance plays an important role in obtaining tight performance guarantees yet is challenging because variances are often not known a priori. Recently, considerable progress has been made by Zhang et al. (2021) where they obtain a variance-adaptive regret bound for linear bandits without knowledge of the variances and a horizon-free regret bound for linear mixture Markov decision processes (MDPs). In this paper, we present novel analyses that improve their regret bounds significantly. For linear bandits, we achieve $\tilde O(\min\{d\sqrt{K}, d^{1.5}\sqrt{\sum_{k=1}^K σ_k^2}\} + d^2)$ where $d$ is the dimension of the features, $K$ is the time horizon, and $σ_k^2$ is the noise variance at time step $k$, and $\tilde O$ ignores polylogarithmic dependence, which is a factor of $d^3$ improvement. For linear mixture MDPs with the assumption of maximum cumulative reward in an episode being in $[0,1]$, we achieve a horizon-free regret bound of $\tilde O(d \sqrt{K} + d^2)$ where $d$ is the number of base models and $K$ is the number of episodes. This is a factor of $d^{3.5}$ improvement in the leading term and $d^7$ in the lower order term. Our analysis critically relies on a novel peeling-based regret analysis that leverages the elliptical potential `count' lemma.
△ Less
Submitted 4 February, 2023; v1 submitted 5 November, 2021;
originally announced November 2021.
-
Training Wasserstein GANs without gradient penalties
Authors:
Dohyun Kwon,
Yeoneung Kim,
Guido Montúfar,
Insoon Yang
Abstract:
We propose a stable method to train Wasserstein generative adversarial networks. In order to enhance stability, we consider two objective functions using the $c$-transform based on Kantorovich duality which arises in the theory of optimal transport. We experimentally show that this algorithm can effectively enforce the Lipschitz constraint on the discriminator while other standard methods fail to…
▽ More
We propose a stable method to train Wasserstein generative adversarial networks. In order to enhance stability, we consider two objective functions using the $c$-transform based on Kantorovich duality which arises in the theory of optimal transport. We experimentally show that this algorithm can effectively enforce the Lipschitz constraint on the discriminator while other standard methods fail to do so. As a consequence, our method yields an accurate estimation for the optimal discriminator and also for the Wasserstein distance between the true distribution and the generated one. Our method requires no gradient penalties nor corresponding hyperparameter tuning and is computationally more efficient than other methods. At the same time, it yields competitive generators of synthetic images based on the MNIST, F-MNIST, and CIFAR-10 datasets.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Infusing model predictive control into meta-reinforcement learning for mobile robots in dynamic environments
Authors:
Jaeuk Shin,
Astghik Hakobyan,
Mingyu Park,
Yeoneung Kim,
Gihun Kim,
Insoon Yang
Abstract:
The successful operation of mobile robots requires them to adapt rapidly to environmental changes. To develop an adaptive decision-making tool for mobile robots, we propose a novel algorithm that combines meta-reinforcement learning (meta-RL) with model predictive control (MPC). Our method employs an off-policy meta-RL algorithm as a baseline to train a policy using transition samples generated by…
▽ More
The successful operation of mobile robots requires them to adapt rapidly to environmental changes. To develop an adaptive decision-making tool for mobile robots, we propose a novel algorithm that combines meta-reinforcement learning (meta-RL) with model predictive control (MPC). Our method employs an off-policy meta-RL algorithm as a baseline to train a policy using transition samples generated by MPC when the robot detects certain events that can be effectively handled by MPC, with its explicit use of robot dynamics. The key idea of our method is to switch between the meta-learned policy and the MPC controller in a randomized and event-triggered fashion to make up for suboptimal MPC actions caused by the limited prediction horizon. During meta-testing, the MPC module is deactivated to significantly reduce computation time in motion control. We further propose an online adaptation scheme that enables the robot to infer and adapt to a new task within a single trajectory. The performance of our method has been demonstrated through simulations using a nonlinear car-like vehicle model with (i) synthetic movements of obstacles, and (ii) real-world pedestrian motion data. The simulation results indicate that our method outperforms other algorithms in terms of learning efficiency and navigation quality.
△ Less
Submitted 7 July, 2022; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Angularly quantized spin rotations in hexagonal LuMnO3
Authors:
Seung Kim,
Jiyeon Nam,
Xianghan Xu,
Sang-Wook Cheong,
In-Sang Yang
Abstract:
Optical control of the spin degree of freedom is often desired in application of the spin technology. Here we report spin-rotational excitations observed through inelastic light scattering of the hexagonal LuMnO3 in the antiferromagnetically (AFM) ordered state. We propose a model based on the spin-spin interaction Hamiltonian associated with the spin rotation of the Mn ions, and find that the spi…
▽ More
Optical control of the spin degree of freedom is often desired in application of the spin technology. Here we report spin-rotational excitations observed through inelastic light scattering of the hexagonal LuMnO3 in the antiferromagnetically (AFM) ordered state. We propose a model based on the spin-spin interaction Hamiltonian associated with the spin rotation of the Mn ions, and find that the spin rotations are angularly quantized by 60, 120, and 180 degrees. Angular quantization is considered to be a consequence of the symmetry of the triangular lattice of the Mn-ion plane in the hexagonal LuMnO3. These angularly-quantized spin excitations may be pictured as isolated flat bubbles in the sea of the ground state, which may lead to high-density information storage if applied to spin devices. Optically pumped and detected spin-excitation bubbles would bring about the advanced technology of optical control of the spin degree of freedom in multiferroic materials.
△ Less
Submitted 3 February, 2022; v1 submitted 19 August, 2021;
originally announced August 2021.
-
On the energy density of linearly polarized, plane gravitational wave
Authors:
I-Ching Yang
Abstract:
In this article, the energy density of plane gravitational wave is studied by using Einstein and Møller's prescription of energy-momentum pseudotensors. The linearly polarized plan gravitational wave solution of Einstein field equation, which has been defined by Bondi et al., is represented by four kinds of different coodrinates. The energy distribution of gravitational wave solution in Einstein a…
▽ More
In this article, the energy density of plane gravitational wave is studied by using Einstein and Møller's prescription of energy-momentum pseudotensors. The linearly polarized plan gravitational wave solution of Einstein field equation, which has been defined by Bondi et al., is represented by four kinds of different coodrinates. The energy distribution of gravitational wave solution in Einstein and Møller's prescription are obtained. Particularly the energy component is zero in null coordinates.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
LES wall modeling for heat transfer at high speeds
Authors:
Peng E. S. Chen,
Yu Lv,
Haosen H. A. Xu,
Yipeng Shi,
Xiang I. A. Yang
Abstract:
A practical application of universal wall scalings is near-wall turbulence modeling. In this paper, we exploit temperature's semi-local scaling [Patel, Boersma, and Pecnik, {Scalar statistics in variable property turbulent channel flows}, Phys. Rev. Fluids, 2017, 2(8), 084604] and derive an eddy conductivity closure for wall-modeled large-eddy simulation of high-speed flows. We show that while the…
▽ More
A practical application of universal wall scalings is near-wall turbulence modeling. In this paper, we exploit temperature's semi-local scaling [Patel, Boersma, and Pecnik, {Scalar statistics in variable property turbulent channel flows}, Phys. Rev. Fluids, 2017, 2(8), 084604] and derive an eddy conductivity closure for wall-modeled large-eddy simulation of high-speed flows. We show that while the semi-local scaling does not collapse high-speed direct numerical simulation (DNS) data, the resulting eddy conductivity and the wall model work fairly well. The paper attempts to answer the following outstanding question: why the semi-local scaling fails but the resulting eddy conductivity works well. We conduct DNSs of Couette flows at Mach numbers from $M=1.4$ to 6. We add a source term in the energy equation to get a cold, a close-to-adiabatic wall, and a hot wall. Detailed analysis of the flows' energy budgets shows that aerodynamic heating is the answer to our question: aerodynamic heating is not accounted for in Patel et al.'s semi-local scaling but is modeled in the equilibrium wall model. We incorporate aerodynamic heating in semi-local scaling and show that the new scaling successfully collapses the high-speed DNS data. We also show that incorporating aerodynamic heating or not, the semi-local scaling gives rise to the exact same eddy conductivity, thereby answering the outstanding question.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Distributionally robust risk map for learning-based motion planning and control: A semidefinite programming approach
Authors:
Astghik Hakobyan,
Insoon Yang
Abstract:
This paper proposes a novel safety specification tool, called the distributionally robust risk map (DR-risk map), for a mobile robot operating in a learning-enabled environment. Given the robot's position, the map aims to reliably assess the conditional value-at-risk (CVaR) of collision with obstacles whose movements are inferred by Gaussian process regression (GPR). Unfortunately, the inferred di…
▽ More
This paper proposes a novel safety specification tool, called the distributionally robust risk map (DR-risk map), for a mobile robot operating in a learning-enabled environment. Given the robot's position, the map aims to reliably assess the conditional value-at-risk (CVaR) of collision with obstacles whose movements are inferred by Gaussian process regression (GPR). Unfortunately, the inferred distribution is subject to errors, making it difficult to accurately evaluate the CVaR of collision. To overcome this challenge, this tool measures the risk under the worst-case distribution in a so-called ambiguity set that characterizes allowable distribution errors. To resolve the infinite-dimensionality issue inherent in the construction of the DR-risk map, we derive a tractable semidefinite programming formulation that provides an upper bound of the risk, exploiting techniques from modern distributionally robust optimization. As a concrete application for motion planning, a distributionally robust RRT* algorithm is considered using the risk map that addresses distribution errors caused by GPR. Furthermore, a motion control method is devised using the DR-risk map in a learning-based model predictive control (MPC) formulation. In particular, a neural network approximation of the risk map is proposed to reduce the computational cost in solving the MPC problem. The performance and utility of the proposed risk map are demonstrated through simulation studies that show its ability to ensure the safety of mobile robots despite learning errors.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
On Anderson acceleration for partially observable Markov decision processes
Authors:
Melike Ermis,
Mingyu Park,
Insoon Yang
Abstract:
This paper proposes an accelerated method for approximately solving partially observable Markov decision process (POMDP) problems offline. Our method carefully combines two existing tools: Anderson acceleration (AA) and the fast informed bound (FIB) method. Adopting AA, our method rapidly solves an approximate Bellman equation with an efficient combination of previous solution estimates. Furthermo…
▽ More
This paper proposes an accelerated method for approximately solving partially observable Markov decision process (POMDP) problems offline. Our method carefully combines two existing tools: Anderson acceleration (AA) and the fast informed bound (FIB) method. Adopting AA, our method rapidly solves an approximate Bellman equation with an efficient combination of previous solution estimates. Furthermore, the use of FIB alleviates the scalability issue inherent in POMDPs. We show the convergence of the overall algorithm to the suboptimal solution obtained by FIB. We further consider a simulation-based method and prove that the approximation error is bounded explicitly. The performance of our algorithm is evaluated on several benchmark problems. The results of our experiments demonstrate that the proposed algorithm converges significantly faster without degrading the quality of the solution compared to its standard counterpart.
△ Less
Submitted 28 March, 2021;
originally announced March 2021.
-
Distributional robustness in minimax linear quadratic control with Wasserstein distance
Authors:
Kihyun Kim,
Insoon Yang
Abstract:
To address the issue of inaccurate distributions in practical stochastic systems, a minimax linear-quadratic control method is proposed using the Wasserstein metric. Our method aims to construct a control policy that is robust against errors in an empirical distribution of underlying uncertainty, by adopting an adversary that selects the worst-case distribution. The opponent receives a Wasserstein…
▽ More
To address the issue of inaccurate distributions in practical stochastic systems, a minimax linear-quadratic control method is proposed using the Wasserstein metric. Our method aims to construct a control policy that is robust against errors in an empirical distribution of underlying uncertainty, by adopting an adversary that selects the worst-case distribution. The opponent receives a Wasserstein penalty proportional to the amount of deviation from the empirical distribution. A closed-form expression of the finite-horizon optimal policy pair is derived using a Riccati equation. The result is then extended to the infinite-horizon average cost setting by identifying conditions under which the Riccati recursion converges to the unique positive semi-definite solution to an algebraic Riccati equation. Our method is shown to possess several salient features including closed-loop stability, and an out-of-sample performance guarantee. We also discuss how to optimize the penalty parameter for enhancing the distributional robustness of our control policy. Last but not least, a theoretical connection to the classical $H_\infty$-method is identified from the perspective of distributional robustness.
△ Less
Submitted 25 February, 2021;
originally announced February 2021.