-
Simulation Model Calibration with Dynamic Stratification and Adaptive Sampling
Authors:
Pranav Jain,
Sara Shashaani,
Eunshin Byon
Abstract:
Calibrating simulation models that take large quantities of multi-dimensional data as input is a hard simulation optimization problem. Existing adaptive sampling strategies offer a methodological solution. However, they may not sufficiently reduce the computational cost for estimation and solution algorithm's progress within a limited budget due to extreme noise levels and heteroskedasticity of sy…
▽ More
Calibrating simulation models that take large quantities of multi-dimensional data as input is a hard simulation optimization problem. Existing adaptive sampling strategies offer a methodological solution. However, they may not sufficiently reduce the computational cost for estimation and solution algorithm's progress within a limited budget due to extreme noise levels and heteroskedasticity of system responses. We propose integrating stratification with adaptive sampling for the purpose of efficiency in optimization. Stratification can exploit local dependence in the simulation inputs and outputs. Yet, the state-of-the-art does not provide a full capability to adaptively stratify the data as different solution alternatives are evaluated. We devise two procedures for data-driven calibration problems that involve a large dataset with multiple covariates to calibrate models within a fixed overall simulation budget. The first approach dynamically stratifies the input data using binary trees, while the second approach uses closed-form solutions based on linearity assumptions between the objective function and concomitant variables. We find that dynamical adjustment of stratification structure accelerates optimization and reduces run-to-run variability in generated solutions. Our case study for calibrating a wind power simulation model, widely used in the wind industry, using the proposed stratified adaptive sampling, shows better-calibrated parameters under a limited budget.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Distributionally Robust Stratified Sampling for Stochastic Simulations with Multiple Uncertain Input Models
Authors:
Seung Min Baik,
Eunshin Byon,
Young Myoung Ko
Abstract:
This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider…
▽ More
This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider more general cases in which it is necessary to assess a simulation's response to a variety of input models, such as when evaluating the reliability of wind turbines under nonstationary wind conditions or the operation of a service system when the distribution of customer inter-arrival time is heterogeneous at different times. Moreover, the estimation variance may be considerably impacted by uncertainty in input models. To address such nonstationary and uncertain input models, we offer a distributionally robust (DR) stratified sampling approach with the goal of minimizing the maximum of worst-case estimator variances among plausible but uncertain input models. Specifically, we devise a bi-level optimization framework for formulating DR stochastic problems with different ambiguity set designs, based on the $L_2$-norm, 1-Wasserstein distance, parametric family of distributions, and distribution moments. In order to cope with the non-convexity of objective function, we present a solution approach that uses Bayesian optimization. Numerical experiments and the wind turbine case study demonstrate the robustness of the proposed approach.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
The Internet of Federated Things (IoFT): A Vision for the Future and In-depth Survey of Data-driven Approaches for Federated Learning
Authors:
Raed Kontar,
Naichen Shi,
Xubo Yue,
Seokhyun Chung,
Eunshin Byon,
Mosharaf Chowdhury,
Judy **,
Wissam Kontar,
Neda Masoud,
Maher Noueihed,
Chinedum E. Okwudire,
Garvesh Raskutti,
Romesh Saigal,
Karandeep Singh,
Zhisheng Ye
Abstract:
The Internet of Things (IoT) is on the verge of a major paradigm shift. In the IoT system of the future, IoFT, the cloud will be substituted by the crowd where model training is brought to the edge, allowing IoT devices to collaboratively extract knowledge and build smart analytics/models while kee** their personal data stored locally. This paradigm shift was set into motion by the tremendous in…
▽ More
The Internet of Things (IoT) is on the verge of a major paradigm shift. In the IoT system of the future, IoFT, the cloud will be substituted by the crowd where model training is brought to the edge, allowing IoT devices to collaboratively extract knowledge and build smart analytics/models while kee** their personal data stored locally. This paradigm shift was set into motion by the tremendous increase in computational power on IoT devices and the recent advances in decentralized and privacy-preserving model training, coined as federated learning (FL). This article provides a vision for IoFT and a systematic overview of current efforts towards realizing this vision. Specifically, we first introduce the defining characteristics of IoFT and discuss FL data-driven approaches, opportunities, and challenges that allow decentralized inference within three dimensions: (i) a global model that maximizes utility across all IoT devices, (ii) a personalized model that borrows strengths across all devices yet retains its own model, (iii) a meta-learning model that quickly adapts to new devices or learning tasks. We end by describing the vision and challenges of IoFT in resha** different industries through the lens of domain experts. Those industries include manufacturing, transportation, energy, healthcare, quality & reliability, business, and computing.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
An Alternative Data-Driven Prediction Approach Based on Real Option Theories
Authors:
Abdullah AlShelahi,
**gxing Wang,
Mingdi You,
Eunshin Byon,
Romesh Saigal
Abstract:
This paper presents a new prediction model for time series data by integrating a time-varying Geometric Brownian Motion model with a pricing mechanism used in financial engineering. Typical time series models such as Auto-Regressive Integrated Moving Average assumes a linear correlation structure in time series data. When a stochastic process is highly volatile, such an assumption can be easily vi…
▽ More
This paper presents a new prediction model for time series data by integrating a time-varying Geometric Brownian Motion model with a pricing mechanism used in financial engineering. Typical time series models such as Auto-Regressive Integrated Moving Average assumes a linear correlation structure in time series data. When a stochastic process is highly volatile, such an assumption can be easily violated, leading to inaccurate predictions. We develop a new prediction model that can flexibly characterize a time-varying volatile process without assuming linearity. We formulate the prediction problem as an optimization problem with unequal overestimation and underestimation costs. Based on real option theories developed in finance, we solve the optimization problem and obtain a predicted value, which can minimize the expected prediction cost. We evaluate the proposed approach using multiple datasets obtained from real-life applications including manufacturing, finance, and environment. The numerical results demonstrate that the proposed model shows competitive prediction capability, compared with alternative approaches.
△ Less
Submitted 19 April, 2019;
originally announced April 2019.
-
Integrative Density Forecast and Uncertainty Quantification of Wind Power Generation
Authors:
**gxing Wang,
Abdullah Alshelahi,
Mingdi You,
Eunshin Byon,
Romesh Saigal
Abstract:
The volatile nature of wind power generation creates challenges in achieving secure power grid operations. It is, therefore, necessary to make accurate wind power prediction and its uncertainty quantification. Wind power forecasting usually depends on wind speed prediction and the wind-to-power conversion process. However, most current wind power prediction models only consider portions of the unc…
▽ More
The volatile nature of wind power generation creates challenges in achieving secure power grid operations. It is, therefore, necessary to make accurate wind power prediction and its uncertainty quantification. Wind power forecasting usually depends on wind speed prediction and the wind-to-power conversion process. However, most current wind power prediction models only consider portions of the uncertainty. This paper develops an integrative framework for predicting wind power density, considering uncertainties arising from both wind speed prediction and the wind-to-power conversion process. Specifically, we model wind speed using the inhomogeneous Geometric Brownian Motion and convert the wind speed prediction density into the wind power density in a closed-form. The resulting wind power density allows quantifying prediction uncertainties through prediction intervals. To forecast the power output, we minimize the expected prediction cost with (unequal) penalties on the overestimation and underestimation. We show the predictive power of the proposed approach using data from multiple operating wind farms located at different sites.
△ Less
Submitted 27 September, 2020; v1 submitted 22 August, 2018;
originally announced August 2018.
-
Bayesian spline method for assessing extreme loads on wind turbines
Authors:
Giwhyun Lee,
Eunshin Byon,
Lewis Ntaimo,
Yu Ding
Abstract:
This study presents a Bayesian parametric model for the purpose of estimating the extreme load on a wind turbine. The extreme load is the highest stress level imposed on a turbine structure that the turbine would experience during its service lifetime. A wind turbine should be designed to resist such a high load to avoid catastrophic structural failures. To assess the extreme load, turbine structu…
▽ More
This study presents a Bayesian parametric model for the purpose of estimating the extreme load on a wind turbine. The extreme load is the highest stress level imposed on a turbine structure that the turbine would experience during its service lifetime. A wind turbine should be designed to resist such a high load to avoid catastrophic structural failures. To assess the extreme load, turbine structural responses are evaluated by conducting field measurement campaigns or performing aeroelastic simulation studies. In general, data obtained in either case are not sufficient to represent various loading responses under all possible weather conditions. An appropriate extrapolation is necessary to characterize the structural loads in a turbine's service life. This study devises a Bayesian spline method for this extrapolation purpose, using load data collected in a period much shorter than a turbine's service life. The spline method is applied to three sets of turbine's load response data to estimate the corresponding extreme loads at the roots of the turbine blades. Compared to the current industry practice, the spline method appears to provide better extreme load assessment.
△ Less
Submitted 13 January, 2014;
originally announced January 2014.