-
Asset Bundling for Wind Power Forecasting
Authors:
Hanyu Zhang,
Mathieu Tanneau,
Chaofan Huang,
V. Roshan Joseph,
Shangkun Wang,
Pascal Van Hentenryck
Abstract:
The growing penetration of intermittent, renewable generation in US power grids, especially wind and solar generation, results in increased operational uncertainty. In that context, accurate forecasts are critical, especially for wind generation, which exhibits large variability and is historically harder to predict. To overcome this challenge, this work proposes a novel Bundle-Predict-Reconcile (…
▽ More
The growing penetration of intermittent, renewable generation in US power grids, especially wind and solar generation, results in increased operational uncertainty. In that context, accurate forecasts are critical, especially for wind generation, which exhibits large variability and is historically harder to predict. To overcome this challenge, this work proposes a novel Bundle-Predict-Reconcile (BPR) framework that integrates asset bundling, machine learning, and forecast reconciliation techniques. The BPR framework first learns an intermediate hierarchy level (the bundles), then predicts wind power at the asset, bundle, and fleet level, and finally reconciles all forecasts to ensure consistency. This approach effectively introduces an auxiliary learning task (predicting the bundle-level time series) to help the main learning tasks. The paper also introduces new asset-bundling criteria that capture the spatio-temporal dynamics of wind power time series. Extensive numerical experiments are conducted on an industry-size dataset of 283 wind farms in the MISO footprint. The experiments consider short-term and day-ahead forecasts, and evaluates a large variety of forecasting models that include weather predictions as covariates. The results demonstrate the benefits of BPR, which consistently and significantly improves forecast accuracy over baselines, especially at the fleet level.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Changes in Commuter Behavior from COVID-19 Lockdowns in the Atlanta Metropolitan Area
Authors:
Tejas Santanam,
Anthony Trasatti,
Hanyu Zhang,
Connor Riley,
Pascal Van Hentenryck,
Ramayya Krishnan
Abstract:
This paper analyzes the impact of COVID-19 related lockdowns in the Atlanta, Georgia metropolitan area by examining commuter patterns in three periods: prior to, during, and after the pandemic lockdown. A cellular phone location dataset is utilized in a novel pipeline to infer the home and work locations of thousands of users from the Density-based Spatial Clustering of Applications with Noise (DB…
▽ More
This paper analyzes the impact of COVID-19 related lockdowns in the Atlanta, Georgia metropolitan area by examining commuter patterns in three periods: prior to, during, and after the pandemic lockdown. A cellular phone location dataset is utilized in a novel pipeline to infer the home and work locations of thousands of users from the Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm. The coordinates derived from the clustering are put through a reverse geocoding process from which word embeddings are extracted in order to categorize the industry of each work place based on the workplace name and Point of Interest (POI) map**. Frequencies of commute from home locations to work locations are analyzed in and across all three time periods. Public health and economic factors are discussed to explain potential reasons for the observed changes in commuter patterns.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Multi-resolution spatio-temporal prediction with application to wind power generation
Authors:
Zheng Dong,
Hanyu Zhang,
Shixiang Zhu,
Yao Xie,
Pascal Van Hentenryck
Abstract:
Wind energy is becoming an increasingly crucial component of a sustainable grid, but its inherent variability and limited predictability present challenges for grid operators. The energy sector needs novel forecasting techniques that can precisely predict the generation of renewable power and offer precise quantification of prediction uncertainty. This will facilitate well-informed decision-making…
▽ More
Wind energy is becoming an increasingly crucial component of a sustainable grid, but its inherent variability and limited predictability present challenges for grid operators. The energy sector needs novel forecasting techniques that can precisely predict the generation of renewable power and offer precise quantification of prediction uncertainty. This will facilitate well-informed decision-making by operators who wish to integrate renewable energy into the power grid. This paper presents a novel approach to wind speed prediction with uncertainty quantification using a multi-resolution spatio-temporal Gaussian process. By leveraging information from multiple sources of predictions with varying accuracies and uncertainties, the joint framework provides a more accurate and robust prediction of wind speed while measuring the uncertainty in these predictions. We assess the effectiveness of our proposed framework using real-world wind data obtained from the Midwest region of the United States. Our results demonstrate that the framework enables predictors with varying data resolutions to learn from each other, leading to an enhancement in overall predictive performance. The proposed framework shows a superior performance compared to other state-of-the-art methods. The goal of this research is to improve grid operation and management by aiding system operators and policymakers in making better-informed decisions related to energy demand management, energy storage system deployment, and energy supply scheduling. This results in potentially further integration of renewable energy sources into the existing power systems.
△ Less
Submitted 2 December, 2023; v1 submitted 30 August, 2021;
originally announced August 2021.
-
Differentially Private and Fair Deep Learning: A Lagrangian Dual Approach
Authors:
Cuong Tran,
Ferdinando Fioretto,
Pascal Van Hentenryck
Abstract:
A critical concern in data-driven decision making is to build models whose outcomes do not discriminate against some demographic groups, including gender, ethnicity, or age. To ensure non-discrimination in learning tasks, knowledge of the sensitive attributes is essential, while, in practice, these attributes may not be available due to legal and ethical requirements. To address this challenge, th…
▽ More
A critical concern in data-driven decision making is to build models whose outcomes do not discriminate against some demographic groups, including gender, ethnicity, or age. To ensure non-discrimination in learning tasks, knowledge of the sensitive attributes is essential, while, in practice, these attributes may not be available due to legal and ethical requirements. To address this challenge, this paper studies a model that protects the privacy of the individuals sensitive information while also allowing it to learn non-discriminatory predictors. The method relies on the notion of differential privacy and the use of Lagrangian duality to design neural networks that can accommodate fairness constraints while guaranteeing the privacy of sensitive attributes. The paper analyses the tension between accuracy, privacy, and fairness and the experimental evaluation illustrates the benefits of the proposed model on several prediction tasks.
△ Less
Submitted 26 September, 2020;
originally announced September 2020.
-
Spatio-Temporal Point Processes with Attention for Traffic Congestion Event Modeling
Authors:
Shixiang Zhu,
Ruyi Ding,
Minghe Zhang,
Pascal Van Hentenryck,
Yao Xie
Abstract:
We present a novel framework for modeling traffic congestion events over road networks. Using multi-modal data by combining count data from traffic sensors with police reports that report traffic incidents, we aim to capture two types of triggering effect for congestion events. Current traffic congestion at one location may cause future congestion over the road network, and traffic incidents may c…
▽ More
We present a novel framework for modeling traffic congestion events over road networks. Using multi-modal data by combining count data from traffic sensors with police reports that report traffic incidents, we aim to capture two types of triggering effect for congestion events. Current traffic congestion at one location may cause future congestion over the road network, and traffic incidents may cause spread traffic congestion. To model the non-homogeneous temporal dependence of the event on the past, we use a novel attention-based mechanism based on neural networks embedding for point processes. To incorporate the directional spatial dependence induced by the road network, we adapt the "tail-up" model from the context of spatial statistics to the traffic network setting. We demonstrate our approach's superior performance compared to the state-of-the-art methods for both synthetic and real data.
△ Less
Submitted 31 May, 2021; v1 submitted 15 May, 2020;
originally announced May 2020.
-
Lagrangian Duality for Constrained Deep Learning
Authors:
Ferdinando Fioretto,
Pascal Van Hentenryck,
Terrence WK Mak,
Cuong Tran,
Federico Baldo,
Michele Lombardi
Abstract:
This paper explores the potential of Lagrangian duality for learning applications that feature complex constraints. Such constraints arise in many science and engineering domains, where the task amounts to learning optimization problems which must be solved repeatedly and include hard physical and operational constraints. The paper also considers applications where the learning task must enforce c…
▽ More
This paper explores the potential of Lagrangian duality for learning applications that feature complex constraints. Such constraints arise in many science and engineering domains, where the task amounts to learning optimization problems which must be solved repeatedly and include hard physical and operational constraints. The paper also considers applications where the learning task must enforce constraints on the predictor itself, either because they are natural properties of the function to learn or because it is desirable from a societal standpoint to impose them. This paper demonstrates experimentally that Lagrangian duality brings significant benefits for these applications. In energy domains, the combination of Lagrangian duality and deep learning can be used to obtain state-of-the-art results to predict optimal power flows, in energy systems, and optimal compressor settings, in gas networks. In transprecision computing, Lagrangian duality can complement deep learning to impose monotonicity constraints on the predictor without sacrificing accuracy. Finally, Lagrangian duality can be used to enforce fairness constraints on a predictor and obtain state-of-the-art results when minimizing disparate treatments.
△ Less
Submitted 6 April, 2020; v1 submitted 25 January, 2020;
originally announced January 2020.
-
Distilling Black-Box Travel Mode Choice Model for Behavioral Interpretation
Authors:
Xilei Zhao,
Zhengze Zhou,
Xiang Yan,
Pascal Van Hentenryck
Abstract:
Machine learning has proved to be very successful for making predictions in travel behavior modeling. However, most machine-learning models have complex model structures and offer little or no explanation as to how they arrive at these predictions. Interpretations about travel behavior models are essential for decision makers to understand travelers' preferences and plan policy interventions accor…
▽ More
Machine learning has proved to be very successful for making predictions in travel behavior modeling. However, most machine-learning models have complex model structures and offer little or no explanation as to how they arrive at these predictions. Interpretations about travel behavior models are essential for decision makers to understand travelers' preferences and plan policy interventions accordingly. Therefore, this paper proposes to apply and extend the model distillation approach, a model-agnostic machine-learning interpretation method, to explain how a black-box travel mode choice model makes predictions for the entire population and subpopulations of interest. Model distillation aims at compressing knowledge from a complex model (teacher) into an understandable and interpretable model (student). In particular, the paper integrates model distillation with market segmentation to generate more insights by accounting for heterogeneity. Furthermore, the paper provides a comprehensive comparison of student models with the benchmark model (decision tree) and the teacher model (gradient boosting trees) to quantify the fidelity and accuracy of the students' interpretations.
△ Less
Submitted 30 October, 2019;
originally announced October 2019.
-
Predicting AC Optimal Power Flows: Combining Deep Learning and Lagrangian Dual Methods
Authors:
Ferdinando Fioretto,
Terrence W. K. Mak,
Pascal Van Hentenryck
Abstract:
The Optimal Power Flow (OPF) problem is a fundamental building block for the optimization of electrical power systems. It is nonlinear and nonconvex and computes the generator setpoints for power and voltage, given a set of load demands. It is often needed to be solved repeatedly under various conditions, either in real-time or in large-scale studies. This need is further exacerbated by the increa…
▽ More
The Optimal Power Flow (OPF) problem is a fundamental building block for the optimization of electrical power systems. It is nonlinear and nonconvex and computes the generator setpoints for power and voltage, given a set of load demands. It is often needed to be solved repeatedly under various conditions, either in real-time or in large-scale studies. This need is further exacerbated by the increasing stochasticity of power systems due to renewable energy sources in front and behind the meter. To address these challenges, this paper presents a deep learning approach to the OPF. The learning model exploits the information available in the prior states of the system (which is commonly available in practical applications), as well as a dual Lagrangian method to satisfy the physical and engineering constraints present in the OPF. The proposed model is evaluated on a large collection of realistic power systems. The experimental results show that its predictions are highly accurate with average errors as low as 0.2%. Additionally, the proposed approach is shown to improve the accuracy of widely adopted OPF linear DC approximation by at least two orders of magnitude.
△ Less
Submitted 3 December, 2019; v1 submitted 18 September, 2019;
originally announced September 2019.
-
Modeling Heterogeneity in Mode-Switching Behavior Under a Mobility-on-Demand Transit System: An Interpretable Machine Learning Approach
Authors:
Xilei Zhao,
Xiang Yan,
Pascal Van Hentenryck
Abstract:
Recent years have witnessed an increased focus on interpretability and the use of machine learning to inform policy analysis and decision making. This paper applies machine learning to examine travel behavior and, in particular, on modeling changes in travel modes when individuals are presented with a novel (on-demand) mobility option. It addresses the following question: Can machine learning be a…
▽ More
Recent years have witnessed an increased focus on interpretability and the use of machine learning to inform policy analysis and decision making. This paper applies machine learning to examine travel behavior and, in particular, on modeling changes in travel modes when individuals are presented with a novel (on-demand) mobility option. It addresses the following question: Can machine learning be applied to model individual taste heterogeneity (preference heterogeneity for travel modes and response heterogeneity to travel attributes) in travel mode choice? This paper first develops a high-accuracy classifier to predict mode-switching behavior under a hypothetical Mobility-on-Demand Transit system (i.e., stated-preference data), which represents the case study underlying this research. We show that this classifier naturally captures individual heterogeneity available in the data. Moreover, the paper derives insights on heterogeneous switching behaviors through the generation of marginal effects and elasticities by current travel mode, partial dependence plots, and individual conditional expectation plots. The paper also proposes two new model-agnostic interpretation tools for machine learning, i.e., conditional partial dependence plots and conditional individual partial dependence plots, specifically designed to examine response heterogeneity. The results on the case study show that the machine-learning classifier, together with model-agnostic interpretation tools, provides valuable insights on travel mode switching behavior for different individuals and population segments. For example, the existing drivers are more sensitive to additional pickups than people using other travel modes, and current transit users are generally willing to share rides but reluctant to take any additional transfers.
△ Less
Submitted 7 February, 2019;
originally announced February 2019.
-
Mobility-on-demand versus fixed-route transit systems: an evaluation of traveler preferences in low-income communities
Authors:
Xiang Yan,
Xilei Zhao,
Yuan Han,
Pascal Van Hentenryck,
Tawanna Dillahunt
Abstract:
Emerging transportation technologies, such as ride-hailing and autonomous vehicles, are disrupting the transportation sector and transforming public transit. Some transit observers envision future public transit to be integrated transit systems with fixed-route services running along major corridors and on-demand ridesharing services covering lower-density areas. A switch from a conventional fixed…
▽ More
Emerging transportation technologies, such as ride-hailing and autonomous vehicles, are disrupting the transportation sector and transforming public transit. Some transit observers envision future public transit to be integrated transit systems with fixed-route services running along major corridors and on-demand ridesharing services covering lower-density areas. A switch from a conventional fixed-route service model to this kind of integrated mobility-on-demand transit system, however, may elicit varied responses from local residents. This paper evaluates traveler preferences for a proposed integrated mobility-on-demand transit system versus the existing fixed-route system, with a particular focus on disadvantaged travelers. We conducted a survey in two low-resource communities in the United States, namely, Detroit and Ypsilanti, Michigan. A majority of survey respondents preferred a mobility-on-demand transit system over a fixed-route one. Based on ordered logit model outputs, we found a stronger preference for mobility-on-demand transit among males, college graduates, individuals who have never heard of or used ride-hailing before, and individuals who currently receive inferior transit services. By contrast, preferences varied little by age, income, race, or disability status. The most important benefit of a mobility-on-demand transit system perceived by the survey respondents is enhanced transit accessibility to different destinations, whereas their major concerns include the need to actively request rides, possible transit-fare increases, and potential technological failures. Addressing the concerns of female riders and accommodating the needs of less technology-proficient individuals should be major priorities for transit agencies that are considering mobility-on-demand initiatives.
△ Less
Submitted 22 January, 2019;
originally announced January 2019.
-
Modeling Stated Preference for Mobility-on-Demand Transit: A Comparison of Machine Learning and Logit Models
Authors:
Xilei Zhao,
Xiang Yan,
Alan Yu,
Pascal Van Hentenryck
Abstract:
Logit models are usually applied when studying individual travel behavior, i.e., to predict travel mode choice and to gain behavioral insights on traveler preferences. Recently, some studies have applied machine learning to model travel mode choice and reported higher out-of-sample predictive accuracy than traditional logit models (e.g., multinomial logit). However, little research focuses on comp…
▽ More
Logit models are usually applied when studying individual travel behavior, i.e., to predict travel mode choice and to gain behavioral insights on traveler preferences. Recently, some studies have applied machine learning to model travel mode choice and reported higher out-of-sample predictive accuracy than traditional logit models (e.g., multinomial logit). However, little research focuses on comparing the interpretability of machine learning with logit models. In other words, how to draw behavioral insights from the high-performance "black-box" machine-learning models remains largely unsolved in the field of travel behavior modeling.
This paper aims at providing a comprehensive comparison between the two approaches by examining the key similarities and differences in model development, evaluation, and behavioral interpretation between logit and machine-learning models for travel mode choice modeling. To complement the theoretical discussions, the paper also empirically evaluates the two approaches on the stated-preference survey data for a new type of transit system integrating high-frequency fixed-route services and ridesourcing. The results show that machine learning can produce significantly higher predictive accuracy than logit models. Moreover, machine learning and logit models largely agree on many aspects of behavioral interpretations. In addition, machine learning can automatically capture the nonlinear relationship between the input features and choice outcomes. The paper concludes that there is great potential in merging ideas from machine learning and conventional statistical methods to develop refined models for travel behavior research and suggests some new research directions.
△ Less
Submitted 1 April, 2019; v1 submitted 3 November, 2018;
originally announced November 2018.
-
Graphical Models and Belief Propagation-hierarchy for Optimal Physics-Constrained Network Flows
Authors:
Michael Chertkov,
Sidhant Misra,
Marc Vuffray,
Dvijotham Krishnamurty,
Pascal Van Hentenryck
Abstract:
In this manuscript we review new ideas and first results on application of the Graphical Models approach, originated from Statistical Physics, Information Theory, Computer Science and Machine Learning, to optimization problems of network flow type with additional constraints related to the physics of the flow. We illustrate the general concepts on a number of enabling examples from power system an…
▽ More
In this manuscript we review new ideas and first results on application of the Graphical Models approach, originated from Statistical Physics, Information Theory, Computer Science and Machine Learning, to optimization problems of network flow type with additional constraints related to the physics of the flow. We illustrate the general concepts on a number of enabling examples from power system and natural gas transmission (continental scale) and distribution (district scale) systems.
△ Less
Submitted 7 February, 2017;
originally announced February 2017.