-
Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI
Authors:
Lang Tong,
Xinyi Wang,
Qing Zhao
Abstract:
Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference. Advancing beyond earlier generations of wide-area monitoring systems built upon supervisory control and data acquisition (SCADA) and synchrophasor technologies, we argue for a monitoring an…
▽ More
Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference. Advancing beyond earlier generations of wide-area monitoring systems built upon supervisory control and data acquisition (SCADA) and synchrophasor technologies, we argue for a monitoring and control framework based on the streaming of continuous point-on-wave (CPOW) measurements with AI-powered data compression and fault detection.
Methods and Results: The architecture of the proposed design originates from the Wiener-Kallianpur innovation representation of a random process that transforms causally a stationary random process into an innovation sequence with independent and identically distributed random variables. This work presents a generative AI approach that (i) learns an innovation autoencoder that extracts innovation sequence from CPOW time series, (ii) compresses the CPOW streaming data with innovation autoencoder and subband coding, and (iii) detects unknown faults and novel trends via nonparametric sequential hypothesis testing.
Conclusion: This work argues that conventional monitoring using SCADA and phasor measurement unit (PMU) technologies is ill-suited for a future grid with deep penetration of inverter-based renewable generations and distributed energy resources. A monitoring system based on CPOW data streaming and AI data analytics should be the basic building blocks for situational awareness of a highly dynamic future grid.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Generative Probabilistic Time Series Forecasting and Applications in Grid Operations
Authors:
Xinyi Wang,
Lang Tong,
Qing Zhao
Abstract:
Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations. Such techniques are essential in risk-based decision-making and planning under uncertainty with broad applications in grid operations, including electricity price forecasting, risk-based economic dispatch, and stochastic optimizations.…
▽ More
Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations. Such techniques are essential in risk-based decision-making and planning under uncertainty with broad applications in grid operations, including electricity price forecasting, risk-based economic dispatch, and stochastic optimizations. Inspired by Wiener and Kallianpur's innovation representation, we propose a weak innovation autoencoder architecture and a learning algorithm to extract independent and identically distributed innovation sequences from nonparametric stationary time series. We show that the weak innovation sequence is Bayesian sufficient, which makes the proposed weak innovation autoencoder a canonical architecture for generative probabilistic forecasting. The proposed technique is applied to forecasting highly volatile real-time electricity prices, demonstrating superior performance across multiple forecasting measures over leading probabilistic and point forecasting techniques.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Multinomial Link Models
Authors:
Tianmeng Wang,
Li** Tong,
Jie Yang
Abstract:
We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of t…
▽ More
We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of the model parameters and computing the Fisher information matrix. Our algorithms solve the infeasibility issue of existing statistical software on estimating parameters of cumulative link models. The applications to real datasets show that the new models can fit the data significantly better, and the corresponding data analysis may correct the misleading conclusions due to missing responses.
△ Less
Submitted 18 June, 2024; v1 submitted 26 December, 2023;
originally announced December 2023.
-
Novelty Detection in Time Series via Weak Innovations Representation: A Deep Learning Approach
Authors:
Xinyi Wang,
Mei-jen Lee,
Qing Zhao,
Lang Tong
Abstract:
We consider novelty detection in time series with unknown and nonparametric probability structures. A deep learning approach is proposed to causally extract an innovations sequence consisting of novelty samples statistically independent of all past samples of the time series. A novelty detection algorithm is developed for the online detection of novel changes in the probability structure in the in…
▽ More
We consider novelty detection in time series with unknown and nonparametric probability structures. A deep learning approach is proposed to causally extract an innovations sequence consisting of novelty samples statistically independent of all past samples of the time series. A novelty detection algorithm is developed for the online detection of novel changes in the probability structure in the innovations sequence. A minimax optimality under a Bayes risk measure is established for the proposed novelty detection method, and its robustness and efficacy are demonstrated in experiments using real and synthetic datasets.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Constrained D-optimal Design for Paid Research Study
Authors:
Yifei Huang,
Li** Tong,
Jie Yang
Abstract:
We consider constrained sampling problems in paid research studies or clinical trials. When qualified volunteers are more than the budget allowed, we recommend a D-optimal sampling strategy based on the optimal design theory and develop a constrained lift-one algorithm to find the optimal allocation. Unlike the literature which mainly deals with linear models, our solution solves the constrained s…
▽ More
We consider constrained sampling problems in paid research studies or clinical trials. When qualified volunteers are more than the budget allowed, we recommend a D-optimal sampling strategy based on the optimal design theory and develop a constrained lift-one algorithm to find the optimal allocation. Unlike the literature which mainly deals with linear models, our solution solves the constrained sampling problem under fairly general statistical models, including generalized linear models and multinomial logistic models, and with more general constraints. We justify theoretically the optimality of our sampling strategy and show by simulation studies and real-world examples the advantages over simple random sampling and proportionally stratified sampling strategies.
△ Less
Submitted 24 May, 2024; v1 submitted 11 July, 2022;
originally announced July 2022.
-
Contrasting random and learned features in deep Bayesian linear regression
Authors:
Jacob A. Zavatone-Veth,
William L. Tong,
Cengiz Pehlevan
Abstract:
Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are t…
▽ More
Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are trained, we provide a detailed characterization of the interplay between width, depth, data density, and prior mismatch. We show that both models display sample-wise double-descent behavior in the presence of label noise. Random feature models can also display model-wise double-descent if there are narrow bottleneck layers, while deep networks do not show these divergences. Random feature models can have particular widths that are optimal for generalization at a given data density, while making neural networks as wide or as narrow as possible is always optimal. Moreover, we show that the leading-order correction to the kernel-limit learning curve cannot distinguish between random feature models and deep networks in which all layers are trained. Taken together, our findings begin to elucidate how architectural details affect generalization performance in this simple class of deep regression models.
△ Less
Submitted 16 June, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Innovations Autoencoder and its Application in One-class Anomalous Sequence Detection
Authors:
Xinyi Wang,
Lang Tong
Abstract:
An innovations sequence of a time series is a sequence of independent and identically distributed random variables with which the original time series has a causal representation. The innovation at a time is statistically independent of the history of the time series. As such, it represents the new information contained at present but not in the past. Because of its simple probability structure, a…
▽ More
An innovations sequence of a time series is a sequence of independent and identically distributed random variables with which the original time series has a causal representation. The innovation at a time is statistically independent of the history of the time series. As such, it represents the new information contained at present but not in the past. Because of its simple probability structure, an innovations sequence is the most efficient signature of the original. Unlike the principle or independent component analysis representations, an innovations sequence preserves not only the complete statistical properties but also the temporal order of the original time series. An long-standing open problem is to find a computationally tractable way to extract an innovations sequence of non-Gaussian processes. This paper presents a deep learning approach, referred to as Innovations Autoencoder (IAE), that extracts innovations sequences using a causal convolutional neural network. An application of IAE to the one-class anomalous sequence detection problem with unknown anomaly and anomaly-free models is also presented.
△ Less
Submitted 15 July, 2021; v1 submitted 23 June, 2021;
originally announced June 2021.
-
Joint Dynamic Models and Statistical Inference for Recurrent Competing Risks, Longitudinal Marker, and Health Status
Authors:
Lili Tong,
Piaomu Liu,
Edsel Pena
Abstract:
Consider a subject or unit in a longitudinal biomedical, public health, engineering, economic or social science study which is being monitored over a possibly random duration. Over time this unit experiences recurrent events of several types and a longitudinal marker transitions over a discrete state-space. In addition, its "health" status also transitions over a discrete state-space with at least…
▽ More
Consider a subject or unit in a longitudinal biomedical, public health, engineering, economic or social science study which is being monitored over a possibly random duration. Over time this unit experiences recurrent events of several types and a longitudinal marker transitions over a discrete state-space. In addition, its "health" status also transitions over a discrete state-space with at least one absorbing state. A vector of covariates will also be associated with this unit. Of major interest for this unit is the time-to-absorption of its health status process, which could be viewed as the unit's lifetime. Aside from being affected by its covariate vector, there could be associations among the recurrent competing risks processes, the longitudinal marker process, and the health status process in the sense that the time-evolution of each process is associated with the other processes. To obtain more realistic models and enhance inferential performance, a joint dynamic stochastic model for these components is proposed and statistical inference methods are developed. This joint model, formulated via counting processes and continuous-time Markov chains, has the potential of facilitating `personalized' interventions. This could enhance, for example, the implementation and adoption of precision medicine in medical settings. Semi-parametric and likelihood-based inferential methods for the model parameters are developed when a sample of these units is available. Finite-sample and asymptotic properties of estimators of model parameters, both finite- and infinite-dimensional, are obtained analytically or through simulation studies. The developed procedures are illustrated using a real data set.
△ Less
Submitted 29 January, 2022; v1 submitted 23 March, 2021;
originally announced March 2021.
-
Infer-AVAE: An Attribute Inference Model Based on Adversarial Variational Autoencoder
Authors:
Yadong Zhou,
Zhihao Ding,
Xiaoming Liu,
Chao Shen,
Lingling Tong,
Xiaohong Guan
Abstract:
User attributes, such as gender and education, face severe incompleteness in social networks. In order to make this kind of valuable data usable for downstream tasks like user profiling and personalized recommendation, attribute inference aims to infer users' missing attribute labels based on observed data. Recently, variational autoencoder (VAE), an end-to-end deep generative model, has shown pro…
▽ More
User attributes, such as gender and education, face severe incompleteness in social networks. In order to make this kind of valuable data usable for downstream tasks like user profiling and personalized recommendation, attribute inference aims to infer users' missing attribute labels based on observed data. Recently, variational autoencoder (VAE), an end-to-end deep generative model, has shown promising performance by handling the problem in a semi-supervised way. However, VAEs can easily suffer from over-fitting and over-smoothing when applied to attribute inference. To be specific, VAE implemented with multi-layer perceptron (MLP) can only reconstruct input data but fail in inferring missing parts. While using the trending graph neural networks (GNNs) as encoder has the problem that GNNs aggregate redundant information from neighborhood and generate indistinguishable user representations, which is known as over-smoothing. In this paper, we propose an attribute \textbf{Infer}ence model based on \textbf{A}dversarial \textbf{VAE} (Infer-AVAE) to cope with these issues. Specifically, to overcome over-smoothing, Infer-AVAE unifies MLP and GNNs in encoder to learn positive and negative latent representations respectively. Meanwhile, an adversarial network is trained to distinguish the two representations and GNNs are trained to aggregate less noise for more robust representations through adversarial training. Finally, to relieve over-fitting, mutual information constraint is introduced as a regularizer for decoder, so that it can make better use of auxiliary information in representations and generate outputs not limited by observations. We evaluate our model on 4 real-world social network datasets, experimental results demonstrate that our model averagely outperforms baselines by 7.0$\%$ in accuracy.
△ Less
Submitted 29 May, 2021; v1 submitted 29 December, 2020;
originally announced December 2020.
-
Towards Robustness against Unsuspicious Adversarial Examples
Authors:
Liang Tong,
Minzhe Guo,
Atul Prakash,
Yevgeniy Vorobeychik
Abstract:
Despite the remarkable success of deep neural networks, significant concerns have emerged about their robustness to adversarial perturbations to inputs. While most attacks aim to ensure that these are imperceptible, physical perturbation attacks typically aim for being unsuspicious, even if perceptible. However, there is no universal notion of what it means for adversarial examples to be unsuspici…
▽ More
Despite the remarkable success of deep neural networks, significant concerns have emerged about their robustness to adversarial perturbations to inputs. While most attacks aim to ensure that these are imperceptible, physical perturbation attacks typically aim for being unsuspicious, even if perceptible. However, there is no universal notion of what it means for adversarial examples to be unsuspicious. We propose an approach for modeling suspiciousness by leveraging cognitive salience. Specifically, we split an image into foreground (salient region) and background (the rest), and allow significantly larger adversarial perturbations in the background, while ensuring that cognitive salience of background remains low. We describe how to compute the resulting non-salience-preserving dual-perturbation attacks on classifiers. We then experimentally demonstrate that our attacks indeed do not significantly change perceptual salience of the background, but are highly effective against classifiers robust to conventional attacks. Furthermore, we show that adversarial training with dual-perturbation attacks yields classifiers that are more robust to these than state-of-the-art robust learning approaches, and comparable in terms of robustness to conventional attacks.
△ Less
Submitted 8 October, 2020; v1 submitted 8 May, 2020;
originally announced May 2020.
-
Universal Data Anomaly Detection via Inverse Generative Adversary Network
Authors:
Kursat Rasim Mestav,
Lang Tong
Abstract:
The problem of detecting data anomaly is considered. Under the null hypothesis that models anomaly-free data, measurements are assumed to be from an unknown distribution with some authenticated historical samples. Under the composite alternative hypothesis, measurements are from an unknown distribution positive distance away from the distribution under the null hypothesis. No training data are ava…
▽ More
The problem of detecting data anomaly is considered. Under the null hypothesis that models anomaly-free data, measurements are assumed to be from an unknown distribution with some authenticated historical samples. Under the composite alternative hypothesis, measurements are from an unknown distribution positive distance away from the distribution under the null hypothesis. No training data are available for the distribution of anomaly data. A semi-supervised deep learning technique based on an inverse generative adversary network is proposed.
△ Less
Submitted 23 January, 2020;
originally announced January 2020.
-
Defending Against Physically Realizable Attacks on Image Classification
Authors:
Tong Wu,
Liang Tong,
Yevgeniy Vorobeychik
Abstract:
We study the problem of defending deep neural network approaches for image classification from physically realizable attacks. First, we demonstrate that the two most scalable and effective methods for learning robust models, adversarial training with PGD attacks and randomized smoothing, exhibit very limited effectiveness against three of the highest profile physical attacks. Next, we propose a ne…
▽ More
We study the problem of defending deep neural network approaches for image classification from physically realizable attacks. First, we demonstrate that the two most scalable and effective methods for learning robust models, adversarial training with PGD attacks and randomized smoothing, exhibit very limited effectiveness against three of the highest profile physical attacks. Next, we propose a new abstract adversarial model, rectangular occlusion attacks, in which an adversary places a small adversarially crafted rectangle in an image, and develop two approaches for efficiently computing the resulting adversarial examples. Finally, we demonstrate that adversarial training using our new attack yields image classification models that exhibit high robustness against the physically realizable attacks we study, offering the first effective generic defense against such attacks.
△ Less
Submitted 14 February, 2020; v1 submitted 20 September, 2019;
originally announced September 2019.
-
Cost-sensitive Boosting Pruning Trees for depression detection on Twitter
Authors:
Lei Tong,
Zhihua Liu,
Zheheng Jiang,
Feixiang Zhou,
Long Chen,
Jialin Lyu,
Xiangrong Zhang,
Qianni Zhang,
Abdul Sadka Senior,
Yinhai Wang,
Ling Li,
Huiyu Zhou
Abstract:
Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clu…
▽ More
Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clues about physical and mental health conditions. In this paper, we argue that it is feasible to identify depression at an early stage by mining online social behaviours. Our approach, which is innovative to the practice of depression detection, does not rely on the extraction of numerous or complicated features to achieve accurate depression detection. Instead, we propose a novel classifier, namely, Cost-sensitive Boosting Pruning Trees (CBPT), which demonstrates a strong classification ability on two publicly accessible Twitter depression detection datasets. To comprehensively evaluate the classification capability of the CBPT, we use additional three datasets from the UCI machine learning repository and the CBPT obtains appealing classification results against several state of the arts boosting algorithms. Finally, we comprehensively explore the influence factors of model prediction, and the results manifest that our proposed framework is promising for identifying Twitter users with depression.
△ Less
Submitted 21 January, 2022; v1 submitted 2 June, 2019;
originally announced June 2019.
-
Predicting Urban Dispersal Events: A Two-Stage Framework through Deep Survival Analysis on Mobility Data
Authors:
Amin Vahedian,
Xun Zhou,
Ling Tong,
W. Nick Street,
Yanhua Li
Abstract:
Urban dispersal events are processes where an unusually large number of people leave the same area in a short period. Early prediction of dispersal events is important in mitigating congestion and safety risks and making better dispatching decisions for taxi and ride-sharing fleets. Existing work mostly focuses on predicting taxi demand in the near future by learning patterns from historical data.…
▽ More
Urban dispersal events are processes where an unusually large number of people leave the same area in a short period. Early prediction of dispersal events is important in mitigating congestion and safety risks and making better dispatching decisions for taxi and ride-sharing fleets. Existing work mostly focuses on predicting taxi demand in the near future by learning patterns from historical data. However, they fail in case of abnormality because dispersal events with abnormally high demand are non-repetitive and violate common assumptions such as smoothness in demand change over time. Instead, in this paper we argue that dispersal events follow a complex pattern of trips and other related features in the past, which can be used to predict such events. Therefore, we formulate the dispersal event prediction problem as a survival analysis problem. We propose a two-stage framework (DILSA), where a deep learning model combined with survival analysis is developed to predict the probability of a dispersal event and its demand volume. We conduct extensive case studies and experiments on the NYC Yellow taxi dataset from 2014-2016. Results show that DILSA can predict events in the next 5 hours with F1-score of 0.7 and with average time error of 18 minutes. It is orders of magnitude better than the state-ofthe-art deep learning approaches for taxi demand prediction.
△ Less
Submitted 11 July, 2019; v1 submitted 3 May, 2019;
originally announced May 2019.
-
Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning
Authors:
Kursat Rasim Mestav,
Jaime Luengo-Rozas,
Lang Tong
Abstract:
The problem of state estimation for unobservable distribution systems is considered. A deep learning approach to Bayesian state estimation is proposed for real-time applications. The proposed technique consists of distribution learning of stochastic power injection, a Monte Carlo technique for the training of a deep neural network for state estimation, and a Bayesian bad-data detection and filteri…
▽ More
The problem of state estimation for unobservable distribution systems is considered. A deep learning approach to Bayesian state estimation is proposed for real-time applications. The proposed technique consists of distribution learning of stochastic power injection, a Monte Carlo technique for the training of a deep neural network for state estimation, and a Bayesian bad-data detection and filtering algorithm. Structural characteristics of the deep neural networks are investigated. Simulations illustrate the accuracy of Bayesian state estimation for unobservable systems and demonstrate the benefit of employing a deep neural network. Numerical results show the robustness of Bayesian state estimation against modeling and estimation errors and the presence of bad and missing data. Comparing with pseudo-measurement techniques, direct Bayesian state estimation via deep learning neural network outperforms existing benchmarks.
△ Less
Submitted 24 February, 2019; v1 submitted 6 November, 2018;
originally announced November 2018.
-
IVUS-Net: An Intravascular Ultrasound Segmentation Network
Authors:
Ji Yang,
Lin Tong,
Mehdi Faraji,
Anup Basu
Abstract:
IntraVascular UltraSound (IVUS) is one of the most effective imaging modalities that provides assistance to experts in order to diagnose and treat cardiovascular diseases. We address a central problem in IVUS image analysis with Fully Convolutional Network (FCN): automatically delineate the lumen and media-adventitia borders in IVUS images, which is crucial to shorten the diagnosis process or bene…
▽ More
IntraVascular UltraSound (IVUS) is one of the most effective imaging modalities that provides assistance to experts in order to diagnose and treat cardiovascular diseases. We address a central problem in IVUS image analysis with Fully Convolutional Network (FCN): automatically delineate the lumen and media-adventitia borders in IVUS images, which is crucial to shorten the diagnosis process or benefits a faster and more accurate 3D reconstruction of the artery. Particularly, we propose an FCN architecture, called IVUS-Net, followed by a post-processing contour extraction step, in order to automatically segments the interior (lumen) and exterior (media-adventitia) regions of the human arteries. We evaluated our IVUS-Net on the test set of a standard publicly available dataset containing 326 IVUS B-mode images with two measurements, namely Jaccard Measure (JM) and Hausdorff Distances (HD). The evaluation result shows that IVUS-Net outperforms the state-of-the-art lumen and media segmentation methods by 4% to 20% in terms of HD distance. IVUS-Net performs well on images in the test set that contain a significant amount of major artifacts such as bifurcations, shadows, and side branches that are not common in the training set. Furthermore, using a modern GPU, IVUS-Net segments each IVUS frame only in 0.15 seconds. The proposed work, to the best of our knowledge, is the first deep learning based method for segmentation of both the lumen and the media vessel walls in 20 MHz IVUS B-mode images that achieves the best results without any manual intervention. Code is available at https://github.com/Kulbear/ivus-segmentation-icsm2018
△ Less
Submitted 14 June, 2018; v1 submitted 10 June, 2018;
originally announced June 2018.
-
Adversarial Regression with Multiple Learners
Authors:
Liang Tong,
Sixie Yu,
Scott Alfeld,
Yevgeniy Vorobeychik
Abstract:
Despite the considerable success enjoyed by machine learning techniques in practice, numerous studies demonstrated that many approaches are vulnerable to attacks. An important class of such attacks involves adversaries changing features at test time to cause incorrect predictions. Previous investigations of this problem pit a single learner against an adversary. However, in many situations an adve…
▽ More
Despite the considerable success enjoyed by machine learning techniques in practice, numerous studies demonstrated that many approaches are vulnerable to attacks. An important class of such attacks involves adversaries changing features at test time to cause incorrect predictions. Previous investigations of this problem pit a single learner against an adversary. However, in many situations an adversary's decision is aimed at a collection of learners, rather than specifically targeted at each independently. We study the problem of adversarial linear regression with multiple learners. We approximate the resulting game by exhibiting an upper bound on learner loss functions, and show that the resulting game has a unique symmetric equilibrium. We present an algorithm for computing this equilibrium, and show through extensive experiments that equilibrium models are significantly more robust than conventional regularized linear regression.
△ Less
Submitted 6 June, 2018;
originally announced June 2018.
-
Multi-Area Interchange Scheduling under Uncertainty
Authors:
Yuting Ji,
Lang Tong
Abstract:
The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optim…
▽ More
The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optimality and convergence are guaranteed for both synchronous and asynchronous scheduling under nominal assumptions.
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
Probabilistic Forecasting and Simulation of Electricity Markets via Online Dictionary Learning
Authors:
Weisi Deng,
Yuting Ji,
Lang Tong
Abstract:
The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of j…
▽ More
The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of joint and marginal probability distributions of future locational marginal prices, power flows, and dispatch levels, conditional on the system state at the time of forecasting. Compared with standard Monte Carlo simulation techniques, the ODL approach offers several orders of magnitude improvement in computation time, making it feasible for online forecasting of market operations. Numerical simulations on large and moderate size power systems illustrate its performance and complexity features and its potential as a tool for system operators.
△ Less
Submitted 24 June, 2016;
originally announced June 2016.
-
Probabilistic Forecast of Real-Time LMP and Network Congestion
Authors:
Yuting Ji,
Robert J. Thomas,
Lang Tong
Abstract:
The short-term forecasting of real-time locational marginal price (LMP) and network congestion is considered from a system operator perspective. A new probabilistic forecasting technique is proposed based on a multiparametric programming formulation that partitions the uncertainty parameter space into critical regions from which the conditional probability distribution of the real-time LMP/congest…
▽ More
The short-term forecasting of real-time locational marginal price (LMP) and network congestion is considered from a system operator perspective. A new probabilistic forecasting technique is proposed based on a multiparametric programming formulation that partitions the uncertainty parameter space into critical regions from which the conditional probability distribution of the real-time LMP/congestion is obtained. The proposed method incorporates load/generation forecast, time varying operation constraints, and contingency models. By shifting the computation cost associated with multiparametric programs offline, the online computation cost is significantly reduced. An online simulation technique by generating critical regions dynamically is also proposed, which results in several orders of magnitude improvement in the computational cost over standard Monte Carlo methods.
△ Less
Submitted 24 June, 2016; v1 submitted 20 March, 2015;
originally announced March 2015.
-
PMU based Detection of Imbalance in Three-Phase Power Systems
Authors:
Tirza Routtenberg,
Yao Xie,
Rebecca M. Willett,
Lang Tong
Abstract:
The problem of imbalance detection in a three-phase power system using a phasor measurement unit (PMU) is considered. A general model for the zero, positive, and negative sequences from a PMU measurement at off-nominal frequencies is presented and a hypothesis testing framework is formulated. The new formulation takes into account the fact that minor degree of imbalance in the system is acceptable…
▽ More
The problem of imbalance detection in a three-phase power system using a phasor measurement unit (PMU) is considered. A general model for the zero, positive, and negative sequences from a PMU measurement at off-nominal frequencies is presented and a hypothesis testing framework is formulated. The new formulation takes into account the fact that minor degree of imbalance in the system is acceptable and does not indicate subsequent interruptions, failures, or degradation of physical components. A generalized likelihood ratio test (GLRT) is developed and shown to be a function of the negative-sequence phasor estimator and the acceptable level of imbalances for nominal system operations. As a by-product to the proposed detection method, a constrained estimation of the positive and negative phasors and the frequency deviation is obtained for both balanced and unbalanced situations. The theoretical and numerical performance analyses show improved performance over benchmark techniques and robustness to the presence of additional harmonics.
△ Less
Submitted 19 September, 2014;
originally announced September 2014.
-
A Subspace Technique for The Identification of Switched Affine Models
Authors:
Liang Li,
Wei Dong,
Yindong Ji,
Lang Tong
Abstract:
The problem of estimating parameters of switched affine systems with noisy input-output observations is considered. The switched affine models is transformed into a switched linear one by removing its intersection subspace, which is estimated from observations. A subspace technique is proposed to exploit the observations' permutation structure, which transforms the problem of associating observati…
▽ More
The problem of estimating parameters of switched affine systems with noisy input-output observations is considered. The switched affine models is transformed into a switched linear one by removing its intersection subspace, which is estimated from observations. A subspace technique is proposed to exploit the observations' permutation structure, which transforms the problem of associating observations with subsystems into one of de-permutating a block diagonal matrix, referred as adjacency matrix. Then a normalized spectral clustering algorithm is presented to recover the block structure of adjacency matrix, from which each observation is related to a particular subsystem. With the labelled observations, parameters of the submodel are estimated via the total least squares (TLS) estimator. The proposed technique is applicable to switched affine systems with arbitrarily shaped domain partitions, and it offers significantly improved performance and lowered computation complexity than existing techniques.
△ Less
Submitted 13 September, 2013;
originally announced September 2013.
-
Spectral Clustering on Subspace for Parameter Estimation of Jump Linear Models
Authors:
Liang Li,
Wei Dong,
Yindong Ji,
Lang Tong
Abstract:
The problem of estimating parameters of a deterministic jump or piecewise linear model is considered. A subspace technique referred to as spectral clustering on subspace (SCS) algorithm is proposed to estimate a set of linear model parameters, the model input, and the set of switching epochs. The SCS algorithm exploits a block diagonal structure of the system input subspace, which partitions the o…
▽ More
The problem of estimating parameters of a deterministic jump or piecewise linear model is considered. A subspace technique referred to as spectral clustering on subspace (SCS) algorithm is proposed to estimate a set of linear model parameters, the model input, and the set of switching epochs. The SCS algorithm exploits a block diagonal structure of the system input subspace, which partitions the observation space into separate subspaces, each corresponding to one and only one linear submodel. A spectral clustering technique is used to label the noisy observations for each submodel, which generates estimates of switching time epoches. A total least squares technique is used to estimate model parameters and the model input. It is shown that, in the absence of observation noise, the SCS algorithm provides exact parameter identification. At high signal to noise ratios, SCS attains a clairvoyant Cramér-Rao bound computed by assuming the labeling of observation samples is perfect.
△ Less
Submitted 1 July, 2013;
originally announced July 2013.
-
Analytic Solutions for D-optimal Factorial Designs under Generalized Linear Models
Authors:
Li** Tong,
Hans W. Volkmer,
Jie Yang
Abstract:
We develop two analytic approaches to solve D-optimal approximate designs under generalized linear models. The first approach provides analytic D-optimal allocations for generalized linear models with two factors, which include as a special case the $2^2$ main-effects model considered by Yang, Mandal and Majumdar (2012). The second approach leads to explicit solutions for a class of generalized li…
▽ More
We develop two analytic approaches to solve D-optimal approximate designs under generalized linear models. The first approach provides analytic D-optimal allocations for generalized linear models with two factors, which include as a special case the $2^2$ main-effects model considered by Yang, Mandal and Majumdar (2012). The second approach leads to explicit solutions for a class of generalized linear models with more than two factors. With the aid of the analytic solutions, we provide a necessary and sufficient condition under which a D-optimal design with two quantitative factors could be constructed on the boundary points only. It bridges the gap between D-optimal factorial designs and D-optimal designs with continuous factors.
△ Less
Submitted 11 October, 2013; v1 submitted 22 June, 2013;
originally announced June 2013.
-
Maximum Likelihood Fusion of Stochastic Maps
Authors:
Brandon Jones,
Mark Campbell,
Lang Tong
Abstract:
The fusion of independently obtained stochastic maps by collaborating mobile agents is considered. The proposed approach includes two parts: matching of stochastic maps and maximum likelihood alignment. In particular, an affine invariant hypergraph is constructed for each stochastic map, and a bipartite matching via a linear program is used to establish landmark correspondence between stochastic m…
▽ More
The fusion of independently obtained stochastic maps by collaborating mobile agents is considered. The proposed approach includes two parts: matching of stochastic maps and maximum likelihood alignment. In particular, an affine invariant hypergraph is constructed for each stochastic map, and a bipartite matching via a linear program is used to establish landmark correspondence between stochastic maps. A maximum likelihood alignment procedure is proposed to determine rotation and translation between common landmarks in order to construct a global map within a common frame of reference. A main feature of the proposed approach is its scalability with respect to the number of landmarks: the matching step has polynomial complexity and the maximum likelihood alignment is obtained in closed form. Experimental validation of the proposed fusion approach is performed using the Victoria Park benchmark dataset.
△ Less
Submitted 25 March, 2013;
originally announced March 2013.
-
Efficient Calculation of P-value and Power for Quadratic Form Statistics in Multilocus Association Testing
Authors:
Li** Tong,
Jie Yang,
Richard S. Cooper
Abstract:
We address the asymptotic and approximate distributions of a large class of test statistics with quadratic forms used in association studies. The statistics of interest do not necessarily follow a chi-square distribution and take the general form $D=X^T A X$, where $X$ follows the multivariate normal distribution, and $A$ is a general similarity matrix which may or may not be positive semi-defin…
▽ More
We address the asymptotic and approximate distributions of a large class of test statistics with quadratic forms used in association studies. The statistics of interest do not necessarily follow a chi-square distribution and take the general form $D=X^T A X$, where $X$ follows the multivariate normal distribution, and $A$ is a general similarity matrix which may or may not be positive semi-definite. We show that $D$ can be written as a linear combination of independent chi-square random variables, whose distribution can be approximated by a chi-square or the difference of two chi-square distributions. In the setting of association testing, our methods are especially useful in two situations. First, for a genome screen, the required significance level is much smaller than 0.05 due to multiple comparisons, and estimation of p-values using permutation procedures is particularly challenging. An efficient and accurate estimation procedure would therefore be useful. Second, in a candidate gene study based on haplotypes when phase is unknown a computationally expensive method-the EM algorithm-is usually required to infer haplotype frequencies. Because the EM algorithm is needed for each permutation, this results in a substantial computational burden, which can be eliminated with our mathematical solution. We assess the practical utility of our method using extensive simulation studies based on two example statistics and apply it to find the sample size needed for a typical candidate gene association study when phase information is not available. Our method can be applied to any quadratic form statistic and therefore should be of general interest.
△ Less
Submitted 22 September, 2009;
originally announced September 2009.
-
A Large-Deviation Analysis of the Maximum-Likelihood Learning of Markov Tree Structures
Authors:
Vincent Y. F. Tan,
Animashree Anandkumar,
Lang Tong,
Alan S. Willsky
Abstract:
The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event tha…
▽ More
The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event that the ML-estimate of the Markov tree structure differs from the true tree structure, given a set of independently drawn samples. By exploiting the fact that the output of ML-estimation is a tree, we establish that the error exponent is equal to the exponential rate of decay of a single dominant crossover event. We prove that in this dominant crossover event, a non-neighbor node pair replaces a true edge of the distribution that is along the path of edges in the true tree graph connecting the nodes in the non-neighbor pair. Using ideas from Euclidean information theory, we then analyze the scenario of ML-estimation in the very noisy learning regime and show that the error exponent can be approximated as a ratio, which is interpreted as the signal-to-noise ratio (SNR) for learning tree distributions. We show via numerical experiments that in this regime, our SNR approximation is accurate.
△ Less
Submitted 21 November, 2010; v1 submitted 6 May, 2009;
originally announced May 2009.