Search | arXiv e-print repository

Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI

Authors: Lang Tong, Xinyi Wang, Qing Zhao

Abstract: Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference. Advancing beyond earlier generations of wide-area monitoring systems built upon supervisory control and data acquisition (SCADA) and synchrophasor technologies, we argue for a monitoring an… ▽ More Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference. Advancing beyond earlier generations of wide-area monitoring systems built upon supervisory control and data acquisition (SCADA) and synchrophasor technologies, we argue for a monitoring and control framework based on the streaming of continuous point-on-wave (CPOW) measurements with AI-powered data compression and fault detection. Methods and Results: The architecture of the proposed design originates from the Wiener-Kallianpur innovation representation of a random process that transforms causally a stationary random process into an innovation sequence with independent and identically distributed random variables. This work presents a generative AI approach that (i) learns an innovation autoencoder that extracts innovation sequence from CPOW time series, (ii) compresses the CPOW streaming data with innovation autoencoder and subband coding, and (iii) detects unknown faults and novel trends via nonparametric sequential hypothesis testing. Conclusion: This work argues that conventional monitoring using SCADA and phasor measurement unit (PMU) technologies is ill-suited for a future grid with deep penetration of inverter-based renewable generations and distributed energy resources. A monitoring system based on CPOW data streaming and AI data analytics should be the basic building blocks for situational awareness of a highly dynamic future grid. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2402.13870 [pdf, ps, other]

Generative Probabilistic Time Series Forecasting and Applications in Grid Operations

Authors: Xinyi Wang, Lang Tong, Qing Zhao

Abstract: Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations. Such techniques are essential in risk-based decision-making and planning under uncertainty with broad applications in grid operations, including electricity price forecasting, risk-based economic dispatch, and stochastic optimizations.… ▽ More Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations. Such techniques are essential in risk-based decision-making and planning under uncertainty with broad applications in grid operations, including electricity price forecasting, risk-based economic dispatch, and stochastic optimizations. Inspired by Wiener and Kallianpur's innovation representation, we propose a weak innovation autoencoder architecture and a learning algorithm to extract independent and identically distributed innovation sequences from nonparametric stationary time series. We show that the weak innovation sequence is Bayesian sufficient, which makes the proposed weak innovation autoencoder a canonical architecture for generative probabilistic forecasting. The proposed technique is applied to forecasting highly volatile real-time electricity prices, demonstrating superior performance across multiple forecasting measures over leading probabilistic and point forecasting techniques. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: Accepted at CISS 2024. arXiv admin note: text overlap with arXiv:2306.03782

arXiv:2312.16260 [pdf, other]

Multinomial Link Models

Authors: Tianmeng Wang, Li** Tong, Jie Yang

Abstract: We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of t… ▽ More We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of the model parameters and computing the Fisher information matrix. Our algorithms solve the infeasibility issue of existing statistical software on estimating parameters of cumulative link models. The applications to real datasets show that the new models can fit the data significantly better, and the corresponding data analysis may correct the misleading conclusions due to missing responses. △ Less

Submitted 18 June, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

Comments: 39 pages, 5 figures

arXiv:2210.13358 [pdf, ps, other]

Novelty Detection in Time Series via Weak Innovations Representation: A Deep Learning Approach

Authors: Xinyi Wang, Mei-jen Lee, Qing Zhao, Lang Tong

Abstract: We consider novelty detection in time series with unknown and nonparametric probability structures. A deep learning approach is proposed to causally extract an innovations sequence consisting of novelty samples statistically independent of all past samples of the time series. A novelty detection algorithm is developed for the online detection of novel changes in the probability structure in the in… ▽ More We consider novelty detection in time series with unknown and nonparametric probability structures. A deep learning approach is proposed to causally extract an innovations sequence consisting of novelty samples statistically independent of all past samples of the time series. A novelty detection algorithm is developed for the online detection of novel changes in the probability structure in the innovations sequence. A minimax optimality under a Bayes risk measure is established for the proposed novelty detection method, and its robustness and efficacy are demonstrated in experiments using real and synthetic datasets. △ Less

Submitted 24 October, 2022; originally announced October 2022.

arXiv:2207.05281 [pdf, ps, other]

Constrained D-optimal Design for Paid Research Study

Authors: Yifei Huang, Li** Tong, Jie Yang

Abstract: We consider constrained sampling problems in paid research studies or clinical trials. When qualified volunteers are more than the budget allowed, we recommend a D-optimal sampling strategy based on the optimal design theory and develop a constrained lift-one algorithm to find the optimal allocation. Unlike the literature which mainly deals with linear models, our solution solves the constrained s… ▽ More We consider constrained sampling problems in paid research studies or clinical trials. When qualified volunteers are more than the budget allowed, we recommend a D-optimal sampling strategy based on the optimal design theory and develop a constrained lift-one algorithm to find the optimal allocation. Unlike the literature which mainly deals with linear models, our solution solves the constrained sampling problem under fairly general statistical models, including generalized linear models and multinomial logistic models, and with more general constraints. We justify theoretically the optimality of our sampling strategy and show by simulation studies and real-world examples the advantages over simple random sampling and proportionally stratified sampling strategies. △ Less

Submitted 24 May, 2024; v1 submitted 11 July, 2022; originally announced July 2022.

Comments: 30 pages

arXiv:2203.00573 [pdf, other]

doi 10.1103/PhysRevE.105.064118

Contrasting random and learned features in deep Bayesian linear regression

Authors: Jacob A. Zavatone-Veth, William L. Tong, Cengiz Pehlevan

Abstract: Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are t… ▽ More Understanding how feature learning affects generalization is among the foremost goals of modern deep learning theory. Here, we study how the ability to learn representations affects the generalization performance of a simple class of models: deep Bayesian linear neural networks trained on unstructured Gaussian data. By comparing deep random feature models to deep networks in which all layers are trained, we provide a detailed characterization of the interplay between width, depth, data density, and prior mismatch. We show that both models display sample-wise double-descent behavior in the presence of label noise. Random feature models can also display model-wise double-descent if there are narrow bottleneck layers, while deep networks do not show these divergences. Random feature models can have particular widths that are optimal for generalization at a given data density, while making neural networks as wide or as narrow as possible is always optimal. Moreover, we show that the leading-order correction to the kernel-limit learning curve cannot distinguish between random feature models and deep networks in which all layers are trained. Taken together, our findings begin to elucidate how architectural details affect generalization performance in this simple class of deep regression models. △ Less

Submitted 16 June, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: 35 pages, 7 figures. v2: minor typos corrected and references added; published in PRE

Journal ref: Physical Review E 105, 064118 (2022)

arXiv:2106.12382 [pdf, ps, other]

Innovations Autoencoder and its Application in One-class Anomalous Sequence Detection

Authors: Xinyi Wang, Lang Tong

Abstract: An innovations sequence of a time series is a sequence of independent and identically distributed random variables with which the original time series has a causal representation. The innovation at a time is statistically independent of the history of the time series. As such, it represents the new information contained at present but not in the past. Because of its simple probability structure, a… ▽ More An innovations sequence of a time series is a sequence of independent and identically distributed random variables with which the original time series has a causal representation. The innovation at a time is statistically independent of the history of the time series. As such, it represents the new information contained at present but not in the past. Because of its simple probability structure, an innovations sequence is the most efficient signature of the original. Unlike the principle or independent component analysis representations, an innovations sequence preserves not only the complete statistical properties but also the temporal order of the original time series. An long-standing open problem is to find a computationally tractable way to extract an innovations sequence of non-Gaussian processes. This paper presents a deep learning approach, referred to as Innovations Autoencoder (IAE), that extracts innovations sequences using a causal convolutional neural network. An application of IAE to the one-class anomalous sequence detection problem with unknown anomaly and anomaly-free models is also presented. △ Less

Submitted 15 July, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

Journal ref: The Journal of Machine Learning Research, Vol 23, Issue 1, pp. 2347-2373, 2022

arXiv:2103.12903 [pdf, other]

Joint Dynamic Models and Statistical Inference for Recurrent Competing Risks, Longitudinal Marker, and Health Status

Authors: Lili Tong, Piaomu Liu, Edsel Pena

Abstract: Consider a subject or unit in a longitudinal biomedical, public health, engineering, economic or social science study which is being monitored over a possibly random duration. Over time this unit experiences recurrent events of several types and a longitudinal marker transitions over a discrete state-space. In addition, its "health" status also transitions over a discrete state-space with at least… ▽ More Consider a subject or unit in a longitudinal biomedical, public health, engineering, economic or social science study which is being monitored over a possibly random duration. Over time this unit experiences recurrent events of several types and a longitudinal marker transitions over a discrete state-space. In addition, its "health" status also transitions over a discrete state-space with at least one absorbing state. A vector of covariates will also be associated with this unit. Of major interest for this unit is the time-to-absorption of its health status process, which could be viewed as the unit's lifetime. Aside from being affected by its covariate vector, there could be associations among the recurrent competing risks processes, the longitudinal marker process, and the health status process in the sense that the time-evolution of each process is associated with the other processes. To obtain more realistic models and enhance inferential performance, a joint dynamic stochastic model for these components is proposed and statistical inference methods are developed. This joint model, formulated via counting processes and continuous-time Markov chains, has the potential of facilitating `personalized' interventions. This could enhance, for example, the implementation and adoption of precision medicine in medical settings. Semi-parametric and likelihood-based inferential methods for the model parameters are developed when a sample of these units is available. Finite-sample and asymptotic properties of estimators of model parameters, both finite- and infinite-dimensional, are obtained analytically or through simulation studies. The developed procedures are illustrated using a real data set. △ Less

Submitted 29 January, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

arXiv:2012.15005 [pdf, other]

Infer-AVAE: An Attribute Inference Model Based on Adversarial Variational Autoencoder

Authors: Yadong Zhou, Zhihao Ding, Xiaoming Liu, Chao Shen, Lingling Tong, Xiaohong Guan

Abstract: User attributes, such as gender and education, face severe incompleteness in social networks. In order to make this kind of valuable data usable for downstream tasks like user profiling and personalized recommendation, attribute inference aims to infer users' missing attribute labels based on observed data. Recently, variational autoencoder (VAE), an end-to-end deep generative model, has shown pro… ▽ More User attributes, such as gender and education, face severe incompleteness in social networks. In order to make this kind of valuable data usable for downstream tasks like user profiling and personalized recommendation, attribute inference aims to infer users' missing attribute labels based on observed data. Recently, variational autoencoder (VAE), an end-to-end deep generative model, has shown promising performance by handling the problem in a semi-supervised way. However, VAEs can easily suffer from over-fitting and over-smoothing when applied to attribute inference. To be specific, VAE implemented with multi-layer perceptron (MLP) can only reconstruct input data but fail in inferring missing parts. While using the trending graph neural networks (GNNs) as encoder has the problem that GNNs aggregate redundant information from neighborhood and generate indistinguishable user representations, which is known as over-smoothing. In this paper, we propose an attribute \textbf{Infer}ence model based on \textbf{A}dversarial \textbf{VAE} (Infer-AVAE) to cope with these issues. Specifically, to overcome over-smoothing, Infer-AVAE unifies MLP and GNNs in encoder to learn positive and negative latent representations respectively. Meanwhile, an adversarial network is trained to distinguish the two representations and GNNs are trained to aggregate less noise for more robust representations through adversarial training. Finally, to relieve over-fitting, mutual information constraint is introduced as a regularizer for decoder, so that it can make better use of auxiliary information in representations and generate outputs not limited by observations. We evaluate our model on 4 real-world social network datasets, experimental results demonstrate that our model averagely outperforms baselines by 7.0$\%$ in accuracy. △ Less

Submitted 29 May, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

arXiv:2005.04272 [pdf, other]

Towards Robustness against Unsuspicious Adversarial Examples

Authors: Liang Tong, Minzhe Guo, Atul Prakash, Yevgeniy Vorobeychik

Abstract: Despite the remarkable success of deep neural networks, significant concerns have emerged about their robustness to adversarial perturbations to inputs. While most attacks aim to ensure that these are imperceptible, physical perturbation attacks typically aim for being unsuspicious, even if perceptible. However, there is no universal notion of what it means for adversarial examples to be unsuspici… ▽ More Despite the remarkable success of deep neural networks, significant concerns have emerged about their robustness to adversarial perturbations to inputs. While most attacks aim to ensure that these are imperceptible, physical perturbation attacks typically aim for being unsuspicious, even if perceptible. However, there is no universal notion of what it means for adversarial examples to be unsuspicious. We propose an approach for modeling suspiciousness by leveraging cognitive salience. Specifically, we split an image into foreground (salient region) and background (the rest), and allow significantly larger adversarial perturbations in the background, while ensuring that cognitive salience of background remains low. We describe how to compute the resulting non-salience-preserving dual-perturbation attacks on classifiers. We then experimentally demonstrate that our attacks indeed do not significantly change perceptual salience of the background, but are highly effective against classifiers robust to conventional attacks. Furthermore, we show that adversarial training with dual-perturbation attacks yields classifiers that are more robust to these than state-of-the-art robust learning approaches, and comparable in terms of robustness to conventional attacks. △ Less

Submitted 8 October, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

Comments: v2.0

arXiv:2001.08809 [pdf, ps, other]

Universal Data Anomaly Detection via Inverse Generative Adversary Network

Authors: Kursat Rasim Mestav, Lang Tong

Abstract: The problem of detecting data anomaly is considered. Under the null hypothesis that models anomaly-free data, measurements are assumed to be from an unknown distribution with some authenticated historical samples. Under the composite alternative hypothesis, measurements are from an unknown distribution positive distance away from the distribution under the null hypothesis. No training data are ava… ▽ More The problem of detecting data anomaly is considered. Under the null hypothesis that models anomaly-free data, measurements are assumed to be from an unknown distribution with some authenticated historical samples. Under the composite alternative hypothesis, measurements are from an unknown distribution positive distance away from the distribution under the null hypothesis. No training data are available for the distribution of anomaly data. A semi-supervised deep learning technique based on an inverse generative adversary network is proposed. △ Less

Submitted 23 January, 2020; originally announced January 2020.

Comments: 5 pages, letter

arXiv:1909.09552 [pdf, other]

Defending Against Physically Realizable Attacks on Image Classification

Authors: Tong Wu, Liang Tong, Yevgeniy Vorobeychik

Abstract: We study the problem of defending deep neural network approaches for image classification from physically realizable attacks. First, we demonstrate that the two most scalable and effective methods for learning robust models, adversarial training with PGD attacks and randomized smoothing, exhibit very limited effectiveness against three of the highest profile physical attacks. Next, we propose a ne… ▽ More We study the problem of defending deep neural network approaches for image classification from physically realizable attacks. First, we demonstrate that the two most scalable and effective methods for learning robust models, adversarial training with PGD attacks and randomized smoothing, exhibit very limited effectiveness against three of the highest profile physical attacks. Next, we propose a new abstract adversarial model, rectangular occlusion attacks, in which an adversary places a small adversarially crafted rectangle in an image, and develop two approaches for efficiently computing the resulting adversarial examples. Finally, we demonstrate that adversarial training using our new attack yields image classification models that exhibit high robustness against the physically realizable attacks we study, offering the first effective generic defense against such attacks. △ Less

Submitted 14 February, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

Comments: camera-ready

arXiv:1906.00398 [pdf, other]

Cost-sensitive Boosting Pruning Trees for depression detection on Twitter

Authors: Lei Tong, Zhihua Liu, Zheheng Jiang, Feixiang Zhou, Long Chen, Jialin Lyu, Xiangrong Zhang, Qianni Zhang, Abdul Sadka Senior, Yinhai Wang, Ling Li, Huiyu Zhou

Abstract: Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clu… ▽ More Depression is one of the most common mental health disorders, and a large number of depressed people commit suicide each year. Potential depression sufferers usually do not consult psychological doctors because they feel ashamed or are unaware of any depression, which may result in severe delay of diagnosis and treatment. In the meantime, evidence shows that social media data provides valuable clues about physical and mental health conditions. In this paper, we argue that it is feasible to identify depression at an early stage by mining online social behaviours. Our approach, which is innovative to the practice of depression detection, does not rely on the extraction of numerous or complicated features to achieve accurate depression detection. Instead, we propose a novel classifier, namely, Cost-sensitive Boosting Pruning Trees (CBPT), which demonstrates a strong classification ability on two publicly accessible Twitter depression detection datasets. To comprehensively evaluate the classification capability of the CBPT, we use additional three datasets from the UCI machine learning repository and the CBPT obtains appealing classification results against several state of the arts boosting algorithms. Finally, we comprehensively explore the influence factors of model prediction, and the results manifest that our proposed framework is promising for identifying Twitter users with depression. △ Less

Submitted 21 January, 2022; v1 submitted 2 June, 2019; originally announced June 2019.

Comments: 15 pages, 7 figures, Accepted by IEEE transactions on Affective Computing

arXiv:1905.01281 [pdf, other]

Predicting Urban Dispersal Events: A Two-Stage Framework through Deep Survival Analysis on Mobility Data

Authors: Amin Vahedian, Xun Zhou, Ling Tong, W. Nick Street, Yanhua Li

Abstract: Urban dispersal events are processes where an unusually large number of people leave the same area in a short period. Early prediction of dispersal events is important in mitigating congestion and safety risks and making better dispatching decisions for taxi and ride-sharing fleets. Existing work mostly focuses on predicting taxi demand in the near future by learning patterns from historical data.… ▽ More Urban dispersal events are processes where an unusually large number of people leave the same area in a short period. Early prediction of dispersal events is important in mitigating congestion and safety risks and making better dispatching decisions for taxi and ride-sharing fleets. Existing work mostly focuses on predicting taxi demand in the near future by learning patterns from historical data. However, they fail in case of abnormality because dispersal events with abnormally high demand are non-repetitive and violate common assumptions such as smoothness in demand change over time. Instead, in this paper we argue that dispersal events follow a complex pattern of trips and other related features in the past, which can be used to predict such events. Therefore, we formulate the dispersal event prediction problem as a survival analysis problem. We propose a two-stage framework (DILSA), where a deep learning model combined with survival analysis is developed to predict the probability of a dispersal event and its demand volume. We conduct extensive case studies and experiments on the NYC Yellow taxi dataset from 2014-2016. Results show that DILSA can predict events in the next 5 hours with F1-score of 0.7 and with average time error of 18 minutes. It is orders of magnitude better than the state-ofthe-art deep learning approaches for taxi demand prediction. △ Less

Submitted 11 July, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

Comments: To appear in AAAI-19 proceedings. The reason for the replacement was the misspelled author name in the meta-data field. Author name was corrected from "Ynahua Li" to "Yanhua Li". The author list in the paper was correct and remained unchanged

arXiv:1811.02756 [pdf, other]

Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning

Authors: Kursat Rasim Mestav, Jaime Luengo-Rozas, Lang Tong

Abstract: The problem of state estimation for unobservable distribution systems is considered. A deep learning approach to Bayesian state estimation is proposed for real-time applications. The proposed technique consists of distribution learning of stochastic power injection, a Monte Carlo technique for the training of a deep neural network for state estimation, and a Bayesian bad-data detection and filteri… ▽ More The problem of state estimation for unobservable distribution systems is considered. A deep learning approach to Bayesian state estimation is proposed for real-time applications. The proposed technique consists of distribution learning of stochastic power injection, a Monte Carlo technique for the training of a deep neural network for state estimation, and a Bayesian bad-data detection and filtering algorithm. Structural characteristics of the deep neural networks are investigated. Simulations illustrate the accuracy of Bayesian state estimation for unobservable systems and demonstrate the benefit of employing a deep neural network. Numerical results show the robustness of Bayesian state estimation against modeling and estimation errors and the presence of bad and missing data. Comparing with pseudo-measurement techniques, direct Bayesian state estimation via deep learning neural network outperforms existing benchmarks. △ Less

Submitted 24 February, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

arXiv:1806.03583 [pdf, other]

IVUS-Net: An Intravascular Ultrasound Segmentation Network

Authors: Ji Yang, Lin Tong, Mehdi Faraji, Anup Basu

Abstract: IntraVascular UltraSound (IVUS) is one of the most effective imaging modalities that provides assistance to experts in order to diagnose and treat cardiovascular diseases. We address a central problem in IVUS image analysis with Fully Convolutional Network (FCN): automatically delineate the lumen and media-adventitia borders in IVUS images, which is crucial to shorten the diagnosis process or bene… ▽ More IntraVascular UltraSound (IVUS) is one of the most effective imaging modalities that provides assistance to experts in order to diagnose and treat cardiovascular diseases. We address a central problem in IVUS image analysis with Fully Convolutional Network (FCN): automatically delineate the lumen and media-adventitia borders in IVUS images, which is crucial to shorten the diagnosis process or benefits a faster and more accurate 3D reconstruction of the artery. Particularly, we propose an FCN architecture, called IVUS-Net, followed by a post-processing contour extraction step, in order to automatically segments the interior (lumen) and exterior (media-adventitia) regions of the human arteries. We evaluated our IVUS-Net on the test set of a standard publicly available dataset containing 326 IVUS B-mode images with two measurements, namely Jaccard Measure (JM) and Hausdorff Distances (HD). The evaluation result shows that IVUS-Net outperforms the state-of-the-art lumen and media segmentation methods by 4% to 20% in terms of HD distance. IVUS-Net performs well on images in the test set that contain a significant amount of major artifacts such as bifurcations, shadows, and side branches that are not common in the training set. Furthermore, using a modern GPU, IVUS-Net segments each IVUS frame only in 0.15 seconds. The proposed work, to the best of our knowledge, is the first deep learning based method for segmentation of both the lumen and the media vessel walls in 20 MHz IVUS B-mode images that achieves the best results without any manual intervention. Code is available at https://github.com/Kulbear/ivus-segmentation-icsm2018 △ Less

Submitted 14 June, 2018; v1 submitted 10 June, 2018; originally announced June 2018.

Comments: 7 pages, 3 figures, accepted to be published in International Conference of Smart Multimedia. The final authenticated publication is available online at https://doi.org/

arXiv:1806.02256 [pdf, other]

Adversarial Regression with Multiple Learners

Authors: Liang Tong, Sixie Yu, Scott Alfeld, Yevgeniy Vorobeychik

Abstract: Despite the considerable success enjoyed by machine learning techniques in practice, numerous studies demonstrated that many approaches are vulnerable to attacks. An important class of such attacks involves adversaries changing features at test time to cause incorrect predictions. Previous investigations of this problem pit a single learner against an adversary. However, in many situations an adve… ▽ More Despite the considerable success enjoyed by machine learning techniques in practice, numerous studies demonstrated that many approaches are vulnerable to attacks. An important class of such attacks involves adversaries changing features at test time to cause incorrect predictions. Previous investigations of this problem pit a single learner against an adversary. However, in many situations an adversary's decision is aimed at a collection of learners, rather than specifically targeted at each independently. We study the problem of adversarial linear regression with multiple learners. We approximate the resulting game by exhibiting an upper bound on learner loss functions, and show that the resulting game has a unique symmetric equilibrium. We present an algorithm for computing this equilibrium, and show through extensive experiments that equilibrium models are significantly more robust than conventional regularized linear regression. △ Less

Submitted 6 June, 2018; originally announced June 2018.

Comments: Accepted by ICML'18

arXiv:1611.05012 [pdf, other]

Multi-Area Interchange Scheduling under Uncertainty

Authors: Yuting Ji, Lang Tong

Abstract: The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optim… ▽ More The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optimality and convergence are guaranteed for both synchronous and asynchronous scheduling under nominal assumptions. △ Less

Submitted 15 November, 2016; originally announced November 2016.

arXiv:1606.07855 [pdf, ps, other]

Probabilistic Forecasting and Simulation of Electricity Markets via Online Dictionary Learning

Authors: Weisi Deng, Yuting Ji, Lang Tong

Abstract: The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of j… ▽ More The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of joint and marginal probability distributions of future locational marginal prices, power flows, and dispatch levels, conditional on the system state at the time of forecasting. Compared with standard Monte Carlo simulation techniques, the ODL approach offers several orders of magnitude improvement in computation time, making it feasible for online forecasting of market operations. Numerical simulations on large and moderate size power systems illustrate its performance and complexity features and its potential as a tool for system operators. △ Less

Submitted 24 June, 2016; originally announced June 2016.

Comments: 8 pages, 6 figures, Hawaii International Conference on System Sciences 2017 (HICSS-50)

MSC Class: 47N10

arXiv:1503.06171 [pdf, ps, other]

Probabilistic Forecast of Real-Time LMP and Network Congestion

Authors: Yuting Ji, Robert J. Thomas, Lang Tong

Abstract: The short-term forecasting of real-time locational marginal price (LMP) and network congestion is considered from a system operator perspective. A new probabilistic forecasting technique is proposed based on a multiparametric programming formulation that partitions the uncertainty parameter space into critical regions from which the conditional probability distribution of the real-time LMP/congest… ▽ More The short-term forecasting of real-time locational marginal price (LMP) and network congestion is considered from a system operator perspective. A new probabilistic forecasting technique is proposed based on a multiparametric programming formulation that partitions the uncertainty parameter space into critical regions from which the conditional probability distribution of the real-time LMP/congestion is obtained. The proposed method incorporates load/generation forecast, time varying operation constraints, and contingency models. By shifting the computation cost associated with multiparametric programs offline, the online computation cost is significantly reduced. An online simulation technique by generating critical regions dynamically is also proposed, which results in several orders of magnitude improvement in the computational cost over standard Monte Carlo methods. △ Less

Submitted 24 June, 2016; v1 submitted 20 March, 2015; originally announced March 2015.

arXiv:1409.5530 [pdf, ps, other]

doi 10.1109/TPWRS.2014.2359630

PMU based Detection of Imbalance in Three-Phase Power Systems

Authors: Tirza Routtenberg, Yao Xie, Rebecca M. Willett, Lang Tong

Abstract: The problem of imbalance detection in a three-phase power system using a phasor measurement unit (PMU) is considered. A general model for the zero, positive, and negative sequences from a PMU measurement at off-nominal frequencies is presented and a hypothesis testing framework is formulated. The new formulation takes into account the fact that minor degree of imbalance in the system is acceptable… ▽ More The problem of imbalance detection in a three-phase power system using a phasor measurement unit (PMU) is considered. A general model for the zero, positive, and negative sequences from a PMU measurement at off-nominal frequencies is presented and a hypothesis testing framework is formulated. The new formulation takes into account the fact that minor degree of imbalance in the system is acceptable and does not indicate subsequent interruptions, failures, or degradation of physical components. A generalized likelihood ratio test (GLRT) is developed and shown to be a function of the negative-sequence phasor estimator and the acceptable level of imbalances for nominal system operations. As a by-product to the proposed detection method, a constrained estimation of the positive and negative phasors and the frequency deviation is obtained for both balanced and unbalanced situations. The theoretical and numerical performance analyses show improved performance over benchmark techniques and robustness to the presence of additional harmonics. △ Less

Submitted 19 September, 2014; originally announced September 2014.

Comments: Accepted to IEEE Trans. on Power System Sep. 2014

arXiv:1309.3387 [pdf, ps, other]

A Subspace Technique for The Identification of Switched Affine Models

Authors: Liang Li, Wei Dong, Yindong Ji, Lang Tong

Abstract: The problem of estimating parameters of switched affine systems with noisy input-output observations is considered. The switched affine models is transformed into a switched linear one by removing its intersection subspace, which is estimated from observations. A subspace technique is proposed to exploit the observations' permutation structure, which transforms the problem of associating observati… ▽ More The problem of estimating parameters of switched affine systems with noisy input-output observations is considered. The switched affine models is transformed into a switched linear one by removing its intersection subspace, which is estimated from observations. A subspace technique is proposed to exploit the observations' permutation structure, which transforms the problem of associating observations with subsystems into one of de-permutating a block diagonal matrix, referred as adjacency matrix. Then a normalized spectral clustering algorithm is presented to recover the block structure of adjacency matrix, from which each observation is related to a particular subsystem. With the labelled observations, parameters of the submodel are estimated via the total least squares (TLS) estimator. The proposed technique is applicable to switched affine systems with arbitrarily shaped domain partitions, and it offers significantly improved performance and lowered computation complexity than existing techniques. △ Less

Submitted 13 September, 2013; originally announced September 2013.

Comments: 13 pages, 14 figures. arXiv admin note: text overlap with arXiv:1307.0326

arXiv:1307.0326 [pdf, ps, other]

Spectral Clustering on Subspace for Parameter Estimation of Jump Linear Models

Authors: Liang Li, Wei Dong, Yindong Ji, Lang Tong

Abstract: The problem of estimating parameters of a deterministic jump or piecewise linear model is considered. A subspace technique referred to as spectral clustering on subspace (SCS) algorithm is proposed to estimate a set of linear model parameters, the model input, and the set of switching epochs. The SCS algorithm exploits a block diagonal structure of the system input subspace, which partitions the o… ▽ More The problem of estimating parameters of a deterministic jump or piecewise linear model is considered. A subspace technique referred to as spectral clustering on subspace (SCS) algorithm is proposed to estimate a set of linear model parameters, the model input, and the set of switching epochs. The SCS algorithm exploits a block diagonal structure of the system input subspace, which partitions the observation space into separate subspaces, each corresponding to one and only one linear submodel. A spectral clustering technique is used to label the noisy observations for each submodel, which generates estimates of switching time epoches. A total least squares technique is used to estimate model parameters and the model input. It is shown that, in the absence of observation noise, the SCS algorithm provides exact parameter identification. At high signal to noise ratios, SCS attains a clairvoyant Cramér-Rao bound computed by assuming the labeling of observation samples is perfect. △ Less

Submitted 1 July, 2013; originally announced July 2013.

Comments: 11 pages with 11 figures

arXiv:1306.5289 [pdf, ps, other]

Analytic Solutions for D-optimal Factorial Designs under Generalized Linear Models

Authors: Li** Tong, Hans W. Volkmer, Jie Yang

Abstract: We develop two analytic approaches to solve D-optimal approximate designs under generalized linear models. The first approach provides analytic D-optimal allocations for generalized linear models with two factors, which include as a special case the $2^2$ main-effects model considered by Yang, Mandal and Majumdar (2012). The second approach leads to explicit solutions for a class of generalized li… ▽ More We develop two analytic approaches to solve D-optimal approximate designs under generalized linear models. The first approach provides analytic D-optimal allocations for generalized linear models with two factors, which include as a special case the $2^2$ main-effects model considered by Yang, Mandal and Majumdar (2012). The second approach leads to explicit solutions for a class of generalized linear models with more than two factors. With the aid of the analytic solutions, we provide a necessary and sufficient condition under which a D-optimal design with two quantitative factors could be constructed on the boundary points only. It bridges the gap between D-optimal factorial designs and D-optimal designs with continuous factors. △ Less

Submitted 11 October, 2013; v1 submitted 22 June, 2013; originally announced June 2013.

Comments: 28 pages, 3 figures

arXiv:1303.6170 [pdf, other]

doi 10.1109/TSP.2014.2304435

Maximum Likelihood Fusion of Stochastic Maps

Authors: Brandon Jones, Mark Campbell, Lang Tong

Abstract: The fusion of independently obtained stochastic maps by collaborating mobile agents is considered. The proposed approach includes two parts: matching of stochastic maps and maximum likelihood alignment. In particular, an affine invariant hypergraph is constructed for each stochastic map, and a bipartite matching via a linear program is used to establish landmark correspondence between stochastic m… ▽ More The fusion of independently obtained stochastic maps by collaborating mobile agents is considered. The proposed approach includes two parts: matching of stochastic maps and maximum likelihood alignment. In particular, an affine invariant hypergraph is constructed for each stochastic map, and a bipartite matching via a linear program is used to establish landmark correspondence between stochastic maps. A maximum likelihood alignment procedure is proposed to determine rotation and translation between common landmarks in order to construct a global map within a common frame of reference. A main feature of the proposed approach is its scalability with respect to the number of landmarks: the matching step has polynomial complexity and the maximum likelihood alignment is obtained in closed form. Experimental validation of the proposed fusion approach is performed using the Victoria Park benchmark dataset. △ Less

Submitted 25 March, 2013; originally announced March 2013.

Comments: 10 pages, 8 figures, submitted to IEEE Transactions on Signal Processing on 24-March-2013

arXiv:0909.4073 [pdf, ps, other]

Efficient Calculation of P-value and Power for Quadratic Form Statistics in Multilocus Association Testing

Authors: Li** Tong, Jie Yang, Richard S. Cooper

Abstract: We address the asymptotic and approximate distributions of a large class of test statistics with quadratic forms used in association studies. The statistics of interest do not necessarily follow a chi-square distribution and take the general form $D=X^T A X$, where $X$ follows the multivariate normal distribution, and $A$ is a general similarity matrix which may or may not be positive semi-defin… ▽ More We address the asymptotic and approximate distributions of a large class of test statistics with quadratic forms used in association studies. The statistics of interest do not necessarily follow a chi-square distribution and take the general form $D=X^T A X$, where $X$ follows the multivariate normal distribution, and $A$ is a general similarity matrix which may or may not be positive semi-definite. We show that $D$ can be written as a linear combination of independent chi-square random variables, whose distribution can be approximated by a chi-square or the difference of two chi-square distributions. In the setting of association testing, our methods are especially useful in two situations. First, for a genome screen, the required significance level is much smaller than 0.05 due to multiple comparisons, and estimation of p-values using permutation procedures is particularly challenging. An efficient and accurate estimation procedure would therefore be useful. Second, in a candidate gene study based on haplotypes when phase is unknown a computationally expensive method-the EM algorithm-is usually required to infer haplotype frequencies. Because the EM algorithm is needed for each permutation, this results in a substantial computational burden, which can be eliminated with our mathematical solution. We assess the practical utility of our method using extensive simulation studies based on two example statistics and apply it to find the sample size needed for a typical candidate gene association study when phase information is not available. Our method can be applied to any quadratic form statistic and therefore should be of general interest. △ Less

Submitted 22 September, 2009; originally announced September 2009.

arXiv:0905.0940 [pdf, ps, other]

doi 10.1109/TIT.2011.2104513

A Large-Deviation Analysis of the Maximum-Likelihood Learning of Markov Tree Structures

Authors: Vincent Y. F. Tan, Animashree Anandkumar, Lang Tong, Alan S. Willsky

Abstract: The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event tha… ▽ More The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event that the ML-estimate of the Markov tree structure differs from the true tree structure, given a set of independently drawn samples. By exploiting the fact that the output of ML-estimation is a tree, we establish that the error exponent is equal to the exponential rate of decay of a single dominant crossover event. We prove that in this dominant crossover event, a non-neighbor node pair replaces a true edge of the distribution that is along the path of edges in the true tree graph connecting the nodes in the non-neighbor pair. Using ideas from Euclidean information theory, we then analyze the scenario of ML-estimation in the very noisy learning regime and show that the error exponent can be approximated as a ratio, which is interpreted as the signal-to-noise ratio (SNR) for learning tree distributions. We show via numerical experiments that in this regime, our SNR approximation is accurate. △ Less

Submitted 21 November, 2010; v1 submitted 6 May, 2009; originally announced May 2009.

Comments: Accepted to the IEEE Transactions on Information Theory on Nov 18, 2010

Showing 1–27 of 27 results for author: Tong, L