-
Dynamic Data Pruning for Automatic Speech Recognition
Authors:
Qiao Xiao,
**chuan Ma,
Adriana Fernandez-Lopez,
Boqian Wu,
Lu Yin,
Stavros Petridis,
Mykola Pechenizkiy,
Maja Pantic,
Decebal Constantin Mocanu,
Shiwei Liu
Abstract:
The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works…
▽ More
The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works often entail significant overhead to achieve meaningful results. To fill this gap, this paper presents the first investigation of dynamic data pruning for ASR, finding that we can reach the full-data performance by dynamically selecting 70% of data. Furthermore, we introduce Dynamic Data Pruning for ASR (DDP-ASR), which offers several fine-grained pruning granularities specifically tailored for speech-related datasets, going beyond the conventional pruning of entire time sequences. Our intensive experiments show that DDP-ASR can save up to 1.6x training time with negligible performance loss.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation
Authors:
Yongrui Yu,
Hanyu Chen,
Zitian Zhang,
Qiong Xiao,
Wenhui Lei,
Linrui Dai,
Yu Fu,
Hui Tan,
Guan Wang,
Peng Gao,
Xiaofan Zhang
Abstract:
Despite the significant success achieved by deep learning methods in medical image segmentation, researchers still struggle in the computer-aided diagnosis of abdominal lymph nodes due to the complex abdominal environment, small and indistinguishable lesions, and limited annotated data. To address these problems, we present a pipeline that integrates the conditional diffusion model for lymph node…
▽ More
Despite the significant success achieved by deep learning methods in medical image segmentation, researchers still struggle in the computer-aided diagnosis of abdominal lymph nodes due to the complex abdominal environment, small and indistinguishable lesions, and limited annotated data. To address these problems, we present a pipeline that integrates the conditional diffusion model for lymph node generation and the nnU-Net model for lymph node segmentation to improve the segmentation performance of abdominal lymph nodes through synthesizing a diversity of realistic abdominal lymph node data. We propose LN-DDPM, a conditional denoising diffusion probabilistic model (DDPM) for lymph node (LN) generation. LN-DDPM utilizes lymph node masks and anatomical structure masks as model conditions. These conditions work in two conditioning mechanisms: global structure conditioning and local detail conditioning, to distinguish between lymph nodes and their surroundings and better capture lymph node characteristics. The obtained paired abdominal lymph node images and masks are used for the downstream segmentation task. Experimental results on the abdominal lymph node datasets demonstrate that LN-DDPM outperforms other generative methods in the abdominal lymph node image synthesis and better assists the downstream abdominal lymph node segmentation task.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction
Authors:
Qing Xiao,
Siyeop Yoon,
Hui Ren,
Matthew Tivnan,
Lichao Sun,
Quanzheng Li,
Tianming Liu,
Yu Zhang,
Xiang Li
Abstract:
Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe…
▽ More
Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffer from temporal sparsity and incompleteness, presenting substantial challenges in modeling the disease's progression accurately. Existing methods are limited, focusing primarily on datasets without missing entries or requiring predefined assumptions about CTh progression. To overcome these obstacles, we propose a conditional score-based diffusion model specifically designed to generate CTh trajectories with the given baseline information, such as age, sex, and initial diagnosis. Our conditional diffusion model utilizes all available data during the training phase to make predictions based solely on baseline information during inference without needing prior history about CTh progression. The prediction accuracy of the proposed CTh prediction pipeline using a conditional score-based model was compared for sub-groups consisting of cognitively normal, mild cognitive impairment, and AD subjects. The Bland-Altman analysis shows our diffusion-based prediction model has a near-zero bias with narrow 95% confidential interval compared to the ground-truth CTh in 6-36 months. In addition, our conditional diffusion model has a stochastic generative nature, therefore, we demonstrated an uncertainty analysis of patient-specific CTh prediction through multiple realizations.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Assessment of Transmission-level Fault Impacts on 3-phase and 1-phase Distribution IBR Operation
Authors:
Qi Xiao,
Jongha Woo,
Lidong Song,
Bei Xu,
David Lubkeman,
Ning Lu,
Abdul Shafae Mohammed,
Johan Enslin,
Cara De Coste Chacko,
Kat Sico,
Steven G. Whisenant
Abstract:
The widespread deployment of inverter-based resources (IBRs) renders distribution systems susceptible to transmission-level faults. This paper presents a comprehensive analysis of the impact of transmission-level faults on 3-phase and 1-phase distribution IBR operation. To evaluate distributed IBR trip** across various phases and locations on a distribution feeder, we conduct simulations of both…
▽ More
The widespread deployment of inverter-based resources (IBRs) renders distribution systems susceptible to transmission-level faults. This paper presents a comprehensive analysis of the impact of transmission-level faults on 3-phase and 1-phase distribution IBR operation. To evaluate distributed IBR trip** across various phases and locations on a distribution feeder, we conduct simulations of both symmetrical and unsymmetrical transmission faults at progressively greater electrical distances on a real-time transmission and distribution (T&D) co-simulation platform. The IBR power-to-load ratios (PLRs) at 50%, 100%, and 300% are considered to emulate low, medium, and high IBR conditions. Our results indicate that, while 1-phase and 2-phase faults typically trigger fewer IBR trips when compared to 3-phase faults, a significant power imbalance arises from the trip** of 1-phase IBRs on the affected phases. The imbalance can result in significant power quality problems and unintended equipment trip**. It may be necessary to design fault-ride-through mechanisms specifically tailored to 1-phase IBRs to help mitigate the power imbalances caused by unbalanced faults.
△ Less
Submitted 1 April, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
Multimodal Indoor Localization Using Crowdsourced Radio Maps
Authors:
Zhaoguang Yi,
Xiangyu Wen,
Qiyue Xia,
Peize Li,
Francisco Zampella,
Firas Alsehly,
Chris Xiaoxuan Lu
Abstract:
Indoor Positioning Systems (IPS) traditionally rely on odometry and building infrastructures like WiFi, often supplemented by building floor plans for increased accuracy. However, the limitation of floor plans in terms of availability and timeliness of updates challenges their wide applicability. In contrast, the proliferation of smartphones and WiFi-enabled robots has made crowdsourced radio maps…
▽ More
Indoor Positioning Systems (IPS) traditionally rely on odometry and building infrastructures like WiFi, often supplemented by building floor plans for increased accuracy. However, the limitation of floor plans in terms of availability and timeliness of updates challenges their wide applicability. In contrast, the proliferation of smartphones and WiFi-enabled robots has made crowdsourced radio maps - databases pairing locations with their corresponding Received Signal Strengths (RSS) - increasingly accessible. These radio maps not only provide WiFi fingerprint-location pairs but encode movement regularities akin to the constraints imposed by floor plans. This work investigates the possibility of leveraging these radio maps as a substitute for floor plans in multimodal IPS. We introduce a new framework to address the challenges of radio map inaccuracies and sparse coverage. Our proposed system integrates an uncertainty-aware neural network model for WiFi localization and a bespoken Bayesian fusion technique for optimal fusion. Extensive evaluations on multiple real-world sites indicate a significant performance enhancement, with results showing ~ 25% improvement over the best baseline
△ Less
Submitted 12 March, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration
Authors:
Xuhao Shan,
Xinyang Li,
Ruiquan Ge,
Shibin Wu,
Ahmed Elazab,
Jichao Zhu,
Lingyan Zhang,
Gangyong Jia,
Qingying Xiao,
Xiang Wan,
Changmiao Wang
Abstract:
Intracerebral Hemorrhage (ICH) is a severe condition resulting from damaged brain blood vessel ruptures, often leading to complications and fatalities. Timely and accurate prognosis and management are essential due to its high mortality rate. However, conventional methods heavily rely on subjective clinician expertise, which can lead to inaccurate diagnoses and delays in treatment. Artificial inte…
▽ More
Intracerebral Hemorrhage (ICH) is a severe condition resulting from damaged brain blood vessel ruptures, often leading to complications and fatalities. Timely and accurate prognosis and management are essential due to its high mortality rate. However, conventional methods heavily rely on subjective clinician expertise, which can lead to inaccurate diagnoses and delays in treatment. Artificial intelligence (AI) models have been explored to assist clinicians, but many prior studies focused on model modification without considering domain knowledge. This paper introduces a novel deep learning algorithm, GCS-ICHNet, which integrates multimodal brain CT image data and the Glasgow Coma Scale (GCS) score to improve ICH prognosis. The algorithm utilizes a transformer-based fusion module for assessment. GCS-ICHNet demonstrates high sensitivity 81.03% and specificity 91.59%, outperforming average clinicians and other state-of-the-art methods.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Under-frequency Load Shedding for Power Reserve Management in Islanded Microgrids
Authors:
Bei Xu,
Victor Paduani,
Qi Xiao,
Lidong Song,
David Lubkeman,
Ning Lu
Abstract:
This paper introduces under-frequency load shedding (UFLS) schemes specially designed to fulfill the power reserve requirements in islanded microgrids (MGs), where only one grid-forming resource is available for frequency regulation. When the power consumption of the MG exceeds a pre-defined threshold, the MG frequency will be lowered to various setpoints, thereby triggering UFLS for different lev…
▽ More
This paper introduces under-frequency load shedding (UFLS) schemes specially designed to fulfill the power reserve requirements in islanded microgrids (MGs), where only one grid-forming resource is available for frequency regulation. When the power consumption of the MG exceeds a pre-defined threshold, the MG frequency will be lowered to various setpoints, thereby triggering UFLS for different levels of load reduction. Three types of controllable devices are considered for executing UFLS: sectionalizers, smart meters, and controllable appliances. To avoid unnecessary UFLS activation, various time delay settings are analyzed, allowing short-lived power spikes caused by events like motor startups or cold-load pickups to be disregarded. We tested the proposed UFLS schemes on a modified IEEE 123-bus system on the OPAL-RT eMEGASIM platform. Simulation results verify the efficacy of the proposed approaches in restoring power reserves, maintaining phase power balance, and effectively handling short-lived power fluctuations. Furthermore, in comparison to sectionalizer-based UFLS, using smart meters or controllable loads for UFLS allows for a more accurate per-phase load shedding in a progressive manner. As a result, it leads to better balanced three-phase voltage and serves more loads.
△ Less
Submitted 6 September, 2023; v1 submitted 3 September, 2023;
originally announced September 2023.
-
Bootstrap** Contrastive Learning Enhanced Music Cold-Start Matching
Authors:
** Zhao,
Ying Zhang,
Qiang Xiao,
Yuming Ren,
Yingchun Yang
Abstract:
We study a particular matching task we call Music Cold-Start Matching. In short, given a cold-start song request, we expect to retrieve songs with similar audiences and then fastly push the cold-start song to the audiences of the retrieved songs to warm up it. However, there are hardly any studies done on this task. Therefore, in this paper, we will formalize the problem of Music Cold-Start Matchi…
▽ More
We study a particular matching task we call Music Cold-Start Matching. In short, given a cold-start song request, we expect to retrieve songs with similar audiences and then fastly push the cold-start song to the audiences of the retrieved songs to warm up it. However, there are hardly any studies done on this task. Therefore, in this paper, we will formalize the problem of Music Cold-Start Matching detailedly and give a scheme. During the offline training, we attempt to learn high-quality song representations based on song content features. But, we find supervision signals typically follow power-law distribution causing skewed representation learning. To address this issue, we propose a novel contrastive learning paradigm named Bootstrap** Contrastive Learning (BCL) to enhance the quality of learned representations by exerting contrastive regularization. During the online serving, to locate the target audiences more accurately, we propose Clustering-based Audience Targeting (CAT) that clusters audience representations to acquire a few cluster centroids and then locate the target audiences by measuring the relevance between the audience representations and the cluster centroids. Extensive experiments on the offline dataset and online system demonstrate the effectiveness and efficiency of our method. Currently, we have deployed it on NetEase Cloud Music, affecting millions of users. Code will be released in the future.
△ Less
Submitted 5 August, 2023;
originally announced August 2023.
-
Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network
Authors:
Xiaoyu Yang,
Lijian Xu,
Simon Yu,
Qing Xia,
Hongsheng Li,
Shaoting Zhang
Abstract:
Segmentation of the coronary artery is an important task for the quantitative analysis of coronary computed tomography angiography (CCTA) images and is being stimulated by the field of deep learning. However, the complex structures with tiny and narrow branches of the coronary artery bring it a great challenge. Coupled with the medical image limitations of low resolution and poor contrast, fragmen…
▽ More
Segmentation of the coronary artery is an important task for the quantitative analysis of coronary computed tomography angiography (CCTA) images and is being stimulated by the field of deep learning. However, the complex structures with tiny and narrow branches of the coronary artery bring it a great challenge. Coupled with the medical image limitations of low resolution and poor contrast, fragmentations of segmented vessels frequently occur in the prediction. Therefore, a geometry-based cascaded segmentation method is proposed for the coronary artery, which has the following innovations: 1) Integrating geometric deformation networks, we design a cascaded network for segmenting the coronary artery and vectorizing results. The generated meshes of the coronary artery are continuous and accurate for twisted and sophisticated coronary artery structures, without fragmentations. 2) Different from mesh annotations generated by the traditional marching cube method from voxel-based labels, a finer vectorized mesh of the coronary artery is reconstructed with the regularized morphology. The novel mesh annotation benefits the geometry-based segmentation network, avoiding bifurcation adhesion and point cloud dispersion in intricate branches. 3) A dataset named CCA-200 is collected, consisting of 200 CCTA images with coronary artery disease. The ground truths of 200 cases are coronary internal diameter annotations by professional radiologists. Extensive experiments verify our method on our collected dataset CCA-200 and public ASOCA dataset, with a Dice of 0.778 on CCA-200 and 0.895 on ASOCA, showing superior results. Especially, our geometry-based model generates an accurate, intact and smooth coronary artery, devoid of any fragmentations of segmented vessels.
△ Less
Submitted 7 May, 2023;
originally announced May 2023.
-
Non-Iterative Solution for Coordinated Optimal Dispatch via Equivalent Projection-Part II: Method and Applications
Authors:
Zhenfei Tan,
Zheng Yan,
Haiwang Zhong,
Qing Xia
Abstract:
This two-part paper develops a non-iterative coordinated optimal dispatch framework, i.e., free of iterative information exchange, via the innovation of the equivalent projection (EP) theory. The EP eliminates internal variables from technical and economic operation constraints of the subsystem and obtains an equivalent model with reduced scale, which is the key to the non-iterative coordinated op…
▽ More
This two-part paper develops a non-iterative coordinated optimal dispatch framework, i.e., free of iterative information exchange, via the innovation of the equivalent projection (EP) theory. The EP eliminates internal variables from technical and economic operation constraints of the subsystem and obtains an equivalent model with reduced scale, which is the key to the non-iterative coordinated optimization. In Part II of this paper, a novel projection algorithm with the explicit error guarantee measured by the Hausdorff distance is proposed, which characterizes the EP model by the convex hull of its vertices. This algorithm is proven to yield a conservative approximation within the prespecified error tolerance and can obtain the exact EP model if the error tolerance is set to zero, which provides flexibility to balance the computation accuracy and effort. Applications of the EP-based coordinated dispatch are demonstrated based on the multi-area coordination and transmission-distribution coordination. Case studies with a wide range of system scales verify the superiority of the proposed projection algorithm in terms of computational efficiency and scalability, and validate the effectiveness of the EP-based coordinated dispatch in comparison with the joint optimization.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Non-Iterative Solution for Coordinated Optimal Dispatch via Equivalent Projection-Part I: Theory
Authors:
Zhenfei Tan,
Zheng Yan,
Haiwang Zhong,
Qing Xia
Abstract:
Coordinated optimal dispatch is of utmost importance for the efficient and secure operation of hierarchically structured power systems. Conventional coordinated optimization methods, such as the Lagrangian relaxation and Benders decomposition, require iterative information exchange among subsystems. Iterative coordination methods have drawbacks including slow convergence, risk of oscillation and d…
▽ More
Coordinated optimal dispatch is of utmost importance for the efficient and secure operation of hierarchically structured power systems. Conventional coordinated optimization methods, such as the Lagrangian relaxation and Benders decomposition, require iterative information exchange among subsystems. Iterative coordination methods have drawbacks including slow convergence, risk of oscillation and divergence, and incapability of multi-level optimization problems. To this end, this paper aims at the non-iterative coordinated optimization method for hierarchical power systems. The theory of the equivalent projection (EP) is proposed, which makes external equivalence of the optimal dispatch model of the subsystem. Based on the EP theory, a coordinated optimization framework is developed, where each subsystem submits the EP model as a substitute for its original model to participate in the cross-system coordination. The proposed coordination framework is proven to guarantee the same optimality as the joint optimization, with additional benefits of avoiding iterative information exchange, protecting privacy, compatibility with practical dispatch scheme, and capability of multi-level problems.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Optimal Control Design for Operating a Hybrid PV Plant with Robust Power Reserves for Fast Frequency Regulation Services
Authors:
Victor Paduani,
Qi Xiao,
Bei Xu,
David Lubkeman,
Ning Lu
Abstract:
This paper presents an optimal control strategy for operating a solar hybrid system consisting of solar photovoltaic (PV) and a high-power, low-storage battery energy storage system (BESS). A state-space model of the hybrid PV plant is first derived, based on which an adaptive model predictive controller is designed. The controller's objective is to control the PV and BESS to follow power setpoint…
▽ More
This paper presents an optimal control strategy for operating a solar hybrid system consisting of solar photovoltaic (PV) and a high-power, low-storage battery energy storage system (BESS). A state-space model of the hybrid PV plant is first derived, based on which an adaptive model predictive controller is designed. The controller's objective is to control the PV and BESS to follow power setpoints sent to the the hybrid system while maintaining desired power reserves and meeting system operational constraints. Furthermore, an extended Kalman filter (EKF) is implemented for estimating the battery SOC, and an error sensitivity is executed to assess its limitations. To validate the proposed strategy, detailed EMT models of the hybrid system are developed so that losses and control limits can be quantified accurately. Day-long simulations are performed in an OPAL-RT real-time simulator using second-by-second actual PV farm data as inputs. Results verify that the proposed method can follow power setpoints while maintaining power reserves in days of high irradiance intermittency even with a small BESS storage.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Improving Sample Efficiency of Deep Learning Models in Electricity Market
Authors:
Guangchun Ruan,
Jianxiao Wang,
Haiwang Zhong,
Qing Xia,
Chongqing Kang
Abstract:
The superior performance of deep learning relies heavily on a large collection of sample data, but the data insufficiency problem turns out to be relatively common in global electricity markets. How to prevent overfitting in this case becomes a fundamental challenge when training deep learning models in different market applications. With this in mind, we propose a general framework, namely Knowle…
▽ More
The superior performance of deep learning relies heavily on a large collection of sample data, but the data insufficiency problem turns out to be relatively common in global electricity markets. How to prevent overfitting in this case becomes a fundamental challenge when training deep learning models in different market applications. With this in mind, we propose a general framework, namely Knowledge-Augmented Training (KAT), to improve the sample efficiency, and the main idea is to incorporate domain knowledge into the training procedures of deep learning models. Specifically, we propose a novel data augmentation technique to generate some synthetic data, which are later processed by an improved training strategy. This KAT methodology follows and realizes the idea of combining analytical and deep learning models together. Modern learning theories demonstrate the effectiveness of our method in terms of effective prediction error feedbacks, a reliable loss function, and rich gradient noises. At last, we study two popular applications in detail: user modeling and probabilistic price forecasting. The proposed method outperforms other competitors in all numerical tests, and the underlying reasons are explained by further statistical and visualization results.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning
Authors:
Momin Abbas,
Quan Xiao,
Lisha Chen,
Pin-Yu Chen,
Tianyi Chen
Abstract:
Model-agnostic meta learning (MAML) is currently one of the dominating approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML can be challenging due to the innate bilevel problem structure. Specifically, the loss landscape of MAML is much more complex with possibly more saddle points and local minimizers than its empirical risk minimization counterpart. To addres…
▽ More
Model-agnostic meta learning (MAML) is currently one of the dominating approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML can be challenging due to the innate bilevel problem structure. Specifically, the loss landscape of MAML is much more complex with possibly more saddle points and local minimizers than its empirical risk minimization counterpart. To address this challenge, we leverage the recently invented sharpness-aware minimization and develop a sharpness-aware MAML approach that we term Sharp-MAML. We empirically demonstrate that Sharp-MAML and its computation-efficient variant can outperform the plain-vanilla MAML baseline (e.g., $+3\%$ accuracy on Mini-Imagenet). We complement the empirical study with the convergence rate analysis and the generalization bound of Sharp-MAML. To the best of our knowledge, this is the first empirical and theoretical study on sharpness-aware minimization in the context of bilevel learning. The code is available at https://github.com/mominabbass/Sharp-MAML.
△ Less
Submitted 14 August, 2022; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Solution to Morgan Problem
Authors:
Qianghui Xiao
Abstract:
In this paper, some preliminaries about Morgan problem, signal flow graph and controllable linear time-invariant standard system are first introduced in detail. In order to synthesize the necessary and sufficient condition for decoupling system, the first and second necessary conditions, and a sufficient condition for decoupling a controllable linear time-invariant system are secondly analyzed res…
▽ More
In this paper, some preliminaries about Morgan problem, signal flow graph and controllable linear time-invariant standard system are first introduced in detail. In order to synthesize the necessary and sufficient condition for decoupling system, the first and second necessary conditions, and a sufficient condition for decoupling a controllable linear time-invariant system are secondly analyzed respectively. Therefore, the nonregular static state feedback expression for decoupling system that ensures the internal stability of the uncontrollable subsystem of the decoupled system at the same time is deduced. Then the pole assignment for the controllable subsystem of a decoupled system while ensuring its decoupled state is introduced. Finally, two examples combined with their corresponding signal flow graphs show the simplicity and feasibility of the necessary and sufficient condition for decoupling system described in the paper.
△ Less
Submitted 11 April, 2022; v1 submitted 8 January, 2022;
originally announced January 2022.
-
Open-Access Data and Toolbox for Tracking COVID-19 Impact on Power Systems
Authors:
Guangchun Ruan,
Zekuan Yu,
Shutong Pu,
Songtao Zhou,
Haiwang Zhong,
Le Xie,
Qing Xia,
Chongqing Kang
Abstract:
Intervention policies against COVID-19 have caused large-scale disruptions globally, and led to a series of pattern changes in the power system operation. Analyzing these pandemic-induced patterns is imperative to identify the potential risks and impacts of this extreme event. With this purpose, we developed an open-access data hub (COVID-EMDA+), an open-source toolbox (CoVEMDA), and a few evaluat…
▽ More
Intervention policies against COVID-19 have caused large-scale disruptions globally, and led to a series of pattern changes in the power system operation. Analyzing these pandemic-induced patterns is imperative to identify the potential risks and impacts of this extreme event. With this purpose, we developed an open-access data hub (COVID-EMDA+), an open-source toolbox (CoVEMDA), and a few evaluation methods to explore what the U.S. power systems are experiencing during COVID-19. These resources could be broadly used for research, public policy, and educational purposes. Technically, our data hub harmonizes a variety of raw data such as generation mix, demand profiles, electricity price, weather observations, mobility, confirmed cases and deaths. Typical methods are reformulated and standardized in our toolbox, including baseline estimation, regression analysis, and scientific visualization. Here the fluctuation index and probabilistic baseline are proposed for the first time to consider data fluctuation and estimation uncertainty. Based on these, we conduct three empirical studies on the U.S. power systems, and share new solutions and unexpected findings to address the issues of public concerns. This conveys a more complete picture of the pandemic's impacts, and also opens up several attractive topics for future work. Python, Matlab source codes, and user manuals are all publicly shared on a Github repository.
△ Less
Submitted 15 May, 2022; v1 submitted 9 December, 2021;
originally announced December 2021.
-
SleepPriorCL: Contrastive Representation Learning with Prior Knowledge-based Positive Mining and Adaptive Temperature for Sleep Staging
Authors:
Hongjun Zhang,
**g Wang,
Qinfeng Xiao,
Jiaoxue Deng,
Youfang Lin
Abstract:
The objective of this paper is to learn semantic representations for sleep stage classification from raw physiological time series. Although supervised methods have gained remarkable performance, they are limited in clinical situations due to the requirement of fully labeled data. Self-supervised learning (SSL) based on contrasting semantically similar (positive) and dissimilar (negative) pairs of…
▽ More
The objective of this paper is to learn semantic representations for sleep stage classification from raw physiological time series. Although supervised methods have gained remarkable performance, they are limited in clinical situations due to the requirement of fully labeled data. Self-supervised learning (SSL) based on contrasting semantically similar (positive) and dissimilar (negative) pairs of samples have achieved promising success. However, existing SSL methods suffer the problem that many semantically similar positives are still uncovered and even treated as negatives. In this paper, we propose a novel SSL approach named SleepPriorCL to alleviate the above problem. Advances of our approach over existing SSL methods are two-fold: 1) by incorporating prior domain knowledge into the training regime of SSL, more semantically similar positives are discovered without accessing ground-truth labels; 2) via investigating the influence of the temperature in contrastive loss, an adaptive temperature mechanism for each sample according to prior domain knowledge is further proposed, leading to better performance. Extensive experiments demonstrate that our method achieves state-of-the-art performance and consistently outperforms baselines.
△ Less
Submitted 15 October, 2021;
originally announced October 2021.
-
Estimating Demand Flexibility Using Siamese LSTM Neural Networks
Authors:
Guangchun Ruan,
Daniel S. Kirschen,
Haiwang Zhong,
Qing Xia,
Chongqing Kang
Abstract:
There is an opportunity in modern power systems to explore the demand flexibility by incentivizing consumers with dynamic prices. In this paper, we quantify demand flexibility using an efficient tool called time-varying elasticity, whose value may change depending on the prices and decision dynamics. This tool is particularly useful for evaluating the demand response potential and system reliabili…
▽ More
There is an opportunity in modern power systems to explore the demand flexibility by incentivizing consumers with dynamic prices. In this paper, we quantify demand flexibility using an efficient tool called time-varying elasticity, whose value may change depending on the prices and decision dynamics. This tool is particularly useful for evaluating the demand response potential and system reliability. Recent empirical evidences have highlighted some abnormal features when studying demand flexibility, such as delayed responses and vanishing elasticities after price spikes. Existing methods fail to capture these complicated features because they heavily rely on some predefined (often over-simplified) regression expressions. Instead, this paper proposes a model-free methodology to automatically and accurately derive the optimal estimation pattern. We further develop a two-stage estimation process with Siamese long short-term memory (LSTM) networks. Here, a LSTM network encodes the price response, while the other network estimates the time-varying elasticities. In the case study, the proposed framework and models are validated to achieve higher overall estimation accuracy and better description for various abnormal features when compared with the state-of-the-art methods.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
Improving Lesion Segmentation for Diabetic Retinopathy using Adversarial Learning
Authors:
Qiqi Xiao,
Jiaxu Zou,
Muqiao Yang,
Alex Gaudio,
Kris Kitani,
Asim Smailagic,
Pedro Costa,
Min Xu
Abstract:
Diabetic Retinopathy (DR) is a leading cause of blindness in working age adults. DR lesions can be challenging to identify in fundus images, and automatic DR detection systems can offer strong clinical value. Of the publicly available labeled datasets for DR, the Indian Diabetic Retinopathy Image Dataset (IDRiD) presents retinal fundus images with pixel-level annotations of four distinct lesions:…
▽ More
Diabetic Retinopathy (DR) is a leading cause of blindness in working age adults. DR lesions can be challenging to identify in fundus images, and automatic DR detection systems can offer strong clinical value. Of the publicly available labeled datasets for DR, the Indian Diabetic Retinopathy Image Dataset (IDRiD) presents retinal fundus images with pixel-level annotations of four distinct lesions: microaneurysms, hemorrhages, soft exudates and hard exudates. We utilize the HEDNet edge detector to solve a semantic segmentation task on this dataset, and then propose an end-to-end system for pixel-level segmentation of DR lesions by incorporating HEDNet into a Conditional Generative Adversarial Network (cGAN). We design a loss function that adds adversarial loss to segmentation loss. Our experiments show that the addition of the adversarial loss improves the lesion segmentation performance over the baseline.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging
Authors:
Zhaohan Xiong,
Qing Xia,
Zhiqiang Hu,
Ning Huang,
Cheng Bian,
Yefeng Zheng,
Sulaiman Vesal,
Nishant Ravikumar,
Andreas Maier,
Xin Yang,
Pheng-Ann Heng,
Dong Ni,
Caizi Li,
Qianqian Tong,
Weixin Si,
Elodie Puybareau,
Younes Khoudli,
Thierry Geraud,
Chen Chen,
Wenjia Bai,
Daniel Rueckert,
Lingchao Xu,
Xiahai Zhuang,
Xinzhe Luo,
Shuman Jia
, et al. (19 additional authors not shown)
Abstract:
Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, auto…
▽ More
Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, automatic methods are of high interest, particularly optimized machine learning approaches. To address this, we organized the "2018 Left Atrium Segmentation Challenge" using 154 3D LGE-MRIs, currently the world's largest cardiac LGE-MRI dataset, and associated labels of the left atrium segmented by three medical experts, ultimately attracting the participation of 27 international teams. In this paper, extensive analysis of the submitted algorithms using technical and biological metrics was performed by undergoing subgroup analysis and conducting hyper-parameter analysis, offering an overall picture of the major design choices of convolutional neural networks (CNNs) and practical considerations for achieving state-of-the-art left atrium segmentation. Results show the top method achieved a dice score of 93.2% and a mean surface to a surface distance of 0.7 mm, significantly outperforming prior state-of-the-art. Particularly, our analysis demonstrated that double, sequentially used CNNs, in which a first CNN is used for automatic region-of-interest localization and a subsequent CNN is used for refined regional segmentation, achieved far superior results than traditional methods and pipelines containing single CNNs. This large-scale benchmarking study makes a significant step towards much-improved segmentation methods for cardiac LGE-MRIs, and will serve as an important benchmark for evaluating and comparing the future works in the field.
△ Less
Submitted 7 May, 2020; v1 submitted 26 April, 2020;
originally announced April 2020.
-
SenseCare: A Research Platform for Medical Image Informatics and Interactive 3D Visualization
Authors:
Qi Duan,
Guotai Wang,
Rui Wang,
Chao Fu,
Xinjun Li,
Na Wang,
Yechong Huang,
Xiaodi Huang,
Tao Song,
Liang Zhao,
Xinglong Liu,
Qing Xia,
Zhiqiang Hu,
Yinan Chen,
Shaoting Zhang
Abstract:
Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research…
▽ More
Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research with Artificial Intelligence (AI), SenseCare provides a range of AI toolkits for different tasks, including image segmentation, registration, lesion and landmark detection from various image modalities ranging from radiology to pathology. In addition, SenseCare is clinic-oriented and supports a wide range of clinical applications such as diagnosis and surgical planning for lung cancer, pelvic tumor, coronary artery disease, etc. SenseCare provides several appealing functions and features such as advanced 3D visualization, concurrent and efficient web-based access, fast data synchronization and high data security, multi-center deployment, support for collaborative research, etc. In this report, we present an overview of SenseCare as an efficient platform providing comprehensive toolkits and high extensibility for intelligent image analysis and clinical research in different application scenarios. We also summarize the research outcome through the collaboration with multiple hospitals.
△ Less
Submitted 2 September, 2022; v1 submitted 2 April, 2020;
originally announced April 2020.
-
Object-Based Image Coding: A Learning-Driven Revisit
Authors:
Qi Xia,
Haojie Liu,
Zhan Ma
Abstract:
The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape. A fundamental issue behind is how to efficiently process the arbitrary-shap…
▽ More
The Object-Based Image Coding (OBIC) that was extensively studied about two decades ago, promised a vast application perspective for both ultra-low bitrate communication and high-level semantical content understanding, but it had rarely been used due to the inefficient compact representation of object with arbitrary shape. A fundamental issue behind is how to efficiently process the arbitrary-shaped objects at a fine granularity (e.g., feature element or pixel wise). To attack this, we have proposed to apply the element-wise masking and compression by devising an object segmentation network for image layer decomposition, and parallel convolution-based neural image compression networks to process masked foreground objects and background scene separately. All components are optimized in an end-to-end learning framework to intelligently weigh their (e.g., object and background) contributions for visually pleasant reconstruction. We have conducted comprehensive experiments to evaluate the performance on PASCAL VOC dataset at a very low bitrate scenario (e.g., $\lesssim$0.1 bits per pixel - bpp) which have demonstrated noticeable subjective quality improvement compared with JPEG2K, HEVC-based BPG and another learned image compression method. All relevant materials are made publicly accessible at https://njuvision.github.io/Neural-Object-Coding/.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
Transmission Expansion Planning with Seasonal Network Optimization
Authors:
Xingpeng Li,
Qianxue Xia
Abstract:
Transmission expansion planning (TEP) is critical for the power grid to meet fast growing demand in the future. Traditional TEP model does not utilize the flexibility in the transmission network that is considered as static assets. However, as the load profile may have different seasonal patterns, the optimal network configuration could be very different for different seasons in the planning horiz…
▽ More
Transmission expansion planning (TEP) is critical for the power grid to meet fast growing demand in the future. Traditional TEP model does not utilize the flexibility in the transmission network that is considered as static assets. However, as the load profile may have different seasonal patterns, the optimal network configuration could be very different for different seasons in the planning horizon. Therefore, this paper proposes to incorporate seasonal network optimization (SNO) into the traditional TEP model. SNO dynamically optimizes the network for each season of each planning epoch. Two TEP-SNO models are proposed to investigate the benefits of optimizing the status of (i) existing branches, and (ii) existing and new branches, respectively. Numerical simulations demonstrate the effectiveness of the proposed TEP-SNO models. It is shown that SNO can improve system operational efficiency, defer investment of new transmission elements, and reduce the total cost.
△ Less
Submitted 29 November, 2019;
originally announced November 2019.
-
Stochastic Optimal Power Flow with Network Reconfiguration: Congestion Management and Facilitating Grid Integration of Renewables
Authors:
Xingpeng Li,
Qianxue Xia
Abstract:
There has been a significant growth of variable renewable generation in the power grid today. However, the industry still uses deterministic optimization to model and solve the optimal power flow (OPF) problem for real-time generation dispatch that ignores the uncertainty associated with intermittent renewable power. Thus, it is necessary to study stochastic OPF (SOPF) that can better handle uncer…
▽ More
There has been a significant growth of variable renewable generation in the power grid today. However, the industry still uses deterministic optimization to model and solve the optimal power flow (OPF) problem for real-time generation dispatch that ignores the uncertainty associated with intermittent renewable power. Thus, it is necessary to study stochastic OPF (SOPF) that can better handle uncertainty since SOPF is able to consider the probabilistic forecasting information of intermittent renewables. Transmission network congestion is one of the main reasons for renewable energy curtailment. Prior efforts in the literature show that utilizing transmission network reconfiguration can relieve congestion and resolve congestion-induced issues. This paper enhances SOPF by incorporating network reconfiguration into the dispatch model. Numerical simulations show that renewable curtailment can be avoided with the proposed network reconfiguration scheme that relieves transmission congestion in post-contingency situations. It is also shown that network reconfiguration can substantially reduce congestion cost, especially the contingency-case congestion cost.
△ Less
Submitted 29 November, 2019;
originally announced November 2019.