-
MIMO Capacity Analysis and Channel Estimation for Electromagnetic Information Theory
Authors:
Jieao Zhu,
Vincent Y. F. Tan,
Linglong Dai
Abstract:
Electromagnetic information theory (EIT) is an interdisciplinary subject that serves to integrate deterministic electromagnetic theory with stochastic Shannon's information theory. Existing EIT analysis operates in the continuous space domain, which is not aligned with the practical algorithms working in the discrete space domain. This mismatch leads to a significant difficulty in application of E…
▽ More
Electromagnetic information theory (EIT) is an interdisciplinary subject that serves to integrate deterministic electromagnetic theory with stochastic Shannon's information theory. Existing EIT analysis operates in the continuous space domain, which is not aligned with the practical algorithms working in the discrete space domain. This mismatch leads to a significant difficulty in application of EIT methodologies to practical discrete space systems, which is called as the discrete-continuous gap in this paper. To bridge this gap, we establish the discrete-continuous correspondence with a prolate spheroidal wave function (PSWF)-based ergodic capacity analysis framework. Specifically, we state and prove some discrete-continuous correspondence lemmas to establish a firm theoretical connection between discrete information-theoretic quantities to their continuous counterparts. With these lemmas, we apply the PSWF ergodic capacity bound to advanced MIMO architectures such as continuous-aperture MIMO (CAP-MIMO) and extremely large-scale MIMO (XL-MIMO). From this PSWF capacity bound, we discover the capacity saturation phenomenon both theoretically and empirically. Although the growth of MIMO performance is fundamentally limited in this EIT-based analysis framework, we reveal new opportunities in MIMO channel estimation by exploiting the EIT knowledge about the channel. Inspired by the PSWF capacity bound, we utilize continuous PSWFs to improve the pilot design of discrete MIMO channel estimators, which is called as the PSWF channel estimator (PSWF-CE). Simulation results demonstrate improved performances of the proposed PSWF-CE, compared to traditional minimum mean squared error (MMSE) and compressed sensing-based estimators.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Resilient control of networked switched systems subject to deception attack and DoS attack
Authors:
Rui Zhao,
Zhiqiang Zuo,
Ying Tan,
Yi**g Wang,
Wentao Zhang
Abstract:
In this paper, the resilient control for switched systems in the presence of deception attack and denial-of-service (DoS) attack is addressed. Due to the interaction of two kinds of attacks and the asynchronous phenomenon of controller mode and subsystem mode, the system dynamics becomes much more complex. A criterion is derived to ensure the mean square security level of the closed-loop system. T…
▽ More
In this paper, the resilient control for switched systems in the presence of deception attack and denial-of-service (DoS) attack is addressed. Due to the interaction of two kinds of attacks and the asynchronous phenomenon of controller mode and subsystem mode, the system dynamics becomes much more complex. A criterion is derived to ensure the mean square security level of the closed-loop system. This in turn reveals the balance of system resilience and control performance. Furthermore, a mixed-switching control strategy is put forward to make the system globally asymptotically stable. It is shown that the system will still converge to the equilibrium even if the deception attack occurs. Finally, simulations are carried out to verify the effectiveness of the theoretical results.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Multimodal Physical Fitness Monitoring (PFM) Framework Based on TimeMAE-PFM in Wearable Scenarios
Authors:
Junjie Zhang,
Zheming Zhang,
Huachen Xiang,
Yangquan Tan,
Linnan Huo,
Fengyi Wang
Abstract:
Physical function monitoring (PFM) plays a crucial role in healthcare especially for the elderly. Traditional assessment methods such as the Short Physical Performance Battery (SPPB) have failed to capture the full dynamic characteristics of physical function. Wearable sensors such as smart wristbands offer a promising solution to this issue. However, challenges exist, such as the computational co…
▽ More
Physical function monitoring (PFM) plays a crucial role in healthcare especially for the elderly. Traditional assessment methods such as the Short Physical Performance Battery (SPPB) have failed to capture the full dynamic characteristics of physical function. Wearable sensors such as smart wristbands offer a promising solution to this issue. However, challenges exist, such as the computational complexity of machine learning methods and inadequate information capture. This paper proposes a multi-modal PFM framework based on an improved TimeMAE, which compresses time-series data into a low-dimensional latent space and integrates a self-enhanced attention module. This framework achieves effective monitoring of physical health, providing a solution for real-time and personalized assessment. The method is validated using the NHATS dataset, and the results demonstrate an accuracy of 70.6% and an AUC of 82.20%, surpassing other state-of-the-art time-series classification models.
△ Less
Submitted 25 March, 2024;
originally announced April 2024.
-
Tunable Superconducting Magnetic Levitation with Self-Stability
Authors:
Qi Xu,
Yi Lin,
Yunfei Tan,
Jianzhao Geng
Abstract:
Magnetic levitation based on the flux pinning nature of type II superconductors has the merit of self-stability, making it appealing for applications such as high speed bearings, maglev trains, space generators, etc. However, such levitation systems physically rely on the superconductor pre-capturing magnetic flux (i.e. field cooling process) before establishing the levitation state which is nonad…
▽ More
Magnetic levitation based on the flux pinning nature of type II superconductors has the merit of self-stability, making it appealing for applications such as high speed bearings, maglev trains, space generators, etc. However, such levitation systems physically rely on the superconductor pre-capturing magnetic flux (i.e. field cooling process) before establishing the levitation state which is nonadjustable afterwards. Moreover, practical type II superconductors in the levitation system inevitably suffer from various sources of energy losses, leading to continuous levitation force decay. These intrinsic drawbacks make superconducting maglev inflexible and impractical for long term operation. Here we propose and demonstrate a new form of superconducting maglev which is tunable and with self-stability. The maglev system uses a closed-loop type II superconducting coil to lock flux of a magnet, establishing self-stable levitation between the two objects. A flux pump is used to modulate the total magnetic flux of the coil without breaking its superconductivity, thus flexibly tuning levitation force and height meanwhile maintaining self-stability. For the first time, we experimentally demonstrate a self-stable type II superconducting maglev system which is able to: counteract long term levitation force decay, adjust levitation force and equilibrium position, and establish levitation under zero field cooling condition. These breakthroughs may bridge the gap between demonstrations and practical applications of type II superconducting maglevs.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
EEG Based Generative Depression Discriminator
Authors:
Ziming Mao,
Hao wu,
Yongxi Tan,
Yuhe **
Abstract:
Depression is a very common but serious mood disorder.In this paper, We built a generative detection network(GDN) in accordance with three physiological laws. Our aim is that we expect the neural network to learn the relevant brain activity based on the EEG signal and, at the same time, to regenerate the target electrode signal based on the brain activity. We trained two generators, the first one…
▽ More
Depression is a very common but serious mood disorder.In this paper, We built a generative detection network(GDN) in accordance with three physiological laws. Our aim is that we expect the neural network to learn the relevant brain activity based on the EEG signal and, at the same time, to regenerate the target electrode signal based on the brain activity. We trained two generators, the first one learns the characteristics of depressed brain activity, and the second one learns the characteristics of control group's brain activity. In the test, a segment of EEG signal was put into the two generators separately, if the relationship between the EEG signal and brain activity conforms to the characteristics of a certain category, then the signal generated by the generator of the corresponding category is more consistent with the original signal. Thus it is possible to determine the category corresponding to a certain segment of EEG signal. We obtained an accuracy of 92.30\% on the MODMA dataset and 86.73\% on the HUSM dataset. Moreover, this model is able to output explainable information, which can be used to help the user to discover possible misjudgments of the network.Our code will be released.
△ Less
Submitted 19 January, 2024;
originally announced February 2024.
-
Automated Detection of Myopic Maculopathy in MMAC 2023: Achievements in Classification, Segmentation, and Spherical Equivalent Prediction
Authors:
Yihao Li,
Philippe Zhang,
Yubo Tan,
**g Zhang,
Zhihan Wang,
Weili Jiang,
Pierre-Henri Conze,
Mathieu Lamard,
Gwenolé Quellec,
Mostafa El Habib Daho
Abstract:
Myopic macular degeneration is the most common complication of myopia and the primary cause of vision loss in individuals with pathological myopia. Early detection and prompt treatment are crucial in preventing vision impairment due to myopic maculopathy. This was the focus of the Myopic Maculopathy Analysis Challenge (MMAC), in which we participated. In task 1, classification of myopic maculopath…
▽ More
Myopic macular degeneration is the most common complication of myopia and the primary cause of vision loss in individuals with pathological myopia. Early detection and prompt treatment are crucial in preventing vision impairment due to myopic maculopathy. This was the focus of the Myopic Maculopathy Analysis Challenge (MMAC), in which we participated. In task 1, classification of myopic maculopathy, we employed the contrastive learning framework, specifically SimCLR, to enhance classification accuracy by effectively capturing enriched features from unlabeled data. This approach not only improved the intrinsic understanding of the data but also elevated the performance of our classification model. For Task 2 (segmentation of myopic maculopathy plus lesions), we have developed independent segmentation models tailored for different lesion segmentation tasks and implemented a test-time augmentation strategy to further enhance the model's performance. As for Task 3 (prediction of spherical equivalent), we have designed a deep regression model based on the data distribution of the dataset and employed an integration strategy to enhance the model's prediction accuracy. The results we obtained are promising and have allowed us to position ourselves in the Top 6 of the classification task, the Top 2 of the segmentation task, and the Top 1 of the prediction task. The code is available at \url{https://github.com/liyihao76/MMAC_LaTIM_Solution}.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Multi-Objective Complementary Control
Authors:
Jiapeng Xu,
Xiang Chen,
Ying Tan,
Kemin Zhou
Abstract:
This paper proposes a novel multi-objective control framework for linear time-invariant systems in which performance and robustness can be achieved in a complementary way instead of trade-off. In particular, a state-space solution is first established for a new stabilizing control structure consisting of two independently designed controllers coordinated with a Youla-type operator ${\bm Q}$. It is…
▽ More
This paper proposes a novel multi-objective control framework for linear time-invariant systems in which performance and robustness can be achieved in a complementary way instead of trade-off. In particular, a state-space solution is first established for a new stabilizing control structure consisting of two independently designed controllers coordinated with a Youla-type operator ${\bm Q}$. It is then shown by performance analysis that these two independently designed controllers operate in a naturally complementary way for a tracking control system, due to coordination function of ${\bm Q}$ driven by the residual signal of a Luenberger observer. Moreover, it is pointed out that ${\bm Q}$ could be further optimized with an additional gain factor to achieve improved performance, through a data-driven methodology for a measured cost function.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Adaptive Event-triggered Control For Strict-feedback Systems With Time-varying Parameters
Authors:
Yan Tan,
Liucang Wu,
Wenqi Liu
Abstract:
In this article, we develop a new adaptive event-triggered asymptotic control scheme for strict-feedback systems with fast time-varying parameters. To deal with time-varying parameters with unknown variation boundaries in the feedback path and the input path, we construct three adaptive laws for parameter estimation, two for the uncertain parameters in the feedback path and one for the uncertain p…
▽ More
In this article, we develop a new adaptive event-triggered asymptotic control scheme for strict-feedback systems with fast time-varying parameters. To deal with time-varying parameters with unknown variation boundaries in the feedback path and the input path, we construct three adaptive laws for parameter estimation, two for the uncertain parameters in the feedback path and one for the uncertain parameters in the input path. In particular, two sets of tuning functions are introduced to avoid over-parametrization. Additionally, an event-triggering mechanism is embedded in this adaptive control framework to reduce the data transmission from the controller to the actuator. We also introduce a soft sign function to handle the perturbations caused by sampling errors to achieve asymptotic stability and avoid the so-called parameter drift. The stability analysis shows that the closed-loop system is globally uniformly asymptotically stable and the Zeno behavior can be excluded. Simulation results verify the effectiveness and performance of the proposed adaptive scheme.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Stain Consistency Learning: Handling Stain Variation for Automatic Digital Pathology Segmentation
Authors:
Michael Yeung,
Todd Watts,
Sean YW Tan,
Pedro F. Ferreira,
Andrew D. Scott,
Sonia Nielles-Vallespin,
Guang Yang
Abstract:
Stain variation is a unique challenge associated with automated analysis of digital pathology. Numerous methods have been developed to improve the robustness of machine learning methods to stain variation, but comparative studies have demonstrated limited benefits to performance. Moreover, methods to handle stain variation were largely developed for H&E stained data, with evaluation generally limi…
▽ More
Stain variation is a unique challenge associated with automated analysis of digital pathology. Numerous methods have been developed to improve the robustness of machine learning methods to stain variation, but comparative studies have demonstrated limited benefits to performance. Moreover, methods to handle stain variation were largely developed for H&E stained data, with evaluation generally limited to classification tasks. Here we propose Stain Consistency Learning, a novel framework combining stain-specific augmentation with a stain consistency loss function to learn stain colour invariant features. We perform the first, extensive comparison of methods to handle stain variation for segmentation tasks, comparing ten methods on Masson's trichrome and H&E stained cell and nuclei datasets, respectively. We observed that stain normalisation methods resulted in equivalent or worse performance, while stain augmentation or stain adversarial methods demonstrated improved performance, with the best performance consistently achieved by our proposed approach. The code is available at: https://github.com/mlyg/stain_consistency_learning
△ Less
Submitted 11 November, 2023;
originally announced November 2023.
-
Concealed Electronic Countermeasures of Radar Signal with Adversarial Examples
Authors:
Ruinan Ma,
Canjie Zhu,
Mingfeng Lu,
Yunjie Li,
Yu-an Tan,
Ruibin Zhang,
Ran Tao
Abstract:
Electronic countermeasures involving radar signals are an important aspect of modern warfare. Traditional electronic countermeasures techniques typically add large-scale interference signals to ensure interference effects, which can lead to attacks being too obvious. In recent years, AI-based attack methods have emerged that can effectively solve this problem, but the attack scenarios are currentl…
▽ More
Electronic countermeasures involving radar signals are an important aspect of modern warfare. Traditional electronic countermeasures techniques typically add large-scale interference signals to ensure interference effects, which can lead to attacks being too obvious. In recent years, AI-based attack methods have emerged that can effectively solve this problem, but the attack scenarios are currently limited to time domain radar signal classification. In this paper, we focus on the time-frequency images classification scenario of radar signals. We first propose an attack pipeline under the time-frequency images scenario and DITIMI-FGSM attack algorithm with high transferability. Then, we propose STFT-based time domain signal attack(STDS) algorithm to solve the problem of non-invertibility in time-frequency analysis, thus obtaining the time-domain representation of the interference signal. A large number of experiments show that our attack pipeline is feasible and the proposed attack method has a high success rate.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Learning Regularized Monotone Graphon Mean-Field Games
Authors:
Fengzhuo Zhang,
Vincent Y. F. Tan,
Zhaoran Wang,
Zhuoran Yang
Abstract:
This paper studies two fundamental problems in regularized Graphon Mean-Field Games (GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $λ$-regularized GMFG (for $λ\geq 0$). This result relies on weaker conditions than those in previous works for analyzing both unregularized GMFGs ($λ=0$) and $λ$-regularized MFGs, which are special cases of GMFGs. Second, we propose provab…
▽ More
This paper studies two fundamental problems in regularized Graphon Mean-Field Games (GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $λ$-regularized GMFG (for $λ\geq 0$). This result relies on weaker conditions than those in previous works for analyzing both unregularized GMFGs ($λ=0$) and $λ$-regularized MFGs, which are special cases of GMFGs. Second, we propose provably efficient algorithms to learn the NE in weakly monotone GMFGs, motivated by Lasry and Lions [2007]. Previous literature either only analyzed continuous-time algorithms or required extra conditions to analyze discrete-time algorithms. In contrast, we design a discrete-time algorithm and derive its convergence rate solely under weakly monotone conditions. Furthermore, we develop and analyze the action-value function estimation procedure during the online learning process, which is absent from algorithms for monotone GMFGs. This serves as a sub-module in our optimization algorithm. The efficiency of the designed algorithm is corroborated by empirical evaluations.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Deep Unrolling for Nonconvex Robust Principal Component Analysis
Authors:
Elizabeth Z. C. Tan,
Caroline Chaux,
Emmanuel Soubies,
Vincent Y. F. Tan
Abstract:
We design algorithms for Robust Principal Component Analysis (RPCA) which consists in decomposing a matrix into the sum of a low rank matrix and a sparse matrix. We propose a deep unrolled algorithm based on an accelerated alternating projection algorithm which aims to solve RPCA in its nonconvex form. The proposed procedure combines benefits of deep neural networks and the interpretability of the…
▽ More
We design algorithms for Robust Principal Component Analysis (RPCA) which consists in decomposing a matrix into the sum of a low rank matrix and a sparse matrix. We propose a deep unrolled algorithm based on an accelerated alternating projection algorithm which aims to solve RPCA in its nonconvex form. The proposed procedure combines benefits of deep neural networks and the interpretability of the original algorithm and it automatically learns hyperparameters. We demonstrate the unrolled algorithm's effectiveness on synthetic datasets and also on a face modeling problem, where it leads to both better numerical and visual performances.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Dictionary Learning under Symmetries via Group Representations
Authors:
Subhroshekhar Ghosh,
Aaron Y. R. Low,
Yong Sheng Soh,
Zhuohang Feng,
Brendan K. Y. Tan
Abstract:
The dictionary learning problem can be viewed as a data-driven process to learn a suitable transformation so that data is sparsely represented directly from example data. In this paper, we examine the problem of learning a dictionary that is invariant under a pre-specified group of transformations. Natural settings include Cryo-EM, multi-object tracking, synchronization, pose estimation, etc. We s…
▽ More
The dictionary learning problem can be viewed as a data-driven process to learn a suitable transformation so that data is sparsely represented directly from example data. In this paper, we examine the problem of learning a dictionary that is invariant under a pre-specified group of transformations. Natural settings include Cryo-EM, multi-object tracking, synchronization, pose estimation, etc. We specifically study this problem under the lens of mathematical representation theory. Leveraging the power of non-abelian Fourier analysis for functions over compact groups, we prescribe an algorithmic recipe for learning dictionaries that obey such invariances. We relate the dictionary learning problem in the physical domain, which is naturally modelled as being infinite dimensional, with the associated computational problem, which is necessarily finite dimensional. We establish that the dictionary learning problem can be effectively understood as an optimization instance over certain matrix orbitopes having a particular block-diagonal structure governed by the irreducible representations of the group of symmetries. This perspective enables us to introduce a band-limiting procedure which obtains dimensionality reduction in applications. We provide guarantees for our computational ansatz to provide a desirable dictionary learning outcome. We apply our paradigm to investigate the dictionary learning problem for the groups SO(2) and SO(3). While the SO(2)-orbitope admits an exact spectrahedral description, substantially less is understood about the SO(3)-orbitope. We describe a tractable spectrahedral outer approximation of the SO(3)-orbitope, and contribute an alternating minimization paradigm to perform optimization in this setting. We provide numerical experiments to highlight the efficacy of our approach in learning SO(3)-invariant dictionaries, both on synthetic and on real world data.
△ Less
Submitted 25 July, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding
Authors:
Yi Xuan Tan,
Navonil Majumder,
Soujanya Poria
Abstract:
The pre-trained speech encoder wav2vec 2.0 performs very well on various spoken language understanding (SLU) tasks. However, on many tasks, it trails behind text encoders with textual input. To improve the understanding capability of SLU encoders, various studies have used knowledge distillation to transfer knowledge from natural language understanding (NLU) encoders. We use a very simple method o…
▽ More
The pre-trained speech encoder wav2vec 2.0 performs very well on various spoken language understanding (SLU) tasks. However, on many tasks, it trails behind text encoders with textual input. To improve the understanding capability of SLU encoders, various studies have used knowledge distillation to transfer knowledge from natural language understanding (NLU) encoders. We use a very simple method of distilling from a textual sentence embedder directly into wav2vec 2.0 as pre-training, utilizing paired audio-text datasets. We observed that this method is indeed capable of improving SLU task performance in fine-tuned settings, as well as full-data and few-shot transfer on a frozen encoder. However, the model performs worse on certain tasks highlighting the strengths and weaknesses of our approach.
△ Less
Submitted 20 May, 2023;
originally announced May 2023.
-
Model-driven CT reconstruction algorithm for nano-resolution X-ray phase contrast imaging
Authors:
Xuebao Cai,
Yuhang Tan,
Ting Su,
Dong Liang,
Hairong Zheng,
**you Xu,
Pei** Zhu,
Yongshuai Ge
Abstract:
The low-density imaging performance of a zone plate based nano-resolution hard X-ray computed tomography (CT) system can be significantly improved by incorporating a grating-based Lau interferometer. Due to the diffraction, however, the acquired nano-resolution phase signal may suffer splitting problem, which impedes the direct reconstruction of phase contrast CT (nPCT) images. To overcome, a new…
▽ More
The low-density imaging performance of a zone plate based nano-resolution hard X-ray computed tomography (CT) system can be significantly improved by incorporating a grating-based Lau interferometer. Due to the diffraction, however, the acquired nano-resolution phase signal may suffer splitting problem, which impedes the direct reconstruction of phase contrast CT (nPCT) images. To overcome, a new model-driven nPCT image reconstruction algorithm is developed in this study. In it, the diffraction procedure is mathematically modeled into a matrix B, from which the projections without signal splitting can be generated invertedly. Furthermore, a penalized weighed least-square model with total variation (PWLS-TV) is employed to denoise these projections, from which nPCT images with high accuracy are directly reconstructed. Numerical and physical experiments demonstrate that this new algorithm is able to work with phase projections having any splitting distances. Results also reveal that nPCT images with higher signal-to-noise-ratio (SNR) would be reconstructed from projections with larger signal splittings. In conclusion, a novel model-driven nPCT image reconstruction algorithm with high accuracy and robustness is verified for the Lau interferometer based hard X-ray nano-resolution phase contrast imaging.
△ Less
Submitted 13 October, 2023; v1 submitted 14 May, 2023;
originally announced May 2023.
-
Robust Tracking Control for Nonlinear Systems: Performance optimization via extremum seeking
Authors:
Jiapeng Xu,
Ying Tan,
Xiang Chen
Abstract:
This paper presents a controller design and optimization framework for nonlinear dynamic systems to track a given reference signal in the presence of disturbances when the task is repeated over a finite-time interval. This novel framework mainly consists of two steps. The first step is to design a robust linear quadratic tracking controller based on the existing control structure with a Youla-type…
▽ More
This paper presents a controller design and optimization framework for nonlinear dynamic systems to track a given reference signal in the presence of disturbances when the task is repeated over a finite-time interval. This novel framework mainly consists of two steps. The first step is to design a robust linear quadratic tracking controller based on the existing control structure with a Youla-type filter $\tilde Q$. Secondly, an extra degree of freedom: a parameterization in terms of $\tilde Q$, is added to this design framework. This extra design parameter is tuned iteratively from measured tracking cost function with the given disturbances and modeling uncertainties to achieve the best transient performance. The proposed method is validated with simulation placed on a Furuta inverted pendulum, showing significant tracking performance improvement.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger
Authors:
Yi Yu,
Yufei Wang,
Wenhan Yang,
Shijian Lu,
Yap-peng Tan,
Alex C. Kot
Abstract:
Recent deep-learning-based compression methods have achieved superior performance compared with traditional approaches. However, deep learning models have proven to be vulnerable to backdoor attacks, where some specific trigger patterns added to the input can lead to malicious behavior of the models. In this paper, we present a novel backdoor attack with multiple triggers against learned image com…
▽ More
Recent deep-learning-based compression methods have achieved superior performance compared with traditional approaches. However, deep learning models have proven to be vulnerable to backdoor attacks, where some specific trigger patterns added to the input can lead to malicious behavior of the models. In this paper, we present a novel backdoor attack with multiple triggers against learned image compression models. Motivated by the widely used discrete cosine transform (DCT) in existing compression systems and standards, we propose a frequency-based trigger injection model that adds triggers in the DCT domain. In particular, we design several attack objectives for various attacking scenarios, including: 1) attacking compression quality in terms of bit-rate and reconstruction quality; 2) attacking task-driven measures, such as down-stream face recognition and semantic segmentation. Moreover, a novel simple dynamic loss is designed to balance the influence of different loss terms adaptively, which helps achieve more efficient training. Extensive experiments show that with our trained trigger injection models and simple modification of encoder parameters (of the compression model), the proposed attack can successfully inject several backdoors with corresponding triggers in a single image compression model.
△ Less
Submitted 28 February, 2023;
originally announced February 2023.
-
Event-triggered Hybrid Energy-aware Scheduling in Manufacturing Systems
Authors:
Zhean Shao,
Wen Li,
Ying Tan
Abstract:
Incorporating renewable energy sources (RESs) into manufacturing systems has been an active research area in order to address many challenges originating from the unpredictable nature of RESs such as photovoltaics.In the energy-aware scheduling for manufacturing systems, the traditional off-line scheduling techniques cannot always work well due to their lack of robustness with respect to uncertain…
▽ More
Incorporating renewable energy sources (RESs) into manufacturing systems has been an active research area in order to address many challenges originating from the unpredictable nature of RESs such as photovoltaics.In the energy-aware scheduling for manufacturing systems, the traditional off-line scheduling techniques cannot always work well due to their lack of robustness with respect to uncertainties coming from imprecise models or unexpected situations. On the other hand, on-line scheduling or rescheduling, which can improve the robustness by using the model and the latest measurements simultaneously, suffer from a high computational cost. This work proposes a hybrid scheduling framework, which combines the advantages of both off-line scheduling and on-line scheduling, to provide a balanced solution between robustness and computational cost. A novel concept of partially-dispatchable state is introduced. It can be treated as a constant in scheduling when the model works well. When the model does not work well, it is triggered as the variable to tune to improve the performance. Such an event-triggered structure can reduce the number of rescheduling and computational costs while achieving a reasonable performance and enhancing system robustness. Moreover, the choice of partially-dispatchable state also provides an extra design freedom in achieving green manufacturing. Simulation examples on a manufacturing system, of which consists a 100-kW solar photovoltaic system, a 10-machine flow shop production line, a 50-kWh energy storage system, a 100-kW gas turbine, and the grid for power supply, demonstrating the validity and applicability of this event-triggered hybrid scheduling (ETHS) framework.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
Authors:
Zhao Ren,
Yi Chang,
Thanh Tam Nguyen,
Yang Tan,
Kun Qian,
Björn W. Schuller
Abstract:
Heart sound auscultation has been applied in clinical usage for early screening of cardiovascular diseases. Due to the high demand for auscultation expertise, automatic auscultation can help with auxiliary diagnosis and reduce the burden of training professional clinicians. Nevertheless, there is a limit to classic machine learning's performance improvement in the era of big data. Deep learning ha…
▽ More
Heart sound auscultation has been applied in clinical usage for early screening of cardiovascular diseases. Due to the high demand for auscultation expertise, automatic auscultation can help with auxiliary diagnosis and reduce the burden of training professional clinicians. Nevertheless, there is a limit to classic machine learning's performance improvement in the era of big data. Deep learning has outperformed classic machine learning in many research fields, as it employs more complex model architectures with a stronger capability of extracting effective representations. Moreover, it has been successfully applied to heart sound analysis in the past years. As most review works about heart sound analysis were carried out before 2017, the present survey is the first to work on a comprehensive overview to summarise papers on heart sound analysis with deep learning published in 2017--2022. This work introduces both classic machine learning and deep learning for comparison, and further offer insights about the advances and future research directions in deep learning for heart sound analysis. Our repository is publicly available at \url{https://github.com/zhaoren91/awesome-heart-sound-analysis}.
△ Less
Submitted 11 May, 2024; v1 submitted 23 January, 2023;
originally announced January 2023.
-
Finding the Most Transferable Tasks for Brain Image Segmentation
Authors:
Yicong Li,
Yang Tan,
**gyun Yang,
Yang Li,
Xiao-** Zhang
Abstract:
Although many studies have successfully applied transfer learning to medical image segmentation, very few of them have investigated the selection strategy when multiple source tasks are available for transfer. In this paper, we propose a prior knowledge guided and transferability based framework to select the best source tasks among a collection of brain image segmentation tasks, to improve the tr…
▽ More
Although many studies have successfully applied transfer learning to medical image segmentation, very few of them have investigated the selection strategy when multiple source tasks are available for transfer. In this paper, we propose a prior knowledge guided and transferability based framework to select the best source tasks among a collection of brain image segmentation tasks, to improve the transfer learning performance on the given target task. The framework consists of modality analysis, RoI (region of interest) analysis, and transferability estimation, such that the source task selection can be refined step by step. Specifically, we adapt the state-of-the-art analytical transferability estimation metrics to medical image segmentation tasks and further show that their performance can be significantly boosted by filtering candidate source tasks based on modality and RoI characteristics. Our experiments on brain matter, brain tumor, and white matter hyperintensities segmentation datasets reveal that transferring from different tasks under the same modality is often more successful than transferring from the same task under different modalities. Furthermore, within the same modality, transferring from the source task that has stronger RoI shape similarity with the target task can significantly improve the final transfer performance. And such similarity can be captured using the Structural Similarity index in the label space.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
On Robust Observer Design for System Motion on SE(3) Using Onboard Visual Sensors
Authors:
Tong Zhang,
Ying Tan,
Xiang Chen,
Zike Lei
Abstract:
Onboard visual sensing has been widely used in the unmanned ground vehicle (UGV) and/or unmanned aerial vehicle (UAV), which can be modeled as dynamic systems on SE(3). The onboard sensing outputs of the dynamic system can usually be applied to derive the relative position between the feature marks and the system, but bearing with explicit geometrical constraint. Such a visual geometrical constrai…
▽ More
Onboard visual sensing has been widely used in the unmanned ground vehicle (UGV) and/or unmanned aerial vehicle (UAV), which can be modeled as dynamic systems on SE(3). The onboard sensing outputs of the dynamic system can usually be applied to derive the relative position between the feature marks and the system, but bearing with explicit geometrical constraint. Such a visual geometrical constraint makes the design of the visual observer on SE(3) very challenging, as it will cause a time-varying or switching visible set due to the varying number of feature marks in this set along different trajectories. Moreover, the possibility of having mis-identified feature marks and modeling uncertainties might result in a divergent estimation error. This paper proposes a new robust observer design method that can accommodate these uncertainties from onboard visual sensing. The key design idea for this observer is to estimate the visible set and identify the mis-identified features from the measurements. Based on the identified uncertainties, a switching strategy is proposed to ensure bounded estimation error for any given trajectory over a fixed time interval. Simulation results are provided to demonstrate the effectiveness of the proposed robust observer.
△ Less
Submitted 21 March, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Segmentation, Classification, and Quality Assessment of UW-OCTA Images for the Diagnosis of Diabetic Retinopathy
Authors:
Yihao Li,
Rachid Zeghlache,
Ikram Brahim,
Hui Xu,
Yubo Tan,
Pierre-Henri Conze,
Mathieu Lamard,
Gwenolé Quellec,
Mostafa El Habib Daho
Abstract:
Diabetic Retinopathy (DR) is a severe complication of diabetes that can cause blindness. Although effective treatments exist (notably laser) to slow the progression of the disease and prevent blindness, the best treatment remains prevention through regular check-ups (at least once a year) with an ophthalmologist. Optical Coherence Tomography Angiography (OCTA) allows for the visualization of the r…
▽ More
Diabetic Retinopathy (DR) is a severe complication of diabetes that can cause blindness. Although effective treatments exist (notably laser) to slow the progression of the disease and prevent blindness, the best treatment remains prevention through regular check-ups (at least once a year) with an ophthalmologist. Optical Coherence Tomography Angiography (OCTA) allows for the visualization of the retinal vascularization, and the choroid at the microvascular level in great detail. This allows doctors to diagnose DR with more precision. In recent years, algorithms for DR diagnosis have emerged along with the development of deep learning and the improvement of computer hardware. However, these usually focus on retina photography. There are no current methods that can automatically analyze DR using Ultra-Wide OCTA (UW-OCTA). The Diabetic Retinopathy Analysis Challenge 2022 (DRAC22) provides a standardized UW-OCTA dataset to train and test the effectiveness of various algorithms on three tasks: lesions segmentation, quality assessment, and DR grading. In this paper, we will present our solutions for the three tasks of the DRAC22 challenge. The obtained results are promising and have allowed us to position ourselves in the TOP 5 of the segmentation task, the TOP 4 of the quality assessment task, and the TOP 3 of the DR grading task. The code is available at \url{https://github.com/Mostafa-EHD/Diabetic_Retinopathy_OCTA}.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Robust output regulation of linear system subject to modeled and unmodeled uncertainty
Authors:
Zhicheng Zhang,
Zhiqiang Zuo,
Xiang Chen,
Ying Tan,
Yi**g Wang
Abstract:
In this paper, a novel robust output regulation control framework is proposed for the system subject to noise, modeled disturbance and unmodeled disturbance to seek tracking performance and robustness simultaneously. The output regulation scheme is utilized in the framework to track the reference in the presence of modeled disturbance, and the effect of unmodeled disturbance is reduced by an…
▽ More
In this paper, a novel robust output regulation control framework is proposed for the system subject to noise, modeled disturbance and unmodeled disturbance to seek tracking performance and robustness simultaneously. The output regulation scheme is utilized in the framework to track the reference in the presence of modeled disturbance, and the effect of unmodeled disturbance is reduced by an $\mathcal{H}_\infty$ compensator. The Kalman filter can be also introduced in the stabilization loop to deal with the white noise. Furthermore, the tracking error in the presence/absence of noise and disturbance is estimated. The effectiveness and performance of our proposed control framework is verified in the numerical example by applying in the Furuta Inverted Pendulum system.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Are Macula or Optic Nerve Head Structures better at Diagnosing Glaucoma? An Answer using AI and Wide-Field Optical Coherence Tomography
Authors:
Charis Y. N. Chiang,
Fabian Braeu,
Thanadet Chuangsuwanich,
Royston K. Y. Tan,
Jacqueline Chua,
Leopold Schmetterer,
Alexandre Thiery,
Martin Buist,
Michaël J. A. Girard
Abstract:
Purpose: (1) To develop a deep learning algorithm to automatically segment structures of the optic nerve head (ONH) and macula in 3D wide-field optical coherence tomography (OCT) scans; (2) To assess whether 3D macula or ONH structures (or the combination of both) provide the best diagnostic power for glaucoma. Methods: A cross-sectional comparative study was performed which included wide-field sw…
▽ More
Purpose: (1) To develop a deep learning algorithm to automatically segment structures of the optic nerve head (ONH) and macula in 3D wide-field optical coherence tomography (OCT) scans; (2) To assess whether 3D macula or ONH structures (or the combination of both) provide the best diagnostic power for glaucoma. Methods: A cross-sectional comparative study was performed which included wide-field swept-source OCT scans from 319 glaucoma subjects and 298 non-glaucoma subjects. All scans were compensated to improve deep-tissue visibility. We developed a deep learning algorithm to automatically label all major ONH tissue structures by using 270 manually annotated B-scans for training. The performance of our algorithm was assessed using the Dice coefficient (DC). A glaucoma classification algorithm (3D CNN) was then designed using a combination of 500 OCT volumes and their corresponding automatically segmented masks. This algorithm was trained and tested on 3 datasets: OCT scans cropped to contain the macular tissues only, those to contain the ONH tissues only, and the full wide-field OCT scans. The classification performance for each dataset was reported using the AUC. Results: Our segmentation algorithm was able to segment ONH and macular tissues with a DC of 0.94 $\pm$ 0.003. The classification algorithm was best able to diagnose glaucoma using wide-field 3D-OCT volumes with an AUC of 0.99 $\pm$ 0.01, followed by ONH volumes with an AUC of 0.93 $\pm$ 0.06, and finally macular volumes with an AUC of 0.91 $\pm$ 0.11. Conclusions: this study showed that using wide-field OCT as compared to the typical OCT images containing just the ONH or macular may allow for a significantly improved glaucoma diagnosis. This may encourage the mainstream adoption of 3D wide-field OCT scans. For clinical AI studies that use traditional machines, we would recommend the use of ONH scans as opposed to macula scans.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Six-center Assessment of CNN-Transformer with Belief Matching Loss for Patient-independent Seizure Detection in EEG
Authors:
Wei Yan Peh,
Prasanth Thangavel,
Yuanyuan Yao,
John Thomas,
Yee Leng Tan,
Justin Dauwels
Abstract:
Neurologists typically identify epileptic seizures from electroencephalograms (EEGs) by visual inspection. This process is often time-consuming, especially for EEG recordings that last hours or days. To expedite the process, a reliable, automated, and patient-independent seizure detector is essential. However, develo** a patient-independent seizure detector is challenging as seizures exhibit div…
▽ More
Neurologists typically identify epileptic seizures from electroencephalograms (EEGs) by visual inspection. This process is often time-consuming, especially for EEG recordings that last hours or days. To expedite the process, a reliable, automated, and patient-independent seizure detector is essential. However, develo** a patient-independent seizure detector is challenging as seizures exhibit diverse characteristics across patients and recording devices. In this study, we propose a patient-independent seizure detector to automatically detect seizures in both scalp EEG and intracranial EEG (iEEG). First, we deploy a convolutional neural network with transformers and belief matching loss to detect seizures in single-channel EEG segments. Next, we extract regional features from the channel-level outputs to detect seizures in multi-channel EEG segments. At last, we apply postprocessing filters to the segment-level outputs to determine seizures' start and end points in multi-channel EEGs. Finally, we introduce the minimum overlap evaluation scoring as an evaluation metric that accounts for minimum overlap between the detection and seizure, improving upon existing assessment metrics. We trained the seizure detector on the Temple University Hospital Seizure (TUH-SZ) dataset and evaluated it on five independent EEG datasets. We evaluate the systems with the following metrics: sensitivity (SEN), precision (PRE), and average and median false positive rate per hour (aFPR/h and mFPR/h). Across four adult scalp EEG and iEEG datasets, we obtained SEN of 0.617-1.00, PRE of 0.534-1.00, aFPR/h of 0.425-2.002, and mFPR/h of 0-1.003. The proposed seizure detector can detect seizures in adult EEGs and takes less than 15s for a 30 minutes EEG. Hence, this system could aid clinicians in reliably identifying seizures expeditiously, allocating more time for devising proper treatment.
△ Less
Submitted 22 November, 2022; v1 submitted 29 July, 2022;
originally announced August 2022.
-
Asymptotic Nash Equilibrium for the $M$-ary Sequential Adversarial Hypothesis Testing Game
Authors:
Jiachun Pan,
Yonglong Li,
Vincent Y. F. Tan
Abstract:
In this paper, we consider a novel $M$-ary sequential hypothesis testing problem in which an adversary is present and perturbs the distributions of the samples before the decision maker observes them. This problem is formulated as a sequential adversarial hypothesis testing game played between the decision maker and the adversary. This game is a zero-sum and strategic one. We assume the adversary…
▽ More
In this paper, we consider a novel $M$-ary sequential hypothesis testing problem in which an adversary is present and perturbs the distributions of the samples before the decision maker observes them. This problem is formulated as a sequential adversarial hypothesis testing game played between the decision maker and the adversary. This game is a zero-sum and strategic one. We assume the adversary is active under \emph{all} hypotheses and knows the underlying distribution of observed samples. We adopt this framework as it is the worst-case scenario from the perspective of the decision maker. The goal of the decision maker is to minimize the expectation of the stop** time to ensure that the test is as efficient as possible; the adversary's goal is, instead, to maximize the stop** time. We derive a pair of strategies under which the asymptotic Nash equilibrium of the game is attained. We also consider the case in which the adversary is not aware of the underlying hypothesis and hence is constrained to apply the same strategy regardless of which hypothesis is in effect. Numerical results corroborate our theoretical findings.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Extremely Low-light Image Enhancement with Scene Text Restoration
Authors:
Pohao Hsu,
Che-Tsung Lin,
Chun Chet Ng,
Jie-Long Kew,
Mei Yih Tan,
Shang-Hong Lai,
Chee Seng Chan,
Christopher Zach
Abstract:
Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene…
▽ More
Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene texts, as well as the overall quality of the image simultaneously under extremely low-light images conditions. Mainly, we employed a self-regularised attention map, an edge map, and a novel text detection loss. In addition, leveraging synthetic low-light images is beneficial for image enhancement on the genuine ones in terms of text detection. The quantitative and qualitative experimental results have shown that the proposed model outperforms state-of-the-art methods in image restoration, text detection, and text spotting on See In the Dark and ICDAR15 datasets.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Optimization of Directional Landmark Deployment for Visual Observer on SE(3)
Authors:
Zike Lei,
Xi Chen,
Ying Tan,
Xiang Chen,
Li Chai
Abstract:
An optimization method is proposed in this paper for novel deployment of given number of directional landmarks (location and pose) within a given region in the 3-D task space. This new deployment technique is built on the geometric models of both landmarks and the monocular camera. In particular, a new concept of Multiple Coverage Probability (MCP) is defined to characterize the probability of at…
▽ More
An optimization method is proposed in this paper for novel deployment of given number of directional landmarks (location and pose) within a given region in the 3-D task space. This new deployment technique is built on the geometric models of both landmarks and the monocular camera. In particular, a new concept of Multiple Coverage Probability (MCP) is defined to characterize the probability of at least n landmarks being covered simultaneously by a camera at a fixed position. The optimization is conducted with respect to the position and pose of the given number of landmarks to maximize MCP through globally exploration of the given 3-D space. By adopting the elimination genetic algorithm, the global optimal solutions can be obtained, which are then applied to improve the convergent performance of the visual observer on SE(3) as a demonstration example. Both simulation and experimental results are presented to validate the effectiveness of the proposed landmark deployment optimization method.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
A distributionally robust optimization approach to two-sided chance constrained stochastic model predictive control with unknown noise distribution
Authors:
Yuan Tan,
Jun Yang,
Wen-Hua Chen,
Shihua Li
Abstract:
In this work, we propose a distributionally robust stochastic model predictive control (DR-SMPC) algorithm to address the problem of two-sided chance constrained discrete-time linear system corrupted by additive noise. The prevalent mechanism to cope with two-sided chance constraints is the so-called risk allocation approach, which conservatively approximates the two-sided chance constraints with…
▽ More
In this work, we propose a distributionally robust stochastic model predictive control (DR-SMPC) algorithm to address the problem of two-sided chance constrained discrete-time linear system corrupted by additive noise. The prevalent mechanism to cope with two-sided chance constraints is the so-called risk allocation approach, which conservatively approximates the two-sided chance constraints with two single chance constraints by applying the Boole's inequality. In this proposed DR-SMPC framework, an exact tractable second-order cone (SOC) approach is adopted to abstract the two-sided chance constraints by considering the first and second moments of the noise. The proposed DR-SMPC algorithm is able to guarantee that the worst-case probability of violating both the upper and lower limits of safety constraints is within the pre-specified maximum probability (PsMP). By flexibly adjusting this PsMP, the feasible region of the initial states can be increased for the SMPC problem. The recursive feasibility and convergence of the proposed DR-SMPC are established rigorously by introducing binary initialization strategy of nominal state. Simulation studies of two practical cases are conducted to demonstrate the effectiveness of the proposed DR-SMPC algorithm.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
Adversarial amplitude swap towards robust image classifiers
Authors:
Chun Yang Tan,
Kazuhiko Kawamoto,
Hiroshi Kera
Abstract:
The vulnerability of convolutional neural networks (CNNs) to image perturbations such as common corruptions and adversarial perturbations has recently been investigated from the perspective of frequency. In this study, we investigate the effect of the amplitude and phase spectra of adversarial images on the robustness of CNN classifiers. Extensive experiments revealed that the images generated by…
▽ More
The vulnerability of convolutional neural networks (CNNs) to image perturbations such as common corruptions and adversarial perturbations has recently been investigated from the perspective of frequency. In this study, we investigate the effect of the amplitude and phase spectra of adversarial images on the robustness of CNN classifiers. Extensive experiments revealed that the images generated by combining the amplitude spectrum of adversarial images and the phase spectrum of clean images accommodates moderate and general perturbations, and training with these images equips a CNN classifier with more general robustness, performing well under both common corruptions and adversarial perturbations. We also found that two types of overfitting (catastrophic overfitting and robust overfitting) can be circumvented by the aforementioned spectrum recombination. We believe that these results contribute to the understanding and the training of truly robust classifiers.
△ Less
Submitted 1 April, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Multi-step dual control for exploration and exploitation in autonomous search with convergence guarantee
Authors:
Yuan Tan,
Jun Yang,
Wen-Hua Chen,
Shihua Li
Abstract:
Motivated by the recently proposed dual control for exploration and exploitation (DCEE) concept, this paper presents a Multi-Step DCEE (MS-DCEE) framework with guaranteed convergence for autonomous search of a source of airborne dispersion. Different from the existing stochastic model predictive control (SMPC) algorithm and informative path planning (IPP) approaches, the proposed MS-DCEE approach…
▽ More
Motivated by the recently proposed dual control for exploration and exploitation (DCEE) concept, this paper presents a Multi-Step DCEE (MS-DCEE) framework with guaranteed convergence for autonomous search of a source of airborne dispersion. Different from the existing stochastic model predictive control (SMPC) algorithm and informative path planning (IPP) approaches, the proposed MS-DCEE approach uses the current and future input to not only drive the agent towards the estimated source location (exploitation) but also reduce its estimation uncertainty (exploration) by actively learning the operational environment. Unknown source target position, together with unknown environment, impose significant challenges in establishing the recursive feasibility and the convergence of the proposed algorithm. To address them, with the help of the property of Bayesian estimation, we develop a two-step approach where the unbiasedness of the mean estimation is assumed first and then the randomness of the mean estimate under each collected information sequence is accounted. Based on that, we develop a MS-DCEE scheme with suitable terminal ingredients where recursive feasibility and convergence are guaranteed. Two simulation scenarios are conducted, which show that the proposed MS-DCEE algorithm outperforms the SMPC, the IPP and the single-step DCEE approaches in terms of searching successful rates and efficiency.
△ Less
Submitted 12 March, 2022;
originally announced March 2022.
-
Towards Adversarially Robust Deep Image Denoising
Authors:
Hanshu Yan,
**gfeng Zhang,
Jiashi Feng,
Masashi Sugiyama,
Vincent Y. F. Tan
Abstract:
This work systematically investigates the adversarial robustness of deep image denoisers (DIDs), i.e, how well DIDs can recover the ground truth from noisy observations degraded by adversarial perturbations. Firstly, to evaluate DIDs' robustness, we propose a novel adversarial attack, namely Observation-based Zero-mean Attack ({\sc ObsAtk}), to craft adversarial zero-mean perturbations on given no…
▽ More
This work systematically investigates the adversarial robustness of deep image denoisers (DIDs), i.e, how well DIDs can recover the ground truth from noisy observations degraded by adversarial perturbations. Firstly, to evaluate DIDs' robustness, we propose a novel adversarial attack, namely Observation-based Zero-mean Attack ({\sc ObsAtk}), to craft adversarial zero-mean perturbations on given noisy images. We find that existing DIDs are vulnerable to the adversarial noise generated by {\sc ObsAtk}. Secondly, to robustify DIDs, we propose an adversarial training strategy, hybrid adversarial training ({\sc HAT}), that jointly trains DIDs with adversarial and non-adversarial noisy data to ensure that the reconstruction quality is high and the denoisers around non-adversarial data are locally smooth. The resultant DIDs can effectively remove various types of synthetic and adversarial noise. We also uncover that the robustness of DIDs benefits their generalization capability on unseen real-world noise. Indeed, {\sc HAT}-trained DIDs can recover high-quality clean images from real-world noise even without training on real noisy data. Extensive experiments on benchmark datasets, including Set68, PolyU, and SIDD, corroborate the effectiveness of {\sc ObsAtk} and {\sc HAT}.
△ Less
Submitted 13 January, 2022; v1 submitted 12 January, 2022;
originally announced January 2022.
-
Verifying Switched System Stability With Logic
Authors:
Yong Kiam Tan,
Stefan Mitsch,
André Platzer
Abstract:
Switched systems are known to exhibit subtle (in)stability behaviors requiring system designers to carefully analyze the stability of closed-loop systems that arise from their proposed switching control laws. This paper presents a formal approach for verifying switched system stability that blends classical ideas from the controls and verification literature using differential dynamic logic (dL),…
▽ More
Switched systems are known to exhibit subtle (in)stability behaviors requiring system designers to carefully analyze the stability of closed-loop systems that arise from their proposed switching control laws. This paper presents a formal approach for verifying switched system stability that blends classical ideas from the controls and verification literature using differential dynamic logic (dL), a logic for deductive verification of hybrid systems. From controls, we use standard stability notions for various classes of switching mechanisms and their corresponding Lyapunov function-based analysis techniques. From verification, we use dL's ability to verify quantified properties of hybrid systems and dL models of switched systems as loo** hybrid programs whose stability can be formally specified and proven by finding appropriate loop invariants, i.e., properties that are preserved across each loop iteration. This blend of ideas enables a trustworthy implementation of switched system stability verification in the KeYmaera X prover based on dL. For standard classes of switching mechanisms, the implementation provides fully automated stability proofs, including searching for suitable Lyapunov functions. Moreover, the generality of the deductive approach also enables verification of switching control laws that require non-standard stability arguments through the design of loop invariants that suitably express specific intuitions behind those control laws. This flexibility is demonstrated on three case studies: a model for longitudinal flight control by Branicky, an automatic cruise controller, and Brockett's nonholonomic integrator.
△ Less
Submitted 8 April, 2022; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Rheumatoid Arthritis: Automated Scoring of Radiographic Joint Damage
Authors:
Yan Ming Tan,
Raphael Quek Hao Chong,
Carol Anne Hargreaves
Abstract:
Rheumatoid arthritis is an autoimmune disease that causes joint damage due to inflammation in the soft tissue lining the joints known as the synovium. It is vital to identify joint damage as soon as possible to provide necessary treatment early and prevent further damage to the bone structures. Radiographs are often used to assess the extent of the joint damage. Currently, the scoring of joint dam…
▽ More
Rheumatoid arthritis is an autoimmune disease that causes joint damage due to inflammation in the soft tissue lining the joints known as the synovium. It is vital to identify joint damage as soon as possible to provide necessary treatment early and prevent further damage to the bone structures. Radiographs are often used to assess the extent of the joint damage. Currently, the scoring of joint damage from the radiograph takes expertise, effort, and time. Joint damage associated with rheumatoid arthritis is also not quantitated in clinical practice and subjective descriptors are used. In this work, we describe a pipeline of deep learning models to automatically identify and score rheumatoid arthritic joint damage from a radiographic image. Our automatic tool was shown to produce scores with extremely high balanced accuracy within a couple of minutes and utilizing this would remove the subjectivity of the scores between human reviewers.
△ Less
Submitted 17 October, 2021;
originally announced October 2021.
-
Energy Management Strategy for Unmanned Tracked Vehicles Based on Local Speed Planning
Authors:
Tianxing Sun,
Shaohang Xu,
Zirui Li,
Yingqi Tan,
Huiyan Chen
Abstract:
The hybrid electric system has good potential for unmanned tracked vehicles due to its excellent power and economy. Due to unmanned tracked vehicles have no traditional driving devices, and the driving cycle is uncertain, it brings new challenges to conventional energy management strategies. This paper proposes a novel energy management strategy for unmanned tracked vehicles based on local speed p…
▽ More
The hybrid electric system has good potential for unmanned tracked vehicles due to its excellent power and economy. Due to unmanned tracked vehicles have no traditional driving devices, and the driving cycle is uncertain, it brings new challenges to conventional energy management strategies. This paper proposes a novel energy management strategy for unmanned tracked vehicles based on local speed planning. The contributions are threefold. Firstly, a local speed planning algorithm is adopted for the input of driving cycle prediction to avoid the dependence of traditional vehicles on driver's operation. Secondly, a prediction model based on Convolutional Neural Networks and Long Short-Term Memory (CNN-LSTM) is proposed, which is used to process both the planned and the historical velocity series to improve the prediction accuracy. Finally, based on the prediction results, the model predictive control algorithm is used to realize the real-time optimization of energy management. The validity of the method is verified by simulation using collected data from actual field experiments of our unmanned tracked vehicle. Compared with multi-step neural networks, the prediction model based on CNN-LSTM improves the prediction accuracy by 20%. Compared with the traditional regular energy management strategy, the energy management strategy based on model predictive control reduces fuel consumption by 7%.
△ Less
Submitted 4 July, 2021;
originally announced July 2021.
-
A Weak Monotonicity Based Muscle Fatigue Detection Algorithm for a Short-Duration Poor Posture Using sEMG Measurements
Authors:
Xinliang Guo,
Lei Lu,
Mark Robinson,
Ying Tan,
Kusal Goonewardena,
Denny Oetomo
Abstract:
Muscle fatigue is usually defined as a decrease in the ability to produce force. The surface electromyography (sEMG) signals have been widely used to provide information about muscle activities including detecting muscle fatigue by various data-driven techniques such as machine learning and statistical approaches. However, it is well-known that sEMG signals are weak signals (low amplitude of the s…
▽ More
Muscle fatigue is usually defined as a decrease in the ability to produce force. The surface electromyography (sEMG) signals have been widely used to provide information about muscle activities including detecting muscle fatigue by various data-driven techniques such as machine learning and statistical approaches. However, it is well-known that sEMG signals are weak signals (low amplitude of the signals) with a low signal-to-noise ratio, data-driven techniques cannot work well when the quality of the data is poor. In particular, the existing methods are unable to detect muscle fatigue coming from static poses. This work exploits the concept of weak monotonicity, which has been observed in the process of fatigue, to robustly detect muscle fatigue in the presence of measurement noises and human variations. Such a population trend methodology has shown its potential in muscle fatigue detection as demonstrated by the experiment of a static pose.
△ Less
Submitted 18 June, 2021;
originally announced June 2021.
-
A stochastic metapopulation state-space approach to modeling and estimating Covid-19 spread
Authors:
Yukun Tan,
Durward Cator III,
Martial Ndeffo-Mbah,
Ulisses Braga-Neto
Abstract:
Mathematical models are widely recognized as an important tool for analyzing and understanding the dynamics of infectious disease outbreaks, predict their future trends, and evaluate public health intervention measures for disease control and elimination. We propose a novel stochastic metapopulation state-space model for COVID-19 transmission, based on a discrete-time spatio-temporal susceptible/e…
▽ More
Mathematical models are widely recognized as an important tool for analyzing and understanding the dynamics of infectious disease outbreaks, predict their future trends, and evaluate public health intervention measures for disease control and elimination. We propose a novel stochastic metapopulation state-space model for COVID-19 transmission, based on a discrete-time spatio-temporal susceptible/exposed/infected/recovered/deceased (SEIRD) model. The proposed framework allows the hidden SEIRD states and unknown transmission parameters to be estimated from noisy, incomplete time series of reported epidemiological data, by application of unscented Kalman filtering (UKF), maximum-likelihood adaptive filtering, and metaheuristic optimization. Experiments using both synthetic data and real data from the Fall 2020 Covid-19 wave in the state of Texas demonstrate the effectiveness of the proposed model.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Exact Recovery in the General Hypergraph Stochastic Block Model
Authors:
Qiaosheng Zhang,
Vincent Y. F. Tan
Abstract:
This paper investigates fundamental limits of exact recovery in the general d-uniform hypergraph stochastic block model (d-HSBM), wherein n nodes are partitioned into k disjoint communities with relative sizes (p1,..., pk). Each subset of nodes with cardinality d is generated independently as an order-d hyperedge with a certain probability that depends on the ground-truth communities that the d no…
▽ More
This paper investigates fundamental limits of exact recovery in the general d-uniform hypergraph stochastic block model (d-HSBM), wherein n nodes are partitioned into k disjoint communities with relative sizes (p1,..., pk). Each subset of nodes with cardinality d is generated independently as an order-d hyperedge with a certain probability that depends on the ground-truth communities that the d nodes belong to. The goal is to exactly recover the k hidden communities based on the observed hypergraph. We show that there exists a sharp threshold such that exact recovery is achievable above the threshold and impossible below the threshold (apart from a small regime of parameters that will be specified precisely). This threshold is represented in terms of a quantity which we term as the generalized Chernoff-Hellinger divergence between communities. Our result for this general model recovers prior results for the standard SBM and d-HSBM with two symmetric communities as special cases. En route to proving our achievability results, we develop a polynomial-time two-stage algorithm that meets the threshold. The first stage adopts a certain hypergraph spectral clustering method to obtain a coarse estimate of communities, and the second stage refines each node individually via local refinement steps to ensure exact recovery.
△ Less
Submitted 9 September, 2022; v1 submitted 10 May, 2021;
originally announced May 2021.
-
Domestic activities clustering from audio recordings using convolutional capsule autoencoder network
Authors:
Ziheng Lin,
Yanxiong Li,
Zhang** Huang,
Wenhao Zhang,
Yufeng Tan,
Yichun Chen,
Qianhua He
Abstract:
Recent efforts have been made on domestic activities classification from audio recordings, especially the works submitted to the challenge of DCASE (Detection and Classification of Acoustic Scenes and Events) since 2018. In contrast, few studies were done on domestic activities clustering, which is a newly emerging problem. Domestic activities clustering from audio recordings aims at merging audio…
▽ More
Recent efforts have been made on domestic activities classification from audio recordings, especially the works submitted to the challenge of DCASE (Detection and Classification of Acoustic Scenes and Events) since 2018. In contrast, few studies were done on domestic activities clustering, which is a newly emerging problem. Domestic activities clustering from audio recordings aims at merging audio clips which belong to the same class of domestic activity into a single cluster. Domestic activities clustering is an effective way for unsupervised estimation of daily activities performed in home environment. In this study, we propose a method for domestic activities clustering using a convolutional capsule autoencoder network (CCAN). In the method, the deep embeddings are learned by the autoencoder in the CCAN, while the deep embeddings which belong to the same class of domestic activities are merged into a single cluster by a clustering layer in the CCAN. Evaluated on a public dataset adopted in DCASE-2018 Task 5, the results show that the proposed method outperforms state-of-the-art methods in terms of the metrics of clustering accuracy and normalized mutual information.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
Frequency Superposition -- A Multi-Frequency Stimulation Method in SSVEP-based BCIs
Authors:
**g Mu,
David B. Grayden,
Ying Tan,
Denny Oetomo
Abstract:
The steady-state visual evoked potential (SSVEP) is one of the most widely used modalities in brain-computer interfaces (BCIs) due to its many advantages. However, the existence of harmonics and the limited range of responsive frequencies in SSVEP make it challenging to further expand the number of targets without sacrificing other aspects of the interface or putting additional constraints on the…
▽ More
The steady-state visual evoked potential (SSVEP) is one of the most widely used modalities in brain-computer interfaces (BCIs) due to its many advantages. However, the existence of harmonics and the limited range of responsive frequencies in SSVEP make it challenging to further expand the number of targets without sacrificing other aspects of the interface or putting additional constraints on the system. This paper introduces a novel multi-frequency stimulation method for SSVEP and investigates its potential to effectively and efficiently increase the number of targets presented. The proposed stimulation method, obtained by the superposition of the stimulation signals at different frequencies, is size-efficient, allows single-step target identification, puts no strict constraints on the usable frequency range, can be suited to self-paced BCIs, and does not require specific light sources. In addition to the stimulus frequencies and their harmonics, the evoked SSVEP waveforms include frequencies that are integer linear combinations of the stimulus frequencies. Results of decoding SSVEPs collected from nine subjects using canonical correlation analysis (CCA) with only the frequencies and harmonics as reference, also demonstrate the potential of using such a stimulation paradigm in SSVEP-based BCIs.
△ Less
Submitted 11 August, 2021; v1 submitted 25 April, 2021;
originally announced April 2021.
-
Adversarially-Trained Nonnegative Matrix Factorization
Authors:
Ting Cai,
Vincent Y. F. Tan,
Cédric Févotte
Abstract:
We consider an adversarially-trained version of the nonnegative matrix factorization, a popular latent dimensionality reduction technique. In our formulation, an attacker adds an arbitrary matrix of bounded norm to the given data matrix. We design efficient algorithms inspired by adversarial training to optimize for dictionary and coefficient matrices with enhanced generalization abilities. Extens…
▽ More
We consider an adversarially-trained version of the nonnegative matrix factorization, a popular latent dimensionality reduction technique. In our formulation, an attacker adds an arbitrary matrix of bounded norm to the given data matrix. We design efficient algorithms inspired by adversarial training to optimize for dictionary and coefficient matrices with enhanced generalization abilities. Extensive simulations on synthetic and benchmark datasets demonstrate the superior predictive performance on matrix completion tasks of our proposed method compared to state-of-the-art competitors, including other variants of adversarial nonnegative matrix factorization.
△ Less
Submitted 22 June, 2021; v1 submitted 10 April, 2021;
originally announced April 2021.
-
Switched Systems as Hybrid Programs
Authors:
Yong Kiam Tan,
André Platzer
Abstract:
Real world systems of interest often feature interactions between discrete and continuous dynamics. Various hybrid system formalisms have been used to model and analyze this combination of dynamics, ranging from mathematical descriptions, e.g., using impulsive differential equations and switching, to automata-theoretic and language-based approaches. This paper bridges two such formalisms by showin…
▽ More
Real world systems of interest often feature interactions between discrete and continuous dynamics. Various hybrid system formalisms have been used to model and analyze this combination of dynamics, ranging from mathematical descriptions, e.g., using impulsive differential equations and switching, to automata-theoretic and language-based approaches. This paper bridges two such formalisms by showing how various classes of switched systems can be modeled using the language of hybrid programs from differential dynamic logic (dL). The resulting models enable the formal specification and verification of switched systems using dL and its existing deductive verification tools such as KeYmaera X. Switched systems also provide a natural avenue for the generalization of dL's deductive proof theory for differential equations. The completeness results for switched system invariants proved in this paper enable effective safety verification of those systems in dL.
△ Less
Submitted 29 April, 2021; v1 submitted 15 January, 2021;
originally announced January 2021.
-
Signal Sets on Time Scales with Application to Hybrid Systems
Authors:
Ti-Chung Lee,
Ying Tan,
Iven Mareels
Abstract:
Recently, time scales calculus is developed to unify continuous and discrete analysis. By extending the definition of time scales properly, this paper introduces the concept of a signal set as well as its stability properties in terms of the so-called pseudo distance measure. This leads to more general Lyapunov like conditions to check stability properties of systems with hybrid nature. By way of…
▽ More
Recently, time scales calculus is developed to unify continuous and discrete analysis. By extending the definition of time scales properly, this paper introduces the concept of a signal set as well as its stability properties in terms of the so-called pseudo distance measure. This leads to more general Lyapunov like conditions to check stability properties of systems with hybrid nature. By way of examples, the proposed framework is used to model hybrid systems with simplicity and flexibility to characterize trajectories in the behavior of hybrid systems.
△ Less
Submitted 25 November, 2020;
originally announced November 2020.
-
Multi-Frequency Canonical Correlation Analysis (MFCCA): A Generalised Decoding Algorithm for Multi-Frequency SSVEP
Authors:
**g Mu,
Ying Tan,
David B. Grayden,
Denny Oetomo
Abstract:
Stimulation methods that utilise more than one stimulation frequency have been developed for steady-state visual evoked potential (SSVEP) brain-computer interfaces (BCIs) with the purpose of increasing the number of targets that can be presented simultaneously. However, there is no unified decoding algorithm that can be used without training for each individual users or cases, and applied to a lar…
▽ More
Stimulation methods that utilise more than one stimulation frequency have been developed for steady-state visual evoked potential (SSVEP) brain-computer interfaces (BCIs) with the purpose of increasing the number of targets that can be presented simultaneously. However, there is no unified decoding algorithm that can be used without training for each individual users or cases, and applied to a large class of multi-frequency stimulated SSVEP settings. This paper extends the widely used canonical correlation analysis (CCA) decoder to explicitly accommodate multi-frequency SSVEP by exploiting the interactions between the multiple stimulation frequencies. A concept of order, defined as the sum of absolute value of the coefficients in the linear combination of the input frequencies, was introduced to assist the design of Multi-Frequency CCA (MFCCA). The probability distribution of the order in the resulting SSVEP response was then used to improve decoding accuracy. Results show that, compared to the standard CCA formulation, the proposed MFCCA has a 20% improvement in decoding accuracy on average at order 2, while kee** its generality and training-free characteristics.
△ Less
Submitted 11 August, 2021; v1 submitted 27 October, 2020;
originally announced November 2020.
-
Intelligent Omni-Surface: Ubiquitous Wireless Transmission by Reflective-Transmissive Metasurface
Authors:
Shuhang Zhang,
Hongliang Zhang,
Boya Di,
Yunhua Tan,
Marco Di Renzo,
Zhu Han,
H. Vincent Poor,
Lingyang Song
Abstract:
Intelligent reflecting surface (IRS), which is capable to adjust propagation conditions by controlling phase shifts of the reflected waves that im**e on the surface, has been widely analyzed for enhancing the performance of wireless systems. However, the reflective properties of widely studied IRSs restrict the service coverage to only one side of the surface. In this paper, to extend the wirele…
▽ More
Intelligent reflecting surface (IRS), which is capable to adjust propagation conditions by controlling phase shifts of the reflected waves that im**e on the surface, has been widely analyzed for enhancing the performance of wireless systems. However, the reflective properties of widely studied IRSs restrict the service coverage to only one side of the surface. In this paper, to extend the wireless coverage of communication systems, we introduce the concept of intelligent omni-surface (IOS)-assisted communication. More precisely, IOS is an important instance of reconfigurable intelligent surface (RIS) that is capable to provide service coverage to the mobile users (MUs) in a reflective and a transmissive manner. We consider a downlink IOS-assisted communication system, where a multi-antenna small base station (SBS) and an IOS perform beamforming jointly, to improve the received power of multiple MUs on both sides of the IOS, through different reflective/transmissive channels. To maximize the sum-rate, we formulate a joint IOS phase shift design and SBS beamforming optimization problem, and propose an iterative algorithm to solve the resulting non-convex program efficiently. Both theoretical analysis and simulation results show that an IOS significantly extends the service coverage of the SBS when compared to an IRS.
△ Less
Submitted 27 June, 2021; v1 submitted 2 November, 2020;
originally announced November 2020.
-
Multi-center validation study of automated classification of pathological slowing in adult scalp electroencephalograms via frequency features
Authors:
Wei Yan Peh,
John Thomas,
Elham Bagheri,
Rima Chaudhari,
Sagar Karia,
Rahul Rathakrishnan,
Vinay Saini,
Nilesh Shah,
Rohit Srivastava,
Yee-Leng Tan,
Justin Dauwels
Abstract:
Pathological slowing in the electroencephalogram (EEG) is widely investigated for the diagnosis of neurological disorders. Currently, the gold standard for slowing detection is the visual inspection of the EEG by experts, which is time-consuming and subjective. To address those issues, we propose three automated approaches to detect slowing in EEG: Threshold-based Detecting System (TDS), Shallow L…
▽ More
Pathological slowing in the electroencephalogram (EEG) is widely investigated for the diagnosis of neurological disorders. Currently, the gold standard for slowing detection is the visual inspection of the EEG by experts, which is time-consuming and subjective. To address those issues, we propose three automated approaches to detect slowing in EEG: Threshold-based Detecting System (TDS), Shallow Learning-based Detecting System (SLDS), and Deep Learning-based Detecting System (DLDS). These systems are evaluated on channel-, segment- and EEG-level. The TDS, SLDS, and DLDS performs prediction via detecting slowing at individual channels, and those detections are arranged in histograms for detection of slowing at the segment- and EEG-level. We evaluate the systems through Leave-One-Subject-Out (LOSO) cross-validation (CV) and Leave-One-Institution-Out (LOIO) CV on four datasets from the US, Singapore, and India. The DLDS achieved the best overall results: LOIO CV mean balanced accuracy (BAC) of 71.9%, 75.5%, and 82.0% at channel-, segment- and EEG-level, and LOSO CV mean BAC of 73.6%, 77.2%, and 81.8% at channel-, segment-, and EEG-level. The channel- and segment-level performance is comparable to the intra-rater agreement (IRA) of an expert of 72.4% and 82%. The DLDS can process a 30-minutes EEG in 4 seconds and can be deployed to assist clinicians in interpreting EEGs.
△ Less
Submitted 26 January, 2021; v1 submitted 28 September, 2020;
originally announced September 2020.
-
Beyond Intelligent Reflecting Surfaces: Reflective-Transmissive Metasurface Aided Communications for Full-dimensional Coverage Extension
Authors:
Shuhang Zhang,
Hongliang Zhang,
Boya Di,
Yunhua Tan,
Zhu Han,
Lingyang Song
Abstract:
In this paper, we study an intelligent omni-surface (IOS)-assisted downlink communication system, where the link quality of a mobile user (MU) can be improved with a proper IOS phase shift design. Unlike the intelligent reflecting surface (IRS) in most existing works that only forwards the signals in a reflective way, the IOS is capable to forward the received signals to the MU in either a reflect…
▽ More
In this paper, we study an intelligent omni-surface (IOS)-assisted downlink communication system, where the link quality of a mobile user (MU) can be improved with a proper IOS phase shift design. Unlike the intelligent reflecting surface (IRS) in most existing works that only forwards the signals in a reflective way, the IOS is capable to forward the received signals to the MU in either a reflective or a transmissive manner, thereby enhancing the wireless coverage. We formulate an IOS phase shift optimization problem to maximize the downlink spectral efficiency (SE) of the MU. The optimal phase shift of the IOS is analysed, and a branch-and-bound based algorithm is proposed to design the IOS phase shift in a finite set. Simulation results show that the IOS-assisted system can extend the coverage significantly when compared to the IRS-assisted system with only reflective signals.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
Generalizing Fault Detection Against Domain Shifts Using Stratification-Aware Cross-Validation
Authors:
Yingshui Tan,
Baihong **,
Qiushi Cui,
Xiangyu Yue,
Alberto Sangiovanni Vincentelli
Abstract:
Incipient anomalies present milder symptoms compared to severe ones, and are more difficult to detect and diagnose due to their close resemblance to normal operating conditions. The lack of incipient anomaly examples in the training data can pose severe risks to anomaly detection methods that are built upon Machine Learning (ML) techniques, because these anomalies can be easily mistaken as normal…
▽ More
Incipient anomalies present milder symptoms compared to severe ones, and are more difficult to detect and diagnose due to their close resemblance to normal operating conditions. The lack of incipient anomaly examples in the training data can pose severe risks to anomaly detection methods that are built upon Machine Learning (ML) techniques, because these anomalies can be easily mistaken as normal operating conditions. To address this challenge, we propose to utilize the uncertainty information available from ensemble learning to identify potential misclassified incipient anomalies. We show in this paper that ensemble learning methods can give improved performance on incipient anomalies and identify common pitfalls in these models through extensive experiments on two real-world datasets. Then, we discuss how to design more effective ensemble models for detecting incipient anomalies.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Using Ensemble Classifiers to Detect Incipient Anomalies
Authors:
Baihong **,
Yingshui Tan,
Albert Liu,
Xiangyu Yue,
Yuxin Chen,
Alberto Sangiovanni Vincentelli
Abstract:
Incipient anomalies present milder symptoms compared to severe ones, and are more difficult to detect and diagnose due to their close resemblance to normal operating conditions. The lack of incipient anomaly examples in the training data can pose severe risks to anomaly detection methods that are built upon Machine Learning (ML) techniques, because these anomalies can be easily mistaken as normal…
▽ More
Incipient anomalies present milder symptoms compared to severe ones, and are more difficult to detect and diagnose due to their close resemblance to normal operating conditions. The lack of incipient anomaly examples in the training data can pose severe risks to anomaly detection methods that are built upon Machine Learning (ML) techniques, because these anomalies can be easily mistaken as normal operating conditions. To address this challenge, we propose to utilize the uncertainty information available from ensemble learning to identify potential misclassified incipient anomalies. We show in this paper that ensemble learning methods can give improved performance on incipient anomalies and identify common pitfalls in these models through extensive experiments on two real-world datasets. Then, we discuss how to design more effective ensemble models for detecting incipient anomalies.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
On the Error Exponent of Approximate Sufficient Statistics for M-ary Hypothesis Testing
Authors:
Jiachun Pan,
Yonglong Li,
Vincent Y. F. Tan,
Yonina C. Eldar
Abstract:
Consider the problem of detecting one of M i.i.d. Gaussian signals corrupted in white Gaussian noise. Conventionally, matched filters are used for detection. We first show that the outputs of the matched filter form a set of asymptotically optimal sufficient statistics in the sense of maximizing the error exponent of detecting the true signal. In practice, however, M may be large which motivates t…
▽ More
Consider the problem of detecting one of M i.i.d. Gaussian signals corrupted in white Gaussian noise. Conventionally, matched filters are used for detection. We first show that the outputs of the matched filter form a set of asymptotically optimal sufficient statistics in the sense of maximizing the error exponent of detecting the true signal. In practice, however, M may be large which motivates the design and analysis of a reduced set of N statistics which we term approximate sufficient statistics. Our construction of these statistics is based on a small set of filters that project the outputs of the matched filters onto a lower-dimensional vector using a sensing matrix. We consider a sequence of sensing matrices that has the desiderata of row orthonormality and low coherence. We analyze the performance of the resulting maximum likelihood (ML) detector, which leads to an achievable bound on the error exponent based on the approximate sufficient statistics; this bound recovers the original error exponent when N = M. We compare this to a bound that we obtain by analyzing a modified form of the Reduced Dimensionality Detector (RDD) proposed by Xie, Eldar, and Goldsmith [IEEE Trans. on Inform. Th., 59(6):3858-3874, 2013]. We show that by setting the sensing matrices to be column-normalized group Hadamard matrices, the exponents derived are ensemble-tight, i.e., our analysis is tight on the exponential scale given the sensing matrices and the decoding rule. Finally, we derive some properties of the exponents, showing, in particular, that they increase linearly in the compression ratio N/M.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.