Search | arXiv e-print repository

Improving the generalization via coupled tensor norm regularization

Authors: Ying Gao, Yunfei Qu, Chunfeng Cui, Deren Han

Abstract: In this paper, we propose a coupled tensor norm regularization that could enable the model output feature and the data input to lie in a low-dimensional manifold, which helps us to reduce overfitting. We show this regularization term is convex, differentiable, and gradient Lipschitz continuous for logistic regression, while nonconvex and nonsmooth for deep neural networks. We further analyze the c… ▽ More In this paper, we propose a coupled tensor norm regularization that could enable the model output feature and the data input to lie in a low-dimensional manifold, which helps us to reduce overfitting. We show this regularization term is convex, differentiable, and gradient Lipschitz continuous for logistic regression, while nonconvex and nonsmooth for deep neural networks. We further analyze the convergence of the first-order method for solving this model. The numerical experiments demonstrate that our method is efficient. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: Operations Research Letters

arXiv:2301.12829 [pdf, other]

doi 10.1109/TSC.2023.3330175

Identifying the Key Attributes in an Unlabeled Event Log for Automated Process Discovery

Authors: Kentaroh Toyoda, Rachel Gan Kai Ying, Allan NengSheng Zhang, Tan Puay Siew

Abstract: Process mining discovers and analyzes a process model from historical event logs. The prior art methods use the key attributes of case-id, activity, and timestamp hidden in an event log as clues to discover a process model. However, a user needs to specify them manually, and this can be an exhaustive task. In this paper, we propose a two-stage key attribute identification method to avoid such a ma… ▽ More Process mining discovers and analyzes a process model from historical event logs. The prior art methods use the key attributes of case-id, activity, and timestamp hidden in an event log as clues to discover a process model. However, a user needs to specify them manually, and this can be an exhaustive task. In this paper, we propose a two-stage key attribute identification method to avoid such a manual investigation, and thus this is a step toward fully automated process discovery. One of the challenging tasks is how to avoid exhaustive computation due to combinatorial explosion. For this, we narrow down candidates for each key attribute by using supervised machine learning in the first stage and identify the best combination of the key attributes by discovering process models and evaluating them in the second stage. Our computational complexity can be reduced from $\mathcal{O}(N^3)$ to $\mathcal{O}(k^3)$ where $N$ and $k$ are the numbers of columns and candidates we keep in the first stage, respectively, and usually $k$ is much smaller than $N$. We evaluated our method with 14 open datasets and showed that our method could identify the key attributes even with $k = 2$ for about 20 seconds for many datasets. △ Less

Submitted 16 November, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

Comments: IEEE Transactions on Services Computing (Early Access version)

arXiv:2111.15097 [pdf, other]

EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs

Authors: Guohao Ying, Xin He, Bin Gao, Bo Han, Xiaowen Chu

Abstract: Generative adversarial networks (GANs) have proven successful in image generation tasks. However, GAN training is inherently unstable. Although many works try to stabilize it by manually modifying GAN architecture, it requires much expertise. Neural architecture search (NAS) has become an attractive solution to search GANs automatically. The early NAS-GANs search only generators to reduce search c… ▽ More Generative adversarial networks (GANs) have proven successful in image generation tasks. However, GAN training is inherently unstable. Although many works try to stabilize it by manually modifying GAN architecture, it requires much expertise. Neural architecture search (NAS) has become an attractive solution to search GANs automatically. The early NAS-GANs search only generators to reduce search complexity but lead to a sub-optimal GAN. Some recent works try to search both generator (G) and discriminator (D), but they suffer from the instability of GAN training. To alleviate the instability, we propose an efficient two-stage evolutionary algorithm-based NAS framework to search GANs, namely EAGAN. We decouple the search of G and D into two stages, where stage-1 searches G with a fixed D and adopts the many-to-one training strategy, and stage-2 searches D with the optimal G found in stage-1 and adopts the one-to-one training and weight-resetting strategies to enhance the stability of GAN training. Both stages use the non-dominated sorting method to produce Pareto-front architectures under multiple objectives (e.g., model size, Inception Score (IS), and Fréchet Inception Distance (FID)). EAGAN is applied to the unconditional image generation task and can efficiently finish the search on the CIFAR-10 dataset in 1.2 GPU days. Our searched GANs achieve competitive results (IS=8.81$\pm$0.10, FID=9.91) on the CIFAR-10 dataset and surpass prior NAS-GANs on the STL-10 dataset (IS=10.44$\pm$0.087, FID=22.18). Source code: https://github.com/marsggbo/EAGAN. △ Less

Submitted 12 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

Comments: Accepted in ECCV2022, Guohao Yin and Xin He contributed equally

arXiv:2107.06809 [pdf]

Resonantly pumped bright-triplet exciton lasing in caesium lead bromide perovskites

Authors: Guanhua Ying, Tristan Farrow, Atanu Jana, Hanbo Shao, Hyunsik Im, Vitaly Osokin, Seung Bin Baek, Mutibah Alanazi, Sanjit Karmakar, Manas Mukherjee, Youngsin Park, Robert A. Taylor

Abstract: The surprising recent observation of highly emissive triplet-states in lead halide perovskites accounts for their orders-of-magnitude brighter optical signals and high quantum efficiencies compared to other semiconductors. This makes them attractive for future optoelectronic applications, especially in bright low-threshold nano-lasers. Whilst non-resonantly pumped lasing from all-inorganic lead-ha… ▽ More The surprising recent observation of highly emissive triplet-states in lead halide perovskites accounts for their orders-of-magnitude brighter optical signals and high quantum efficiencies compared to other semiconductors. This makes them attractive for future optoelectronic applications, especially in bright low-threshold nano-lasers. Whilst non-resonantly pumped lasing from all-inorganic lead-halide perovskites is now well-established as an attractive pathway to scalable low-power laser sources for nano-optoelectronics, here we showcase a resonant optical pum** scheme on a fast triplet-state in CsPbBr3 nanocrystals. The scheme allows us to realize a polarized triplet-laser source that dramatically enhances the coherent signal by one order of magnitude whilst suppressing non-coherent contributions. The result is a source with highly attractive technological characteristics including a bright and polarized signal, and a high stimulated-to-spontaneous emission signal contrast that can be filtered to enhance spectral purity. The emission is generated by pum** selectively on a weakly-confined excitonic state with a Bohr radius ~10 nm in the nanocrystals. The exciton fine-structure is revealed by the energy-splitting resulting from confinement in nanocrystals with tetragonal symmetry. We use a linear polarizer to resolve two-fold non-degenerate sub-levels in the triplet exciton and use photoluminescence excitation spectroscopy to determine the energy of the state before pum** it resonantly. △ Less

Submitted 14 July, 2021; originally announced July 2021.

Comments: 19 pages, 9 figures

arXiv:2101.10667 [pdf, other]

Evolutionary Multi-objective Architecture Search Framework: Application to COVID-19 3D CT Classification

Authors: Xin He, Guohao Ying, Jiyong Zhang, Xiaowen Chu

Abstract: The COVID-19 pandemic has threatened global health. Many studies have applied deep convolutional neural networks (CNN) to recognize COVID-19 based on chest 3D computed tomography (CT). Recent works show that no model generalizes well across CT datasets from different countries, and manually designing models for specific datasets requires expertise; thus, neural architecture search (NAS) that aims… ▽ More The COVID-19 pandemic has threatened global health. Many studies have applied deep convolutional neural networks (CNN) to recognize COVID-19 based on chest 3D computed tomography (CT). Recent works show that no model generalizes well across CT datasets from different countries, and manually designing models for specific datasets requires expertise; thus, neural architecture search (NAS) that aims to search models automatically has become an attractive solution. To reduce the search cost on large 3D CT datasets, most NAS-based works use the weight-sharing (WS) strategy to make all models share weights within a supernet; however, WS inevitably incurs search instability, leading to inaccurate model estimation. In this work, we propose an efficient Evolutionary Multi-objective ARchitecture Search (EMARS) framework. We propose a new objective, namely potential, which can help exploit promising models to indirectly reduce the number of models involved in weights training, thus alleviating search instability. We demonstrate that under objectives of accuracy and potential, EMARS can balance exploitation and exploration, i.e., reducing search time and finding better models. Our searched models are small and perform better than prior works on three public COVID-19 3D CT datasets. △ Less

Submitted 8 July, 2022; v1 submitted 26 January, 2021; originally announced January 2021.

Comments: accepted in MICCAI2022, Neural Architecture Search, Evolutionary Algorithm, COVID-19, CT

arXiv:2005.08457 [pdf, other]

Simultaneous Differential Network Analysis and Classification for High-dimensional Matrix-variate Data, with application to Brain Connectivity Alteration Detection and fMRI-guided Medical Diagnoses of Alzheimer's Disease

Authors: Chen Hao, Guo Ying, He Yong, Ji Jiadong, Liu Lei, Shi Yufeng, Wang Yikai, Yu Long, Zhang Xinsheng

Abstract: Alzheimer's disease (AD) is the most common form of dementia, which causes problems with memory, thinking and behavior. Growing evidence has shown that the brain connectivity network experiences alterations for such a complex disease. Network comparison, also known as differential network analysis, is thus particularly powerful to reveal the disease pathologies and identify clinical biomarkers for… ▽ More Alzheimer's disease (AD) is the most common form of dementia, which causes problems with memory, thinking and behavior. Growing evidence has shown that the brain connectivity network experiences alterations for such a complex disease. Network comparison, also known as differential network analysis, is thus particularly powerful to reveal the disease pathologies and identify clinical biomarkers for medical diagnoses (classification). Data from neurophysiological measurements are multi-dimensional and in matrix-form, which poses major challenges in brain connectivity analysis and medical diagnoses. Naive vectorization method is not sufficient as it ignores the structural information within the matrix. In the article, we adopt the Kronecker product covariance matrix framework to capture both spatial and temporal correlations of the matrix-variate data while the temporal covariance matrix is treated as a nuisance parameter. By recognizing that the strengths of network connections may vary across subjects, we develop an ensemble-learning procedure, which identifies the differential interaction patterns of brain regions between the AD group and the control group and conducts medical diagnosis (classification) of AD simultaneously. We applied the proposed procedure to functional connectivity analysis of fMRI dataset related with Alzheimer's disease. The hub nodes and differential interaction patterns identified are consistent with existing experimental studies, and satisfactory out-of-sample classification performance is achieved for medical diagnosis of Alzheimer's disease. An R package \SDNCMV" for implementation is available at https://github.com/heyongstat/SDNCMV. △ Less

Submitted 27 May, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

arXiv:1901.01649 [pdf, other]

Better Guider Predicts Future Better: Difference Guided Generative Adversarial Networks

Authors: Guohao Ying, Yingtian Zou, Lin Wan, Yiming Hu, Jiashi Feng

Abstract: Predicting the future is a fantasy but practicality work. It is the key component to intelligent agents, such as self-driving vehicles, medical monitoring devices and robotics. In this work, we consider generating unseen future frames from previous obeservations, which is notoriously hard due to the uncertainty in frame dynamics. While recent works based on generative adversarial networks (GANs) m… ▽ More Predicting the future is a fantasy but practicality work. It is the key component to intelligent agents, such as self-driving vehicles, medical monitoring devices and robotics. In this work, we consider generating unseen future frames from previous obeservations, which is notoriously hard due to the uncertainty in frame dynamics. While recent works based on generative adversarial networks (GANs) made remarkable progress, there is still an obstacle for making accurate and realistic predictions. In this paper, we propose a novel GAN based on inter-frame difference to circumvent the difficulties. More specifically, our model is a multi-stage generative network, which is named the Difference Guided Generative Adversarial Netwok (DGGAN). The DGGAN learns to explicitly enforce future-frame predictions that is guided by synthetic inter-frame difference. Given a sequence of frames, DGGAN first uses dual paths to generate meta information. One path, called Coarse Frame Generator, predicts the coarse details about future frames, and the other path, called Difference Guide Generator, generates the difference image which include complementary fine details. Then our coarse details will then be refined via guidance of difference image under the support of GANs. With this model and novel architecture, we achieve state-of-the-art performance for future video prediction on UCF-101, KITTI. △ Less

Submitted 6 January, 2019; originally announced January 2019.

Comments: To appear in ACCV 2018

arXiv:1509.01139 [pdf]

doi 10.1016/j.physb.2016.07.018

Eigenmodal Analysis of Anderson Localization: Applications to Photonic Lattices and Bose-Einstein Condensates

Authors: Guanwen Ying, Guennadi Kouzaev

Abstract: We present the eigenmodal analysis techniques enhanced towards calculations of optical and non-interacting Bose-Einstein condensate (BEC) modes formed by random potentials and localized by Anderson effect. The results are compared with the published measurements and verified additionally by the convergence criterion. In 2-D BECs captured in circular areas, the randomness shows edge localization of… ▽ More We present the eigenmodal analysis techniques enhanced towards calculations of optical and non-interacting Bose-Einstein condensate (BEC) modes formed by random potentials and localized by Anderson effect. The results are compared with the published measurements and verified additionally by the convergence criterion. In 2-D BECs captured in circular areas, the randomness shows edge localization of the high-order Tamm-modes. To avoid strong diffusive effect, which is typical for BECs trapped by speckle potentials, a 3-D-lattice potential with increased step magnitudes is proposed, and the BECs in these lattices are simulated and plotted. △ Less

Submitted 1 March, 2019; v1 submitted 3 September, 2015; originally announced September 2015.

Comments: 29 pages, 13 figures

Journal ref: Physica B Condensed Matter, vol. 499, pp. 87-96, 2016

Showing 1–8 of 8 results for author: Ying, G