-
Energy Management of Multi-mode Plug-in Hybrid Electric Vehicle using Multi-agent Deep Reinforcement Learning
Authors:
Min Hua,
Cetengfei Zhang,
Fanggang Zhang,
Zhi Li,
Xiaoli Yu,
Hongming Xu,
Quan Zhou
Abstract:
The recently emerging multi-mode plug-in hybrid electric vehicle (PHEV) technology is one of the pathways making contributions to decarbonization, and its energy management requires multiple-input and multipleoutput (MIMO) control. At the present, the existing methods usually decouple the MIMO control into singleoutput (MISO) control and can only achieve its local optimal performance. To optimize…
▽ More
The recently emerging multi-mode plug-in hybrid electric vehicle (PHEV) technology is one of the pathways making contributions to decarbonization, and its energy management requires multiple-input and multipleoutput (MIMO) control. At the present, the existing methods usually decouple the MIMO control into singleoutput (MISO) control and can only achieve its local optimal performance. To optimize the multi-mode vehicle globally, this paper studies a MIMO control method for energy management of the multi-mode PHEV based on multi-agent deep reinforcement learning (MADRL). By introducing a relevance ratio, a hand-shaking strategy is proposed to enable two learning agents to work collaboratively under the MADRL framework using the deep deterministic policy gradient (DDPG) algorithm. Unified settings for the DDPG agents are obtained through a sensitivity analysis of the influencing factors to the learning performance. The optimal working mode for the hand-shaking strategy is attained through a parametric study on the relevance ratio. The advantage of the proposed energy management method is demonstrated on a software-in-the-loop testing platform. The result of the study indicates that the learning rate of the DDPG agents is the greatest influencing factor for learning performance. Using the unified DDPG settings and a relevance ratio of 0.2, the proposed MADRL system can save up to 4% energy compared to the single-agent learning system and up to 23.54% energy compared to the conventional rule-based system.
△ Less
Submitted 27 August, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Inter-brain substrates of role switching during mother-child interaction
Authors:
Yamin Li,
Saishuang Wu,
Jiayang Xu,
Haiwa Wang,
Qi Zhu,
Wen Shi,
Yue Fang,
Fan Jiang,
Shanbao Tong,
Yunting Zhang,
Xiaoli Guo
Abstract:
Mother-child interaction is highly dynamic and reciprocal. Switching roles in these back-and-forth interactions serves as a crucial feature of reciprocal behaviors while the underlying neural entrainment is still not well-studied. Here, we designed a role-controlled cooperative task with dual EEG recording to study how differently two brains interact when mothers and children hold different roles.…
▽ More
Mother-child interaction is highly dynamic and reciprocal. Switching roles in these back-and-forth interactions serves as a crucial feature of reciprocal behaviors while the underlying neural entrainment is still not well-studied. Here, we designed a role-controlled cooperative task with dual EEG recording to study how differently two brains interact when mothers and children hold different roles. When children were actors and mothers were observers, mother-child inter-brain synchrony emerged within the theta oscillations and the frontal lobe, which highly correlated with children's attachment to their mothers. When their roles were reversed, this synchrony was shifted to the alpha oscillations and the central area and associated with mothers' perception of their relationship with their children. The results suggested an observer-actor neural alignment within the actor's oscillations, which was modulated by the actor-toward-observer emotional bonding. Our findings contribute to the understanding of how inter-brain synchrony is established and dynamically changed during mother-child reciprocal interaction.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Stability and energy identity for Yang-Mills-Higgs pairs
Authors:
Xiaoli Han,
Xishen **,
Yang Wen
Abstract:
In this paper, we study the properties of the critical points of Yang-Mills-Higgs functional, which are called Yang-Mills-Higgs pairs. We first consider the properties of weakly stable Yang-Mills-Higgs pairs on a vector bundle over S^n (n > 3). When n > 3, we prove that the norm of its Higgs field is 1 and the connection is actually Yang-Mills. More precisely, its curvature vanishes when n > 4. We…
▽ More
In this paper, we study the properties of the critical points of Yang-Mills-Higgs functional, which are called Yang-Mills-Higgs pairs. We first consider the properties of weakly stable Yang-Mills-Higgs pairs on a vector bundle over S^n (n > 3). When n > 3, we prove that the norm of its Higgs field is 1 and the connection is actually Yang-Mills. More precisely, its curvature vanishes when n > 4. We also use the bubble-neck decomposition to prove the energy identity of a sequence of Yang-Mills-Higgs pairs over a 4-dimensional compact manifold with uniformly bounded energy. We show there is a subsequence converges smoothly to a Yang-Mills-Higgs pair up to gauge modulo finitely many 4-dimensional spheres with Yang-Mills connections.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Efficient and accurate exponential SAV algorithms with relaxation for dissipative system
Authors:
Yanrong Zhang,
Xiaoli Li
Abstract:
In this paper, we construct two kinds of exponential SAV approach with relaxation (R-ESAV) for dissipative system. The constructed schemes are linear and unconditionally energy stable. They can guarantee the positive property of SAV without any assumption compared with R-SAV and R-GSAV approaches, preserve all the advantages of the ESAV approach and satiesfy dissipation law with respect to a modif…
▽ More
In this paper, we construct two kinds of exponential SAV approach with relaxation (R-ESAV) for dissipative system. The constructed schemes are linear and unconditionally energy stable. They can guarantee the positive property of SAV without any assumption compared with R-SAV and R-GSAV approaches, preserve all the advantages of the ESAV approach and satiesfy dissipation law with respect to a modified energy which is directly related to the original free energy. Moreover the second version of R-ESAV approach is easy to construct high-order BDF$k$ schemes. Especially for Navier-Stokes equations, we construct wo kinds of novel schemes based on the R-ESAV method. Finally, ample numerical examples are presented to exhibit that the proposed approaches are accurate and effective.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
Fe$_{1+y}$Te$_{x}$Se$_{1-x}$: a delicate and tunable Majorana material
Authors:
Fazhi Yang,
Giao Ngoc Phan,
Renjie Zhang,
** Zhao,
Jiajun Li,
Zouyouwei Lu,
John Schneeloch,
Ruidan Zhong,
Mingwei Ma,
Genda Gu,
Xiaoli Dong,
Tian Qian,
Hong Ding
Abstract:
We report the observation for the p$_{z}$ electron band and the band inversion in Fe$_{1+y}$Te$_{x}$Se$_{1-x}$ with angle-resolved photoemission spectroscopy. Furthermore, we found that excess Fe (y>0) inhibits the topological band inversion in Fe$_{1+y}$Te$_{x}$Se$_{1-x}$, which explains the absence of Majorana zero modes in previous reports for Fe$_{1+y}$Te$_{x}$Se$_{1-x}$ with excess Fe. Based…
▽ More
We report the observation for the p$_{z}$ electron band and the band inversion in Fe$_{1+y}$Te$_{x}$Se$_{1-x}$ with angle-resolved photoemission spectroscopy. Furthermore, we found that excess Fe (y>0) inhibits the topological band inversion in Fe$_{1+y}$Te$_{x}$Se$_{1-x}$, which explains the absence of Majorana zero modes in previous reports for Fe$_{1+y}$Te$_{x}$Se$_{1-x}$ with excess Fe. Based on our analysis of different amounts of Te do** and excess Fe, we propose a delicate topological phase in this material. Thanks to this delicate phase, one may be able to tune the topological transition via applying lattice strain or carrier do**.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Detecting Stochastic Governing Laws with Observation on Stationary Distributions
Authors:
Xiaoli Chen,
Hui Wang,
**qiao Duan
Abstract:
Mathematical models for complex systems are often accompanied with uncertainties. The goal of this paper is to extract a stochastic differential equation governing model with observation on stationary probability distributions. We develop a neural network method to learn the drift and diffusion terms of the stochastic differential equation. We introduce a new loss function containing the Hellinger…
▽ More
Mathematical models for complex systems are often accompanied with uncertainties. The goal of this paper is to extract a stochastic differential equation governing model with observation on stationary probability distributions. We develop a neural network method to learn the drift and diffusion terms of the stochastic differential equation. We introduce a new loss function containing the Hellinger distance between the observation data and the learned stationary probability density function. We discover that the learnt stochastic differential equation provides a fair approximation of the data-driven dynamical system after minimizing this loss function during the training method. The effectiveness of our method is demonstrated in numerical experiments.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Label-efficient Time Series Representation Learning: A Review
Authors:
Emadeldeen Eldele,
Mohamed Ragab,
Zhenghua Chen,
Min Wu,
Chee-Keong Kwoh,
Xiaoli Li
Abstract:
The scarcity of labeled data is one of the main challenges of applying deep learning models on time series data in the real world. Therefore, several approaches, e.g., transfer learning, self-supervised learning, and semi-supervised learning, have been recently developed to promote the learning capability of deep learning models from the limited time series labels. In this survey, for the first ti…
▽ More
The scarcity of labeled data is one of the main challenges of applying deep learning models on time series data in the real world. Therefore, several approaches, e.g., transfer learning, self-supervised learning, and semi-supervised learning, have been recently developed to promote the learning capability of deep learning models from the limited time series labels. In this survey, for the first time, we provide a novel taxonomy to categorize existing approaches that address the scarcity of labeled data problem in time series data based on their dependency on external data sources. Moreover, we present a review of the recent advances in each approach and conclude the limitations of the current works and provide future directions that could yield better progress in the field.
△ Less
Submitted 25 February, 2024; v1 submitted 13 February, 2023;
originally announced February 2023.
-
An inverse potential problem for the stochastic diffusion equation with a multiplicative white noise
Authors:
Xiaoli Feng,
Peijun Li,
Xu Wang
Abstract:
This work concerns the direct and inverse potential problems for the stochastic diffusion equation driven by a multiplicative time-dependent white noise. The direct problem is to examine the well-posedness of the stochastic diffusion equation for a given potential, while the inverse problem is to determine the potential from the expectation of the solution at a fixed observation point inside the s…
▽ More
This work concerns the direct and inverse potential problems for the stochastic diffusion equation driven by a multiplicative time-dependent white noise. The direct problem is to examine the well-posedness of the stochastic diffusion equation for a given potential, while the inverse problem is to determine the potential from the expectation of the solution at a fixed observation point inside the spatial domain. The direct problem is shown to admit a unique and positive mild solution if the initial value is nonnegative. Moreover, an explicit formula is deduced to reconstruct the square of the potential, which leads to the uniqueness of the inverse problem for nonnegative potential functions. Two regularization methods are utilized to overcome the instability of the numerical differentiation in the reconstruction formula. Numerical results show that the methods are effective to reconstruct both smooth and nonsmooth potential functions.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans
Authors:
Jieneng Chen,
Yingda Xia,
Jiawen Yao,
Ke Yan,
Jianpeng Zhang,
Le Lu,
Fakai Wang,
Bo Zhou,
Mingyan Qiu,
Qihang Yu,
Mingze Yuan,
Wei Fang,
Yuxing Tang,
Minfeng Xu,
Jian Zhou,
Yuqian Zhao,
Qifeng Wang,
Xianghua Ye,
Xiaoli Yin,
Yu Shi,
Xin Chen,
**gren Zhou,
Alan Yuille,
Zaiyi Liu,
Ling Zhang
Abstract:
Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice, while most medical AI systems are built to focus on single organs with a narrow list of a few diseases. This might severely limit AI's clinical adoption. A certain number of AI models need to be assembled non-trivially to match the diagnostic process of a human reading…
▽ More
Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice, while most medical AI systems are built to focus on single organs with a narrow list of a few diseases. This might severely limit AI's clinical adoption. A certain number of AI models need to be assembled non-trivially to match the diagnostic process of a human reading a CT scan. In this paper, we construct a Unified Tumor Transformer (CancerUniT) model to jointly detect tumor existence & location and diagnose tumor characteristics for eight major cancers in CT scans. CancerUniT is a query-based Mask Transformer model with the output of multi-tumor prediction. We decouple the object queries into organ queries, tumor detection queries and tumor diagnosis queries, and further establish hierarchical relationships among the three groups. This clinically-inspired architecture effectively assists inter- and intra-organ representation learning of tumors and facilitates the resolution of these complex, anatomically related multi-organ cancer image reading tasks. CancerUniT is trained end-to-end using a curated large-scale CT images of 10,042 patients including eight major types of cancers and occurring non-cancer tumors (all are pathology-confirmed with 3D tumor masks annotated by radiologists). On the test set of 631 patients, CancerUniT has demonstrated strong performance under a set of clinically relevant evaluation metrics, substantially outperforming both multi-disease methods and an assembly of eight single-organ expert models in tumor detection, segmentation, and diagnosis. This moves one step closer towards a universal high performance cancer screening tool.
△ Less
Submitted 6 October, 2023; v1 submitted 28 January, 2023;
originally announced January 2023.
-
Self-supervised Domain Adaptation for Breaking the Limits of Low-quality Fundus Image Quality Enhancement
Authors:
Qingshan Hou,
Peng Cao,
Jiaqi Wang,
Xiaoli Liu,
**zhu Yang,
Osmar R. Zaiane
Abstract:
Retinal fundus images have been applied for the diagnosis and screening of eye diseases, such as Diabetic Retinopathy (DR) or Diabetic Macular Edema (DME). However, both low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease and even lead to misdiagnosis by ophthalmologists. Most of the existing image enhancement methods mainly focus o…
▽ More
Retinal fundus images have been applied for the diagnosis and screening of eye diseases, such as Diabetic Retinopathy (DR) or Diabetic Macular Edema (DME). However, both low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease and even lead to misdiagnosis by ophthalmologists. Most of the existing image enhancement methods mainly focus on improving the image quality by leveraging the guidance of high-quality images, which is difficult to be collected in medical applications. In this paper, we tackle image quality enhancement in a fully unsupervised setting, i.e., neither paired images nor high-quality images. To this end, we explore the potential of the self-supervised task for improving the quality of fundus images without the requirement of high-quality reference images. Specifically, we construct multiple patch-wise domains via an auxiliary pre-trained quality assessment network and a style clustering. To achieve robust low-quality image enhancement and address style inconsistency, we formulate two self-supervised domain adaptation tasks to disentangle the features of image content, low-quality factor and style information by exploring intrinsic supervision signals within the low-quality images. Extensive experiments are conducted on EyeQ and Messidor datasets, and results show that our DASQE method achieves new state-of-the-art performance when only low-quality images are available.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
Co-training with High-Confidence Pseudo Labels for Semi-supervised Medical Image Segmentation
Authors:
Zhiqiang Shen,
Peng Cao,
Hua Yang,
Xiaoli Liu,
**zhu Yang,
Osmar R. Zaiane
Abstract:
Consistency regularization and pseudo labeling-based semi-supervised methods perform co-training using the pseudo labels from multi-view inputs. However, such co-training models tend to converge early to a consensus, degenerating to the self-training ones, and produce low-confidence pseudo labels from the perturbed inputs during training. To address these issues, we propose an Uncertainty-guided C…
▽ More
Consistency regularization and pseudo labeling-based semi-supervised methods perform co-training using the pseudo labels from multi-view inputs. However, such co-training models tend to converge early to a consensus, degenerating to the self-training ones, and produce low-confidence pseudo labels from the perturbed inputs during training. To address these issues, we propose an Uncertainty-guided Collaborative Mean-Teacher (UCMT) for semi-supervised semantic segmentation with the high-confidence pseudo labels. Concretely, UCMT consists of two main components: 1) collaborative mean-teacher (CMT) for encouraging model disagreement and performing co-training between the sub-networks, and 2) uncertainty-guided region mix (UMIX) for manipulating the input images according to the uncertainty maps of CMT and facilitating CMT to produce high-confidence pseudo labels. Combining the strengths of UMIX with CMT, UCMT can retain model disagreement and enhance the quality of pseudo labels for the co-training segmentation. Extensive experiments on four public medical image datasets including 2D and 3D modalities demonstrate the superiority of UCMT over the state-of-the-art. Code is available at: https://github.com/Senyh/UCMT.
△ Less
Submitted 26 May, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Error estimate of a consistent splitting GSAV scheme for the Navier-Stokes equations
Authors:
Xiaoli Li,
Jie Shen
Abstract:
We carry out a rigorous error analysis of the first-order semi-discrete (in time) consistent splitting scheme coupled with a generalized scalar auxiliary variable (GSAV) approach for the Navier-Stokes equations with no-slip boundary conditions. The scheme is linear, unconditionally stable, and only requires solving a sequence of Poisson type equations at each time step. By using the build-in uncon…
▽ More
We carry out a rigorous error analysis of the first-order semi-discrete (in time) consistent splitting scheme coupled with a generalized scalar auxiliary variable (GSAV) approach for the Navier-Stokes equations with no-slip boundary conditions. The scheme is linear, unconditionally stable, and only requires solving a sequence of Poisson type equations at each time step. By using the build-in unconditional stability of the GSAV approach, we derive optimal global (resp. local) in time error estimates in the two (resp. three) dimensional case for the velocity and pressure approximations.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
A Clustering-guided Contrastive Fusion for Multi-view Representation Learning
Authors:
Guanzhou Ke,
Guoqing Chao,
Xiaoli Wang,
Chenyang Xu,
Yongqi Zhu,
Yang Yu
Abstract:
The past two decades have seen increasingly rapid advances in the field of multi-view representation learning due to it extracting useful information from diverse domains to facilitate the development of multi-view applications. However, the community faces two challenges: i) how to learn robust representations from a large amount of unlabeled data to against noise or incomplete views setting, and…
▽ More
The past two decades have seen increasingly rapid advances in the field of multi-view representation learning due to it extracting useful information from diverse domains to facilitate the development of multi-view applications. However, the community faces two challenges: i) how to learn robust representations from a large amount of unlabeled data to against noise or incomplete views setting, and ii) how to balance view consistency and complementary for various downstream tasks. To this end, we utilize a deep fusion network to fuse view-specific representations into the view-common representation, extracting high-level semantics for obtaining robust representation. In addition, we employ a clustering task to guide the fusion network to prevent it from leading to trivial solutions. For balancing consistency and complementary, then, we design an asymmetrical contrastive strategy that aligns the view-common representation and each view-specific representation. These modules are incorporated into a unified method known as CLustering-guided cOntrastiVE fusioN (CLOVEN). We quantitatively and qualitatively evaluate the proposed method on five datasets, demonstrating that CLOVEN outperforms 11 competitive multi-view learning methods in clustering and classification. In the incomplete view scenario, our proposed method resists noise interference better than those of our competitors. Furthermore, the visualization analysis shows that CLOVEN can preserve the intrinsic structure of view-specific representation while also improving the compactness of view-commom representation. Our source code will be available soon at https://github.com/guanzhou-ke/cloven.
△ Less
Submitted 4 August, 2023; v1 submitted 28 December, 2022;
originally announced December 2022.
-
XMAM:X-raying Models with A Matrix to Reveal Backdoor Attacks for Federated Learning
Authors:
Jianyi Zhang,
Fangjiao Zhang,
Qichao **,
Zhiqiang Wang,
Xiaodong Lin,
Xiali Hei
Abstract:
Federated Learning (FL) has received increasing attention due to its privacy protection capability. However, the base algorithm FedAvg is vulnerable when it suffers from so-called backdoor attacks. Former researchers proposed several robust aggregation methods. Unfortunately, many of these aggregation methods are unable to defend against backdoor attacks. What's more, the attackers recently have p…
▽ More
Federated Learning (FL) has received increasing attention due to its privacy protection capability. However, the base algorithm FedAvg is vulnerable when it suffers from so-called backdoor attacks. Former researchers proposed several robust aggregation methods. Unfortunately, many of these aggregation methods are unable to defend against backdoor attacks. What's more, the attackers recently have proposed some hiding methods that further improve backdoor attacks' stealthiness, making all the existing robust aggregation methods fail.
To tackle the threat of backdoor attacks, we propose a new aggregation method, X-raying Models with A Matrix (XMAM), to reveal the malicious local model updates submitted by the backdoor attackers. Since we observe that the output of the Softmax layer exhibits distinguishable patterns between malicious and benign updates, we focus on the Softmax layer's output in which the backdoor attackers are difficult to hide their malicious behavior. Specifically, like X-ray examinations, we investigate the local model updates by using a matrix as an input to get their Softmax layer's outputs. Then, we preclude updates whose outputs are abnormal by clustering. Without any training dataset in the server, the extensive evaluations show that our XMAM can effectively distinguish malicious local model updates from benign ones. For instance, when other methods fail to defend against the backdoor attacks at no more than 20% malicious clients, our method can tolerate 45% malicious clients in the black-box mode and about 30% in Projected Gradient Descent (PGD) mode. Besides, under adaptive attacks, the results demonstrate that XMAM can still complete the global model training task even when there are 40% malicious clients. Finally, we analyze our method's screening complexity, and the results show that XMAM is about 10-10000 times faster than the existing methods.
△ Less
Submitted 27 December, 2022;
originally announced December 2022.
-
Non-trivial band topology and orbital-selective electronic nematicity in a new titanium-based kagome superconductor
Authors:
Yong Hu,
Congcong Le,
Zhen Zhao,
Junzhang Ma,
Nicholas C. Plumb,
Milan Radovic,
Andreas P. Schnyder,
Xianxin Wu,
Hui Chen,
Xiaoli Dong,
Jiang** Hu,
Haitao Yang,
Hong-Jun Gao,
Ming Shi
Abstract:
Electronic nematicity that spontaneously breaks rotational symmetry has been shown as a generic phenomenon in correlated quantum systems including high-temperature superconductors and the AV3Sb5 (A = K, Rb, Cs) family with a kagome network. Identifying the driving force has been a central challenge for understanding nematicity. In iron-based superconductors, the problem is complicated because the…
▽ More
Electronic nematicity that spontaneously breaks rotational symmetry has been shown as a generic phenomenon in correlated quantum systems including high-temperature superconductors and the AV3Sb5 (A = K, Rb, Cs) family with a kagome network. Identifying the driving force has been a central challenge for understanding nematicity. In iron-based superconductors, the problem is complicated because the spin, orbital and lattice degrees of freedom are intimately coupled. In vanadium-based kagome superconductors AV3Sb5, the electronic nematicity exhibits an intriguing entanglement with the charge density wave order (CDW), making understanding its origin difficult. Recently, a new family of titanium-based kagome superconductors ATi3Bi5 has been synthesized. In sharp contrast to its vanadium-based counterpart, the electronic nematicity occurs in the absence of CDW. ATi3Bi5 provides a new window to explore the mechanism of electronic nematicity and its interplay with the orbital degree of freedom. Here, we combine polarization-dependent angle-resolved photoemission spectroscopy with density functional theory to directly reveal the band topology and orbital characters of the multi-orbital RbTi3Bi5. The promising coexistence of flat bands, type-II Dirac nodal line and nontrivial Z2 topological states is identified in RbTi3Bi5. Remarkably, our study clearly unveils the orbital character change along the G-M and G-K directions, implying a strong intrinsic inter-orbital coupling in the Ti-based kagome metals, reminiscent of iron-based superconductors. Furthermore, do**-dependent measurements directly uncover the orbital-selective features in the kagome bands, which can be well explained by the d-p hybridization. The suggested d-p hybridization, in collaboration with the inter-orbital coupling, could account for the electronic nematicity in ATi3Bi5.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Syntactic Multi-view Learning for Open Information Extraction
Authors:
Kuicai Dong,
Aixin Sun,
Jung-Jae Kim,
Xiaoli Li
Abstract:
Open Information Extraction (OpenIE) aims to extract relational tuples from open-domain sentences. Traditional rule-based or statistical models have been developed based on syntactic structures of sentences, identified by syntactic parsers. However, previous neural OpenIE models under-explore the useful syntactic information. In this paper, we model both constituency and dependency trees into word…
▽ More
Open Information Extraction (OpenIE) aims to extract relational tuples from open-domain sentences. Traditional rule-based or statistical models have been developed based on syntactic structures of sentences, identified by syntactic parsers. However, previous neural OpenIE models under-explore the useful syntactic information. In this paper, we model both constituency and dependency trees into word-level graphs, and enable neural OpenIE to learn from the syntactic structures. To better fuse heterogeneous information from both graphs, we adopt multi-view learning to capture multiple relationships from them. Finally, the finetuned constituency and dependency representations are aggregated with sentential semantic representations for tuple generation. Experiments show that both constituency and dependency information, and the multi-view learning are effective.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Quasi two-dimensional nature of high-Tc superconductivity in iron-based (Li,Fe)OHFeSe
Authors:
Dong Li,
Yue Liu,
Zouyouwei Lu,
Peiling Li,
Yuhang Zhang,
Sheng Ma,
Jiali Liu,
Jihu Lu,
Hua Zhang,
Guangtong Liu,
Fang Zhou,
Xiaoli Dong,
Zhongxian Zhao
Abstract:
The intercalated iron selenide (Li,Fe)OHFeSe has a strongly layered structure analogous to the quasi two-dimensional (2D) bismuth cuprate superconductors, and exhibits both high-temperature (Tc) and topological superconductivity. However, the issue of its superconductivity dimensionality has not yet been fully investigated so far. Here we report that the quasi-2D superconductivity features, includ…
▽ More
The intercalated iron selenide (Li,Fe)OHFeSe has a strongly layered structure analogous to the quasi two-dimensional (2D) bismuth cuprate superconductors, and exhibits both high-temperature (Tc) and topological superconductivity. However, the issue of its superconductivity dimensionality has not yet been fully investigated so far. Here we report that the quasi-2D superconductivity features, including the high anisotropy γ = 151 and the associated quasi-2D vortices, are also revealed for (Li,Fe)OHFeSe, based on systematic experiments of the electrical transport and magnetization and model fittings. Thus, we establish a new vortex phase diagram for (Li,Fe)OHFeSe, which delineates an emergent quasi-2D vortex-liquid state, and a subsequent vortex-solid dimensional crossover from a pancake-like to a three-dimensional state with decreasing temperature and magnetic field. Furthermore, we find that all the quasi-2D characteristics revealed here for the high-Tc iron selenide superconductor are very similar to those reported for the high-Tc bismuth cuprate superconductors.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Contrastive Domain Adaptation for Time-Series via Temporal Mixup
Authors:
Emadeldeen Eldele,
Mohamed Ragab,
Zhenghua Chen,
Min Wu,
Chee-Keong Kwoh,
Xiaoli Li
Abstract:
Unsupervised Domain Adaptation (UDA) has emerged as a powerful solution for the domain shift problem via transferring the knowledge from a labeled source domain to a shifted unlabeled target domain. Despite the prevalence of UDA for visual applications, it remains relatively less explored for time-series applications. In this work, we propose a novel lightweight contrastive domain adaptation frame…
▽ More
Unsupervised Domain Adaptation (UDA) has emerged as a powerful solution for the domain shift problem via transferring the knowledge from a labeled source domain to a shifted unlabeled target domain. Despite the prevalence of UDA for visual applications, it remains relatively less explored for time-series applications. In this work, we propose a novel lightweight contrastive domain adaptation framework called CoTMix for time-series data. Unlike existing approaches that either use statistical distances or adversarial techniques, we leverage contrastive learning solely to mitigate the distribution shift across the different domains. Specifically, we propose a novel temporal mixup strategy to generate two intermediate augmented views for the source and target domains. Subsequently, we leverage contrastive learning to maximize the similarity between each domain and its corresponding augmented view. The generated views consider the temporal dynamics of time-series data during the adaptation process while inheriting the semantics among the two domains. Hence, we gradually push both domains towards a common intermediate space, mitigating the distribution shift across them. Extensive experiments conducted on five real-world time-series datasets show that our approach can significantly outperform all state-of-the-art UDA methods. The implementation code of CoTMix is available at \href{https://github.com/emadeldeen24/CoTMix}{github.com/emadeldeen24/CoTMix}.
△ Less
Submitted 27 July, 2023; v1 submitted 3 December, 2022;
originally announced December 2022.
-
Thermoelectric properties of cement composite analogues from first principles calculations
Authors:
Esther Orisakwe,
Conrad Johnston,
Ruchita Jani,
Xiaoli Liu,
Lorenzo Stella,
Jorge Kohanoff,
Niall Holmes,
Brian Norton,
Ming Qu,
Hongxi Yin,
Kazuaki Yazawa
Abstract:
Buildings are responsible for a considerable fraction of the energy wasted globally every year, and as a result, excess carbon emissions. While heat is lost directly in colder months and climates, resulting in increased heating loads, in hot climates cooling and ventilation is required. One avenue towards improving the energy efficiency of buildings is to integrate thermoelectric devices and mater…
▽ More
Buildings are responsible for a considerable fraction of the energy wasted globally every year, and as a result, excess carbon emissions. While heat is lost directly in colder months and climates, resulting in increased heating loads, in hot climates cooling and ventilation is required. One avenue towards improving the energy efficiency of buildings is to integrate thermoelectric devices and materials within the fabric of the building to exploit the temperature gradient between the inside and outside to do useful work. Cement-based materials are ubiquitous in modern buildings and present an interesting opportunity to be functionalised. We present a systematic investigation of the electronic transport coefficients relevant to the thermoelectric materials of the calcium silicate hydrate (C-S-H) gel analogue, tobermorite, using Density Functional Theory calculations with the Boltzmann transport method. The calculated values of the Seebeck coefficient are within the typical magnitude (200 - 600 $μV/K$) indicative of a good thermoelectric material. The tobermorite models are predicted to be intrinsically $p$-type thermoelectric material because of the presence of large concentration of the Si-O tetrahedra sites. The calculated electronic $ZT$ for the tobermorite models have their optimal values of 0.983 at (400 $\mathrm{K}$ and $10^{17}$ $\mathrm{cm^{-3}}$) for tobermorite 9 Å, 0.985 at (400 $\mathrm{K}$ and $10^{17}$ $\mathrm{cm^{-3}}$) for tobermorite 11 Å and 1.20 at (225 $\mathrm{K}$ and $10^{19}$ $\mathrm{cm^{-3}}$) for tobermorite 14 Å, respectively.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Automated Generating Natural Language Requirements based on Domain Ontology
Authors:
Ziyan Zhao,
Li Zhang,
Xiaoyun Gao,
Xiaoli Lian,
Heyang Lv,
Lin Shi
Abstract:
Software requirements specification is undoubtedly critical for the whole software life-cycle. Nowadays, writing software requirements specifications primarily depends on human work. Although massive studies have been proposed to fasten the process via proposing advanced elicitation and analysis techniques, it is still a time-consuming and error-prone task that needs to take domain knowledge and b…
▽ More
Software requirements specification is undoubtedly critical for the whole software life-cycle. Nowadays, writing software requirements specifications primarily depends on human work. Although massive studies have been proposed to fasten the process via proposing advanced elicitation and analysis techniques, it is still a time-consuming and error-prone task that needs to take domain knowledge and business information into consideration. In this paper, we propose an approach, named ReqGen, which can provide recommendations by automatically generating natural language requirements specifications based on certain given keywords. Specifically, ReqGen consists of three critical steps. First, keywords-oriented knowledge is selected from domain ontology and is injected to the basic Unified pre-trained Language Model (UniLM) for domain fine-tuning. Second, a copy mechanism is integrated to ensure the occurrence of keywords in the generated statements. Finally, a requirement syntax constrained decoding is designed to close the semantic and syntax distance between the candidate and reference specifications. Experiments on two public datasets from different groups and domains show that ReqGen outperforms six popular natural language generation approaches with respect to the hard constraint of keywords(phrases) inclusion, BLEU, ROUGE and syntax compliance. We believe that ReqGen can promote the efficiency and intelligence of specifying software requirements.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Superconductivity and orbital-selective nematic order in a new titanium-based kagome metal CsTi3Bi5
Authors:
Haitao Yang,
Yuhan Ye,
Zhen Zhao,
Jiali Liu,
Xin-Wei Yi,
Yuhang Zhang,
**an Shi,
**g-Yang You,
Zihao Huang,
Bingjie Wang,
**g Wang,
Hui Guo,
Xiao Lin,
Chengmin Shen,
Wu Zhou,
Hui Chen,
Xiaoli Dong,
Gang Su,
Ziqiang Wang,
Hong-Jun Gao
Abstract:
Fabrication of new types of superconductors with novel physical properties has always been a major thread in the research of superconducting materials. An example is the enormous interests generated by the cascade of correlated topological quantum states in the newly discovered vanadium-based kagome superconductors AV3Sb5 (A=K, Rb, and Cs) with a Z2 topological band structure. Here we report the s…
▽ More
Fabrication of new types of superconductors with novel physical properties has always been a major thread in the research of superconducting materials. An example is the enormous interests generated by the cascade of correlated topological quantum states in the newly discovered vanadium-based kagome superconductors AV3Sb5 (A=K, Rb, and Cs) with a Z2 topological band structure. Here we report the successful fabrication of single-crystals of titanium-based kagome metal CsTi3Bi5 and the observation of superconductivity and electronic nematicity. The onset of the superconducting transition temperature Tc is around 4.8 K. In sharp contrast to the charge density wave superconductor AV3Sb5, we find that the kagome superconductor CsTi3Bi5 preserves translation symmetry, but breaks rotational symmetry and exhibits an electronic nematicity. The angular-dependent magnetoresistivity shows a remarkable two-fold rotational symmetry as the magnetic field rotates in the kagome plane. The scanning tunneling microscopy and spectroscopic imaging detect rotational-symmetry breaking C2 quasiparticle interference patterns (QPI) at low energies, providing further microscopic evidence for electronic nematicity. Combined with first-principle calculations, we find that the nematic QPI is orbital-selective and dominated by the Ti dxz and dyz orbitals, possibly originating from the intriguing orbital bond nematic order. Our findings in the new "135" material CsTi3Bi5 provide new directions for exploring the multi-orbital correlation effect and the role of orbital or bond order in the electron liquid crystal phases evidenced by the symmetry breaking states in kagome superconductors.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
STILN: A Novel Spatial-Temporal Information Learning Network for EEG-based Emotion Recognition
Authors:
Yiheng Tang,
Yongxiong Wang,
Xiaoli Zhang,
Zhe Wang
Abstract:
The spatial correlations and the temporal contexts are indispensable in Electroencephalogram (EEG)-based emotion recognition. However, the learning of complex spatial correlations among several channels is a challenging problem. Besides, the temporal contexts learning is beneficial to emphasize the critical EEG frames because the subjects only reach the prospective emotion during part of stimuli.…
▽ More
The spatial correlations and the temporal contexts are indispensable in Electroencephalogram (EEG)-based emotion recognition. However, the learning of complex spatial correlations among several channels is a challenging problem. Besides, the temporal contexts learning is beneficial to emphasize the critical EEG frames because the subjects only reach the prospective emotion during part of stimuli. Hence, we propose a novel Spatial-Temporal Information Learning Network (STILN) to extract the discriminative features by capturing the spatial correlations and temporal contexts. Specifically, the generated 2D power topographic maps capture the dependencies among electrodes, and they are fed to the CNN-based spatial feature extraction network. Furthermore, Convolutional Block Attention Module (CBAM) recalibrates the weights of power topographic maps to emphasize the crucial brain regions and frequency bands. Meanwhile, Batch Normalizations (BNs) and Instance Normalizations (INs) are appropriately combined to relieve the individual differences. In the temporal contexts learning, we adopt the Bidirectional Long Short-Term Memory Network (Bi-LSTM) network to capture the dependencies among the EEG frames. To validate the effectiveness of the proposed method, subject-independent experiments are conducted on the public DEAP dataset. The proposed method has achieved the outstanding performance, and the accuracies of arousal and valence classification have reached 0.6831 and 0.6752 respectively.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey
Authors:
Yuecong Xu,
Haozhi Cao,
Zhenghua Chen,
Xiaoli Li,
Lihua Xie,
Jianfei Yang
Abstract:
Video analysis tasks such as action recognition have received increasing research interest with growing applications in fields such as smart healthcare, thanks to the introduction of large-scale datasets and deep learning-based representations. However, video models trained on existing datasets suffer from significant performance degradation when deployed directly to real-world applications due to…
▽ More
Video analysis tasks such as action recognition have received increasing research interest with growing applications in fields such as smart healthcare, thanks to the introduction of large-scale datasets and deep learning-based representations. However, video models trained on existing datasets suffer from significant performance degradation when deployed directly to real-world applications due to domain shifts between the training public video datasets (source video domains) and real-world videos (target video domains). Further, with the high cost of video annotation, it is more practical to use unlabeled videos for training. To tackle performance degradation and address concerns in high video annotation cost uniformly, the video unsupervised domain adaptation (VUDA) is introduced to adapt video models from the labeled source domain to the unlabeled target domain by alleviating video domain shift, improving the generalizability and portability of video models. This paper surveys recent progress in VUDA with deep learning. We begin with the motivation of VUDA, followed by its definition, and recent progress of methods for both closed-set VUDA and VUDA under different scenarios, and current benchmark datasets for VUDA research. Eventually, future directions are provided to promote further VUDA research.
△ Less
Submitted 20 November, 2022; v1 submitted 17 November, 2022;
originally announced November 2022.
-
Personal Privacy Protection Problems in the Digital Age
Authors:
Zhiheng Yi,
Xiaoli Chen
Abstract:
With the development of Internet technology, the issue of privacy leakage has attracted more and more attention from the public. In our daily life, mobile phone applications and identity documents that we use may bring the risk of privacy leakage, which had increasingly aroused public concern. The path of privacy protection in the digital age remains to be explored. To explore the source of this r…
▽ More
With the development of Internet technology, the issue of privacy leakage has attracted more and more attention from the public. In our daily life, mobile phone applications and identity documents that we use may bring the risk of privacy leakage, which had increasingly aroused public concern. The path of privacy protection in the digital age remains to be explored. To explore the source of this risk and how it can be reduced, we conducted this study by using personal experience, collecting data and applying the theory.
△ Less
Submitted 20 November, 2022; v1 submitted 17 November, 2022;
originally announced November 2022.
-
Orbital Fulde-Ferrell-Larkin-Ovchinnikov state in an Ising superconductor
Authors:
Puhua Wan,
Oleksandr Zheliuk,
Noah F. Q. Yuan,
Xiaoli Peng,
Le Zhang,
Minpeng Liang,
Uli Zeitler,
Steffen Wiedmann,
Nigel Hussey,
Thomas T. M. Palstra,
Jianting Ye
Abstract:
The conventional Fulde-Ferrell-Larkin-Ovchinnikov (FFLO) state relies on the Zeeman effect of an external magnetic field to break time-reversal symmetry, forming a state of finite-momentum Cooper pairing. In superconductors with broken inversion symmetries, the Rashba or Ising-type spin-orbit coupling (SOC) can interact with either the Zeeman or the orbital effect of magnetic fields, extending the…
▽ More
The conventional Fulde-Ferrell-Larkin-Ovchinnikov (FFLO) state relies on the Zeeman effect of an external magnetic field to break time-reversal symmetry, forming a state of finite-momentum Cooper pairing. In superconductors with broken inversion symmetries, the Rashba or Ising-type spin-orbit coupling (SOC) can interact with either the Zeeman or the orbital effect of magnetic fields, extending the range of possible FFLO states, though evidence for these more exotic forms of FFLO pairing has been lacking. Here we report the discovery of an unconventional FFLO state induced by coupling the Ising SOC and the orbital effect in multilayer 2H-NbSe2. Transport measurements show that the translational and rotational symmetries are broken in the orbital FFLO state, providing the hallmark signatures of finite momentum cooper pairings. We establish the entire orbital FFLO phase diagram, consisting of normal metal, uniform Ising superconducting phase, and a six-fold orbital FFLO state. This study highlights an alternative route to finite-momentum superconductivity and provides a universal mechanism to prepare orbital FFLO states in similar materials with broken inversion symmetries.
△ Less
Submitted 30 August, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.
-
WR-ONE2SET: Towards Well-Calibrated Keyphrase Generation
Authors:
Binbin Xie,
Xiangpeng Wei,
Baosong Yang,
Huan Lin,
Jun Xie,
Xiaoli Wang,
Min Zhang,
**song Su
Abstract:
Keyphrase generation aims to automatically generate short phrases summarizing an input document. The recently emerged ONE2SET paradigm (Ye et al., 2021) generates keyphrases as a set and has achieved competitive performance. Nevertheless, we observe serious calibration errors outputted by ONE2SET, especially in the over-estimation of $\varnothing$ token (means "no corresponding keyphrase"). In thi…
▽ More
Keyphrase generation aims to automatically generate short phrases summarizing an input document. The recently emerged ONE2SET paradigm (Ye et al., 2021) generates keyphrases as a set and has achieved competitive performance. Nevertheless, we observe serious calibration errors outputted by ONE2SET, especially in the over-estimation of $\varnothing$ token (means "no corresponding keyphrase"). In this paper, we deeply analyze this limitation and identify two main reasons behind: 1) the parallel generation has to introduce excessive $\varnothing$ as padding tokens into training instances; and 2) the training mechanism assigning target to each slot is unstable and further aggravates the $\varnothing$ token over-estimation. To make the model well-calibrated, we propose WR-ONE2SET which extends ONE2SET with an adaptive instance-level cost Weighting strategy and a target Re-assignment mechanism. The former dynamically penalizes the over-estimated slots for different instances thus smoothing the uneven training distribution. The latter refines the original inappropriate assignment and reduces the supervisory signals of over-estimated slots. Experimental results on commonly-used datasets demonstrate the effectiveness and generality of our proposed paradigm.
△ Less
Submitted 16 February, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Transferability-based Chain Motion Map** from Humans to Humanoids for Teleoperation
Authors:
Matthew Stanley,
Yunsik Jung,
Michael Bowman,
Lingfeng Tao,
Xiaoli Zhang
Abstract:
Although data-driven motion map** methods are promising to allow intuitive robot control and teleoperation that generate human-like robot movement, they normally require tedious pair-wise training for each specific human and robot pair. This paper proposes a transferability-based map** scheme to allow new robot and human input systems to leverage the map** of existing trained pairs to form a…
▽ More
Although data-driven motion map** methods are promising to allow intuitive robot control and teleoperation that generate human-like robot movement, they normally require tedious pair-wise training for each specific human and robot pair. This paper proposes a transferability-based map** scheme to allow new robot and human input systems to leverage the map** of existing trained pairs to form a map** transfer chain, which will reduce the number of new pair-specific map**s that need to be generated. The first part of the map** schematic is the development of a Synergy Map** via Dual-Autoencoder (SyDa) method. This method uses the latent features from two autoencoders to extract the common synergy of the two agents. Secondly, a transferability metric is created that approximates how well the map** between a pair of agents will perform compared to another pair before creating the motion map** models. Thus, it can guide the formation of an optimal map** chain for the new human-robot pair. Experiments with human subjects and a Pepper robot demonstrated 1) The SyDa method improves the accuracy and generalizability of the pair map**s, 2) the SyDa method allows for bidirectional map** that does not prioritize the direction of map** motion, and 3) the transferability metric measures how compatible two agents are for accurate teleoperation. The combination of the SyDa method and transferability metric creates generalizable and accurate map** need to create the transfer map** chain.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Solar Ring Mission: Building a Panorama of the Sun and Inner-heliosphere
Authors:
Yuming Wang,
Xianyong Bai,
Changyong Chen,
Linjie Chen,
Xin Cheng,
Lei Deng,
Linhua Deng,
Yuanyong Deng,
Li Feng,
Tingyu Gou,
**gnan Guo,
Yang Guo,
Xinjun Hao,
Jiansen He,
Junfeng Hou,
Huang Jiangjiang,
Zhenghua Huang,
Haisheng Ji,
Chaowei Jiang,
Jie Jiang,
Chunlan **,
Xiaolei Li,
Yiren Li,
Jiajia Liu,
Kai Liu
, et al. (29 additional authors not shown)
Abstract:
Solar Ring (SOR) is a proposed space science mission to monitor and study the Sun and inner heliosphere from a full 360° perspective in the ecliptic plane. It will deploy three 120°-separated spacecraft on the 1-AU orbit. The first spacecraft, S1, locates 30° upstream of the Earth, the second, S2, 90° downstream, and the third, S3, completes the configuration. This design with necessary science in…
▽ More
Solar Ring (SOR) is a proposed space science mission to monitor and study the Sun and inner heliosphere from a full 360° perspective in the ecliptic plane. It will deploy three 120°-separated spacecraft on the 1-AU orbit. The first spacecraft, S1, locates 30° upstream of the Earth, the second, S2, 90° downstream, and the third, S3, completes the configuration. This design with necessary science instruments, e.g., the Doppler-velocity and vector magnetic field imager, wide-angle coronagraph, and in-situ instruments, will allow us to establish many unprecedented capabilities: (1) provide simultaneous Doppler-velocity observations of the whole solar surface to understand the deep interior, (2) provide vector magnetograms of the whole photosphere - the inner boundary of the solar atmosphere and heliosphere, (3) provide the information of the whole lifetime evolution of solar featured structures, and (4) provide the whole view of solar transients and space weather in the inner heliosphere. With these capabilities, Solar Ring mission aims to address outstanding questions about the origin of solar cycle, the origin of solar eruptions and the origin of extreme space weather events. The successful accomplishment of the mission will construct a panorama of the Sun and inner-heliosphere, and therefore advance our understanding of the star and the space environment that holds our life.
△ Less
Submitted 23 October, 2022; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Towards Trustworthy AI-Empowered Real-Time Bidding for Online Advertisement Auctioning
Authors:
Xiaoli Tang,
Han Yu
Abstract:
Artificial intelligence-empowred Real-Time Bidding (AIRTB) is regarded as one of the most enabling technologies for online advertising. It has attracted significant research attention from diverse fields such as pattern recognition, game theory and mechanism design. Despite of its remarkable development and deployment, the AIRTB system can sometimes harm the interest of its participants (e.g., dep…
▽ More
Artificial intelligence-empowred Real-Time Bidding (AIRTB) is regarded as one of the most enabling technologies for online advertising. It has attracted significant research attention from diverse fields such as pattern recognition, game theory and mechanism design. Despite of its remarkable development and deployment, the AIRTB system can sometimes harm the interest of its participants (e.g., depleting the advertisers' budget with various kinds of fraud). As such, building trustworthy AIRTB auctioning systems has emerged as an important direction of research in this field in recent years. Due to the highly interdisciplinary nature of this field and a lack of a comprehensive survey, it is a challenge for researchers to enter this field and contribute towards building trustworthy AIRTB technologies. This paper bridges this important gap in trustworthy AIRTB literature. We start by analysing the key concerns of various AIRTB stakeholders and identify three main dimensions of trust building in AIRTB, namely security, robustness and fairness. For each of these dimensions, we propose a unique taxonomy of the state of the art, trace the root causes of possible breakdown of trust, and discuss the necessity of the given dimension. This is followed by a comprehensive review of existing strategies for fulfilling the requirements of each trust dimension. In addition, we discuss the promising future directions of research essential towards building trustworthy AIRTB systems to benefit the field of online advertising.
△ Less
Submitted 21 September, 2022;
originally announced October 2022.
-
Self-supervised Learning for Label-Efficient Sleep Stage Classification: A Comprehensive Evaluation
Authors:
Emadeldeen Eldele,
Mohamed Ragab,
Zhenghua Chen,
Min Wu,
Chee-Keong Kwoh,
Xiaoli Li
Abstract:
The past few years have witnessed a remarkable advance in deep learning for EEG-based sleep stage classification (SSC). However, the success of these models is attributed to possessing a massive amount of labeled data for training, limiting their applicability in real-world scenarios. In such scenarios, sleep labs can generate a massive amount of data, but labeling these data can be expensive and…
▽ More
The past few years have witnessed a remarkable advance in deep learning for EEG-based sleep stage classification (SSC). However, the success of these models is attributed to possessing a massive amount of labeled data for training, limiting their applicability in real-world scenarios. In such scenarios, sleep labs can generate a massive amount of data, but labeling these data can be expensive and time-consuming. Recently, the self-supervised learning (SSL) paradigm has shined as one of the most successful techniques to overcome the scarcity of labeled data. In this paper, we evaluate the efficacy of SSL to boost the performance of existing SSC models in the few-labels regime. We conduct a thorough study on three SSC datasets, and we find that fine-tuning the pretrained SSC models with only 5% of labeled data can achieve competitive performance to the supervised training with full labels. Moreover, self-supervised pretraining helps SSC models to be more robust to data imbalance and domain shift problems. The code is publicly available at https://github.com/emadeldeen24/eval_ssl_ssc.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation
Authors:
Lingfeng Tao,
Jiucai Zhang,
Michael Bowman,
Xiaoli Zhang
Abstract:
In-hand manipulation is challenging for a multi-finger robotic hand due to its high degrees of freedom and the complex interaction with the object. To enable in-hand manipulation, existing deep reinforcement learning based approaches mainly focus on training a single robot-structure-specific policy through the centralized learning mechanism, lacking adaptability to changes like robot malfunction.…
▽ More
In-hand manipulation is challenging for a multi-finger robotic hand due to its high degrees of freedom and the complex interaction with the object. To enable in-hand manipulation, existing deep reinforcement learning based approaches mainly focus on training a single robot-structure-specific policy through the centralized learning mechanism, lacking adaptability to changes like robot malfunction. To solve this limitation, this work treats each finger as an individual agent and trains multiple agents to control their assigned fingers to complete the in-hand manipulation task cooperatively. We propose the Multi-Agent Global-Observation Critic and Local-Observation Actor (MAGCLA) method, where the critic can observe all agents' actions globally, and the actor only locally observes its neighbors' actions. Besides, conventional individual experience replay may cause unstable cooperation due to the asynchronous performance increment of each agent, which is critical for in-hand manipulation tasks. To solve this issue, we propose the Synchronized Hindsight Experience Replay (SHER) method to synchronize and efficiently reuse the replayed experience across all agents. The methods are evaluated in two in-hand manipulation tasks on the Shadow dexterous hand. The results show that SHER helps MAGCLA achieve comparable learning efficiency to a single policy, and the MAGCLA approach is more generalizable in different tasks. The trained policies have higher adaptability in the robot malfunction test compared to the baseline multi-agent and single-agent approaches.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
GBSVM: Granular-ball Support Vector Machine
Authors:
Shuyin Xia,
Xiaoyu Lian,
Guoyin Wang,
Xinbo Gao,
Jiancu Chen,
Xiaoli Peng
Abstract:
GBSVM (Granular-ball Support Vector Machine) is a significant attempt to construct a classifier using the coarse-to-fine granularity of a granular-ball as input, rather than a single data point. It is the first classifier whose input contains no points. However, the existing model has some errors, and its dual model has not been derived. As a result, the current algorithm cannot be implemented or…
▽ More
GBSVM (Granular-ball Support Vector Machine) is a significant attempt to construct a classifier using the coarse-to-fine granularity of a granular-ball as input, rather than a single data point. It is the first classifier whose input contains no points. However, the existing model has some errors, and its dual model has not been derived. As a result, the current algorithm cannot be implemented or applied. To address these problems, this paper has fixed the errors of the original model of the existing GBSVM, and derived its dual model. Furthermore, a particle swarm optimization algorithm is designed to solve the dual model. The sequential minimal optimization algorithm is also carefully designed to solve the dual model. The solution is faster and more stable than the particle swarm optimization based version. The experimental results on the UCI benchmark datasets demonstrate that GBSVM has good robustness and efficiency. All codes have been released in the open source library at http://www.cquptshuyinxia.com/GBSVM.html or https://github.com/syxiaa/GBSVM.
△ Less
Submitted 11 February, 2024; v1 submitted 6 October, 2022;
originally announced October 2022.
-
A novel Lagrange Multiplier approach with relaxation for gradient flows
Authors:
Zhengguang Liu,
Xiaoli Li
Abstract:
In this paper, we propose a novel Lagrange Multiplier approach, named zero-factor (ZF) approach to solve a series of gradient flow problems. The numerical schemes based on the new algorithm are unconditionally energy stable with the original energy and do not require any extra assumption conditions. We also prove that the ZF schemes with specific zero factors lead to the popular SAV-type method. T…
▽ More
In this paper, we propose a novel Lagrange Multiplier approach, named zero-factor (ZF) approach to solve a series of gradient flow problems. The numerical schemes based on the new algorithm are unconditionally energy stable with the original energy and do not require any extra assumption conditions. We also prove that the ZF schemes with specific zero factors lead to the popular SAV-type method. To reduce the computation cost and improve the accuracy and consistency, we propose a zero-factor approach with relaxation, which we named the relaxed zero-factor (RZF) method, to design unconditional energy stable schemes for gradient flows. The RZF schemes can be proved to be unconditionally energy stable with respect to a modified energy that is closer to the original energy, and provide a very simple calculation process. The variation of the introduced zero factor is highly consistent with the nonlinear free energy which implies that the introduced ZF method is a very efficient way to capture the sharp dissipation of nonlinear free energy. Several numerical examples are provided to demonstrate the improved efficiency and accuracy of the proposed method.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
Time- vs. frequency- domain inverse elastic scattering: Theory and experiment
Authors:
Xiaoli Liu,
Jian Song,
Fatemeh Pourahmadian,
Houssem Haddar
Abstract:
This study formally adapts the time-domain linear sampling method (TLSM) for ultrasonic imaging of stationary and evolving fractures in safety-critical components. The TLSM indicator is then applied to the laboratory test data of [22, 18] and the obtained reconstructions are compared to their frequency-domain counterparts. The results highlight the unique capability of the time-domain imaging func…
▽ More
This study formally adapts the time-domain linear sampling method (TLSM) for ultrasonic imaging of stationary and evolving fractures in safety-critical components. The TLSM indicator is then applied to the laboratory test data of [22, 18] and the obtained reconstructions are compared to their frequency-domain counterparts. The results highlight the unique capability of the time-domain imaging functional for high-fidelity tracking of evolving damage, and its relative robustness to sparse and reduced aperture data at moderate noise levels. A comparative analysis of the TLSM images against the multifrequency LSM maps of [22] further reveals that thanks to the full-waveform inversion in time and space, the TLSM generates images of remarkably higher quality with the same dataset.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Graphon Mean-Field Control for Cooperative Multi-Agent Reinforcement Learning
Authors:
Yuanquan Hu,
Xiaoli Wei,
Junji Yan,
Hengxi Zhang
Abstract:
The marriage between mean-field theory and reinforcement learning has shown a great capacity to solve large-scale control problems with homogeneous agents. To break the homogeneity restriction of mean-field theory, a recent interest is to introduce graphon theory to the mean-field paradigm. In this paper, we propose a graphon mean-field control (GMFC) framework to approximate cooperative multi-age…
▽ More
The marriage between mean-field theory and reinforcement learning has shown a great capacity to solve large-scale control problems with homogeneous agents. To break the homogeneity restriction of mean-field theory, a recent interest is to introduce graphon theory to the mean-field paradigm. In this paper, we propose a graphon mean-field control (GMFC) framework to approximate cooperative multi-agent reinforcement learning (MARL) with nonuniform interactions and show that the approximate order is of $\mathcal{O}(\frac{1}{\sqrt{N}})$, with $N$ the number of agents. By discretizing the graphon index of GMFC, we further introduce a smaller class of GMFC called block GMFC, which is shown to well approximate cooperative MARL. Our empirical studies on several examples demonstrate that our GMFC approach is comparable with the state-of-art MARL algorithms while enjoying better scalability.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
Privacy-Preserving Deep Learning Model for Covid-19 Disease Detection
Authors:
Vijay Srinivas Tida Sai Venkatesh Chilukoti,
Sonya Hsu,
Xiali Hei
Abstract:
Recent studies demonstrated that X-ray radiography showed higher accuracy than Polymerase Chain Reaction (PCR) testing for COVID-19 detection. Therefore, applying deep learning models to X-rays and radiography images increases the speed and accuracy of determining COVID-19 cases. However, due to Health Insurance Portability and Accountability (HIPAA) compliance, the hospitals were unwilling to sha…
▽ More
Recent studies demonstrated that X-ray radiography showed higher accuracy than Polymerase Chain Reaction (PCR) testing for COVID-19 detection. Therefore, applying deep learning models to X-rays and radiography images increases the speed and accuracy of determining COVID-19 cases. However, due to Health Insurance Portability and Accountability (HIPAA) compliance, the hospitals were unwilling to share patient data due to privacy concerns. To maintain privacy, we propose differential private deep learning models to secure the patients' private information. The dataset from the Kaggle website is used to evaluate the designed model for COVID-19 detection. The EfficientNet model version was selected according to its highest test accuracy. The injection of differential privacy constraints into the best-obtained model was made to evaluate performance. The accuracy is noted by varying the trainable layers, privacy loss, and limiting information from each sample. We obtained 84\% accuracy with a privacy loss of 10 during the fine-tuning process.
△ Less
Submitted 9 October, 2022; v1 submitted 7 September, 2022;
originally announced September 2022.
-
Titanium-based kagome superconductor CsTi_3Bi_5 and topological states
Authors:
Haitao Yang,
Zhen Zhao,
Xin-Wei Yi,
Jiali Liu,
**g-Yang You,
Yuhang Zhang,
Hui Guo,
Xiao Lin,
Chengmin Shen,
Hui Chen,
Xiaoli Dong,
Gang Su,
Hong-Jun Gao
Abstract:
Since the discovery of a new family of vanadium-based kagome superconductor AV3Sb5 (A=K, Rb, and Cs) with topological band structures, extensive effort has been devoted to exploring the origin of superconducting states and the intertwined orders. Meanwhile, searching for new types of superconductors with novel physical properties and higher superconducting transition temperatures has always been a…
▽ More
Since the discovery of a new family of vanadium-based kagome superconductor AV3Sb5 (A=K, Rb, and Cs) with topological band structures, extensive effort has been devoted to exploring the origin of superconducting states and the intertwined orders. Meanwhile, searching for new types of superconductors with novel physical properties and higher superconducting transition temperatures has always been a major thread in the history of superconductor research. Here we report a successful fabrication and the topological states of a Titanium-based kagome metal CsTi3Bi5 (CT3B5) crystal. The as-grown CT3B5 crystal is of high quality and possesses a perfect two-dimensional kagome net of Titanium. The superconductivity of the CT3B5 crystal shows that the critical temperature Tc is of ~4.8 K. First-principle calculations predict that the CT3B5 has robust topological surface states, implying that CT3B5 is a Z2 topological kagome superconductor. This finding provides a new type of superconductors and the base for exploring the origin of superconductivity and topological states in kagome superconductors.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Kernel-Segregated Transpose Convolution Operation
Authors:
Vijay Srinivas Tida,
Sai Venkatesh Chilukoti,
Xiali Hei,
Sonya Hsu
Abstract:
Transpose convolution has shown prominence in many deep learning applications. However, transpose convolution layers are computationally intensive due to the increased feature map size due to adding zeros after each element in each row and column. Thus, convolution operation on the expanded input feature map leads to poor utilization of hardware resources. The main reason for unnecessary multiplic…
▽ More
Transpose convolution has shown prominence in many deep learning applications. However, transpose convolution layers are computationally intensive due to the increased feature map size due to adding zeros after each element in each row and column. Thus, convolution operation on the expanded input feature map leads to poor utilization of hardware resources. The main reason for unnecessary multiplication operations is zeros at predefined positions in the input feature map. We propose an algorithmic-level optimization technique for the effective transpose convolution implementation to solve these problems. Based on kernel activations, we segregated the original kernel into four sub-kernels. This scheme could reduce memory requirements and unnecessary multiplications. Our proposed method was $3.09 (3.02) \times$ faster computation using the Titan X GPU (Intel Dual Core CPU) with a flower dataset from the Kaggle website. Furthermore, the proposed optimization method can be generalized to existing devices without additional hardware requirements. A simple deep learning model containing one transpose convolution layer was used to evaluate the optimization method. It showed $2.2 \times$ faster training using the MNIST dataset with an Intel Dual-core CPU than the conventional implementation.
△ Less
Submitted 12 October, 2022; v1 submitted 8 September, 2022;
originally announced September 2022.
-
MSSPN: Automatic First Arrival Picking using Multi-Stage Segmentation Picking Network
Authors:
Hongtao Wang,
Jiangshe Zhang,
Xiaoli Wei,
Chunxia Zhang,
Zhenbo Guo,
Li Long,
Yicheng Wang
Abstract:
Picking the first arrival times of prestack gathers is called First Arrival Time (FAT) picking, which is an indispensable step in seismic data processing, and is mainly solved manually in the past. With the current increasing density of seismic data collection, the efficiency of manual picking has been unable to meet the actual needs. Therefore, automatic picking methods have been greatly develope…
▽ More
Picking the first arrival times of prestack gathers is called First Arrival Time (FAT) picking, which is an indispensable step in seismic data processing, and is mainly solved manually in the past. With the current increasing density of seismic data collection, the efficiency of manual picking has been unable to meet the actual needs. Therefore, automatic picking methods have been greatly developed in recent decades, especially those based on deep learning. However, few of the current supervised deep learning-based method can avoid the dependence on labeled samples. Besides, since the gather data is a set of signals which are greatly different from the natural images, it is difficult for the current method to solve the FAT picking problem in case of a low Signal to Noise Ratio (SNR). In this paper, for hard rock seismic gather data, we propose a Multi-Stage Segmentation Pickup Network (MSSPN), which solves the generalization problem across worksites and the picking problem in the case of low SNR. In MSSPN, there are four sub-models to simulate the manually picking processing, which is assumed to four stages from coarse to fine. Experiments on seven field datasets with different qualities show that our MSSPN outperforms benchmarks by a large margin.Particularly, our method can achieve more than 90\% accurate picking across worksites in the case of medium and high SNRs, and even fine-tuned model can achieve 88\% accurate picking of the dataset with low SNR.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Asphaltene precipitation under controlled mixing conditions in a microchamber
Authors:
Jia Meng,
Chiranjeevi Kanike,
Somasekhara Goud Sontti,
Arnab Atta,
Xiaoli Tan,
Xuehua Zhang
Abstract:
Solvent exchange is a controlled process for dilution-induced phase separation. This work utilizes the solvent exchange method to reveal the effect of the mixing dynamics on the asphaltene precipitation process under 20 different mixing conditions using a model system of n-heptane and asphaltene in toluene. The external mixing between the asphaltene solution and the paraffinic solvent is strictly…
▽ More
Solvent exchange is a controlled process for dilution-induced phase separation. This work utilizes the solvent exchange method to reveal the effect of the mixing dynamics on the asphaltene precipitation process under 20 different mixing conditions using a model system of n-heptane and asphaltene in toluene. The external mixing between the asphaltene solution and the paraffinic solvent is strictly controlled. We employed a high-spatial-resolution total internal reflection fluorescence microscope to detect asphaltene precipitates with a resolution up to 200 nm. A multiphysics model is used to simulate the evolution of the oversaturation pulse in the solvent exchange process. Based on the simulation results, we predicted the effect of the flow rate, dimension, the orientation of the microfluidic chamber, and temperature on the surface coverage and size distribution of asphaltene precipitates. The model predictions of all factors corroborate with the experimental observations. Local concentration of the solvent and shear forces are found to be the two main reasons for the change of asphaltene precipitation caused by mixing dynamics. However, the influence of thermodynamics is more critical than the mixing dynamics as temperature changes. Through a combination of experimental and simulation studies, this work illuminates the significance of the transportation process for the final morphology of asphaltene precipitates and provides an in-depth insight into the mechanism of mixing dynamics on the asphaltene precipitation. A smart mixing may be to boost new phase formation without excessive solvent consumption.
△ Less
Submitted 31 August, 2022;
originally announced September 2022.
-
A Preliminary Study on the Potential Usefulness of Open Domain Model for Missing Software Requirements Recommendation
Authors:
Ziyan Zhao,
Li Zhang,
Xiaoli Lian
Abstract:
Completeness is one of the most important attributes of software requirement specifications. Unfortunately, incompleteness is meanwhile one of the most difficult problems to detect. Some approaches have been proposed to detect missing requirements based on the requirement-oriented domain model. However, this kind of models are lacking for lots of domains. Fortunately, the domain models constructed…
▽ More
Completeness is one of the most important attributes of software requirement specifications. Unfortunately, incompleteness is meanwhile one of the most difficult problems to detect. Some approaches have been proposed to detect missing requirements based on the requirement-oriented domain model. However, this kind of models are lacking for lots of domains. Fortunately, the domain models constructed for different purposes can usually be found online. This raises a question: whether or not these domain models are helpful in finding the missing functional information in requirement specification? To explore this question, we design and conduct a preliminary study by computing the overlap** rate between the entities in domain models and the concepts of natural language software requirements and then digging into four regularities of the occurrence of these entities(concepts) based on two example domains. The usefulness of these regularities, especially the one based on our proposed metric AHME (with F2 gains of 146% and 223% on the two domains than without any regularity), has been shown in experiments.
△ Less
Submitted 13 August, 2022;
originally announced August 2022.
-
Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification
Authors:
Emadeldeen Eldele,
Mohamed Ragab,
Zhenghua Chen,
Min Wu,
Chee-Keong Kwoh,
Xiaoli Li,
Cuntai Guan
Abstract:
Learning time-series representations when only unlabeled data or few labeled samples are available can be a challenging task. Recently, contrastive self-supervised learning has shown great improvement in extracting useful representations from unlabeled data via contrasting different augmented views of data. In this work, we propose a novel Time-Series representation learning framework via Temporal…
▽ More
Learning time-series representations when only unlabeled data or few labeled samples are available can be a challenging task. Recently, contrastive self-supervised learning has shown great improvement in extracting useful representations from unlabeled data via contrasting different augmented views of data. In this work, we propose a novel Time-Series representation learning framework via Temporal and Contextual Contrasting (TS-TCC) that learns representations from unlabeled data with contrastive learning. Specifically, we propose time-series-specific weak and strong augmentations and use their views to learn robust temporal relations in the proposed temporal contrasting module, besides learning discriminative representations by our proposed contextual contrasting module. Additionally, we conduct a systematic study of time-series data augmentation selection, which is a key part of contrastive learning. We also extend TS-TCC to the semi-supervised learning settings and propose a Class-Aware TS-TCC (CA-TCC) that benefits from the available few labeled data to further improve representations learned by TS-TCC. Specifically, we leverage the robust pseudo labels produced by TS-TCC to realize a class-aware contrastive loss. Extensive experiments show that the linear evaluation of the features learned by our proposed framework performs comparably with the fully supervised training. Additionally, our framework shows high efficiency in the few labeled data and transfer learning scenarios. The code is publicly available at \url{https://github.com/emadeldeen24/CA-TCC}.
△ Less
Submitted 2 September, 2023; v1 submitted 13 August, 2022;
originally announced August 2022.
-
Leveraging Endo- and Exo-Temporal Regularization for Black-box Video Domain Adaptation
Authors:
Yuecong Xu,
Jianfei Yang,
Haozhi Cao,
Min Wu,
Xiaoli Li,
Lihua Xie,
Zhenghua Chen
Abstract:
To enable video models to be applied seamlessly across video tasks in different environments, various Video Unsupervised Domain Adaptation (VUDA) methods have been proposed to improve the robustness and transferability of video models. Despite improvements made in model robustness, these VUDA methods require access to both source data and source model parameters for adaptation, raising serious dat…
▽ More
To enable video models to be applied seamlessly across video tasks in different environments, various Video Unsupervised Domain Adaptation (VUDA) methods have been proposed to improve the robustness and transferability of video models. Despite improvements made in model robustness, these VUDA methods require access to both source data and source model parameters for adaptation, raising serious data privacy and model portability issues. To cope with the above concerns, this paper firstly formulates Black-box Video Domain Adaptation (BVDA) as a more realistic yet challenging scenario where the source video model is provided only as a black-box predictor. While a few methods for Black-box Domain Adaptation (BDA) are proposed in image domain, these methods cannot apply to video domain since video modality has more complicated temporal features that are harder to align. To address BVDA, we propose a novel Endo and eXo-TEmporal Regularized Network (EXTERN) by applying mask-to-mix strategies and video-tailored regularizations: endo-temporal regularization and exo-temporal regularization, performed across both clip and temporal features, while distilling knowledge from the predictions obtained from the black-box predictor. Empirical results demonstrate the state-of-the-art performance of EXTERN across various cross-domain closed-set and partial-set action recognition benchmarks, which even surpassed most existing video domain adaptation methods with source data accessibility.
△ Less
Submitted 9 November, 2022; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Phase diagrams on composition-spread Fe$_y$Te$_{1-x}$Se$_x$ films
Authors:
Zefeng Lin,
Sijia Tu,
Juan Xu,
Yujun Shi,
Beiyi Zhu,
Chao Dong,
Jie Yuan,
Xiaoli Dong,
Qihong Chen,
Yangmu Li,
Kui **,
Zhongxian Zhao
Abstract:
Fe$_y$Te$_{1-x}$Se$_x$, an archetypical iron-based high-temperature superconductor with a simple structure but rich physical properties, has attracted lots of attention because the two end compositions, Se content $x = 0$ and 1, exhibit antiferromagnetism and nematicity, respectively, making it an ideal candidate for studying their interactions with superconductivity. However, what is clearly lack…
▽ More
Fe$_y$Te$_{1-x}$Se$_x$, an archetypical iron-based high-temperature superconductor with a simple structure but rich physical properties, has attracted lots of attention because the two end compositions, Se content $x = 0$ and 1, exhibit antiferromagnetism and nematicity, respectively, making it an ideal candidate for studying their interactions with superconductivity. However, what is clearly lacking to date is a complete phase diagram of Fe$_y$Te$_{1-x}$Se$_x$ as functions of its chemical compositions since phase separation usually occurs from $x\sim 0.6$ to 0.9 in bulk crystals. Moreover, fine control of its composition is experimentally challenging because both Te and Se are volatile elements. Here we establish a complete phase diagram of Fe$_y$Te$_{1-x}$Se$_x$, achieved by high-throughput film synthesis and characterization techniques. An advanced combinatorial synthesis process enables us to fabricate an epitaxial composition-spread Fe$_y$Te$_{1-x}$Se$_x$ film encompassing the entire Se content $x$ from 0 to 1 on a single piece of CaF$_2$ substrate. The micro-region composition analysis and X-ray diffraction show a successful continuous tuning of chemical compositions and lattice parameters, respectively. The micro-scale pattern technique allows the map** of electrical transport properties as a function of relative Se content with an unprecedented resolution of 0.0074. Combining with the spin patterns in literature, we build a detailed phase diagram that can unify the electronic and magnetic properties of Fe$_y$Te$_{1-x}$Se$_x$. Our composition-spread Fe$_y$Te$_{1-x}$Se$_x$ films, overcoming the challenges of phase separation and precise control of chemical compositions, provide an ideal platform for studying the relationship between superconductivity and magnetism.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Robust Newsvendor Problem in Global Market: Stable Operation Strategy for a Two-Market Stochastic System
Authors:
Xiaoli Yan
Abstract:
The global markets provide enterprises with selling opportunities and challenges in stabilizing operational strategies. From the perspective of production management, it is important to improve the profitability of an enterprise by exploiting the different timing of the selling season in different markets to develop an operational strategy that is optimized and configured on a global scale. This p…
▽ More
The global markets provide enterprises with selling opportunities and challenges in stabilizing operational strategies. From the perspective of production management, it is important to improve the profitability of an enterprise by exploiting the different timing of the selling season in different markets to develop an operational strategy that is optimized and configured on a global scale. This paper examines the above issue with an insightful model of selling the product to two markets (a primary and a secondary market) with multiple risks of changes in the market environment and nonoverlap** selling seasons. We refer to this problem as the "global robust newsvendor" problem. We provide closed-form solutions of the optimal operation strategy for demand-independent and demand-related scenarios for the above two market stochastic systems. The closed-form solutions fully reflect the influence of the relationship between supply and demand on strategy selection. We find that the demand correlation and the lack of demand information will not substantially affect the operation strategy, and the enterprise's industrial chain and supply chain remain stable. However, the reduction of inter-market tariffs or logistics costs will cause changes, and the existence of the secondary market will lead to more capacity planning in the primary market. In addition, our model explicitly considers the impact of exchange rate uncertainty on operating strategies.
△ Less
Submitted 26 July, 2022; v1 submitted 8 July, 2022;
originally announced July 2022.
-
High Order Compact Finite Difference Methods for Non-Fickian Flows in Porous Media
Authors:
Xuan Zhao,
Ziyan Li,
Xiaoli Li
Abstract:
In this work, fourth-order compact block-centered finite difference (CBCFD) schemes combined with the Crank-Nicolson discretization are constructed and analyzed for solving parabolic integro-differential type non-Fickian flows in one-dimensional and two-dimensional cases. Stability analyses of the constructed schemes are derived rigorously. We also obtain the optimal second-order convergence in te…
▽ More
In this work, fourth-order compact block-centered finite difference (CBCFD) schemes combined with the Crank-Nicolson discretization are constructed and analyzed for solving parabolic integro-differential type non-Fickian flows in one-dimensional and two-dimensional cases. Stability analyses of the constructed schemes are derived rigorously. We also obtain the optimal second-order convergence in temporal increment and the fourth-order convergence in spatial direction for both velocity and pressure. To verify the validity of the CBCFD schemes, we present some experiments to show that the numerical results are in agreement with our theoretical analysis.
△ Less
Submitted 3 July, 2022;
originally announced July 2022.
-
A Deep Learning Approach to Nonconvex Energy Minimization for Martensitic Phase Transitions
Authors:
Xiaoli Chen,
Phoebus Rosakis,
Zhizhang Wu,
Zhiwen Zhang
Abstract:
We propose a mesh-free method to solve nonconvex energy minimization problems for martensitic phase transitions and twinning in crystals, using the deep learning approach. These problems pose multiple challenges to both analysis and computation, as they involve multiwell gradient energies with large numbers of local minima, each involving a topologically complex microstructure of free boundaries w…
▽ More
We propose a mesh-free method to solve nonconvex energy minimization problems for martensitic phase transitions and twinning in crystals, using the deep learning approach. These problems pose multiple challenges to both analysis and computation, as they involve multiwell gradient energies with large numbers of local minima, each involving a topologically complex microstructure of free boundaries with gradient jumps. We use the Deep Ritz method, whereby candidates for minimizers are represented by parameter-dependent deep neural networks, and the energy is minimized with respect to network parameters. The new essential ingredient is a novel activation function proposed here, which is a smoothened rectified linear unit we call SmReLU; this captures the structure of minimizers where usual activation functions fail. The method is mesh-free and thus can approximate free boundaries essential to this problem without any special treatment, and is extremely simple to implement. We show the results of many numerical computations demonstrating the success of our method.
△ Less
Submitted 14 October, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Impact of Channel Memory on the Data Freshness
Authors:
Qixing Guan,
Xiaoli Xu
Abstract:
In this letter, we investigate the impact of channel memory on the average age of information (AoI) for networks with various packet arrival models under first-come-first-served (FCFS) and preemptive last-generated-first-served (pLGFS) policies over Gilbert-Elliott (GE) erasure channel. For networks with Bernoulli arrival model, we first derive the average AoI under the pLGFS queuing policy, and t…
▽ More
In this letter, we investigate the impact of channel memory on the average age of information (AoI) for networks with various packet arrival models under first-come-first-served (FCFS) and preemptive last-generated-first-served (pLGFS) policies over Gilbert-Elliott (GE) erasure channel. For networks with Bernoulli arrival model, we first derive the average AoI under the pLGFS queuing policy, and then characterize the AoI gap between the FCFS and pLGFS policies. For networks with Bernoulli arrival and generate-at-will arrival models, the AoI performances under the FCFS and pLGFS policies are derived explicitly. For networks with periodic arrival model, we derive the closed-form expression for the average AoI under pLGFS over a general GE channel and propose a numerical algorithm for calculating that under FCFS efficiently. It is revealed that for pLGFS policy, the average AoI increases monotonically with channel memory $η$ at $\fracη{1-η}$ over the symmetric GE channel. For FCFS, the average AoI increases even faster due to the queuing delay, with an additional term related to the packet arrival rate.
△ Less
Submitted 8 October, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Single Threshold Packet Scheduling Policy for AoI Minimization in Resource-Constrained Network
Authors:
Yonghao Ji,
Xiaoli Xu
Abstract:
This paper investigates the tradeoff between the average age of information (AoI) and the transmission cost for networks with stochastic packet arrival and random erasure channel. Specifically, we model the resource-constrained AoI minimization problem as a constrained Markov decision process (CMDP) and propose a low-complexity single threshold packet scheduling policy for it. The key advantage of…
▽ More
This paper investigates the tradeoff between the average age of information (AoI) and the transmission cost for networks with stochastic packet arrival and random erasure channel. Specifically, we model the resource-constrained AoI minimization problem as a constrained Markov decision process (CMDP) and propose a low-complexity single threshold packet scheduling policy for it. The key advantage of the proposed policy is its tractability and convenience for implementation. The AoI distribution and long-term average transmission cost of the proposed policy are derived as closed-form functions of the selected threshold. Furthermore, we show that the proposed policy reduces to the optimal policies under special settings and achieves close-to-optimal performance under general settings.
△ Less
Submitted 25 June, 2022;
originally announced June 2022.
-
LPCSE: Neural Speech Enhancement through Linear Predictive Coding
Authors:
Yang Liu,
Na Tang,
Xiaoli Chu,
Yang Yang,
Jun Wang
Abstract:
The increasingly stringent requirement on quality-of-experience in 5G/B5G communication systems has led to the emerging neural speech enhancement techniques, which however have been developed in isolation from the existing expert-rule based models of speech pronunciation and distortion, such as the classic Linear Predictive Coding (LPC) speech model because it is difficult to integrate the models…
▽ More
The increasingly stringent requirement on quality-of-experience in 5G/B5G communication systems has led to the emerging neural speech enhancement techniques, which however have been developed in isolation from the existing expert-rule based models of speech pronunciation and distortion, such as the classic Linear Predictive Coding (LPC) speech model because it is difficult to integrate the models with auto-differentiable machine learning frameworks. In this paper, to improve the efficiency of neural speech enhancement, we introduce an LPC-based speech enhancement (LPCSE) architecture, which leverages the strong inductive biases in the LPC speech model in conjunction with the expressive power of neural networks. Differentiable end-to-end learning is achieved in LPCSE via two novel blocks: a block that utilizes the expert rules to reduce the computational overhead when integrating the LPC speech model into neural networks, and a block that ensures the stability of the model and avoids exploding gradients in end-to-end training by map** the Linear prediction coefficients to the filter poles. The experimental results show that LPCSE successfully restores the formants of the speeches distorted by transmission loss, and outperforms two existing neural speech enhancement methods of comparable neural network sizes in terms of the Perceptual evaluation of speech quality (PESQ) and Short-Time Objective Intelligibility (STOI) on the LJ Speech corpus.
△ Less
Submitted 22 June, 2022; v1 submitted 14 June, 2022;
originally announced June 2022.