Search | arXiv e-print repository

doi 10.1038/s41467-022-35181-w

Finding shortest and nearly shortest path nodes in large substantially incomplete networks

Authors: Maksim Kitsak, Alexander Ganin, Ahmed Elmokashfi, Hongzhu Cui, Daniel A. Eisenberg, David L. Alderson, Dmitry Korkin, Igor Linkov

Abstract: Dynamic processes on networks, be it information transfer in the Internet, contagious spreading in a social network, or neural signaling, take place along shortest or nearly shortest paths. Unfortunately, our maps of most large networks are substantially incomplete due to either the highly dynamic nature of networks, or high cost of network measurements, or both, rendering traditional path finding… ▽ More Dynamic processes on networks, be it information transfer in the Internet, contagious spreading in a social network, or neural signaling, take place along shortest or nearly shortest paths. Unfortunately, our maps of most large networks are substantially incomplete due to either the highly dynamic nature of networks, or high cost of network measurements, or both, rendering traditional path finding methods inefficient. We find that shortest paths in large real networks, such as the network of protein-protein interactions (PPI) and the Internet at the autonomous system (AS) level, are not random but are organized according to latent-geometric rules. If nodes of these networks are mapped to points in latent hyperbolic spaces, shortest paths in them align along geodesic curves connecting endpoint nodes. We find that this alignment is sufficiently strong to allow for the identification of shortest path nodes even in the case of substantially incomplete networks. We demonstrate the utility of latent-geometric path-finding in problems of cellular pathway reconstruction and communication security. △ Less

Submitted 8 April, 2022; originally announced April 2022.

arXiv:2204.02864 [pdf, ps, other]

doi 10.1088/1361-6455/acbcb3

Optical Stern-Gerlach effect via a single traveling-wave light

Authors: Haihu Cui, Wenxi Lai

Abstract: In this paper, we propose a simplified model of optical Stern-Gerlach effect based on coherent coupling between clock transition of alkaline-earth single atoms and a traveling-wave light. It is demonstrated that spin-orbit coupling induced chiral motion in atom deflection appears under the strong atom-light interaction. The strong optical driving removes perturbation from the Doppler effect and ba… ▽ More In this paper, we propose a simplified model of optical Stern-Gerlach effect based on coherent coupling between clock transition of alkaline-earth single atoms and a traveling-wave light. It is demonstrated that spin-orbit coupling induced chiral motion in atom deflection appears under the strong atom-light interaction. The strong optical driving removes perturbation from the Doppler effect and back action effect to access the coherent system. In this process, superposition of distant matter waves connected to the arbitrary distribution of atom internal state could be predicted, which is important for the realization of atom interferometry and quantum state operation. The influence from atom relaxation and atom-atom interactions are discussed. Basic conditions of experimental design are given in the end of this work. △ Less

Submitted 12 February, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

Comments: 11 pages, 8 figures

arXiv:2203.16020 [pdf]

Shell DFT-1/2 method towards engineering accuracy for semiconductors: GGA versus LDA

Authors: Hanli Cui, Shengxin Yang, Jun-Hui Yuan, Li-Heng Li, Fan Ye, **hai Huang, Kan-Hao Xue, Xiangshui Miao

Abstract: The Kohn-Sham gaps of density functional theory (DFT) obtained in terms of local density approximation (LDA) or generalized gradient approximation (GGA) cannot be directly linked to the fundamental gaps of semiconductors, but in engineering there is a strong demand to match them through certain rectification methods. Shell DFT-1/2 (shDFT-1/2), as a variant of DFT-1/2, is a potential candidate to y… ▽ More The Kohn-Sham gaps of density functional theory (DFT) obtained in terms of local density approximation (LDA) or generalized gradient approximation (GGA) cannot be directly linked to the fundamental gaps of semiconductors, but in engineering there is a strong demand to match them through certain rectification methods. Shell DFT-1/2 (shDFT-1/2), as a variant of DFT-1/2, is a potential candidate to yield much improved band gaps for covalent semiconductors, but its accuracy depends on the LDA/GGA ground state, including optimized lattice parameters, basic Kohn-Sham gap before self-energy correction and the amount of self-energy correction that is specific to the exchange-correlation (XC) functional. In this work, we test the LDA/GGA as well as shDFT-1/2 results of six technically important covalent semiconductors Si, Ge, GaN, GaP, GaAs and GaSb, with an additional ionic insulator LiF for comparison. The impact of XC flavor (LDA, PBEsol, PBE and RPBE), either directly on the gap value, or indirectly through the optimized lattice constant, is examined comprehensively. Moreover, we test the impact of XC flavor on LDA/GGA and shDFT-1/2 gaps under the condition of fixed experimental lattice constants. In-depth analysis reveals the rule of reaching the best accuracy in calculating the electronic band structures of typical covalent semiconductors. Relevant parameters like lattice constant, self-consistency in shDFT-1/2 runs, as well as the exchange enhancement factor of GGA, are discussed in details. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: 23 pages, 10 figures

arXiv:2203.14802 [pdf, other]

doi 10.1016/j.physa.2023.128724

"Born in Rome" or "Slee** Beauty": Emergence of hashtag popularity on the Chinese microblog Sina Weibo

Authors: Hao Cui, János Kertész

Abstract: To understand the emergence of hashtag popularity in online social networking complex systems, we study the largest Chinese microblogging site Sina Weibo, which has a Hot Search List (HSL) showing in real time the ranking of the 50 most popular hashtags based on search activity. We investigate the prehistory of successful hashtags from 17 July 2020 to 17 September 2020 by map** out the related i… ▽ More To understand the emergence of hashtag popularity in online social networking complex systems, we study the largest Chinese microblogging site Sina Weibo, which has a Hot Search List (HSL) showing in real time the ranking of the 50 most popular hashtags based on search activity. We investigate the prehistory of successful hashtags from 17 July 2020 to 17 September 2020 by map** out the related interaction network preceding the selection to HSL. We have found that the circadian activity pattern has an impact on the time needed to get to the HSL. When analyzing this time we distinguish two extreme categories: a) "Born in Rome", which means hashtags are mostly first created by super-hubs or reach super-hubs at an early stage during their propagation and thus gain immediate wide attention from the broad public, and b) "Slee** Beauty", meaning the hashtags gain little attention at the beginning and reach system-wide popularity after a considerable time lag. The evolution of the repost networks of successful hashtags before getting to the HSL show two types of growth patterns: "smooth" and "stepwise". The former is usually dominated by a super-hub and the latter results from consecutive waves of contributions of smaller hubs. The repost networks of unsuccessful hashtags exhibit a simple evolution pattern. △ Less

Submitted 9 November, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

Comments: Main paper 12 pages, 7 figures. Supplementary information 5 pages, 3 figures

arXiv:2203.01469 [pdf, other]

doi 10.1007/JHEP11(2022)100

The $Higgs\to b\bar{b}, c\bar{c}, gg$ measurement at CEPC

Authors: Yongfeng Zhu, Hanhua Cui, Manqi Ruan

Abstract: Accurately measuring the properties of the Higgs boson is one of the core physics objectives of the Circular Electron Positron Collider (CEPC). As a Higgs factory, the CEPC is expected to operate at a centre-of-mass energy of $240\,GeV$, deliver an integrated luminosity of $5.6\,ab^{-1}$, and produce one million Higgs bosons according to the CEPC Conceptual Design Report (CDR). Combining measureme… ▽ More Accurately measuring the properties of the Higgs boson is one of the core physics objectives of the Circular Electron Positron Collider (CEPC). As a Higgs factory, the CEPC is expected to operate at a centre-of-mass energy of $240\,GeV$, deliver an integrated luminosity of $5.6\,ab^{-1}$, and produce one million Higgs bosons according to the CEPC Conceptual Design Report (CDR). Combining measurements of the $\ell^+\ell^-H$, $ν\barν H$, and $q\bar{q}H$ channels, we conclude that the signal strength of $H\to b\bar{b}/c\bar{c}/gg$ can be measured with a relative accuracy (statistic uncertainty only) of 0.27\%/4.03\%/1.56\%. Extrapolating to the recently released TDR operating parameters corresponding to the integrated luminosity of $20\,ab^{-1}$, the relative accuracy of $H\to b\bar{b}/c\bar{c}/gg$ signal strength is 0.14\%/2.13\%/0.82\%. We analyze the dependence of the expected accuracies on the critical detector performances: Color Singlet Identification (CSI) for the $q\bar{q}H$ channel and flavor tagging for both $ν\barν H$ and $q\bar{q}H$ channels. We observe that compared to the baseline CEPC detector performance, ideal flavor tagging increases the $H\to b\bar{b}/c\bar{c}/gg$ signal strength accuracy by 2\%/63\%/13\% in the $ν\barν H$ channel and 35\%/122\%/181\% in the $q\bar{q}H$ channel. In addition, better performance of CSI can significantly improve the anticipated accuracy of signal strength. The relevant systematics are also discussed in this paper. △ Less

Submitted 14 November, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

arXiv:2203.01292 [pdf, other]

Andes_gym: A Versatile Environment for Deep Reinforcement Learning in Power Systems

Authors: Hantao Cui, Yichen Zhang

Abstract: This paper presents Andes_gym, a versatile and high-performance reinforcement learning environment for power system studies. The environment leverages the modeling and simulation capability of ANDES and the reinforcement learning (RL) environment OpenAI Gym to enable the prototy** and demonstration of RL algorithms for power systems. The architecture of the proposed software tool is elaborated t… ▽ More This paper presents Andes_gym, a versatile and high-performance reinforcement learning environment for power system studies. The environment leverages the modeling and simulation capability of ANDES and the reinforcement learning (RL) environment OpenAI Gym to enable the prototy** and demonstration of RL algorithms for power systems. The architecture of the proposed software tool is elaborated to provide the observation and action interfaces for RL algorithms. An example is shown to rapidly prototype a load-frequency control algorithm based on RL trained by available algorithms. The proposed environment is highly generalized by supporting all the power system dynamic models available in ANDES and numerous RL algorithms available for OpenAI Gym. △ Less

Submitted 2 March, 2022; originally announced March 2022.

Comments: 5 pages, 7 figures, accepted by 2022 IEEE Power and Energy Society General Meeting

arXiv:2202.04044 [pdf]

Aging Scientists and Slowed Advance

Authors: Haochuan Cui, Lingfei Wu, James A. Evans

Abstract: What is the relationship between aging and the character of scientific advance? Prior research focuses on star scientists, their changing dates, and rates of breakthrough success through history. Analyzing more than 244 million scholars across 241 million articles over the last two centuries, we show that for all fields, periods, and impact levels, scientists research ideas and references age over… ▽ More What is the relationship between aging and the character of scientific advance? Prior research focuses on star scientists, their changing dates, and rates of breakthrough success through history. Analyzing more than 244 million scholars across 241 million articles over the last two centuries, we show that for all fields, periods, and impact levels, scientists research ideas and references age over time, their research is less likely to disrupt the state of science and more likely to criticize emerging work. Early success accelerates scientist aging; while changing institutions and fields and collaborating with young scientists slows it. These patterns aggregate within fields such that those with a higher proportion of older scientists experience a lower churn of ideas and more rapid individual aging, suggesting a universal link between aging, activity, and advance. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: 37 pages, 18 figures

arXiv:2202.00752 [pdf]

doi 10.1103/PhysRevLett.128.035703

Ultrahigh-Pressure Magnesium Hydrosilicates as Reservoirs of Water in Early Earth

Authors: Han-Fei Li, Artem R. Oganov, Haixu Cui, Xiang-Feng Zhou, Xiao Dong, Hui-Tian Wang

Abstract: The origin of water on the Earth is a long-standing mystery, requiring a comprehensive search for hydrous compounds, stable at conditions of the deep Earth and made of Earth-abundant elements. Previous studies usually focused on the current range of pressure-temperature conditions in the Earth's mantle and ignored a possible difference in the past, such as the stage of the core-mantle separation.… ▽ More The origin of water on the Earth is a long-standing mystery, requiring a comprehensive search for hydrous compounds, stable at conditions of the deep Earth and made of Earth-abundant elements. Previous studies usually focused on the current range of pressure-temperature conditions in the Earth's mantle and ignored a possible difference in the past, such as the stage of the core-mantle separation. Here, using ab initio evolutionary structure prediction, we find that only two magnesium hydrosilicate phases are stable at megabar pressures, $α$-Mg$_2$SiO$_5$H$_2$ and $β$-Mg$_2$SiO$_5$H$_2$, stable at 262-338 GPa and >338 GPa,respectively (all these pressures now lie within the Earth's iron core). Both are superionic conductors with quasi-one-dimensional proton diffusion at relevant conditions. In the first 30 million years of Earth's history, before the Earth's core was formed, these must have existed in the Earth, hosting much of Earth's water. As dense iron alloys segregated to form the Earth's core, Mg$_2$SiO$_5$H$_2$ phases decomposed and released water. Thus, now-extinct Mg$_2$SiO$_5$H$_2$ phases have likely contributed in a major way to the evolution of our planet. △ Less

Submitted 30 January, 2022; originally announced February 2022.

Journal ref: Phys. Rev. Lett. 128, 035703 (2022)

arXiv:2201.12655 [pdf, other]

doi 10.1088/2632-2153/acf041

Error Scaling Laws for Kernel Classification under Source and Capacity Conditions

Authors: Hugo Cui, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

Abstract: We consider the problem of kernel classification. While worst-case bounds on the decay rate of the prediction error with the number of samples are known for some classifiers, they often fail to accurately describe the learning curves of real data sets. In this work, we consider the important class of data sets satisfying the standard source and capacity conditions, comprising a number of real data… ▽ More We consider the problem of kernel classification. While worst-case bounds on the decay rate of the prediction error with the number of samples are known for some classifiers, they often fail to accurately describe the learning curves of real data sets. In this work, we consider the important class of data sets satisfying the standard source and capacity conditions, comprising a number of real data sets as we show numerically. Under the Gaussian design, we derive the decay rates for the misclassification (prediction) error as a function of the source and capacity coefficients. We do so for two standard kernel classification settings, namely margin-maximizing Support Vector Machines (SVM) and ridge classification, and contrast the two methods. We find that our rates tightly describe the learning curves for this class of data sets, and are also observed on real data. Our results can also be seen as an explicit prediction of the exponents of a scaling law for kernel classification that is accurate on some real datasets. △ Less

Submitted 6 September, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

Journal ref: Mach. Learn.: Sci. Technol. (2023) 4 035033

arXiv:2201.07073 [pdf]

doi 10.3390/cryst12010102

A Discrepancy in Thermal Conductivity Measurement Data of Quantum Spin Liquid $β$'-EtMe$_3$Sb[Pd(dmit)$_2$]$_2$ (dmit = 1,3-Dithiol-2-thione-4,5-dithiolate)

Authors: Reizo Kato, Masashi Uebe, Shigeki Fujiyama, Hengbo Cui

Abstract: A molecular Mott insulator $β$'-EtMe$_3$Sb[Pd(dmit)$_2$]$_2$ is a quantum spin liquid candidate. In 2010, it was reported that thermal conductivity of $β$'-EtMe$_3$Sb[Pd(dmit)$_2$]$_2$ is characterized by its large value and gapless behavior (a finite temperature-linear term). In 2019, however, two other research groups reported opposite data (much smaller value and a vanishingly small temperature… ▽ More A molecular Mott insulator $β$'-EtMe$_3$Sb[Pd(dmit)$_2$]$_2$ is a quantum spin liquid candidate. In 2010, it was reported that thermal conductivity of $β$'-EtMe$_3$Sb[Pd(dmit)$_2$]$_2$ is characterized by its large value and gapless behavior (a finite temperature-linear term). In 2019, however, two other research groups reported opposite data (much smaller value and a vanishingly small temperature-linear term) and the discrepancy in the thermal conductivity measurement data emerges as a serious problem concerning the ground state of the quantum spin liquid. Recently, the cooling rate was proposed to be an origin of the discrepancy. We examined effects of the cooling rate on electrical resistivity, low-temperature crystal structure, and $^{13}$C-NMR measurements and could not find any significant cooling rate dependence. △ Less

Submitted 18 January, 2022; originally announced January 2022.

Comments: 8 pages, 5 figures

Journal ref: Crystals 12, 102 (2022)

arXiv:2201.04672 [pdf, other]

How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation

Authors: Hejie Cui, Jiaying Lu, Yao Ge, Carl Yang

Abstract: Graph neural networks (GNNs), as a group of powerful tools for representation learning on irregular data, have manifested superiority in various downstream tasks. With unstructured texts represented as concept maps, GNNs can be exploited for tasks like document retrieval. Intrigued by how can GNNs help document retrieval, we conduct an empirical study on a large-scale multi-discipline dataset CORD… ▽ More Graph neural networks (GNNs), as a group of powerful tools for representation learning on irregular data, have manifested superiority in various downstream tasks. With unstructured texts represented as concept maps, GNNs can be exploited for tasks like document retrieval. Intrigued by how can GNNs help document retrieval, we conduct an empirical study on a large-scale multi-discipline dataset CORD-19. Results show that instead of the complex structure-oriented GNNs such as GINs and GATs, our proposed semantics-oriented graph functions achieve better and more stable performance based on the BM25 retrieved candidates. Our insights in this case study can serve as a guideline for future work to develop effective GNNs with appropriate semantics-oriented inductive biases for textual reasoning tasks like document retrieval and classification. All code for this case study is available at https://github.com/HennyJie/GNN-DocRetrieval. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: This paper has been accepted to the 44th European Conference on Information Retrieval (ECIR) 2022

MSC Class: 68T50; 68T37; 68T01; 68P20 ACM Class: H.3.3; I.7; I.2.7; I.2.6; I.2.4

arXiv:2112.12359 [pdf, other]

Dual Path Structural Contrastive Embeddings for Learning Novel Objects

Authors: Bingbin Li, Elvis Han Cui, Yanan Li, Donghui Wang, Weng Kee Wong

Abstract: Learning novel classes from a very few labeled samples has attracted increasing attention in machine learning areas. Recent research on either meta-learning based or transfer-learning based paradigm demonstrates that gaining information on a good feature space can be an effective solution to achieve favorable performance on few-shot tasks. In this paper, we propose a simple but effective paradigm… ▽ More Learning novel classes from a very few labeled samples has attracted increasing attention in machine learning areas. Recent research on either meta-learning based or transfer-learning based paradigm demonstrates that gaining information on a good feature space can be an effective solution to achieve favorable performance on few-shot tasks. In this paper, we propose a simple but effective paradigm that decouples the tasks of learning feature representations and classifiers and only learns the feature embedding architecture from base classes via the typical transfer-learning training strategy. To maintain both the generalization ability across base and novel classes and discrimination ability within each class, we propose a dual path feature learning scheme that effectively combines structural similarity with contrastive feature construction. In this way, both inner-class alignment and inter-class uniformity can be well balanced, and result in improved performance. Experiments on three popular benchmarks show that when incorporated with a simple prototype based classifier, our method can still achieve promising results for both standard and generalized few-shot problems in either an inductive or transductive inference setting. △ Less

Submitted 4 January, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

arXiv:2112.04785 [pdf, other]

VMAgent: Scheduling Simulator for Reinforcement Learning

Authors: Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo **, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

Abstract: A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling. VMAgent is inspired by practical virtual machine (VM) scheduling tasks and provides an efficient simulation platform that can reflect the real situations of cloud computing. Three scenarios (fading, recovering, and expansion) are concluded from practical clou… ▽ More A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling. VMAgent is inspired by practical virtual machine (VM) scheduling tasks and provides an efficient simulation platform that can reflect the real situations of cloud computing. Three scenarios (fading, recovering, and expansion) are concluded from practical cloud computing and corresponds to many reinforcement learning challenges (high dimensional state and action spaces, high non-stationarity, and life-long demand). VMAgent provides flexible configurations for RL researchers to design their customized scheduling environments considering different problem features. From the VM scheduling perspective, VMAgent also helps to explore better learning-based scheduling solutions. △ Less

Submitted 9 December, 2021; originally announced December 2021.

arXiv:2112.02767 [pdf, other]

A General Framework for Debiasing in CTR Prediction

Authors: Wenjie Chu, Shen Li, Chao Chen, Longfei Xu, Hengbin Cui, Kaikui Liu

Abstract: Most of the existing methods for debaising in click-through rate (CTR) prediction depend on an oversimplified assumption, i.e., the click probability is the product of observation probability and relevance probability. However, since there is a complicated interplay between these two probabilities, these methods cannot be applied to other scenarios, e.g. query auto completion (QAC) and route recom… ▽ More Most of the existing methods for debaising in click-through rate (CTR) prediction depend on an oversimplified assumption, i.e., the click probability is the product of observation probability and relevance probability. However, since there is a complicated interplay between these two probabilities, these methods cannot be applied to other scenarios, e.g. query auto completion (QAC) and route recommendation. We propose a general debiasing framework without simplifying the relationships between variables, which can handle all scenarios in CTR prediction. Simulation experiments show that: under the simplest scenario, our method maintains a similar AUC with the state-of-the-art methods; in other scenarios, our method achieves considerable improvements compared with existing methods. Meanwhile, in online experiments, the framework also gains significant improvements consistently. △ Less

Submitted 5 December, 2021; originally announced December 2021.

arXiv:2111.13970 [pdf, other]

doi 10.5445/KSP/1000138532

Label Assistant: A Workflow for Assisted Data Annotation in Image Segmentation Tasks

Authors: Marcel P. Schilling, Luca Rettenberger, Friedrich Münke, Haijun Cui, Anna A. Popova, Pavel A. Levkin, Ralf Mikut, Markus Reischl

Abstract: Recent research in the field of computer vision strongly focuses on deep learning architectures to tackle image processing problems. Deep neural networks are often considered in complex image processing scenarios since traditional computer vision approaches are expensive to develop or reach their limits due to complex relations. However, a common criticism is the need for large annotated datasets… ▽ More Recent research in the field of computer vision strongly focuses on deep learning architectures to tackle image processing problems. Deep neural networks are often considered in complex image processing scenarios since traditional computer vision approaches are expensive to develop or reach their limits due to complex relations. However, a common criticism is the need for large annotated datasets to determine robust parameters. Annotating images by human experts is time-consuming, burdensome, and expensive. Thus, support is needed to simplify annotation, increase user efficiency, and annotation quality. In this paper, we propose a generic workflow to assist the annotation process and discuss methods on an abstract level. Thereby, we review the possibilities of focusing on promising samples, image pre-processing, pre-labeling, label inspection, or post-processing of annotations. In addition, we present an implementation of the proposal by means of a developed flexible and extendable software prototype nested in hybrid touchscreen/laptop device. △ Less

Submitted 27 November, 2021; originally announced November 2021.

Journal ref: Proceedings - 31. Workshop Computational Intelligence, 2021

arXiv:2111.06717 [pdf, other]

doi 10.1073/pnas.2205463120

Device-Independent-Quantum-Randomness-Enhanced Zero-Knowledge Proof

Authors: Cheng-Long Li, Kai-Yi Zhang, Xingjian Zhang, Kui-Xing Yang, Yu Han, Su-Yi Cheng, Hongrui Cui, Wen-Zhao Liu, Ming-Han Li, Yang Liu, Bing Bai, Hai-Hao Dong, Jun Zhang, Xiongfeng Ma, Yu Yu, **gyun Fan, Qiang Zhang, Jian-Wei Pan

Abstract: Zero-knowledge proof (ZKP) is a fundamental cryptographic primitive that allows a prover to convince a verifier of the validity of a statement without leaking any further information. As an efficient variant of ZKP, non-interactive zero-knowledge proof (NIZKP) adopting the Fiat-Shamir heuristic is essential to a wide spectrum of applications, such as federated learning, blockchain and social netwo… ▽ More Zero-knowledge proof (ZKP) is a fundamental cryptographic primitive that allows a prover to convince a verifier of the validity of a statement without leaking any further information. As an efficient variant of ZKP, non-interactive zero-knowledge proof (NIZKP) adopting the Fiat-Shamir heuristic is essential to a wide spectrum of applications, such as federated learning, blockchain and social networks. However, the heuristic is typically built upon the random oracle model making ideal assumptions about hash functions, which does not hold in reality and thus undermines the security of the protocol. Here, we present a quantum resolution to the problem. Instead of resorting to a random oracle model, we implement a quantum randomness service. This service generates random numbers certified by the loophole-free Bell test and delivers them with postquantum cryptography (PQC) authentication. Employing this service, we conceive and implement a NIZKP of the three-colouring problem. By bridging together three prominent research themes, quantum non-locality, PQC and ZKP, we anticipate this work to open a new paradigm of quantum information science. △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: 20 pages, 9 figures, 6 tables

Journal ref: PNAS 120, e2205463120 (2023)

arXiv:2111.06545 [pdf, ps, other]

doi 10.1126/science.abg5137

Peta-electron volt gamma-ray emission from the Crab Nebula

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, H. Cai, J. T. Cai, Zhe Cao, J. Chang, J. F. Chang, B. M. Chen, E. S. Chen, J. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen , et al. (250 additional authors not shown)

Abstract: The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet… ▽ More The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ petaelectronvolt (PeV). The ultra-high-energy photons exhibit the presence of a PeV electron accelerator (a pevatron) with an acceleration rate exceeding 15% of the absolute theoretical limit. Assuming that unpulsed $γ$-rays are produced at the termination of the pulsar's wind, we constrain the pevatron's size, between $0.025$ and $0.1$ pc, and the magnetic field $\approx 110 μ$G. The production rate of PeV electrons, $2.5 \times 10^{36}$ erg $\rm s^{-1}$, constitutes 0.5% of the pulsar's spin-down luminosity, although we do not exclude a non-negligible contribution of PeV protons to the production of the highest energy $γ$-rays. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 43 pages, 13 figures, 2 tables; Published in Science

Journal ref: Science, 2021, Vol 373, Issue 6553, pp. 425-430

arXiv:2110.12632 [pdf, other]

Effects of shallow carbon and deep N++ layer on the radiation hardness of IHEP-IME LGAD sensors

Authors: Mengzhao Li, Yunyun Fan, Xuewei Jia, Han Cui, Zhijun Liang, Mei Zhao, Tao Yang, Kewei Wu, Shuqi Li, Chengjun Yu, Bo Liu, Wei Wang, Xuan Yang, Yuhang Tan, Xin Shi, J. G. da Costa, Yuekun Heng, Gaobo Xu, Qionghua Zhai, Gang** Yan, Mingzheng Ding, Jun Luo, Huaxiang Yin, Junfeng Li, Alissa Howard , et al. (1 additional authors not shown)

Abstract: Low Gain Avalanche Diode (LGAD) is applied for the High-Granularity Timing Detector (HGTD), and it will be used to upgrade the ATLAS experiment. The first batch IHEP-IME LGAD sensors were designed by the Institute of High Energy Physics (IHEP) and fabricated by the Institute of Microelectronics (IME). Three IHEP-IME sensors (W1, W7 and W8) were irradiated by the neutrons up to the fluence of 2.5 x… ▽ More Low Gain Avalanche Diode (LGAD) is applied for the High-Granularity Timing Detector (HGTD), and it will be used to upgrade the ATLAS experiment. The first batch IHEP-IME LGAD sensors were designed by the Institute of High Energy Physics (IHEP) and fabricated by the Institute of Microelectronics (IME). Three IHEP-IME sensors (W1, W7 and W8) were irradiated by the neutrons up to the fluence of 2.5 x 10^15 n_eq/cm^2 to study the effect of the shallow carbon and deep N++ layer on the irradiation hardness. Taking W7 as a reference, W1 has an extra shallow carbon applied, and W8 has a deeper N++ layer. △ Less

Submitted 25 October, 2021; originally announced October 2021.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2110.10474 [pdf, other]

R4: A Framework for Route Representation and Route Recommendation

Authors: Ran Cheng, Chao Chen, Longfei Xu, Shen Li, Lei Wang, Hengbin Cui, Kaikui Liu, Xiaolong Li

Abstract: Route recommendation is significant in navigation service. Two major challenges for route recommendation are route representation and user representation. Different from items that can be identified by unique IDs in traditional recommendation, routes are combinations of links (i.e., a road segment and its following action like turning left) and the number of combinations could be close to infinite… ▽ More Route recommendation is significant in navigation service. Two major challenges for route recommendation are route representation and user representation. Different from items that can be identified by unique IDs in traditional recommendation, routes are combinations of links (i.e., a road segment and its following action like turning left) and the number of combinations could be close to infinite. Besides, the representation of a route changes under different scenarios. These facts result in severe sparsity of routes, which increases the difficulty of route representation. Moreover, link attribute deficiencies and errors affect preciseness of route representation. Because of the sparsity of routes, the interaction data between users and routes are also sparse. This makes it not easy to acquire user representation from historical user-item interactions as traditional recommendations do. To address these issues, we propose a novel learning framework R4. In R4, we design a sparse & dense network to obtain representations of routes. The sparse unit learns link ID embeddings and aggregates them to represent a route, which captures implicit route characteristics and subsequently alleviates problems caused by link attribute deficiencies and errors. The dense unit extracts implicit local features of routes from link attributes. For user representation, we utilize a series of historical navigation to extract user preference. R4 achieves remarkable performance in both offline and online experiments. △ Less

Submitted 24 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2110.09260 [pdf, other]

doi 10.1109/TMI.2020.3045775

A Unified Framework for Generalized Low-Shot Medical Image Segmentation with Scarce Data

Authors: Hengji Cui, Dong Wei, Kai Ma, Shi Gu, Yefeng Zheng

Abstract: Medical image segmentation has achieved remarkable advancements using deep neural networks (DNNs). However, DNNs often need big amounts of data and annotations for training, both of which can be difficult and costly to obtain. In this work, we propose a unified framework for generalized low-shot (one- and few-shot) medical image segmentation based on distance metric learning (DML). Unlike most exi… ▽ More Medical image segmentation has achieved remarkable advancements using deep neural networks (DNNs). However, DNNs often need big amounts of data and annotations for training, both of which can be difficult and costly to obtain. In this work, we propose a unified framework for generalized low-shot (one- and few-shot) medical image segmentation based on distance metric learning (DML). Unlike most existing methods which only deal with the lack of annotations while assuming abundance of data, our framework works with extreme scarcity of both, which is ideal for rare diseases. Via DML, the framework learns a multimodal mixture representation for each category, and performs dense predictions based on cosine distances between the pixels' deep embeddings and the category representations. The multimodal representations effectively utilize the inter-subject similarities and intraclass variations to overcome overfitting due to extremely limited data. In addition, we propose adaptive mixing coefficients for the multimodal mixture distributions to adaptively emphasize the modes better suited to the current input. The representations are implicitly embedded as weights of the fc layer, such that the cosine distances can be computed efficiently via forward propagation. In our experiments on brain MRI and abdominal CT datasets, the proposed framework achieves superior performances for low-shot segmentation towards standard DNN-based (3D U-Net) and classical registration-based (ANTs) methods, e.g., achieving mean Dice coefficients of 81%/69% for brain tissue/abdominal multiorgan segmentation using a single training sample, as compared to 52%/31% and 72%/35% by the U-Net and ANTs, respectively. △ Less

Submitted 18 October, 2021; originally announced October 2021.

Comments: Published in IEEE TRANSACTIONS ON MEDICAL IMAGING

arXiv:2110.03765 [pdf, other]

Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-Supervised Learning

Authors: Huanle Zhang, Nicharee Wisuthiphaet, Hemiao Cui, Nitin Nitin, Xin Liu, Qing Zhao

Abstract: The past decade witnesses a rapid development in the measurement and monitoring technologies for food science. Among these technologies, spectroscopy has been widely used for the analysis of food quality, safety, and nutritional properties. Due to the complexity of food systems and the lack of comprehensive predictive models, rapid and simple measurements to predict complex properties in food syst… ▽ More The past decade witnesses a rapid development in the measurement and monitoring technologies for food science. Among these technologies, spectroscopy has been widely used for the analysis of food quality, safety, and nutritional properties. Due to the complexity of food systems and the lack of comprehensive predictive models, rapid and simple measurements to predict complex properties in food systems are largely missing. Machine Learning (ML) has shown great potential to improve classification and prediction of these properties. However, the barriers to collect large datasets for ML applications still persists. In this paper, we explore different approaches of data annotation and model training to improve data efficiency for ML applications. Specifically, we leverage Active Learning (AL) and Semi-Supervised Learning (SSL) and investigate four approaches: baseline passive learning, AL, SSL, and a hybrid of AL and SSL. To evaluate these approaches, we collect two spectroscopy datasets: predicting plasma dosage and detecting foodborne pathogen. Our experimental results show that, compared to the de facto passive learning approach, AL and SSL methods reduce the number of labeled samples by 50% and 25% for each ML application, respectively. △ Less

Submitted 4 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

arXiv:2109.02982 [pdf]

doi 10.1103/PhysRevB.104.054409

Intermediate anomalous Hall states induced by noncollinear spin structure in magnetic topological insulator MnBi2Te4

Authors: **g-Zhi Fang, Shuo Wang, Xing-Guo Ye, Ben-Chuan Lin, An-Qi Wang, Hao-Nan Cui, Jian-Kun Wang, Guang-Yu Zhu, Song Liu, Yongkai Li, Zhiwei Wang, Yugui Yao, Zhongming Wei, Dapeng Yu, Zhi-Min Liao

Abstract: The combination of topology and magnetism is attractive to produce exotic quantum matters, such as the quantum anomalous Hall state, axion insulators and the magnetic Weyl semimetals. MnBi2Te4, as an intrinsic magnetic topological insulator, provides a platform for the realization of various topological phases. Here we report the intermediate Hall steps in the magnetic hysteresis of MnBi2Te4, wher… ▽ More The combination of topology and magnetism is attractive to produce exotic quantum matters, such as the quantum anomalous Hall state, axion insulators and the magnetic Weyl semimetals. MnBi2Te4, as an intrinsic magnetic topological insulator, provides a platform for the realization of various topological phases. Here we report the intermediate Hall steps in the magnetic hysteresis of MnBi2Te4, where four distinguishable magnetic memory states at zero magnetic field are revealed. The gate and temperature dependence of the magnetic intermediate states indicates the noncollinear spin structure in MnBi2Te4, which can be attributed to the Dzyaloshinskii-Moriya interaction as the coexistence of strong spin-orbit coupling and local inversion symmetry breaking on the surface. Moreover, these multiple magnetic memory states can be programmatically switched among each other through applying designed pulses of magnetic field. Our results provide new insights of the influence of bulk topology on the magnetic states, and the multiple memory states should be promising for spintronic devices. △ Less

Submitted 7 September, 2021; originally announced September 2021.

Journal ref: Physical Review B 104, 054409 (2021)

arXiv:2108.13886 [pdf, other]

Structure-Aware Hard Negative Mining for Heterogeneous Graph Contrastive Learning

Authors: Yanqiao Zhu, Yichen Xu, Hejie Cui, Carl Yang, Qiang Liu, Shu Wu

Abstract: Recently, heterogeneous Graph Neural Networks (GNNs) have become a de facto model for analyzing HGs, while most of them rely on a relative large number of labeled data. In this work, we investigate Contrastive Learning (CL), a key component in self-supervised approaches, on HGs to alleviate the label scarcity problem. We first generate multiple semantic views according to metapaths and network sch… ▽ More Recently, heterogeneous Graph Neural Networks (GNNs) have become a de facto model for analyzing HGs, while most of them rely on a relative large number of labeled data. In this work, we investigate Contrastive Learning (CL), a key component in self-supervised approaches, on HGs to alleviate the label scarcity problem. We first generate multiple semantic views according to metapaths and network schemas. Then, by pushing node embeddings corresponding to different semantic views close to each other (positives) and pulling other embeddings apart (negatives), one can obtain informative representations without human annotations. However, this CL approach ignores the relative hardness of negative samples, which may lead to suboptimal performance. Considering the complex graph structure and the smoothing nature of GNNs, we propose a structure-aware hard negative mining scheme that measures hardness by structural characteristics for HGs. By synthesizing more negative nodes, we give larger weights to harder negatives with limited computational overhead to further boost the performance. Empirical studies on three real-world datasets show the effectiveness of our proposed method. The proposed method consistently outperforms existing state-of-the-art methods and notably, even surpasses several supervised counterparts. △ Less

Submitted 31 August, 2021; originally announced August 2021.

Comments: KDD Workshop on Deep Learning on Graphs: Method and Applications (DLG@KDD 2021)

arXiv:2108.05096 [pdf]

doi 10.1364/OL.440660

Omnidirectional ghost imaging system and unwrap**-free panoramic ghost imaging

Authors: Huan Cui, Jie Cao, Qun Hao, Dong Zhou, Mingyuan Tang, Kaiyu Zhang, Yingqiang Zhang

Abstract: Ghost imaging (GI) is a novel imaging method, which can reconstruct the object information by the light intensity correlation measurements. However, at present, the field of view (FOV) is limited to the illuminating range of the light patterns. To enlarge FOV of GI efficiently, here we proposed the omnidirectional ghost imaging system (OGIS), which can achieve a 360° omnidirectional FOV at one sho… ▽ More Ghost imaging (GI) is a novel imaging method, which can reconstruct the object information by the light intensity correlation measurements. However, at present, the field of view (FOV) is limited to the illuminating range of the light patterns. To enlarge FOV of GI efficiently, here we proposed the omnidirectional ghost imaging system (OGIS), which can achieve a 360° omnidirectional FOV at one shot only by adding a curved mirror. Moreover, by designing the retina-like annular patterns with log-polar patterns, OGIS can obtain unwrap**-free undistorted panoramic images with uniform resolution, which opens up a new way for the application of GI. △ Less

Submitted 11 August, 2021; originally announced August 2021.

arXiv:2108.03914 [pdf, other]

Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval

Authors: Hui Cui, Lei Zhu, **g**g Li, Zhiyong Cheng, Zheng Zhang

Abstract: Hashing learns compact binary codes to store and retrieve massive data efficiently. Particularly, unsupervised deep hashing is supported by powerful deep neural networks and has the desirable advantage of label independence. It is a promising technique for scalable image retrieval. However, deep models introduce a large number of parameters, which is hard to optimize due to the lack of explicit se… ▽ More Hashing learns compact binary codes to store and retrieve massive data efficiently. Particularly, unsupervised deep hashing is supported by powerful deep neural networks and has the desirable advantage of label independence. It is a promising technique for scalable image retrieval. However, deep models introduce a large number of parameters, which is hard to optimize due to the lack of explicit semantic labels and brings considerable training cost. As a result, the retrieval accuracy and training efficiency of existing unsupervised deep hashing are still limited. To tackle the problems, in this paper, we propose a simple and efficient \emph{Lightweight Augmented Graph Network Hashing} (LAGNH) method with a two-pronged strategy. For one thing, we extract the inner structure of the image as the auxiliary semantics to enhance the semantic supervision of the unsupervised hash learning process. For another, we design a lightweight network structure with the assistance of the auxiliary semantics, which greatly reduces the number of network parameters that needs to be optimized and thus greatly accelerates the training process. Specifically, we design a cross-modal attention module based on the auxiliary semantic information to adaptively mitigate the adverse effects in the deep image features. Besides, the hash codes are learned by multi-layer message passing within an adversarial regularized graph convolutional network. Simultaneously, the semantic representation capability of hash codes is further enhanced by reconstructing the similarity graph. △ Less

Submitted 9 August, 2021; originally announced August 2021.

arXiv:2108.01667 [pdf]

doi 10.1364/OE.439704

Optimization of retina-like illumination patterns in ghost imaging

Authors: Jie Cao, Dong Zhou, Ying-Qiang Zhang, Huan Cui, Fang-Hua Zhang, Qun Hao

Abstract: Ghost imaging (GI) reconstructs images using a single-pixel or bucket detector, which has the advantages of scattering robustness, wide spectrum and beyond-visual-field imaging. However, this technique needs large amount of measurements to obtain a sharp image. There have been a lot of methods proposed to overcome this disadvantage. Retina-like patterns, as one of the compressive sensing approache… ▽ More Ghost imaging (GI) reconstructs images using a single-pixel or bucket detector, which has the advantages of scattering robustness, wide spectrum and beyond-visual-field imaging. However, this technique needs large amount of measurements to obtain a sharp image. There have been a lot of methods proposed to overcome this disadvantage. Retina-like patterns, as one of the compressive sensing approaches, enhance the imaging quality of region of interest (ROI) while not increase measurements. The design of the retina-like patterns determines the performance of the ROI in the reconstructed image. Unlike the conventional method to fill in ROI with random patterns, we propose to optimize retina-like patterns by filling in the ROI with the patterns containing the sparsity prior of objects. This proposed method is verified by simulations and experiments compared with conventional GI, retina-like GI and GI using patterns optimized by principal component analysis. The method using optimized retina-like patterns obtain the best imaging quality in ROI than other methods. Meanwhile, the good generalization ability of the optimized retina-like pattern is also verified. While designing the size and position of the ROI of retina-like pattern, the feature information of the target can be obtained to optimize the pattern of ROI. This proposed method paves the way for realizing high-quality GI. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2108.01666 [pdf]

Complementary Fourier single-pixel imaging

Authors: Dong Zhou, Jie Cao, Huan Cui, Qun Hao, Bing-Kun Chen, Kai Lin

Abstract: Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we propos… ▽ More Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we proposed a novel approach called complementary Fourier single-pixel imaging (CFSI) to reduce measurements while retaining its robustness. The complementary nature of Fourier patterns based on a four-step phase-shift algorithm is combined with the complementary nature of a digital micromirror device. CFSI only requires two phase-shifted patterns to obtain one Fourier spectral value. Four light intensity values are obtained by load the two patterns, and the spectral value is calculated through differential measurement, which has good robustness to noise. The proposed method is verified by simulations and experiments compared with FSI based on two-, three-, and four-step phase shift algorithms. CFSI performed better than the other methods under the condition that the best imaging quality of CFSI is not reached. The reported technique provides an alternative approach to realize real-time and high-quality imaging. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2108.00847 [pdf, other]

doi 10.1103/PhysRevE.105.034108

Large Deviations of Semi-supervised Learning in the Stochastic Block Model

Authors: Hugo Cui, Luca Saglietti, Lenka Zdeborová

Abstract: In community detection on graphs, the semi-supervised learning problem entails inferring the ground-truth membership of each node in a graph, given the connectivity structure and a limited number of revealed node labels. Different subsets of revealed labels can in principle lead to higher or lower information gains and induce different reconstruction accuracies. In the framework of the dense stoch… ▽ More In community detection on graphs, the semi-supervised learning problem entails inferring the ground-truth membership of each node in a graph, given the connectivity structure and a limited number of revealed node labels. Different subsets of revealed labels can in principle lead to higher or lower information gains and induce different reconstruction accuracies. In the framework of the dense stochastic block model, we employ statistical physics methods to derive a large deviation analysis for this problem, in the high-dimensional limit. This analysis allows the characterization of the fluctuations around the typical behaviour, capturing the effect of correlated label choices and yielding an estimate of their informativeness and their rareness among subsets of the same size. We find theoretical evidence of a non-monotonic relationship between reconstruction accuracy and the free energy associated to the posterior measure of the inference problem. We further discuss possible implications for active learning applications in community detection. △ Less

Submitted 2 August, 2021; originally announced August 2021.

Journal ref: Phys. Rev. E 105, 034108 (2022)

arXiv:2108.00265 [pdf, other]

doi 10.1016/j.physleta.2022.128314

Localization-enhanced dissipation in a generalized Aubry-André-Harper model coupled with Ohmic baths

Authors: H. T. Cui, M. Qin, L. Tang, H. Z. Shen, X. X. Yi

Abstract: In this work, the exact dynamics of excitation in the generalized Aubry-André-Harper model coupled with an Ohmic-type environment is discussed by evaluating the survival probability and inverse participation ratio of the state of system. In contrast to the common belief that localization will preserve the information of the initial state in the system against dissipation into the environment, our… ▽ More In this work, the exact dynamics of excitation in the generalized Aubry-André-Harper model coupled with an Ohmic-type environment is discussed by evaluating the survival probability and inverse participation ratio of the state of system. In contrast to the common belief that localization will preserve the information of the initial state in the system against dissipation into the environment, our study found that strong localization can enhance the dissipation of quantum information instead. By a thorough examination of the dynamics, we show that the coherent transition between the energy state of system is crucial for understanding this unusual behavior. Under this circumstance, the coupling induced energy exchange between the system and its environment can induce the periodic population of excitation on the states of system. As a result, the stable or localization-enhanced decaying of excitation can be observed, dependent on the energy difference between the states of system. This point is verified in further by checking the varying of dynamics of excitation in the system when the coupling between the system and environment is more strong. △ Less

Submitted 27 July, 2022; v1 submitted 31 July, 2021; originally announced August 2021.

Comments: 9 pages, 5 figures

Journal ref: Physics Letters A 448(2022)128314

arXiv:2107.12882 [pdf, ps, other]

doi 10.1088/1361-648X/ac216f

Photovoltaic transistor of atoms due to spin-orbit coupling in three optical traps

Authors: Haihu Cui, Mingzhu Zhang, Wenxi Lai

Abstract: In this paper, spin-orbit coupling induced photovoltaic effect of cold atoms has been studied in a three-trap system which is an two-dimensional extension of a two-trap system reported previously. It is proposed here that atom coherent length is one of the important influence to the resistance of this photovoltaic battery. Current properties of the system for different geometrical structures of th… ▽ More In this paper, spin-orbit coupling induced photovoltaic effect of cold atoms has been studied in a three-trap system which is an two-dimensional extension of a two-trap system reported previously. It is proposed here that atom coherent length is one of the important influence to the resistance of this photovoltaic battery. Current properties of the system for different geometrical structures of the trap** potentials are discussed. Numerical results show extension in the number of traps could cause current increase directly. Quantum master equation at finite temperature is used to treat this opened system. This work may give a theoretical basis for further development of the photovoltaic effect of neutral atoms. △ Less

Submitted 27 July, 2021; originally announced July 2021.

Comments: 6 pages, 5 figures

arXiv:2107.11247 [pdf, other]

Effective and Interpretable fMRI Analysis via Functional Brain Network Generation

Authors: Xuan Kan, Hejie Cui, Ying Guo, Carl Yang

Abstract: Recent studies in neuroscience show great potential of functional brain networks constructed from fMRI data for popularity modeling and clinical predictions. However, existing functional brain networks are noisy and unaware of downstream prediction tasks, while also incompatible with recent powerful machine learning models of GNNs. In this work, we develop an end-to-end trainable pipeline to extra… ▽ More Recent studies in neuroscience show great potential of functional brain networks constructed from fMRI data for popularity modeling and clinical predictions. However, existing functional brain networks are noisy and unaware of downstream prediction tasks, while also incompatible with recent powerful machine learning models of GNNs. In this work, we develop an end-to-end trainable pipeline to extract prominent fMRI features, generate brain networks, and make predictions with GNNs, all under the guidance of downstream prediction tasks. Preliminary experiments on the PNC fMRI data show the superior effectiveness and unique interpretability of our framework. △ Less

Submitted 23 July, 2021; originally announced July 2021.

Comments: This paper has been accepted for ICML 2021 Workshop for Interpretable Machine Learning in Healthcare

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2107.10481 [pdf, ps, other]

doi 10.1103/PhysRevB.104.024509

Evolution of transport properties in FeSe thin flakes with thickness approaching the two-dimensional limit

Authors: C. S. Zhu, B. Lei, Z. L. Sun, J. H. Cui, M. Z. Shi, W. Z. Zhuo, X. G. Luo, X. H. Chen

Abstract: Electronic properties of FeSe can be tuned by various routes. Here, we present a comprehensive study on the evolution of the superconductivity and nematicity in FeSe with thickness from bulk single crystal down to bilayer ($\sim$ 1.1 nm) through exfoliation. With decreasing flake thickness, both the structural transition temperature $T_{\rm s}$ and the superconducting transition temperature… ▽ More Electronic properties of FeSe can be tuned by various routes. Here, we present a comprehensive study on the evolution of the superconductivity and nematicity in FeSe with thickness from bulk single crystal down to bilayer ($\sim$ 1.1 nm) through exfoliation. With decreasing flake thickness, both the structural transition temperature $T_{\rm s}$ and the superconducting transition temperature $T_{\rm c}^{\rm zero}$ are greatly suppressed. The magnetic field ($B$) dependence of Hall resistance $R_{xy}$ at 15 K changes from $B$-nonlinear to $B$-linear behavior up to 9 T, as the thickness ($d$) is reduced to 13 nm. $T_{\rm c}$ is linearly dependent on the inverse of flake thickness (1/$d$) when $d\le$ 13 nm, and a clear drop of $T_{\rm c}$ appears with thickness smaller than 27 nm. The $I$-$V$ characteristic curves in ultrathin flakes reveal the signature of Berezinskii-Kosterlitz-Thouless (BKT) transition, indicating the presence of two-dimensional superconductivity. Anisotropic magnetoresistance measurements further support 2D superconductivity in few-layer FeSe. Increase of disorder scattering, anisotropic strains and dimensionality effect with reducing the thickness of FeSe flakes, might be taken into account for understanding these behaviors. Our study provides systematic insights into the evolution of the superconducting properties, structural transition and Hall resistance of a superconductor FeSe with flakes thickness and provides an effective way to find two-dimensional superconductivity as well as other 2D novel phenomena. △ Less

Submitted 22 July, 2021; originally announced July 2021.

Comments: 8 pages, 6 figures

arXiv:2107.05097 [pdf, other]

BrainNNExplainer: An Interpretable Graph Neural Network Framework for Brain Network based Disease Analysis

Authors: Hejie Cui, Wei Dai, Yanqiao Zhu, Xiaoxiao Li, Lifang He, Carl Yang

Abstract: Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for… ▽ More Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for brain network analysis. It is mainly composed of two jointly learned modules: a backbone prediction model that is specifically designed for brain networks and an explanation generator that highlights disease-specific prominent brain network connections. Extensive experimental results with visualizations on two challenging disease prediction datasets demonstrate the unique interpretability and outstanding performance of BrainNNExplainer. △ Less

Submitted 11 July, 2021; originally announced July 2021.

Comments: This paper has been accepted to ICML 2021 Workshop on Interpretable Machine Learning in Healthcare

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2107.05080 [pdf, other]

Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration

Authors: Xuan Kan, Hejie Cui, Carl Yang

Abstract: Relation prediction among entities in images is an important step in scene graph generation (SGG), which further impacts various visual understanding and reasoning tasks. Existing SGG frameworks, however, require heavy training yet are incapable of modeling unseen (i.e.,zero-shot) triplets. In this work, we stress that such incapability is due to the lack of commonsense reasoning,i.e., the ability… ▽ More Relation prediction among entities in images is an important step in scene graph generation (SGG), which further impacts various visual understanding and reasoning tasks. Existing SGG frameworks, however, require heavy training yet are incapable of modeling unseen (i.e.,zero-shot) triplets. In this work, we stress that such incapability is due to the lack of commonsense reasoning,i.e., the ability to associate similar entities and infer similar relations based on general understanding of the world. To fill this gap, we propose CommOnsense-integrAted sCenegrapHrElation pRediction (COACHER), a framework to integrate commonsense knowledge for SGG, especially for zero-shot relation prediction. Specifically, we develop novel graph mining pipelines to model the neighborhoods and paths around entities in an external commonsense knowledge graph, and integrate them on top of state-of-the-art SGG frameworks. Extensive quantitative evaluations and qualitative case studies on both original and manipulated datasets from Visual Genome demonstrate the effectiveness of our proposed approach. △ Less

Submitted 11 July, 2021; originally announced July 2021.

Comments: This paper has been accepted for presentation in the Research Track of ECML-PKDD 2021

ACM Class: I.4.8; I.2.4; I.2.6

arXiv:2107.03871 [pdf]

Spatial beam self-cleaning in bi-tapered multimode fibers

Authors: Xiao-Jun Lin, Yu-Xin Gao, **-Gan Long, Jia-Wen Wu, Xiang-Yue Li, Wei-Yi Hong, Hu Cui, Zhi-Chao Luo, Wen-Cheng Xu, Ai-** Luo

Abstract: We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tape… ▽ More We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tapered MMF with longer tapered length at the same waist diameter shows better beam self-cleaning effect and larger spectral broadening. The obtained results offer a new method to improve the beam quality of high-power laser at low cost. Besides, it may be interesting for manufacturing bi-tapered MMF-based devices to obtain the quasi-fundamental mode beam in spatiotemporal mode-locked fiber lasers. △ Less

Submitted 8 July, 2021; originally announced July 2021.

arXiv:2107.03563 [pdf, other]

doi 10.1088/1748-0221/16/08/P08053

The performance of IHEP-NDL LGAD sensors after neutron irradiation

Authors: Mengzhao Li, Yunyun Fan, Bo Liu, Han Cui, Xuewei Jia, Shuqi Li, Chengjun Yu, Xuan Yang, Wei Wang, Mingjie Zhai, Tao Yang, Kewei Wu, Yuhang Tan, Suyu Xiao, Mei Zhao, Xin Shi, Zhijun Liang, Yuekun Heng, Joao Guimaraes da Costa, Xingan Zhang, Dejun Han, Alissa Howard, Gregor Kramberger

Abstract: The performances of Low Gain Avalanche diode (LGAD) sensors from a neutron irradiation campaign with fluences of 0.8 x 10^15, 15 x 10^15 and 2.5 x 10^15 neq/cm2 are reported in this article. These LGAD sensors are developed by the Institute of High Energy Physics, Chinese Academy of Sciences and the Novel Device Laboratory for the High Granularity Timing Detector of the High Luminosity Large Hadro… ▽ More The performances of Low Gain Avalanche diode (LGAD) sensors from a neutron irradiation campaign with fluences of 0.8 x 10^15, 15 x 10^15 and 2.5 x 10^15 neq/cm2 are reported in this article. These LGAD sensors are developed by the Institute of High Energy Physics, Chinese Academy of Sciences and the Novel Device Laboratory for the High Granularity Timing Detector of the High Luminosity Large Hadron Collider. The timing resolution and collected charge of the LGAD sensors were measured with electrons from a beta source. After irradiation with a fluence of 2.5 x 10^15 neq/cm2, the collected charge decreases from 40 fC to 7 fC, the signal-to-noise ratio deteriorates from 48 to 12, and the timing resolution increases from 29 ps to 39 ps. △ Less

Submitted 7 July, 2021; originally announced July 2021.

arXiv:2107.03340 [pdf, other]

Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning

Authors: Wing Fung Chong, Haoen Cui, Yuxuan Li

Abstract: This paper proposes a two-phase deep reinforcement learning approach, for hedging variable annuity contracts with both GMMB and GMDB riders, which can address model miscalibration in Black-Scholes financial and constant force of mortality actuarial market environments. In the training phase, an infant reinforcement learning agent interacts with a pre-designed training environment, collects sequent… ▽ More This paper proposes a two-phase deep reinforcement learning approach, for hedging variable annuity contracts with both GMMB and GMDB riders, which can address model miscalibration in Black-Scholes financial and constant force of mortality actuarial market environments. In the training phase, an infant reinforcement learning agent interacts with a pre-designed training environment, collects sequential anchor-hedging reward signals, and gradually learns how to hedge the contracts. As expected, after a sufficient number of training steps, the trained reinforcement learning agent hedges, in the training environment, equally well as the correct Delta while outperforms misspecified Deltas. In the online learning phase, the trained reinforcement learning agent interacts with the market environment in real time, collects single terminal reward signals, and self-revises its hedging strategy. The hedging performance of the further trained reinforcement learning agent is demonstrated via an illustrative example on a rolling basis to reveal the self-revision capability on the hedging strategy by online learning. △ Less

Submitted 1 October, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

arXiv:2107.03220 [pdf, other]

Joint Embedding of Structural and Functional Brain Networks with Graph Neural Networks for Mental Illness Diagnosis

Authors: Yanqiao Zhu, Hejie Cui, Lifang He, Lichao Sun, Carl Yang

Abstract: Multimodal brain networks characterize complex connectivities among different brain regions from both structural and functional aspects and provide a new means for mental disease analysis. Recently, Graph Neural Networks (GNNs) have become a de facto model for analyzing graph-structured data. However, how to employ GNNs to extract effective representations from brain networks in multiple modalitie… ▽ More Multimodal brain networks characterize complex connectivities among different brain regions from both structural and functional aspects and provide a new means for mental disease analysis. Recently, Graph Neural Networks (GNNs) have become a de facto model for analyzing graph-structured data. However, how to employ GNNs to extract effective representations from brain networks in multiple modalities remains rarely explored. Moreover, as brain networks provide no initial node features, how to design informative node attributes and leverage edge weights for GNNs to learn is left unsolved. To this end, we develop a novel multiview GNN for multimodal brain networks. In particular, we regard each modality as a view for brain networks and employ contrastive learning for multimodal fusion. Then, we propose a GNN model which takes advantage of the message passing scheme by propagating messages based on degree statistics and brain region connectivities. Extensive experiments on two real-world disease datasets (HIV and Bipolar) demonstrate the effectiveness of our proposed method over state-of-the-art baselines. △ Less

Submitted 24 May, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

Comments: Formal version accepted to IEEE EMBC 2022; previously presented at ICML 2021 Workshop on Computational Approaches to Mental Health (no proceedings)

arXiv:2107.01502 [pdf, other]

doi 10.1007/978-3-030-32226-7_33

Pulmonary Vessel Segmentation based on Orthogonal Fused U-Net++ of Chest CT Images

Authors: Hejie Cui, Xinglong Liu, Ning Huang

Abstract: Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which prese… ▽ More Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which presents a robust and fully automated pulmonary vessel segmentation result with lower network complexity and memory usage compared to 3D networks. The slice radius is introduced to convolve the adjacent information of the center slice and the multi-planar fusion optimizes the presentation of intra- and inter- slice features. Besides, the tree-like structure of the pulmonary vessel is extracted in the post-processing process, which is used for segmentation refining and pruning. In the evaluation experiments, three fusion methods are tested and the most promising one is compared with the state-of-the-art 2D and 3D structures on 300 cases of lung images randomly selected from LIDC dataset. Our method outperforms other network structures by a large margin and achieves by far the highest average DICE score of 0.9272 and precision of 0.9310, as per our knowledge from the pulmonary vessel segmentation models available in the literature. △ Less

Submitted 3 July, 2021; originally announced July 2021.

Comments: Published in Medical Image Computing and Computer Assisted Intervention (MICCAI 2019)

MSC Class: 68T45; 68T07 ACM Class: I.2.10; J.3

arXiv:2107.01495 [pdf, other]

On Positional and Structural Node Features for Graph Neural Networks on Non-attributed Graphs

Authors: Hejie Cui, Zijie Lu, Pan Li, Carl Yang

Abstract: Graph neural networks (GNNs) have been widely used in various graph-related problems such as node classification and graph classification, where superior performance is mainly established when natural node features are available. However, it is not well understood how GNNs work without natural node features, especially regarding the various ways to construct artificial ones. In this paper, we poin… ▽ More Graph neural networks (GNNs) have been widely used in various graph-related problems such as node classification and graph classification, where superior performance is mainly established when natural node features are available. However, it is not well understood how GNNs work without natural node features, especially regarding the various ways to construct artificial ones. In this paper, we point out the two types of artificial node features, i.e., positional and structural node features, and provide insights on why each of them is more appropriate for certain tasks, i.e., positional node classification, structural node classification, and graph classification. Extensive experimental results on 10 benchmark datasets validate our insights, thus leading to a practical guideline on the choices between different artificial node features for GNNs on non-attributed graphs. The code is available at https://github.com/zjzijielu/gnn-positional-structural-node-features. △ Less

Submitted 8 September, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

Comments: Accepted to CIKM 2022. The previous version is accepted for KDD-DLG Workshop 2021 (spotlight, no proceedings)

MSC Class: 68T01; 68T07; 68T30 ACM Class: I.2.6; I.2.4

arXiv:2106.15421 [pdf, other]

doi 10.1016/j.nima.2022.167111

Leakage current simulations of Low Gain Avalanche Diode with improved Radiation Damage Modeling

Authors: Tao Yang, Kewei Wu, Mei Zhao, Xuewei Jia, Yuhang Tan, Suyu Xiao, Kai Liu, Xiyuan Zhang, Congcong Wang, Mengzhao Li, Yunyun Fan, Shuqi Li, Chengjun Yu, Han Cui, Hao Zeng, Mingjie Zhai, Shuiting Xin, Maoqiang **g, Gang** Yan, Qionghua Zhai, Mingzheng Ding, Gaobo Xu, Huaxiang Yin, Gregor Kramberger, Zhijun Liang , et al. (2 additional authors not shown)

Abstract: We report precise TCAD simulations of IHEP-IME-v1 Low Gain Avalanche Diode (LGAD) calibrated by secondary ion mass spectroscopy (SIMS). Our setup allows us to evaluate the leakage current, capacitance, and breakdown voltage of LGAD, which agree with measurements' results before irradiation. And we propose an improved LGAD Radiation Damage Model (LRDM) which combines local acceptor removal with glo… ▽ More We report precise TCAD simulations of IHEP-IME-v1 Low Gain Avalanche Diode (LGAD) calibrated by secondary ion mass spectroscopy (SIMS). Our setup allows us to evaluate the leakage current, capacitance, and breakdown voltage of LGAD, which agree with measurements' results before irradiation. And we propose an improved LGAD Radiation Damage Model (LRDM) which combines local acceptor removal with global deep energy levels. The LRDM is applied to the IHEP-IME-v1 LGAD and able to predict the leakage current well at -30 $^{\circ}$C after an irradiation fluence of $ Φ_{eq}=2.5 \times 10^{15} ~n_{eq}/cm^{2}$. The charge collection efficiency (CCE) is under development. △ Less

Submitted 30 September, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

arXiv:2106.12721 [pdf]

Hourly Warning for Strong Earthquakes

Authors: T. Chen, L. Li, X. -X. Zhang, C. Wang, X. -B. **, Q. -M. Ma, J. -Y. Xu, Z. -H. He, H. Li, S. -G. Xiao, X. -Z. Wang, X. -H. Shen, X. -M. Zhang, H. -B. Li, Z. -M. Zeren, J. -P. Huang, F. -Q. Huang, S. Che, Z. -M. Zou, P. Xiong, J. Liu, L. -Q. Zhang, Q. Guo, I. Roth, V. S. Makhmutov , et al. (32 additional authors not shown)

Abstract: A promising perspective is presented that humans can provide hourly warning for strong land earthquakes (EQs, Ms6). Two important atmospheric electrostatic signal features are described. A table that lists 9 strong land EQs with shock time, epicenter, magnitude, weather in the region near the epicenter, precursor beginning time, and precursor duration demonstrates that at approximately several hou… ▽ More A promising perspective is presented that humans can provide hourly warning for strong land earthquakes (EQs, Ms6). Two important atmospheric electrostatic signal features are described. A table that lists 9 strong land EQs with shock time, epicenter, magnitude, weather in the region near the epicenter, precursor beginning time, and precursor duration demonstrates that at approximately several hours to one day before a strong land EQ, the weather conditions are fair near the epicenter, and an abnormal negative atmospheric electrostatic signal is very obvious. Moreover, the mechanism is explained. A method by which someone could determine the epicenter and the magnitude of a forthcoming strong EQ is suggested. Finally, the possibility of realizing hourly warning for strong land EQs in the near future is pointed out. △ Less

Submitted 23 June, 2021; originally announced June 2021.

arXiv:2106.05407 [pdf, other]

OVRseen: Auditing Network Traffic and Privacy Policies in Oculus VR

Authors: Rahmadi Trimananda, Hieu Le, Hao Cui, Janice Tran Ho, Anastasia Shuba, Athina Markopoulou

Abstract: Virtual reality (VR) is an emerging technology that enables new applications but also introduces privacy risks. In this paper, we focus on Oculus VR (OVR), the leading platform in the VR space and we provide the first comprehensive analysis of personal data exposed by OVR apps and the platform itself, from a combined networking and privacy policy perspective. We experimented with the Quest 2 heads… ▽ More Virtual reality (VR) is an emerging technology that enables new applications but also introduces privacy risks. In this paper, we focus on Oculus VR (OVR), the leading platform in the VR space and we provide the first comprehensive analysis of personal data exposed by OVR apps and the platform itself, from a combined networking and privacy policy perspective. We experimented with the Quest 2 headset and tested the most popular VR apps available on the official Oculus and the SideQuest app stores. We developed OVRseen, a methodology and system for collecting, analyzing, and comparing network traffic and privacy policies on OVR. On the networking side, we captured and decrypted network traffic of VR apps, which was previously not possible on OVR, and we extracted data flows, defined as <app, data type, destination>. Compared to the mobile and other app ecosystems, we found OVR to be more centralized and driven by tracking and analytics, rather than by third-party advertising. We show that the data types exposed by VR apps include personally identifiable information (PII), device information that can be used for fingerprinting, and VR-specific data types. By comparing the data flows found in the network traffic with statements made in the apps' privacy policies, we found that approximately 70% of OVR data flows were not properly disclosed. Furthermore, we extracted additional context from the privacy policies, and we observed that 69% of the data flows were used for purposes unrelated to the core functionality of apps. △ Less

Submitted 19 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

Comments: This is the extended version of the paper with the same title published at USENIX Security Symposium 2022

arXiv:2106.03949 [pdf]

Mechanical metamaterials: does toughness characterize fracture?

Authors: Angkur Jyoti Dipanka Shaikeea, Huachen Cui, Mark R. O'Masta, Xiaoyu, Zheng, Vikram S. Deshpande

Abstract: Rapid progress in additive manufacturing methods has created a new class of ultralight and strong architected metamaterials that resemble periodic truss structures. The mechanical performance of these metamaterials with a very large number of unit cells is ultimately limited by their tolerance to damage and defects, but an understanding of this sensitivity has remained elusive. Using a stretching-… ▽ More Rapid progress in additive manufacturing methods has created a new class of ultralight and strong architected metamaterials that resemble periodic truss structures. The mechanical performance of these metamaterials with a very large number of unit cells is ultimately limited by their tolerance to damage and defects, but an understanding of this sensitivity has remained elusive. Using a stretching-dominated micro-architecture and metamaterial specimens comprising millions of unit-cells we show that not only is the stress intensity factor, as used in conventional elastic fracture mechanics, insufficient to characterize fracture but also that conventional fracture testing protocols are inadequate. Via a combination of numerical calculations and asymptotic analyses, we extend the ideas of fracture mechanics and develop a general test and design protocol for the failure of metamaterials. △ Less

Submitted 7 June, 2021; originally announced June 2021.

Comments: 58 pages, 5 figures, Supplementary Information, 4 Supplementary Movies

arXiv:2105.15004 [pdf, other]

doi 10.1088/1742-5468/ac9829

Generalization Error Rates in Kernel Regression: The Crossover from the Noiseless to Noisy Regime

Authors: Hugo Cui, Bruno Loureiro, Florent Krzakala, Lenka Zdeborová

Abstract: In this manuscript we consider Kernel Ridge Regression (KRR) under the Gaussian design. Exponents for the decay of the excess generalization error of KRR have been reported in various works under the assumption of power-law decay of eigenvalues of the features co-variance. These decays were, however, provided for sizeably different setups, namely in the noiseless case with constant regularization… ▽ More In this manuscript we consider Kernel Ridge Regression (KRR) under the Gaussian design. Exponents for the decay of the excess generalization error of KRR have been reported in various works under the assumption of power-law decay of eigenvalues of the features co-variance. These decays were, however, provided for sizeably different setups, namely in the noiseless case with constant regularization and in the noisy optimally regularized case. Intermediary settings have been left substantially uncharted. In this work, we unify and extend this line of work, providing characterization of all regimes and excess error decay rates that can be observed in terms of the interplay of noise and regularization. In particular, we show the existence of a transition in the noisy setting between the noiseless exponents to its noisy values as the sample complexity is increased. Finally, we illustrate how this crossover can also be observed on real data sets. △ Less

Submitted 15 December, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

Comments: 22 pages, 10 figures, 2 tables

Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) vol 34 p10131--10143. J. Stat. Mech. (2022) 114004

arXiv:2105.08213 [pdf, other]

doi 10.1016/j.neunet.2022.04.019

Distantly Supervised Relation Extraction via Recursive Hierarchy-Interactive Attention and Entity-Order Perception

Authors: Ridong Han, Tao Peng, Jiayu Han, Hai Cui, Lu Liu

Abstract: Wrong-labeling problem and long-tail relations severely affect the performance of distantly supervised relation extraction task. Many studies mitigate the effect of wrong-labeling through selective attention mechanism and handle long-tail relations by introducing relation hierarchies to share knowledge. However, almost all existing studies ignore the fact that, in a sentence, the appearance order… ▽ More Wrong-labeling problem and long-tail relations severely affect the performance of distantly supervised relation extraction task. Many studies mitigate the effect of wrong-labeling through selective attention mechanism and handle long-tail relations by introducing relation hierarchies to share knowledge. However, almost all existing studies ignore the fact that, in a sentence, the appearance order of two entities contributes to the understanding of its semantics. Furthermore, they only utilize each relation level of relation hierarchies separately, but do not exploit the heuristic effect between relation levels, i.e., higher-level relations can give useful information to the lower ones. Based on the above, in this paper, we design a novel Recursive Hierarchy-Interactive Attention network (RHIA) to further handle long-tail relations, which models the heuristic effect between relation levels. From the top down, it passes relation-related information layer by layer, which is the most significant difference from existing models, and generates relation-augmented sentence representations for each relation level in a recursive structure. Besides, we introduce a newfangled training objective, called Entity-Order Perception (EOP), to make the sentence encoder retain more entity appearance information. Substantial experiments on the popular (NYT) dataset are conducted. Compared to prior baselines, our RHIA-EOP achieves state-of-the-art performance in terms of precision-recall (P-R) curves, AUC, Top-N precision and other evaluation metrics. Insightful analysis also demonstrates the necessity and effectiveness of each component of RHIA-EOP. △ Less

Submitted 25 April, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

Comments: 31 pages, Accepted by "Neural Networks"

arXiv:2105.07637 [pdf, other]

Class-Incremental Few-Shot Object Detection

Authors: Pengyang Li, Yanan Li, Han Cui, Donghui Wang

Abstract: Conventional detection networks usually need abundant labeled training samples, while humans can learn new concepts incrementally with just a few examples. This paper focuses on a more challenging but realistic class-incremental few-shot object detection problem (iFSD). It aims to incrementally transfer the model for novel objects from only a few annotated samples without catastrophically forgetti… ▽ More Conventional detection networks usually need abundant labeled training samples, while humans can learn new concepts incrementally with just a few examples. This paper focuses on a more challenging but realistic class-incremental few-shot object detection problem (iFSD). It aims to incrementally transfer the model for novel objects from only a few annotated samples without catastrophically forgetting the previously learned ones. To tackle this problem, we propose a novel method LEAST, which can transfer with Less forgetting, fEwer training resources, And Stronger Transfer capability. Specifically, we first present the transfer strategy to reduce unnecessary weight adaptation and improve the transfer capability for iFSD. On this basis, we then integrate the knowledge distillation technique using a less resource-consuming approach to alleviate forgetting and propose a novel clustering-based exemplar selection process to preserve more discriminative features previously learned. Being a generic and effective method, LEAST can largely improve the iFSD performance on various benchmarks. △ Less

Submitted 28 December, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

arXiv:2104.13547 [pdf, ps, other]

doi 10.1103/PhysRevLett.128.027201

Large Diamagnetism and Electromagnetic Duality in Two-dimensional Dirac Electron System

Authors: S. Fujiyama, H. Maebashi, N. Tajima, T. Tsumuraya, H-B. Cui, M. Ogata, R. Kato

Abstract: A Dirac electron system in solids mimics a relativistic quantum physics that is compatible with Maxwell's equations, by which we anticipate unified electromagnetic responses. We find a large orbital diamagnetism only along the interplane direction and the nearly temperature-independent conductance of the order of e2/h for the new 2D Dirac organic conductor, a-(BETS)2I3. Distinct from conventional… ▽ More A Dirac electron system in solids mimics a relativistic quantum physics that is compatible with Maxwell's equations, by which we anticipate unified electromagnetic responses. We find a large orbital diamagnetism only along the interplane direction and the nearly temperature-independent conductance of the order of e2/h for the new 2D Dirac organic conductor, a-(BETS)2I3. Distinct from conventional electrons in solids whose nonrelativistic effects bifurcate electric and magnetic responses, the observed orbital diamagnetism scales the electrical conductivity for a wide temperature range. This demonstrates that an electromagnetic duality that is valid only within the relativistic framework is revived in solids. △ Less

Submitted 20 December, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

arXiv:2104.10832 [pdf, other]

Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss

Authors: Yaogen Yang, Haozhe Zhang, Xiaoyi Qin, Shanshan Liang, Huahua Cui, Mingyang Xu, Ming Li

Abstract: Building cross-lingual voice conversion (VC) systems for multiple speakers and multiple languages has been a challenging task for a long time. This paper describes a parallel non-autoregressive network to achieve bilingual and code-switched voice conversion for multiple speakers when there are only mono-lingual corpora for each language. We achieve cross-lingual VC between Mandarin speech with mul… ▽ More Building cross-lingual voice conversion (VC) systems for multiple speakers and multiple languages has been a challenging task for a long time. This paper describes a parallel non-autoregressive network to achieve bilingual and code-switched voice conversion for multiple speakers when there are only mono-lingual corpora for each language. We achieve cross-lingual VC between Mandarin speech with multiple speakers and English speech with multiple speakers by applying bilingual bottleneck features. To boost voice cloning performance, we use an adversarial speaker classifier with a gradient reversal layer to reduce the source speaker's information from the output of encoder. Furthermore, in order to improve speaker similarity between reference speech and converted speech, we adopt an embedding consistency loss between the synthesized speech and its natural reference speech in our network. Experimental results show that our proposed method can achieve high quality converted speech with mean opinion score (MOS) around 4. The conversion system performs well in terms of speaker similarity for both in-set speaker conversion and out-set-of one-shot conversion. △ Less

Submitted 21 April, 2021; originally announced April 2021.

Comments: Submitted to Interspeech 2021

arXiv:2104.09701 [pdf, ps, other]

doi 10.1016/j.knosys.2021.106753

Free-form tumor synthesis in computed tomography images via richer generative adversarial network

Authors: Qiangguo **, Hui Cui, Changming Sun, Zhaopeng Meng, Ran Su

Abstract: The insufficiency of annotated medical imaging scans for cancer makes it challenging to train and validate data-hungry deep learning models in precision oncology. We propose a new richer generative adversarial network for free-form 3D tumor/lesion synthesis in computed tomography (CT) images. The network is composed of a new richer convolutional feature enhanced dilated-gated generator (RicherDG)… ▽ More The insufficiency of annotated medical imaging scans for cancer makes it challenging to train and validate data-hungry deep learning models in precision oncology. We propose a new richer generative adversarial network for free-form 3D tumor/lesion synthesis in computed tomography (CT) images. The network is composed of a new richer convolutional feature enhanced dilated-gated generator (RicherDG) and a hybrid loss function. The RicherDG has dilated-gated convolution layers to enable tumor-painting and to enlarge perceptive fields; and it has a novel richer convolutional feature association branch to recover multi-scale convolutional features especially from uncertain boundaries between tumor and surrounding healthy tissues. The hybrid loss function, which consists of a diverse range of losses, is designed to aggregate complementary information to improve optimization. We perform a comprehensive evaluation of the synthesis results on a wide range of public CT image datasets covering the liver, kidney tumors, and lung nodules. The qualitative and quantitative evaluations and ablation study demonstrated improved synthesizing results over advanced tumor synthesis methods. △ Less

Submitted 19 April, 2021; originally announced April 2021.

Showing 151–200 of 379 results for author: Cui, H