-
Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction
Authors:
Siqi Li,
Kuang Gong,
Ramsey D. Badawi,
Edward J. Kim,
**yi Qi,
Guobao Wang
Abstract:
Image reconstruction of low-count positron emission tomography (PET) data is challenging. Kernel methods address the challenge by incorporating image prior information in the forward model of iterative PET image reconstruction. The kernelized expectation-maximization (KEM) algorithm has been developed and demonstrated to be effective and easy to implement. A common approach for a further improveme…
▽ More
Image reconstruction of low-count positron emission tomography (PET) data is challenging. Kernel methods address the challenge by incorporating image prior information in the forward model of iterative PET image reconstruction. The kernelized expectation-maximization (KEM) algorithm has been developed and demonstrated to be effective and easy to implement. A common approach for a further improvement of the kernel method would be adding an explicit regularization, which however leads to a complex optimization problem. In this paper, we propose an implicit regularization for the kernel method by using a deep coefficient prior, which represents the kernel coefficient image in the PET forward model using a convolutional neural-network. To solve the maximum-likelihood neural network-based reconstruction problem, we apply the principle of optimization transfer to derive a neural KEM algorithm. Each iteration of the algorithm consists of two separate steps: a KEM step for image update from the projection data and a deep-learning step in the image domain for updating the kernel coefficient image using the neural network. This optimization algorithm is guaranteed to monotonically increase the data likelihood. The results from computer simulations and real patient data have demonstrated that the neural KEM can outperform existing KEM and deep image prior methods.
△ Less
Submitted 24 October, 2022; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Deconfounded Visual Grounding
Authors:
Jianqiang Huang,
Yu Qin,
Jiaxin Qi,
Qianru Sun,
Hanwang Zhang
Abstract:
We focus on the confounding bias between language and location in the visual grounding pipeline, where we find that the bias is the major visual reasoning bottleneck. For example, the grounding process is usually a trivial language-location association without visual reasoning, e.g., grounding any language query containing sheep to the nearly central regions, due to that most queries about sheep h…
▽ More
We focus on the confounding bias between language and location in the visual grounding pipeline, where we find that the bias is the major visual reasoning bottleneck. For example, the grounding process is usually a trivial language-location association without visual reasoning, e.g., grounding any language query containing sheep to the nearly central regions, due to that most queries about sheep have ground-truth locations at the image center. First, we frame the visual grounding pipeline into a causal graph, which shows the causalities among image, query, target location and underlying confounder. Through the causal graph, we know how to break the grounding bottleneck: deconfounded visual grounding. Second, to tackle the challenge that the confounder is unobserved in general, we propose a confounder-agnostic approach called: Referring Expression Deconfounder (RED), to remove the confounding bias. Third, we implement RED as a simple language attention, which can be applied in any grounding method. On popular benchmarks, RED improves various state-of-the-art grounding methods by a significant margin. Code will soon be available at: https://github.com/JianqiangH/Deconfounded_VG.
△ Less
Submitted 31 December, 2021;
originally announced December 2021.
-
A new way to explore cosmological tensions using gravitational waves and strong gravitational lensing
Authors:
Meng-Di Cao,
Jie Zheng,
**g-Zhao Qi,
Xin Zhang,
Zong-Hong Zhu
Abstract:
In recent years, a crisis in the standard cosmology has been caused by inconsistencies in the measurements of some key cosmological parameters, Hubble constant $H_0$ and cosmic curvature parameter $Ω_K$ for example. It is necessary to remeasure them with the cosmological model-independent methods. In this paper, based on the distance sum rule, we present such a way to constrain $H_0$ and $Ω_K$ sim…
▽ More
In recent years, a crisis in the standard cosmology has been caused by inconsistencies in the measurements of some key cosmological parameters, Hubble constant $H_0$ and cosmic curvature parameter $Ω_K$ for example. It is necessary to remeasure them with the cosmological model-independent methods. In this paper, based on the distance sum rule, we present such a way to constrain $H_0$ and $Ω_K$ simultaneously in the late universe from strong gravitational lensing time delay (SGLTD) data and gravitational wave (GW) standard siren data simulated from the future observation of the Einstein Telescope (ET). Based on the currently 6 observed SGLTD data, we find that the constraint precision of $H_0$ from the combined 100 GW events can be comparable with the measurement from SH0ES collaboration. As the number of GW events increases to 700, the constraint precision of $H_0$ will exceed that of the \textit{Planck} 2018 results. Considering 1000 GW events as the conservative estimation of ET in ten-year observation, we obtain $H_0=73.69\pm 0.36 \mathrm{~km~s^{-1}~Mpc^{-1}}$ with a 0.5\% uncertainty and $Ω_K=0.076^{+0.068}_{-0.087}$. In addition, we simulate 55 SGL systems with 6.6\% uncertainty for the measurement of time-delay distance. By combining with 1000 GWs, we infer that $H_0=73.65\pm0.35 \mathrm{~km~s^{-1}~Mpc^{-1}}$ and $Ω_K=0.008\pm0.048$. Our results suggest that this approach can play an important role in exploring cosmological tensions.
△ Less
Submitted 27 June, 2022; v1 submitted 29 December, 2021;
originally announced December 2021.
-
Extrinsic ferroelectricity originated from oxygen vacancy drift in HfO2-based films
Authors:
Yong Cheng,
Maoyuan Zheng,
Xingwang Zhang,
Hao Dong,
Yitian Jiang,
**liang Wu,
**g Qi,
Zhigang Yin
Abstract:
It is generally accepted that oxygen vacancies play a central role in the emergence of ferroelectricity for HfO2-based materials, but the underlying mechanism still remains elusive. Herein, starting from the basic characterization circuit, we propose that the observed ferroelectricity is extrinsic. A key finding is that charged oxygen vacancies oscillate within the sample under repeated electric p…
▽ More
It is generally accepted that oxygen vacancies play a central role in the emergence of ferroelectricity for HfO2-based materials, but the underlying mechanism still remains elusive. Herein, starting from the basic characterization circuit, we propose that the observed ferroelectricity is extrinsic. A key finding is that charged oxygen vacancies oscillate within the sample under repeated electric pulses, yielding a nonlinear current which behaves similarly to the polarization current for a normal ferroelectric. This unwanted current signal results in a ferroelectric-like hysteresis loop with both remnant polarization and coercive field in good agreements with experimental values, given a charged oxygen vacancy concentration in the vicinity of 1*10^20/cm^3. Moreover, it is possible to exploit this mechanism to reproduce the effects of wake-up, split-up and limited endurance that are of crucial relevance for the device applications.
△ Less
Submitted 26 December, 2021;
originally announced December 2021.
-
Application and modeling of an online distillation method to reduce krypton and argon in XENON1T
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (129 additional authors not shown)
Abstract:
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of…
▽ More
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of $(360 \pm 60)$ ppq was achieved. It is the lowest concentration measured in the fiducial volume of an operating dark matter detector to date. A model was developed and fit to the data to describe the krypton evolution in the liquid and gas volumes of the detector system for several operation modes over the time span of 550 days, including the commissioning and science runs of XENON1T. The online distillation was also successfully applied to remove Ar-37 after its injection for a low energy calibration in XENON1T. This makes the usage of Ar-37 as a regular calibration source possible in the future. The online distillation can be applied to next-generation experiments to remove krypton prior to, or during, any science run. The model developed here allows further optimization of the distillation strategy for future large scale detectors.
△ Less
Submitted 14 June, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Emission of Single and Few Electrons in XENON1T and Limits on Light Dark Matter
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (130 additional authors not shown)
Abstract:
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effe…
▽ More
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effectively be vetoed. In this work we extend previous S2-only analyses down to a single electron. From this analysis, after removing the correlated backgrounds, we observe rates < 30 events/(electron*kg*day) in the region of interest spanning 1 to 5 electrons. We derive 90% confidence upper limits for dark matter-electron scattering, first direct limits on the electric dipole, magnetic dipole, and anapole interactions, and bosonic dark matter models, where we exclude new parameter space for dark photons and solar dark photons.
△ Less
Submitted 28 June, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Authors:
Garvit Goel,
**gyuan Qi,
Wu-chun Feng,
Guohua Cao
Abstract:
We present a deep-learning based computing framework for fast-and-accurate CT (DL-FACT) testing of COVID-19. Our CT-based DL framework was developed to improve the testing speed and accuracy of COVID-19 (plus its variants) via a DL-based approach for CT image enhancement and classification. The image enhancement network is adapted from DDnet, short for DenseNet and Deconvolution based network. To…
▽ More
We present a deep-learning based computing framework for fast-and-accurate CT (DL-FACT) testing of COVID-19. Our CT-based DL framework was developed to improve the testing speed and accuracy of COVID-19 (plus its variants) via a DL-based approach for CT image enhancement and classification. The image enhancement network is adapted from DDnet, short for DenseNet and Deconvolution based network. To demonstrate its speed and accuracy, we evaluated DL-FACT across several sources of COVID-19 CT images. Our results show that DL-FACT can significantly shorten the turnaround time from days to minutes and improve the COVID-19 testing accuracy up to 91%. DL-FACT could be used as a software tool for medical professionals in diagnosing and monitoring COVID-19.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
R2: A Distributed Remote Function Execution Mechanism With Built-in Metadata
Authors:
Jianpeng Qi,
Rui Wang
Abstract:
Named data networking (NDN) constructs a network by names, providing a flexible and decentralized way to manage resources within the edge computing continuum. This paper aims to solve the question, "Given a function with its parameters and metadata, how to select the executor in a distributed manner and obtain the result in NDN?" To answer it, we design R2 that involves the following stages. First…
▽ More
Named data networking (NDN) constructs a network by names, providing a flexible and decentralized way to manage resources within the edge computing continuum. This paper aims to solve the question, "Given a function with its parameters and metadata, how to select the executor in a distributed manner and obtain the result in NDN?" To answer it, we design R2 that involves the following stages. First, we design a name structure including data, function names, and other function parameters. Second, we develop a 2-phase mechanism, where in the first phase, the function request from a client-first reaches the data source and retrieves the metadata, then the best node is selected while the metadata is responding to the client. In the second phase, the chosen node directly retrieves the data, executes the function, and provides the result to the client. Furthermore, we propose a stop condition to intelligently reduce the processing time of the first phase and provide a simple proof and range analysis. Simulations confirm that R2 outperforms the current solutions in terms of resource allocation, especially when the data volume and the function complexity are high. In the experiments, when the data size is 100 KiB and the function complexity is $\mathcal{O}(n^2)$, the speedup ratio is 4.61. To further evaluate R2, we also implement a general intermediate data processing logic named ``Bolt'' implemented on an app-level in ndnSIM. We believe that R2 shall help the researchers and developers to verify their ideas smoothly.
△ Less
Submitted 20 August, 2022; v1 submitted 5 December, 2021;
originally announced December 2021.
-
Material radiopurity control in the XENONnT experiment
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino,
M. Clark
, et al. (128 additional authors not shown)
Abstract:
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove…
▽ More
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove or mitigate surface contamination of detector materials are described. Screening results, used as inputs for a XENONnT Monte Carlo simulation, predict a reduction of materials background ($\sim$17%) with respect to its predecessor XENON1T. Through radon emanation measurements, the expected $^{222}$Rn activity concentration in XENONnT is determined to be 4.2$\,(^{+0.5}_{-0.7})\,μ$Bq/kg, a factor three lower with respect to XENON1T. This radon concentration will be further suppressed by means of the novel radon distillation system.
△ Less
Submitted 26 January, 2023; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Findings on Conversation Disentanglement
Authors:
Rongxin Zhu,
Jey Han Lau,
Jianzhong Qi
Abstract:
Conversation disentanglement, the task to identify separate threads in conversations, is an important pre-processing step in multi-party conversational NLP applications such as conversational question answering and conversation summarization. Framing it as a utterance-to-utterance classification problem -- i.e. given an utterance of interest (UOI), find which past utterance it replies to -- we exp…
▽ More
Conversation disentanglement, the task to identify separate threads in conversations, is an important pre-processing step in multi-party conversational NLP applications such as conversational question answering and conversation summarization. Framing it as a utterance-to-utterance classification problem -- i.e. given an utterance of interest (UOI), find which past utterance it replies to -- we explore a number of transformer-based models and found that BERT in combination with handcrafted features remains a strong baseline. We then build a multi-task learning model that jointly learns utterance-to-utterance and utterance-to-thread classification. Observing that the ground truth label (past utterance) is in the top candidates when our model makes an error, we experiment with using bipartite graphs as a post-processing step to learn how to best match a set of UOIs to past utterances. Experiments on the Ubuntu IRC dataset show that this approach has the potential to outperform the conventional greedy approach of simply selecting the highest probability candidate for each UOI independently, indicating a promising future research direction.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
The In-Flight Realtime Trigger and Localization Software of GECAM
Authors:
Xiao-Yun Zhao,
Shao-Lin Xiong,
Xiang-Yang Wen,
Xin-Qiao Li,
Ce Cai,
Shuo Xiao,
Qi Luo,
Wen-Xi Peng,
Dong-Ya Guo,
Zheng-Hua An,
Ke Gong,
**-Yuan Liao,
Yan-Qiu Zhang,
Yue Huang,
Lu Li,
Xing Wen,
Fei Zhang,
**g Duan,
Chen-Wei Wang,
Dong-Li Shi,
Peng Zhang,
Qi-Bin Yi,
Chao-Yang Li,
Yan-Bing Xu,
Xiao-Hua Liang
, et al. (64 additional authors not shown)
Abstract:
Realtime trigger and localization of bursts are the key functions of GECAM, which is an all-sky gamma-ray monitor launched in Dec 10, 2020. We developed a multifunctional trigger and localization software operating on the CPU of the GECAM electronic box (EBOX). This onboard software has the following features: high trigger efficiency for real celestial bursts with a suppression of false triggers c…
▽ More
Realtime trigger and localization of bursts are the key functions of GECAM, which is an all-sky gamma-ray monitor launched in Dec 10, 2020. We developed a multifunctional trigger and localization software operating on the CPU of the GECAM electronic box (EBOX). This onboard software has the following features: high trigger efficiency for real celestial bursts with a suppression of false triggers caused by charged particle bursts and background fluctuation, dedicated localization algorithm optimized for short and long bursts respetively, short time latency of the trigger information which is downlinked throught the BeiDou satellite navigation System (BDS). This paper presents the detailed design and deveopment of this trigger and localization software system of GECAM, including the main functions, general design, workflow and algorithms, as well as the verification and demonstration of this software, including the on-ground trigger tests with simulated gamma-ray bursts made by a dedicated X-ray tube and the in-flight performance to real gamma-ray bursts and magnetar bursts.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Spintronic Sources of Ultrashort Terahertz Electromagnetic Pulses
Authors:
Tom S. Seifert,
Liang Cheng,
Zhengxing Wei,
Tobias Kampfrath,
**gbo Qi
Abstract:
Spintronic terahertz emitters are novel, broadband and efficient sources of terahertz radiation, which emerged at the intersection of ultrafast spintronics and terahertz photonics. They are based on efficient spin-current generation, spin-to-charge-current and current-to-field conversion at terahertz rates. In this review, we address the recent developments and applications, the current understand…
▽ More
Spintronic terahertz emitters are novel, broadband and efficient sources of terahertz radiation, which emerged at the intersection of ultrafast spintronics and terahertz photonics. They are based on efficient spin-current generation, spin-to-charge-current and current-to-field conversion at terahertz rates. In this review, we address the recent developments and applications, the current understanding of the physical processes as well as the future challenges and perspectives of broadband spintronic terahertz emitters.
△ Less
Submitted 3 May, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
REMR: A Reliability Evaluation Method for Dynamic Edge Computing Network under Time Constraints
Authors:
Liang Chen,
Jianpeng Qi,
Xiao Su,
Rui Wang
Abstract:
While the concept of Artificial Intelligent Internet of Things\ (AIoT) is booming, computation and/or communication-intensive tasks accompanied by several sub-tasks are slowly moving from centralized deployment to edge-side deployment. The idea of edge computing also makes intelligent services sink locally. But in actual scenarios like dynamic edge computing networks (DECN), due to fluctuations in…
▽ More
While the concept of Artificial Intelligent Internet of Things\ (AIoT) is booming, computation and/or communication-intensive tasks accompanied by several sub-tasks are slowly moving from centralized deployment to edge-side deployment. The idea of edge computing also makes intelligent services sink locally. But in actual scenarios like dynamic edge computing networks (DECN), due to fluctuations in available computing resources of intermediate servers and changes in bandwidth during data transmission, service reliability becomes difficult to guarantee. Coupled with changes in the amount of data in a service, the above three problems all make the existing reliability evaluation methods no longer accurate. To study the effect of distributed service deployment strategies under such a background, this paper proposes a reliability evaluation method (REMR) based on lower boundary rule under time constraint to study the degree of the rationality of a service deployment plan combined with DECN. In this scenario, time delay is the main concern which would be affected by three quantitative factors: data packet storing and sending time, data transmission time and the calculation time of executing sub-tasks on the node devices, specially while the last two are in dynamic scenarios. In actual calculation, based on the idea of the minimal paths, the solution set would to be found that can meet requirements in the current deployment. Then the reliability of the service supported by the solution sets would be found out based on the principle of inclusion-exclusion combined with the distribution of available data transmission bandwidth and the distribution of node available computing resources. Besides a illustrative example was provided, to verify the calculated reliability of the designed service deployment plan, the NS3 is utilized along with Google cluster data set for simulation.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences
Authors:
Ziwang Fu,
Feng Liu,
Hanyang Wang,
Siyuan Shen,
Jiahao Zhang,
Jiayin Qi,
Xiangling Fu,
Aimin Zhou
Abstract:
Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition. Existing approaches use directional pairwise attention or a message hub to fuse language, visual, and audio modalities. However, those approaches introduce information redundancy when fusing features and are inefficient without considering the comp…
▽ More
Learning modality-fused representations and processing unaligned multimodal sequences are meaningful and challenging in multimodal emotion recognition. Existing approaches use directional pairwise attention or a message hub to fuse language, visual, and audio modalities. However, those approaches introduce information redundancy when fusing features and are inefficient without considering the complementarity of modalities. In this paper, we propose an efficient neural network to learn modality-fused representations with CB-Transformer (LMR-CBT) for multimodal emotion recognition from unaligned multimodal sequences. Specifically, we first perform feature extraction for the three modalities respectively to obtain the local structure of the sequences. Then, we design a novel transformer with cross-modal blocks (CB-Transformer) that enables complementary learning of different modalities, mainly divided into local temporal learning,cross-modal feature fusion and global self-attention representations. In addition, we splice the fused features with the original features to classify the emotions of the sequences. Finally, we conduct word-aligned and unaligned experiments on three challenging datasets, IEMOCAP, CMU-MOSI, and CMU-MOSEI. The experimental results show the superiority and efficiency of our proposed method in both settings. Compared with the mainstream methods, our approach reaches the state-of-the-art with a minimum number of parameters.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
The physics of Lyman-alpha escape from disc-like galaxies
Authors:
Aaron Smith,
Rahul Kannan,
Sandro Tacchella,
Mark Vogelsberger,
Lars Hernquist,
Federico Marinacci,
Laura V. Sales,
Paul Torrey,
Hui Li,
Yuan-Chen Yeh,
Jia Qi
Abstract:
Hydrogen emission lines can provide extensive information about star-forming galaxies in both the local and high-redshift Universe. We present a detailed Lyman continuum (LyC), Lyman-alpha (Lyα), and Balmer line (Hα and H\b{eta}) radiative transfer study of a high-resolution isolated Milky-Way simulation using the Arepo-RT radiation hydrodynamics code with the SMUGGLE galaxy formation model. The r…
▽ More
Hydrogen emission lines can provide extensive information about star-forming galaxies in both the local and high-redshift Universe. We present a detailed Lyman continuum (LyC), Lyman-alpha (Lyα), and Balmer line (Hα and H\b{eta}) radiative transfer study of a high-resolution isolated Milky-Way simulation using the Arepo-RT radiation hydrodynamics code with the SMUGGLE galaxy formation model. The realistic framework includes stellar feedback, non-equilibrium thermochemistry, and dust grain evolution in the interstellar medium (ISM). We extend our Cosmic Lyα Transfer (COLT) code with photoionization equilibrium Monte Carlo radiative transfer for self-consistent end-to-end (non-)resonant line predictions. Accurate LyC reprocessing to recombination emission requires modelling pre-absorption by dust (27.5%), helium ionization (8.7%), and anisotropic escape fractions (7.9%), as these reduce the available budget for hydrogen line emission (55.9%). We investigate the role of the multiphase dusty ISM, disc geometry, gas kinematics, and star formation activity in governing the physics of emission and escape, focusing on the time variability, gas phase structure, and spatial, spectral, and viewing angle dependence of the emergent photons. Isolated disc simulations are well-suited for comprehensive observational comparisons with local Hα surveys, but would require a proper cosmological circumgalactic medium (CGM) environment as well as less dust absorption and rotational broadening to serve as analogs for high-redshift Lyα emitting galaxies. Future applications of our framework to next-generation cosmological simulations of galaxy formation including radiation-hydrodynamics that resolve <10 pc multiphase ISM and <1 kpc CGM structures will provide crucial insights and predictions for current and upcoming Lyα observations.
△ Less
Submitted 26 September, 2022; v1 submitted 26 November, 2021;
originally announced November 2021.
-
Performance of a Radial Time Projection Chamber with Electroluminescence in Liquid Xenon
Authors:
Yuehuan Wei,
Jianyang Qi,
Evan Shockley,
Haiwen Xu,
Kaixuan Ni
Abstract:
The dual-phase xenon time projection chamber (TPC) is a leading detector technology in rare event searches for dark matter and neutrino physics. The success of this type of detector technology relies on its capability to detect both primary scintillation and ionization signals from particle interactions in liquid xenon (LXe). The ionization electrons are converted into electroluminescence in the g…
▽ More
The dual-phase xenon time projection chamber (TPC) is a leading detector technology in rare event searches for dark matter and neutrino physics. The success of this type of detector technology relies on its capability to detect both primary scintillation and ionization signals from particle interactions in liquid xenon (LXe). The ionization electrons are converted into electroluminescence in the gas xenon (GXe), where a single electron can be amplified by more than 100 times in number of photons in a strong electric field. Maintaining a strong and uniform electric field in the small gas gap in large diameter TPCs is challenging. One alternative solution is to produce the electroluminescence in the LXe directly to overcome the gas gap uniformity problem. Here we report on the design and performance of a single-phase Radial TPC (RTPC) which can create and detect the electroluminescence directly in LXe. It simplifies the design and operation of the LXe TPC by using a single wire in the axial center to create the strong electric field. We present the performance of such an RTPC and discuss its limitations for potential applications.
△ Less
Submitted 9 February, 2022; v1 submitted 14 November, 2021;
originally announced November 2021.
-
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge
Authors:
Jiyang Qi,
Yan Gao,
Yao Hu,
Xinggang Wang,
Xiaoyu Liu,
Xiang Bai,
Serge Belongie,
Alan Yuille,
Philip H. S. Torr,
Song Bai
Abstract:
Although deep learning methods have achieved advanced video object recognition performance in recent years, perceiving heavily occluded objects in a video is still a very challenging task. To promote the development of occlusion understanding, we collect a large-scale dataset called OVIS for video instance segmentation in the occluded scenario. OVIS consists of 296k high-quality instance masks and…
▽ More
Although deep learning methods have achieved advanced video object recognition performance in recent years, perceiving heavily occluded objects in a video is still a very challenging task. To promote the development of occlusion understanding, we collect a large-scale dataset called OVIS for video instance segmentation in the occluded scenario. OVIS consists of 296k high-quality instance masks and 901 occluded scenes. While our human vision systems can perceive those occluded objects by contextual reasoning and association, our experiments suggest that current video understanding systems cannot. On the OVIS dataset, all baseline methods encounter a significant performance degradation of about 80% in the heavily occluded object group, which demonstrates that there is still a long way to go in understanding obscured objects and videos in a complex real-world scenario. To facilitate the research on new paradigms for video understanding systems, we launched a challenge based on the OVIS dataset. The submitted top-performing algorithms have achieved much higher performance than our baselines. In this paper, we will introduce the OVIS dataset and further dissect it by analyzing the results of baselines and submitted methods. The OVIS dataset and challenge information can be found at http://songbai.site/ovis .
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognition
Authors:
Ziwang Fu,
Feng Liu,
Hanyang Wang,
Jiayin Qi,
Xiangling Fu,
Aimin Zhou,
Zhibin Li
Abstract:
The audio-video based multimodal emotion recognition has attracted a lot of attention due to its robust performance. Most of the existing methods focus on proposing different cross-modal fusion strategies. However, these strategies introduce redundancy in the features of different modalities without fully considering the complementary properties between modal information, and these approaches do n…
▽ More
The audio-video based multimodal emotion recognition has attracted a lot of attention due to its robust performance. Most of the existing methods focus on proposing different cross-modal fusion strategies. However, these strategies introduce redundancy in the features of different modalities without fully considering the complementary properties between modal information, and these approaches do not guarantee the non-loss of original semantic information during intra- and inter-modal interactions. In this paper, we propose a novel cross-modal fusion network based on self-attention and residual structure (CFN-SR) for multimodal emotion recognition. Firstly, we perform representation learning for audio and video modalities to obtain the semantic features of the two modalities by efficient ResNeXt and 1D CNN, respectively. Secondly, we feed the features of the two modalities into the cross-modal blocks separately to ensure efficient complementarity and completeness of information through the self-attention mechanism and residual structure. Finally, we obtain the output of emotions by splicing the obtained fused representation with the original representation. To verify the effectiveness of the proposed method, we conduct experiments on the RAVDESS dataset. The experimental results show that the proposed CFN-SR achieves the state-of-the-art and obtains 75.76% accuracy with 26.30M parameters. Our code is available at https://github.com/skeletonNN/CFN-SR.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
EvoGAN: An Evolutionary Computation Assisted GAN
Authors:
Feng Liu,
HanYang Wang,
Jiahao Zhang,
Ziwang Fu,
Aimin Zhou,
Jiayin Qi,
Zhibin Li
Abstract:
The image synthesis technique is relatively well established which can generate facial images that are indistinguishable even by human beings. However, all of these approaches uses gradients to condition the output, resulting in the outputting the same image with the same input. Also, they can only generate images with basic expression or mimic an expression instead of generating compound expressi…
▽ More
The image synthesis technique is relatively well established which can generate facial images that are indistinguishable even by human beings. However, all of these approaches uses gradients to condition the output, resulting in the outputting the same image with the same input. Also, they can only generate images with basic expression or mimic an expression instead of generating compound expression. In real life, however, human expressions are of great diversity and complexity. In this paper, we propose an evolutionary algorithm (EA) assisted GAN, named EvoGAN, to generate various compound expressions with any accurate target compound expression. EvoGAN uses an EA to search target results in the data distribution learned by GAN. Specifically, we use the Facial Action Coding System (FACS) as the encoding of an EA and use a pre-trained GAN to generate human facial images, and then use a pre-trained classifier to recognize the expression composition of the synthesized images as the fitness function to guide the search of the EA. Combined random searching algorithm, various images with the target expression can be easily sythesized. Quantitative and Qualitative results are presented on several compound expressions, and the experimental results demonstrate the feasibility and the potential of EvoGAN.
△ Less
Submitted 22 October, 2021;
originally announced October 2021.
-
Ultrasensitive barocaloric material for room-temperature solid-state refrigeration
Authors:
Qingyong Ren,
Ji Qi,
Dehong Yu,
Wenli Song,
Bao Yuan,
Tianhao Wan,
Weijun Ren,
Zhidong Zhang,
Xin Tong,
Bing Li
Abstract:
Solid-state refrigeration based on caloric effects is an energetically efficient and environmentally friendly technology, which is deemed as a potential alternative to the conventional vapor-compression technology. One of the greatest obstacles to the real application is the huge driving fields. Here, we report a giant barocaloric effect in inorganic NH4I with maximum entropy changes of ΔS_BCE^max…
▽ More
Solid-state refrigeration based on caloric effects is an energetically efficient and environmentally friendly technology, which is deemed as a potential alternative to the conventional vapor-compression technology. One of the greatest obstacles to the real application is the huge driving fields. Here, we report a giant barocaloric effect in inorganic NH4I with maximum entropy changes of ΔS_BCE^max ~89 J K-1 kg-1 around room temperature, associated with the orientationally order-disorder phase transition. The phase transition temperature, Tt, varies dramatically with pressure in a rate of dTt/dP ~0.81 K MPa-1, which leads to a very much small saturation driving pressure of ΔP ~20 MPa, an unprecedentedly large caloric strength of |ΔS_BCE^max/ΔP| ~4.45 J K-1 kg-1 MPa-1, as well as a broad temperature window of ~68 K under an 80 MPa driving pressure. Comprehensive characterization of the crystal structure and dynamics by neutron scattering measurements reveals a strong reorientation-vibration coupling that is responsible for the large pressure sensitivity of Tt. This work is expected to advance the practical application of barocaloric refrigeration.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
Classical-to-Quantum Transfer Learning for Spoken Command Recognition Based on Quantum Neural Networks
Authors:
Jun Qi,
Javier Tejedor
Abstract:
This work investigates an extension of transfer learning applied in machine learning algorithms to the emerging hybrid end-to-end quantum neural network (QNN) for spoken command recognition (SCR). Our QNN-based SCR system is composed of classical and quantum components: (1) the classical part mainly relies on a 1D convolutional neural network (CNN) to extract speech features; (2) the quantum part…
▽ More
This work investigates an extension of transfer learning applied in machine learning algorithms to the emerging hybrid end-to-end quantum neural network (QNN) for spoken command recognition (SCR). Our QNN-based SCR system is composed of classical and quantum components: (1) the classical part mainly relies on a 1D convolutional neural network (CNN) to extract speech features; (2) the quantum part is built upon the variational quantum circuit with a few learnable parameters. Since it is inefficient to train the hybrid end-to-end QNN from scratch on a noisy intermediate-scale quantum (NISQ) device, we put forth a hybrid transfer learning algorithm that allows a pre-trained classical network to be transferred to the classical part of the hybrid QNN model. The pre-trained classical network is further modified and augmented through jointly fine-tuning with a variational quantum circuit (VQC). The hybrid transfer learning methodology is particularly attractive for the task of QNN-based SCR because low-dimensional classical features are expected to be encoded into quantum states. We assess the hybrid transfer learning algorithm applied to the hybrid classical-quantum QNN for SCR on the Google speech command dataset, and our classical simulation results suggest that the hybrid transfer learning can boost our baseline performance on the SCR task.
△ Less
Submitted 16 October, 2021;
originally announced October 2021.
-
QTN-VQC: An End-to-End Learning framework for Quantum Neural Networks
Authors:
Jun Qi,
Chao-Han Huck Yang,
Pin-Yu Chen
Abstract:
The advent of noisy intermediate-scale quantum (NISQ) computers raises a crucial challenge to design quantum neural networks for fully quantum learning tasks. To bridge the gap, this work proposes an end-to-end learning framework named QTN-VQC, by introducing a trainable quantum tensor network (QTN) for quantum embedding on a variational quantum circuit (VQC). The architecture of QTN is composed o…
▽ More
The advent of noisy intermediate-scale quantum (NISQ) computers raises a crucial challenge to design quantum neural networks for fully quantum learning tasks. To bridge the gap, this work proposes an end-to-end learning framework named QTN-VQC, by introducing a trainable quantum tensor network (QTN) for quantum embedding on a variational quantum circuit (VQC). The architecture of QTN is composed of a parametric tensor-train network for feature extraction and a tensor product encoding for quantum embedding. We highlight the QTN for quantum embedding in terms of two perspectives: (1) we theoretically characterize QTN by analyzing its representation power of input features; (2) QTN enables an end-to-end parametric model pipeline, namely QTN-VQC, from the generation of quantum embedding to the output measurement. Our experiments on the MNIST dataset demonstrate the advantages of QTN for quantum embedding over other quantum embedding approaches.
△ Less
Submitted 22 November, 2021; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Reverse strain-induced snake states in graphene nanoribbons
Authors:
Cheng-Yi Zuo,
Junjie Qi,
Tian-Lun Lu,
Zhi-qiang Bao,
Yan Li
Abstract:
Strain can tailor the band structures and properties of graphene nanoribbons (GNRs) with the well-known emergent pseudo-magnetic fields and the corresponding pseudo-Landau levels (pLLs). We design one type of the zigzag GNR (ZGNR) with reverse strains, producing pseudo-magnetic fields with opposite signs in the lower and upper half planes. Therefore, electrons propagate along the interface as "sna…
▽ More
Strain can tailor the band structures and properties of graphene nanoribbons (GNRs) with the well-known emergent pseudo-magnetic fields and the corresponding pseudo-Landau levels (pLLs). We design one type of the zigzag GNR (ZGNR) with reverse strains, producing pseudo-magnetic fields with opposite signs in the lower and upper half planes. Therefore, electrons propagate along the interface as "snake states", experiencing opposite Lorentz forces as they cross the zero field border line. By using the Landauer-Buttiker formalism combined with the nonequilibrium Green's function method, the existence and robustness of the reverse strain-induced snake states are further studied. Furthermore, the realization of long-thought pure valley currents in monolayer graphene systems is also proposed in our device.
△ Less
Submitted 17 April, 2023; v1 submitted 3 October, 2021;
originally announced October 2021.
-
Real and counterfeit cores: how feedback expands halos and disrupts tracers of inner gravitational potential in dwarf galaxies
Authors:
Ethan D. Jahn,
Laura V. Sales,
Federico Marinacci,
Mark Vogelsberger,
Paul Torrey,
Jia Qi,
Aaron Smith,
Hui Li,
Rahul Kannan,
Jan D. Burger,
Jesús Zavala
Abstract:
The tension between the diverging density profiles in Lambda Cold Dark Matter ($Λ$CDM) simulations and the constant-density inner regions of observed galaxies is a long-standing challenge known as the `core-cusp' problem. We demonstrate that the \texttt{SMUGGLE} galaxy formation model implemented in the \textsc{Arepo} moving mesh code forms constant-density cores in idealized dwarf galaxies of…
▽ More
The tension between the diverging density profiles in Lambda Cold Dark Matter ($Λ$CDM) simulations and the constant-density inner regions of observed galaxies is a long-standing challenge known as the `core-cusp' problem. We demonstrate that the \texttt{SMUGGLE} galaxy formation model implemented in the \textsc{Arepo} moving mesh code forms constant-density cores in idealized dwarf galaxies of $M_\star \approx 8 \times 10^7$ M$_{\odot}$ with initially cuspy dark matter halos of $M_{200} \approx 10^{10}$ M$_{\odot}$. Identical initial conditions run with the Springel and Hernquist (2003; SH03) feedback model preserve cuspiness. Literature on the subject has pointed to the low density threshold for star formation, $ρ_\text{th}$, in SH03-like models as an obstacle to baryon-induced core formation. Using a \texttt{SMUGGLE} run with equal $ρ_\text{th}$ to SH03, we demonstrate that core formation can proceed at low density thresholds, indicating that $ρ_\text{th}$ is insufficient on its own to determine whether a galaxy develops a core. We suggest that the ability to resolve a multiphase interstellar medium at sufficiently high densities is a more reliable indicator of core formation than any individual model parameter. In \texttt{SMUGGLE}, core formation is accompanied by large degrees of non-circular motion, with gas rotational velocity profiles that consistently fall below the circular velocity $v_\text{circ} = \sqrt{GM/R}$ out to $\sim 2$ kpc. This may artificially mimic larger core sizes when derived from observable quantities compared to the size measured from the dark matter distribution ($\sim 0.5$ kpc), highlighting the need for careful modeling in the inner regions of dwarfs to infer the true distribution of dark matter.
△ Less
Submitted 30 September, 2021;
originally announced October 2021.
-
Galaxy-Scale Test of General Relativity with Strong Gravitational Lensing
Authors:
Xiao-Hui Liu,
Zhen-Hua Li,
**g-Zhao Qi,
Xin Zhang
Abstract:
Although general relativity (GR) has been precisely tested at the solar system scale, precise tests at a galactic or cosmological scale are still relatively insufficient. Here, in order to test GR at the galactic scale, we use the newly compiled galaxy-scale strong gravitational lensing (SGL) sample to constrain the parameter $γ_{PPN}$ in the parametrized post-Newtonian (PPN) formalism. We employ…
▽ More
Although general relativity (GR) has been precisely tested at the solar system scale, precise tests at a galactic or cosmological scale are still relatively insufficient. Here, in order to test GR at the galactic scale, we use the newly compiled galaxy-scale strong gravitational lensing (SGL) sample to constrain the parameter $γ_{PPN}$ in the parametrized post-Newtonian (PPN) formalism. We employ the Pantheon sample of type Ia supernovae observation to calibrate the distances in the SGL systems using the Gaussian Process method, which avoids the logical problem caused by assuming a cosmological model within GR to determine the distances in the SGL sample. Furthermore, we consider three typical lens models in this work to investigate the influences of the lens mass distributions on the fitting results. We find that the choice of the lens models has a significant impact on the constraints on the PPN parameter $γ_{PPN}$. We use the Bayesian information criterion as an evaluation tool to make a comparison for the fitting results of the three lens models, and we find that the most reliable lens model gives the result of $γ_{PPN}=1.065^{+0.064}_{-0.074}$, which is in good agreement with the prediction of $γ_{PPN}=1$ by GR. As far as we know, our 6.4% constraint result is the best result so far among the recent works using the SGL method.
△ Less
Submitted 16 January, 2022; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Authors:
Jiaju Qi,
Qihao Zhou,
Lei Lei,
Kan Zheng
Abstract:
This paper presents a comprehensive survey of Federated Reinforcement Learning (FRL), an emerging and promising field in Reinforcement Learning (RL). Starting with a tutorial of Federated Learning (FL) and RL, we then focus on the introduction of FRL as a new method with great potential by leveraging the basic idea of FL to improve the performance of RL while preserving data-privacy. According to…
▽ More
This paper presents a comprehensive survey of Federated Reinforcement Learning (FRL), an emerging and promising field in Reinforcement Learning (RL). Starting with a tutorial of Federated Learning (FL) and RL, we then focus on the introduction of FRL as a new method with great potential by leveraging the basic idea of FL to improve the performance of RL while preserving data-privacy. According to the distribution characteristics of the agents in the framework, FRL algorithms can be divided into two categories, i.e. Horizontal Federated Reinforcement Learning (HFRL) and Vertical Federated Reinforcement Learning (VFRL). We provide the detailed definitions of each category by formulas, investigate the evolution of FRL from a technical perspective, and highlight its advantages over previous RL algorithms. In addition, the existing works on FRL are summarized by application fields, including edge computing, communication, control optimization, and attack detection. Finally, we describe and discuss several key research directions that are crucial to solving the open problems within FRL.
△ Less
Submitted 24 October, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Stable and scalable multistage terahertz-driven particle accelerator
Authors:
Heng Tang,
Lingrong Zhao,
Pengfei Zhu,
Xiao Zou,
Jia Qi,
Ya Cheng,
Jiaqi Qiu,
Xianggang Hu,
Wei Song,
Dao Xiang,
Jie Zhang
Abstract:
Particle accelerators that use electromagnetic fields to increase a charged particle's energy have greatly advanced the development of science and industry since invention. However, the enormous cost and size of conventional radio-frequency accelerators have limited their accessibility. Here we demonstrate a mini-accelerator powered by terahertz pulses with wavelengths 100 times shorter than radio…
▽ More
Particle accelerators that use electromagnetic fields to increase a charged particle's energy have greatly advanced the development of science and industry since invention. However, the enormous cost and size of conventional radio-frequency accelerators have limited their accessibility. Here we demonstrate a mini-accelerator powered by terahertz pulses with wavelengths 100 times shorter than radio-frequency pulses. By injecting a short relativistic electron bunch to a 30-mm-long dielectric-lined waveguide and tuning the frequency of a 20-period terahertz pulse to the phase-velocity-matched value, precise and sustained acceleration for nearly 100% of the electrons is achieved with the beam energy spread essentially unchanged. Furthermore, by accurately controlling the phase of two terahertz pulses, the beam is stably accelerated successively in two dielectric waveguides with close to 100% charge coupling efficiency. Our results demonstrate stable and scalable beam acceleration in a multistage mini-accelerator and pave the way for functioning terahertz-driven high-energy accelerators.
△ Less
Submitted 22 August, 2021;
originally announced August 2021.
-
Anisotropic heat conduction of coherently transported phonons in InGaO3(ZnO)m single crystal films with superlattice structures
Authors:
Hai Jun Cho,
Yuzhang Wu,
Youngha Kwon,
Jiajun Qi,
Yuna Kim,
Keiji Saito,
Hiromichi Ohta
Abstract:
Superlattices provide a great platform for studying coherent transportation of low-frequency phonons, which are the main issues in mastering the manipulation of heat conduction. Studies have shown that the dominating characteristics in the thermal conductivity of superlattice can be adjusted between wave-like and particle-like phonon properties depending on the superlattice period. However, the ph…
▽ More
Superlattices provide a great platform for studying coherent transportation of low-frequency phonons, which are the main issues in mastering the manipulation of heat conduction. Studies have shown that the dominating characteristics in the thermal conductivity of superlattice can be adjusted between wave-like and particle-like phonon properties depending on the superlattice period. However, the phonon coherence length and the phonon mean free path from Umklapp processes have not been defined in one superlattice system, and the transition from wave-like and particle-like behavior is not clear to date despite the extensive research efforts. In this study, we use InGaO3(ZnO)m (m = integer) single crystal films with superlattice structure to experimentally characterize the phonon coherence length as well as the Umklapp mean free path in one system. According to the results, the nature of heat conduction in superlattice can change in three different ways depending on the ratio between the phonon coherence length and the superlattice period. We also discuss the role of the phonon characteristic lengths in the heat conduction of superlattices and its anisotropy.
△ Less
Submitted 10 August, 2021;
originally announced August 2021.
-
Extremely low-energy collective modes in a quasi-one-dimensional system
Authors:
Z. X. Wei,
S. Zhang,
Y. L. Su,
L. Cheng,
H. D. Zhou,
Z. Jiang,
H. Weng,
J. Qi
Abstract:
We have investigated the quasiparticle dynamics and collective excitations in the quasi-one-dimensional material ZrTe$_5$ using ultrafast optical pump-probe spectroscopy. Our time-domain results reveal two coherent oscillations having extremely low energies of $\hbarω_1\sim$0.33 meV (0.08 THz) and $\hbarω_2\sim$1.9 meV (0.45 THz), which are softened as the temperature approaches two different crit…
▽ More
We have investigated the quasiparticle dynamics and collective excitations in the quasi-one-dimensional material ZrTe$_5$ using ultrafast optical pump-probe spectroscopy. Our time-domain results reveal two coherent oscillations having extremely low energies of $\hbarω_1\sim$0.33 meV (0.08 THz) and $\hbarω_2\sim$1.9 meV (0.45 THz), which are softened as the temperature approaches two different critical temperatures ($\sim$54 K and $\sim$135 K). We attribute these two collective excitations to the amplitude mode of charge density wave instabilities in ZrTe$_5$ with tremendously small nesting wave vectors. Furthermore, scattering with the $\hbarω_2$ mode may result in a peculiar quasiparticle decay process with a timescale of $\sim$1-2 ps below the transition temperature $T^*$ ($\sim$135 K). Our findings provide pivotal information for studying the fluctuating order parameters and their associated quasiparticle dynamics in various low-dimensional topological systems and other materials.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
SkyCell: A Space-Pruning Based Parallel Skyline Algorithm
Authors:
Chuanwen Li,
Yu Gu,
Jianzhong Qi,
Ge Yu
Abstract:
Skyline computation is an essential database operation that has many applications in multi-criteria decision making scenarios such as recommender systems. Existing algorithms have focused on checking point domination, which lack efficiency over large datasets. We propose a grid-based structure that enables grid cell domination checks. We show that only a small constant number of cells need to be c…
▽ More
Skyline computation is an essential database operation that has many applications in multi-criteria decision making scenarios such as recommender systems. Existing algorithms have focused on checking point domination, which lack efficiency over large datasets. We propose a grid-based structure that enables grid cell domination checks. We show that only a small constant number of cells need to be checked which is independent from the number of data points. Our structure also enables parallel processing. We thus obtain a highly efficient parallel skyline algorithm named SkyCell, taking advantage of the parallelization power of graphics processing units. Experimental results confirm the effectiveness and efficiency of SkyCell -- it outperforms state-of-the-art algorithms consistently and by up to over two orders of magnitude in the computation time.
△ Less
Submitted 21 July, 2021;
originally announced July 2021.
-
Revealing complex optical phenomena through vectorial metrics
Authors:
Chao He,
**tao Chang,
Patrick S. Salter,
Yuanxing Shen,
Ben Dai,
Pengcheng Li,
Yihan **,
Samlan Chandran Thodika,
Mengmeng Li,
Aziz Tariq,
**gyu Wang,
Jacopo Antonello,
Yang Dong,
Ji Qi,
Jianyu Lin,
Honghui He,
Daniel S. Elson,
Min Zhang,
Hui Ma,
Martin J. Booth
Abstract:
Advances in vectorial polarisation-resolved imaging are bringing new capabilities to applications ranging from fundamental physics through to clinical diagnosis. Imaging polarimetry requires determination of the Mueller matrix (MM) at every point, providing a complete description of an object's vectorial properties. Despite forming a comprehensive representation, the MM does not usually provide ea…
▽ More
Advances in vectorial polarisation-resolved imaging are bringing new capabilities to applications ranging from fundamental physics through to clinical diagnosis. Imaging polarimetry requires determination of the Mueller matrix (MM) at every point, providing a complete description of an object's vectorial properties. Despite forming a comprehensive representation, the MM does not usually provide easily-interpretable information about the object's internal structure. Certain simpler vectorial metrics are derived from subsets of the MM elements. These metrics permit extraction of signatures that provide direct indicators of hidden optical properties of complex systems, while featuring an intriguing asymmetry about what information can or cannot be inferred via these metrics. We harness such characteristics to reveal the spin-Hall effect of light, infer microscopic structure within laser-written photonic waveguides, and conduct rapid pathological diagnosis through analysis of healthy and cancerous tissue. This provides new insight for the broader usage of such asymmetric inferred vectorial information.
△ Less
Submitted 21 July, 2021; v1 submitted 20 July, 2021;
originally announced July 2021.
-
Delay-Compensated Distributed PDE Control of Traffic with Connected/Automated Vehicles
Authors:
Jie Qi,
Shurong Mo,
Miroslav Krstic
Abstract:
We develop an input delay-compensating design for stabilization of an Aw-Rascle-Zhang (ARZ) traffic model in congested regime which is governed by a $2\times 2$ first-order hyperbolic nonlinear PDE. The traffic flow consists of both adaptive cruise control-equipped (ACC-equipped) and manually-driven vehicles. The control input is the time gap of ACC-equipped and connected vehicles, which is subjec…
▽ More
We develop an input delay-compensating design for stabilization of an Aw-Rascle-Zhang (ARZ) traffic model in congested regime which is governed by a $2\times 2$ first-order hyperbolic nonlinear PDE. The traffic flow consists of both adaptive cruise control-equipped (ACC-equipped) and manually-driven vehicles. The control input is the time gap of ACC-equipped and connected vehicles, which is subject to delays resulting from communication lag. For the linearized system, a novel three-branch bakcstep** transformation with explicit kernel functions is introduced to compensate the input delay. The transformation is proved æto be bounded, continuous and invertible, with explicit inverse transformation derived. Based on the transformation, we obtain the explicit predictor-feedback controller. We prove exponential stability of the closed-loop system with the delay compensator in $L_2$ norm. The performance improvement of the closed-loop system under the proposed controller is illustrated in simulation.
△ Less
Submitted 2 September, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
HAT: Hierarchical Aggregation Transformers for Person Re-identification
Authors:
Guowen Zhang,
**** Zhang,
**qing Qi,
Huchuan Lu
Abstract:
Recently, with the advance of deep Convolutional Neural Networks (CNNs), person Re-Identification (Re-ID) has witnessed great success in various applications. However, with limited receptive fields of CNNs, it is still challenging to extract discriminative representations in a global view for persons under non-overlapped cameras. Meanwhile, Transformers demonstrate strong abilities of modeling lon…
▽ More
Recently, with the advance of deep Convolutional Neural Networks (CNNs), person Re-Identification (Re-ID) has witnessed great success in various applications. However, with limited receptive fields of CNNs, it is still challenging to extract discriminative representations in a global view for persons under non-overlapped cameras. Meanwhile, Transformers demonstrate strong abilities of modeling long-range dependencies for spatial and sequential data. In this work, we take advantages of both CNNs and Transformers, and propose a novel learning framework named Hierarchical Aggregation Transformer (HAT) for image-based person Re-ID with high performance. To achieve this goal, we first propose a Deeply Supervised Aggregation (DSA) to recurrently aggregate hierarchical features from CNN backbones. With multi-granularity supervisions, the DSA can enhance multi-scale features for person retrieval, which is very different from previous methods. Then, we introduce a Transformer-based Feature Calibration (TFC) to integrate low-level detail information as the global prior for high-level semantic information. The proposed TFC is inserted to each level of hierarchical features, resulting in great performance improvements. To our best knowledge, this work is the first to take advantages of both CNNs and Transformers for image-based person Re-ID. Comprehensive experiments on four large-scale Re-ID benchmarks demonstrate that our method shows better results than several state-of-the-art methods. The code is released at https://github.com/AI-Zhpp/HAT.
△ Less
Submitted 13 July, 2021; v1 submitted 13 July, 2021;
originally announced July 2021.
-
A Flexible Multi-Task Model for BERT Serving
Authors:
Tianwen Wei,
Jianwei Qi,
Shenghuan He
Abstract:
In this demonstration, we present an efficient BERT-based multi-task (MT) framework that is particularly suitable for iterative and incremental development of the tasks. The proposed framework is based on the idea of partial fine-tuning, i.e. only fine-tune some top layers of BERT while keep the other layers frozen. For each task, we train independently a single-task (ST) model using partial fine-…
▽ More
In this demonstration, we present an efficient BERT-based multi-task (MT) framework that is particularly suitable for iterative and incremental development of the tasks. The proposed framework is based on the idea of partial fine-tuning, i.e. only fine-tune some top layers of BERT while keep the other layers frozen. For each task, we train independently a single-task (ST) model using partial fine-tuning. Then we compress the task-specific layers in each ST model using knowledge distillation. Those compressed ST models are finally merged into one MT model so that the frozen layers of the former are shared across the tasks. We exemplify our approach on eight GLUE tasks, demonstrating that it is able to achieve both strong performance and efficiency. We have implemented our method in the utterance understanding system of XiaoAI, a commercial AI assistant developed by Xiaomi. We estimate that our model reduces the overall serving cost by 86%.
△ Less
Submitted 8 March, 2022; v1 submitted 12 July, 2021;
originally announced July 2021.
-
A Multi-task Mean Teacher for Semi-supervised Facial Affective Behavior Analysis
Authors:
Lingfeng Wang,
Shisen Wang,
** Qi,
Kenji Suzuki
Abstract:
Affective Behavior Analysis is an important part in human-computer interaction. Existing multi-task affective behavior recognition methods suffer from the problem of incomplete labeled datasets. To tackle this problem, this paper presents a semi-supervised model with a mean teacher framework to leverage additional unlabeled data. To be specific, a multi-task model is proposed to learn three differ…
▽ More
Affective Behavior Analysis is an important part in human-computer interaction. Existing multi-task affective behavior recognition methods suffer from the problem of incomplete labeled datasets. To tackle this problem, this paper presents a semi-supervised model with a mean teacher framework to leverage additional unlabeled data. To be specific, a multi-task model is proposed to learn three different kinds of facial affective representations simultaneously. After that, the proposed model is assigned to be student and teacher networks. When training with unlabeled data, the teacher network is employed to predict pseudo labels for student network training, which allows it to learn from unlabeled data. Experimental results showed that our proposed method achieved much better performance than baseline model and ranked 4th in both competition track 1 and track 2, and 6th in track 3, which verifies that the proposed network can effectively learn from incomplete datasets.
△ Less
Submitted 13 August, 2021; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Atomic-scale imaging of CH3NH3PbI3 structure and its decomposition pathway
Authors:
Shulin Chen,
Changwei Wu,
Bo Han,
Zhetong Liu,
Zhou Mi,
Weizhong Hao,
**** Zhao,
Xiao Wang,
Qing Zhang,
Kaihui Liu,
Junlei Qi,
Jian Cao,
Jicai Feng,
Dapeng Yu,
Jiangyu Li,
Peng Gao
Abstract:
Understanding the atomic structure and structural instability of organic-inorganic hybrid perovskites is the key to appreciate their remarkable photoelectric properties and failure mechanism. Here, using low-dose imaging technique by direct-detection electron-counting camera in transmission electron microscope, we investigate the atomic structure and decomposition pathway of CH3NH3PbI3 (MAPbI3) at…
▽ More
Understanding the atomic structure and structural instability of organic-inorganic hybrid perovskites is the key to appreciate their remarkable photoelectric properties and failure mechanism. Here, using low-dose imaging technique by direct-detection electron-counting camera in transmission electron microscope, we investigate the atomic structure and decomposition pathway of CH3NH3PbI3 (MAPbI3) at the atomic scale. We successfully image the atomic structure of perovskite in real space under ultra-low electron dose condition, and observe a two-step decomposition process, i.e. initial loss of MA followed by the collapse of perovskite structure into 6H-PbI2 with their critical threshold dose also determined. Interestingly, an intermediate phase (MA0.5PbI3) with locally ordered vacancies can robustly exist before perovskite collapses, enlightening strategies for prevention and recovery of perovskite structure during degradation. Associated with structure evolution, the bandgap gradually increases from ~1.6 eV to ~2.1 eV, and it is found that both C-N and N-H bonds can be destroyed under irradiation, releasing NH3 and leaving hydrocarbons. These findings enhance our understanding of the photoelectric properties and failure mechanism of MAPbI3, providing potential strategy into material optimization.
△ Less
Submitted 22 June, 2021;
originally announced June 2021.
-
Direct Reconstruction of Linear Parametric Images from Dynamic PET Using Nonlocal Deep Image Prior
Authors:
Kuang Gong,
Ciprian Catana,
**yi Qi,
Quanzheng Li
Abstract:
Direct reconstruction methods have been developed to estimate parametric images directly from the measured PET sinograms by combining the PET imaging model and tracer kinetics in an integrated framework. Due to limited counts received, signal-to-noise-ratio (SNR) and resolution of parametric images produced by direct reconstruction frameworks are still limited. Recently supervised deep learning me…
▽ More
Direct reconstruction methods have been developed to estimate parametric images directly from the measured PET sinograms by combining the PET imaging model and tracer kinetics in an integrated framework. Due to limited counts received, signal-to-noise-ratio (SNR) and resolution of parametric images produced by direct reconstruction frameworks are still limited. Recently supervised deep learning methods have been successfully applied to medical imaging denoising/reconstruction when large number of high-quality training labels are available. For static PET imaging, high-quality training labels can be acquired by extending the scanning time. However, this is not feasible for dynamic PET imaging, where the scanning time is already long enough. In this work, we proposed an unsupervised deep learning framework for direct parametric reconstruction from dynamic PET, which was tested on the Patlak model and the relative equilibrium Logan model. The patient's anatomical prior image, which is readily available from PET/CT or PET/MR scans, was supplied as the network input to provide a manifold constraint, and also utilized to construct a kernel layer to perform non-local feature denoising. The linear kinetic model was embedded in the network structure as a 1x1 convolution layer. The training objective function was based on the PET statistical model. Evaluations based on dynamic datasets of 18F-FDG and 11C-PiB tracers show that the proposed framework can outperform the traditional and the kernel method-based direct reconstruction methods.
△ Less
Submitted 18 June, 2021;
originally announced June 2021.
-
Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders
Authors:
**zi Qi,
Hugo Van hamme
Abstract:
Objective speech disorder classification for speakers with communication difficulty is desirable for diagnosis and administering therapy. With the current state of speech technology, it is evident to propose neural networks for this application. But neural network model training is hampered by a lack of labeled disordered speech data. In this research, we apply an extended version of Factorized Hi…
▽ More
Objective speech disorder classification for speakers with communication difficulty is desirable for diagnosis and administering therapy. With the current state of speech technology, it is evident to propose neural networks for this application. But neural network model training is hampered by a lack of labeled disordered speech data. In this research, we apply an extended version of Factorized Hierarchical Variational Auto-encoders (FHVAE) for representation learning on disordered speech. The FHVAE model extracts both content-related and sequence-related latent variables from speech data, and we utilize the extracted variables to explore how disorder type information is represented in the latent variables. For better classification performance, the latent variables are aggregated at the word and sentence level. We show that an extension of the FHVAE model succeeds in the better disentanglement of the content-related and sequence-related related representations, but both representations are still required for best results on disorder type classification.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
Sub-trajectory Similarity Join with Obfuscation
Authors:
Yanchuan Chang,
Jianzhong Qi,
Egemen Tanin,
Xingjun Ma,
Hanan Samet
Abstract:
User trajectory data is becoming increasingly accessible due to the prevalence of GPS-equipped devices such as smartphones. Many existing studies focus on querying trajectories that are similar to each other in their entirety. We observe that trajectories partially similar to each other contain useful information about users' travel patterns which should not be ignored. Such partially similar traj…
▽ More
User trajectory data is becoming increasingly accessible due to the prevalence of GPS-equipped devices such as smartphones. Many existing studies focus on querying trajectories that are similar to each other in their entirety. We observe that trajectories partially similar to each other contain useful information about users' travel patterns which should not be ignored. Such partially similar trajectories are critical in applications such as epidemic contact tracing. We thus propose to query trajectories that are within a given distance range from each other for a given period of time. We formulate this problem as a sub-trajectory similarity join query named as the STS-Join. We further propose a distributed index structure and a query algorithm for STS-Join, where users retain their raw location data and only send obfuscated trajectories to a server for query processing. This helps preserve user location privacy which is vital when dealing with such data. Theoretical analysis and experiments on real data confirm the effectiveness and the efficiency of our proposed index structure and query algorithm.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Contour Moments Based Manipulation of Composite Rigid-Deformable Objects with Finite Time Model Estimation and Shape/Position Control
Authors:
Jiaming Qi,
Guangfu Ma,
Jihong Zhu,
Peng Zhou,
Yueyong Lyu,
Haibo Zhang,
David Navarro-Alarcon
Abstract:
The robotic manipulation of composite rigid-deformable objects (i.e. those with mixed non-homogeneous stiffness properties) is a challenging problem with clear practical applications that, despite the recent progress in the field, it has not been sufficiently studied in the literature. To deal with this issue, in this paper we propose a new visual servoing method that has the capability to manipul…
▽ More
The robotic manipulation of composite rigid-deformable objects (i.e. those with mixed non-homogeneous stiffness properties) is a challenging problem with clear practical applications that, despite the recent progress in the field, it has not been sufficiently studied in the literature. To deal with this issue, in this paper we propose a new visual servoing method that has the capability to manipulate this broad class of objects (which varies from soft to rigid) with the same adaptive strategy. To quantify the object's infinite-dimensional configuration, our new approach computes a compact feedback vector of 2D contour moments features. A sliding mode control scheme is then designed to simultaneously ensure the finite-time convergence of both the feedback shape error and the model estimation error. The stability of the proposed framework (including the boundedness of all the signals) is rigorously proved with Lyapunov theory. Detailed simulations and experiments are presented to validate the effectiveness of the proposed approach. To the best of the author's knowledge, this is the first time that contour moments along with finite-time control have been used to solve this difficult manipulation problem.
△ Less
Submitted 4 June, 2021;
originally announced June 2021.
-
You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection
Authors:
Yuxin Fang,
Bencheng Liao,
Xinggang Wang,
Jiemin Fang,
Jiyang Qi,
Rui Wu,
Jianwei Niu,
Wenyu Liu
Abstract:
Can Transformer perform 2D object- and region-level recognition from a pure sequence-to-sequence perspective with minimal knowledge about the 2D spatial structure? To answer this question, we present You Only Look at One Sequence (YOLOS), a series of object detection models based on the vanilla Vision Transformer with the fewest possible modifications, region priors, as well as inductive biases of…
▽ More
Can Transformer perform 2D object- and region-level recognition from a pure sequence-to-sequence perspective with minimal knowledge about the 2D spatial structure? To answer this question, we present You Only Look at One Sequence (YOLOS), a series of object detection models based on the vanilla Vision Transformer with the fewest possible modifications, region priors, as well as inductive biases of the target task. We find that YOLOS pre-trained on the mid-sized ImageNet-1k dataset only can already achieve quite competitive performance on the challenging COCO object detection benchmark, e.g., YOLOS-Base directly adopted from BERT-Base architecture can obtain 42.0 box AP on COCO val. We also discuss the impacts as well as limitations of current pre-train schemes and model scaling strategies for Transformer in vision through YOLOS. Code and pre-trained models are available at https://github.com/hustvl/YOLOS.
△ Less
Submitted 26 October, 2021; v1 submitted 1 June, 2021;
originally announced June 2021.
-
DiaKG: an Annotated Diabetes Dataset for Medical Knowledge Graph Construction
Authors:
Dejie Chang,
Mosha Chen,
Chaozhen Liu,
Li** Liu,
Dongdong Li,
Wei Li,
Fei Kong,
Bangchang Liu,
Xiaobin Luo,
Ji Qi,
Qiao **,
Bin Xu
Abstract:
Knowledge Graph has been proven effective in modeling structured information and conceptual knowledge, especially in the medical domain. However, the lack of high-quality annotated corpora remains a crucial problem for advancing the research and applications on this task. In order to accelerate the research for domain-specific knowledge graphs in the medical domain, we introduce DiaKG, a high-qual…
▽ More
Knowledge Graph has been proven effective in modeling structured information and conceptual knowledge, especially in the medical domain. However, the lack of high-quality annotated corpora remains a crucial problem for advancing the research and applications on this task. In order to accelerate the research for domain-specific knowledge graphs in the medical domain, we introduce DiaKG, a high-quality Chinese dataset for Diabetes knowledge graph, which contains 22,050 entities and 6,890 relations in total. We implement recent typical methods for Named Entity Recognition and Relation Extraction as a benchmark to evaluate the proposed dataset thoroughly. Empirical results show that the DiaKG is challenging for most existing methods and further analysis is conducted to discuss future research direction for improvements. We hope the release of this dataset can assist the construction of diabetes knowledge graphs and facilitate AI-based applications.
△ Less
Submitted 21 September, 2021; v1 submitted 31 May, 2021;
originally announced May 2021.
-
Fast, Accurate and Interpretable Time Series Classification Through Randomization
Authors:
Nestor Cabello,
Elham Naghizade,
Jianzhong Qi,
Lars Kulik
Abstract:
Time series classification (TSC) aims to predict the class label of a given time series, which is critical to a rich set of application areas such as economics and medicine. State-of-the-art TSC methods have mostly focused on classification accuracy and efficiency, without considering the interpretability of their classifications, which is an important property required by modern applications such…
▽ More
Time series classification (TSC) aims to predict the class label of a given time series, which is critical to a rich set of application areas such as economics and medicine. State-of-the-art TSC methods have mostly focused on classification accuracy and efficiency, without considering the interpretability of their classifications, which is an important property required by modern applications such as appliance modeling and legislation such as the European General Data Protection Regulation. To address this gap, we propose a novel TSC method - the Randomized-Supervised Time Series Forest (r-STSF). r-STSF is highly efficient, achieves state-of-the-art classification accuracy and enables interpretability. r-STSF takes an efficient interval-based approach to classify time series according to aggregate values of discriminatory sub-series (intervals). To achieve state-of-the-art accuracy, r-STSF builds an ensemble of randomized trees using the discriminatory sub-series. It uses four time series representations, nine aggregation functions and a supervised binary-inspired search combined with a feature ranking metric to identify highly discriminatory sub-series. The discriminatory sub-series enable interpretable classifications. Experiments on extensive datasets show that r-STSF achieves state-of-the-art accuracy while being orders of magnitude faster than most existing TSC methods. It is the only classifier from the state-of-the-art group that enables interpretability. Our findings also highlight that r-STSF is the best TSC method when classifying complex time series datasets.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Evolution of Berry curvature and reentrant quantum anomalous Hall effect in an intrinsic magnetic topological insulator
Authors:
Chui-Zhen Chen,
Junjie Qi,
Dong-Hui Xu,
X. C. Xie
Abstract:
Recently, the magnetic topological insulator MnBi$_2$Te$_4$ emerged as a competitive platform to realize quantum anomalous Hall (QAH) states. We report a Berry-curvature splitting mechanism to realize the QAH effect in the disordered magnetic TI multilayers when switching from an antiferromagnetic order to a ferromagnetic order. We reveal that the splitting of spin-resolved Berry curvature, origin…
▽ More
Recently, the magnetic topological insulator MnBi$_2$Te$_4$ emerged as a competitive platform to realize quantum anomalous Hall (QAH) states. We report a Berry-curvature splitting mechanism to realize the QAH effect in the disordered magnetic TI multilayers when switching from an antiferromagnetic order to a ferromagnetic order. We reveal that the splitting of spin-resolved Berry curvature, originating from the separation of the mobility edge during the magnetic switching, can give rise to a QAH insulator even \emph{without} closing the band gap. We present a global phase diagram, and also provide a phenomenological picture to elucidate the Berry curvature splitting mechanism by the evolution of topological charges. At last, we predict that the Berry curvature splitting mechanism will lead to a reentrant QAH effect, which can be detected by tuning gate voltage. Our theory will be instructive for the studies of the QAH effect in MnBi$_2$Te$_4$ in future experiments.
△ Less
Submitted 31 October, 2021; v1 submitted 12 May, 2021;
originally announced May 2021.
-
Federated Learning with Fair Averaging
Authors:
Zheng Wang,
Xiaoliang Fan,
Jianzhong Qi,
Chenglu Wen,
Cheng Wang,
Rongshan Yu
Abstract:
Fairness has emerged as a critical problem in federated learning (FL). In this work, we identify a cause of unfairness in FL -- conflicting gradients with large differences in the magnitudes. To address this issue, we propose the federated fair averaging (FedFV) algorithm to mitigate potential conflicts among clients before averaging their gradients. We first use the cosine similarity to detect gr…
▽ More
Fairness has emerged as a critical problem in federated learning (FL). In this work, we identify a cause of unfairness in FL -- conflicting gradients with large differences in the magnitudes. To address this issue, we propose the federated fair averaging (FedFV) algorithm to mitigate potential conflicts among clients before averaging their gradients. We first use the cosine similarity to detect gradient conflicts, and then iteratively eliminate such conflicts by modifying both the direction and the magnitude of the gradients. We further show the theoretical foundation of FedFV to mitigate the issue conflicting gradients and converge to Pareto stationary solutions. Extensive experiments on a suite of federated datasets confirm that FedFV compares favorably against state-of-the-art methods in terms of fairness, accuracy and efficiency. The source code is available at https://github.com/WwZzz/easyFL.
△ Less
Submitted 16 June, 2021; v1 submitted 30 April, 2021;
originally announced April 2021.
-
WGCN: Graph Convolutional Networks with Weighted Structural Features
Authors:
Yunxiang Zhao,
Jianzhong Qi,
Qingwei Liu,
Rui Zhang
Abstract:
Graph structural information such as topologies or connectivities provides valuable guidance for graph convolutional networks (GCNs) to learn nodes' representations. Existing GCN models that capture nodes' structural information weight in- and out-neighbors equally or differentiate in- and out-neighbors globally without considering nodes' local topologies. We observe that in- and out-neighbors con…
▽ More
Graph structural information such as topologies or connectivities provides valuable guidance for graph convolutional networks (GCNs) to learn nodes' representations. Existing GCN models that capture nodes' structural information weight in- and out-neighbors equally or differentiate in- and out-neighbors globally without considering nodes' local topologies. We observe that in- and out-neighbors contribute differently for nodes with different local topologies. To explore the directional structural information for different nodes, we propose a GCN model with weighted structural features, named WGCN. WGCN first captures nodes' structural fingerprints via a direction and degree aware Random Walk with Restart algorithm, where the walk is guided by both edge direction and nodes' in- and out-degrees. Then, the interactions between nodes' structural fingerprints are used as the weighted node structural features. To further capture nodes' high-order dependencies and graph geometry, WGCN embeds graphs into a latent space to obtain nodes' latent neighbors and geometrical relationships. Based on nodes' geometrical relationships in the latent space, WGCN differentiates latent, in-, and out-neighbors with an attention-based geometrical aggregation. Experiments on transductive node classification tasks show that WGCN outperforms the baseline models consistently by up to 17.07% in terms of accuracy on five benchmark datasets.
△ Less
Submitted 21 July, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Thermonuclear fusion and generalized Gamow penetrability factor in intense laser fields
Authors:
**tao Qi
Abstract:
A theoretical study on thermonuclear fusion including deuterium-tritium (D-T) fusion and D-$^{3}$He fusion in intense laser fields has been shown in this article. With the laser fields expected to be available in the near future, some quantitative results for the laser-induced modifications to the cross-sections are given. It is reported that the cross-sections are more sensitive to the external l…
▽ More
A theoretical study on thermonuclear fusion including deuterium-tritium (D-T) fusion and D-$^{3}$He fusion in intense laser fields has been shown in this article. With the laser fields expected to be available in the near future, some quantitative results for the laser-induced modifications to the cross-sections are given. It is reported that the cross-sections are more sensitive to the external laser fields at the lower energies. An explicit generalized form of the Gamow penetrability factor is given for the predictions of the laser-induced effects for some other similar nuclear processes.
△ Less
Submitted 4 March, 2021;
originally announced April 2021.
-
Bidirectional Multiscale Feature Aggregation for Speaker Verification
Authors:
Jiajun Qi,
Wu Guo,
Bin Gu
Abstract:
In this paper, we propose a novel bidirectional multiscale feature aggregation (BMFA) network with attentional fusion modules for text-independent speaker verification. The feature maps from different stages of the backbone network are iteratively combined and refined in both a bottom-up and top-down manner. Furthermore, instead of simple concatenation or element-wise addition of feature maps from…
▽ More
In this paper, we propose a novel bidirectional multiscale feature aggregation (BMFA) network with attentional fusion modules for text-independent speaker verification. The feature maps from different stages of the backbone network are iteratively combined and refined in both a bottom-up and top-down manner. Furthermore, instead of simple concatenation or element-wise addition of feature maps from different stages, an attentional fusion module is designed to compute the fusion weights. Experiments are conducted on the NIST SRE16 and VoxCeleb1 datasets. The experimental results demonstrate the effectiveness of the bidirectional aggregation strategy and show that the proposed attentional fusion module can further improve the performance.
△ Less
Submitted 31 March, 2021;
originally announced April 2021.
-
A Benchmark and Comprehensive Survey on Knowledge Graph Entity Alignment via Representation Learning
Authors:
Rui Zhang,
Bayu Distiawan Trisedy,
Miao Li,
Yong Jiang,
Jianzhong Qi
Abstract:
In the last few years, the interest in knowledge bases has grown exponentially in both the research community and the industry due to their essential role in AI applications. Entity alignment is an important task for enriching knowledge bases. This paper provides a comprehensive tutorial-type survey on representative entity alignment techniques that use the new approach of representation learning.…
▽ More
In the last few years, the interest in knowledge bases has grown exponentially in both the research community and the industry due to their essential role in AI applications. Entity alignment is an important task for enriching knowledge bases. This paper provides a comprehensive tutorial-type survey on representative entity alignment techniques that use the new approach of representation learning. We present a framework for capturing the key characteristics of these techniques, propose two datasets to address the limitation of existing benchmark datasets, and conduct extensive experiments using the proposed datasets. The framework gives a clear picture of how the techniques work. The experiments yield important results about the empirical performance of the techniques and how various factors affect the performance. One important observation not stressed by previous work is that techniques making good use of attribute triples and relation predicates as features stand out as winners.
△ Less
Submitted 5 May, 2022; v1 submitted 28 March, 2021;
originally announced March 2021.
-
Accurate Assessment via Process Data
Authors:
Susu Zhang,
Zhi Wang,
Jitong Qi,
**gchen Liu,
Zhiliang Ying
Abstract:
Accurate assessment of students' ability is the key task of a test. Assessments based on final responses are the standard. As the infrastructure advances, substantially more information is observed. One of such instances is the process data that is collected by computer-based interactive items, which contain a student's detailed interactive processes. In this paper, we show both theoretically and…
▽ More
Accurate assessment of students' ability is the key task of a test. Assessments based on final responses are the standard. As the infrastructure advances, substantially more information is observed. One of such instances is the process data that is collected by computer-based interactive items, which contain a student's detailed interactive processes. In this paper, we show both theoretically and empirically that appropriately including such information in the assessment will substantially improve relevant assessment precision. The precision is measured empirically by out-of-sample test reliability.
△ Less
Submitted 4 October, 2021; v1 submitted 27 March, 2021;
originally announced March 2021.