-
Responsible Urban Intelligence: Towards a Research Agenda
Authors:
Rui Cao,
Qi-Li Gao,
Guo** Qiu
Abstract:
Acceleration of urbanisation is posing great challenges to sustainable development. Growing accessibility to big data and artificial intelligence (AI) technologies have revolutionised many fields and offered great potential for addressing pressing urban problems. However, using these technologies without explicitly considering responsibilities would bring new societal and environmental issues. To…
▽ More
Acceleration of urbanisation is posing great challenges to sustainable development. Growing accessibility to big data and artificial intelligence (AI) technologies have revolutionised many fields and offered great potential for addressing pressing urban problems. However, using these technologies without explicitly considering responsibilities would bring new societal and environmental issues. To maximise the benefits of big data and AI while minimising potential issues, we envisage a conceptual framework of Responsible Urban Intelligence (RUI) and advocate an agenda for action. We first define RUI as consisting of three major components including urban problems, enabling technologies, and responsibilities; then introduce transparency, fairness, and eco-friendliness as the three dimensions of responsibilities which naturally link with the human, space, and time dimensions of cities; and further develop a four-stage implementation framework for responsibilities as consisting of solution design, data preparation, model building, and practical application; and finally present a research agenda for RUI addressing challenging issues including data and model transparency, tension between performance and fairness, and solving urban problems in an eco-friendly manner.
△ Less
Submitted 4 September, 2023; v1 submitted 7 August, 2022;
originally announced August 2022.
-
LRIP-Net: Low-Resolution Image Prior based Network for Limited-Angle CT Reconstruction
Authors:
Qifeng Gao,
Rui Ding,
Linyuan Wang,
Bin Xue,
Yu** Duan
Abstract:
In the practical applications of computed tomography imaging, the projection data may be acquired within a limited-angle range and corrupted by noises due to the limitation of scanning conditions. The noisy incomplete projection data results in the ill-posedness of the inverse problems. In this work, we theoretically verify that the low-resolution reconstruction problem has better numerical stabil…
▽ More
In the practical applications of computed tomography imaging, the projection data may be acquired within a limited-angle range and corrupted by noises due to the limitation of scanning conditions. The noisy incomplete projection data results in the ill-posedness of the inverse problems. In this work, we theoretically verify that the low-resolution reconstruction problem has better numerical stability than the high-resolution problem. In what follows, a novel low-resolution image prior based CT reconstruction model is proposed to make use of the low-resolution image to improve the reconstruction quality. More specifically, we build up a low-resolution reconstruction problem on the down-sampled projection data, and use the reconstructed low-resolution image as prior knowledge for the original limited-angle CT problem. We solve the constrained minimization problem by the alternating direction method with all subproblems approximated by the convolutional neural networks. Numerical experiments demonstrate that our double-resolution network outperforms both the variational method and popular learning-based reconstruction methods on noisy limited-angle reconstruction problems.
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Compton scattering for photon and gluon in fixed-target collisions at AFTER@LHC
Authors:
Gongming Yu,
Runlong Liu,
Yanbing Cai,
Quangui Gao,
Qiang Hu
Abstract:
We calculate the Compton scattering for photon and gluon with the Klein-Nishina formula in fixed-target collisions by using the proton and lead beams at AFTER@LHC. In these collisions, we can investigate the particular case of Compton scattering at the partonic level, such as $γq\rightarrow qγ$, $γq\rightarrow qg$, $gq\rightarrow qγ$, and $gq\rightarrow qg$, that can help to check of the equivalen…
▽ More
We calculate the Compton scattering for photon and gluon with the Klein-Nishina formula in fixed-target collisions by using the proton and lead beams at AFTER@LHC. In these collisions, we can investigate the particular case of Compton scattering at the partonic level, such as $γq\rightarrow qγ$, $γq\rightarrow qg$, $gq\rightarrow qγ$, and $gq\rightarrow qg$, that can help to check of the equivalent-photon approximation and understand the dynamics of hadron collisions at high energies, as well as probe the inner hadron structure.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021
Authors:
LHAASO Collaboration,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Zhe Cao,
Zhen Cao,
J. Chang,
J. F. Chang,
E. S. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
S. H. Chen,
S. Z. Chen,
T. L. Chen,
X. J. Chen
, et al. (248 additional authors not shown)
Abstract:
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations…
▽ More
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations of trigger rates (increases or decreases) are found to be strongly dependent on the primary zenith angle. The flux of secondary particles increases significantly, following a similar trend with that of the shower events. To better understand the observed behavior, Monte Carlo simulations are performed with CORSIKA and G4KM2A (a code based on GEANT4). We find that the experimental data (in saturated negative fields) are in good agreement with simulations, assuming the presence of a uniform upward electric field of 700 V/cm with a thickness of 1500 m in the atmosphere above the observation level. Due to the acceleration/deceleration and deflection by the atmospheric electric field, the number of secondary particles with energy above the detector threshold is modified, resulting in the changes in shower detection rate.
△ Less
Submitted 6 December, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks
Authors:
Laura Stops,
Roel Leenhouts,
Qinghe Gao,
Artur M. Schweidtmann
Abstract:
Process synthesis experiences a disruptive transformation accelerated by digitization and artificial intelligence. We propose a reinforcement learning algorithm for chemical process design based on a state-of-the-art actor-critic logic. Our proposed algorithm represents chemical processes as graphs and uses graph convolutional neural networks to learn from process graphs. In particular, the graph…
▽ More
Process synthesis experiences a disruptive transformation accelerated by digitization and artificial intelligence. We propose a reinforcement learning algorithm for chemical process design based on a state-of-the-art actor-critic logic. Our proposed algorithm represents chemical processes as graphs and uses graph convolutional neural networks to learn from process graphs. In particular, the graph neural networks are implemented within the agent architecture to process the states and make decisions. Moreover, we implement a hierarchical and hybrid decision-making process to generate flowsheets, where unit operations are placed iteratively as discrete decisions and corresponding design variables are selected as continuous decisions. We demonstrate the potential of our method to design economically viable flowsheets in an illustrative case study comprising equilibrium reactions, azeotropic separation, and recycles. The results show quick learning in discrete, continuous, and hybrid action spaces. Due to the flexible architecture of the proposed reinforcement learning agent, the method is predestined to include large action-state spaces and an interface to process simulators in future research.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
The helical vortex filaments of Ginzburg-Landau system in ${\mathbb R}^3$
Authors:
Lipeng Duan,
Qi Gao,
Jun Yang
Abstract:
We consider the following coupled Ginzburg-Landau system in ${\mathbb R}^3$ \begin{align*} \begin{cases} -ε^2 Δw^+ +\Big[A_+\big(|w^+|^2-{t^+}^2\big)+B\big(|w^-|^2-{t^-}^2\big)\Big]w^+=0, \\[3mm] -ε^2 Δw^- +\Big[A_-\big(|w^-|^2-{t^-}^2\big)+B\big(|w^+|^2-{t^+}^2\big)\Big]w^-=0, \end{cases} \end{align*} where $w=(w^+, w^-)\in \mathbb{C}^2$ and the constant coefficients satisfy…
▽ More
We consider the following coupled Ginzburg-Landau system in ${\mathbb R}^3$ \begin{align*} \begin{cases} -ε^2 Δw^+ +\Big[A_+\big(|w^+|^2-{t^+}^2\big)+B\big(|w^-|^2-{t^-}^2\big)\Big]w^+=0, \\[3mm] -ε^2 Δw^- +\Big[A_-\big(|w^-|^2-{t^-}^2\big)+B\big(|w^+|^2-{t^+}^2\big)\Big]w^-=0, \end{cases} \end{align*} where $w=(w^+, w^-)\in \mathbb{C}^2$ and the constant coefficients satisfy $$ A_+, A_->0,\quad B^2<A_+A_-, \quad t^\pm >0, \quad {t^+}^2+{ t^-}^2=1. $$ If $B<0$, then for every $ε$ small enough, we construct a family of entire solutions $w_ε(\tilde{z}, t)\in \mathbb{C}^2$ in the cylindrical coordinates $(\tilde{z}, t)\in \mathbb{R}^2 \times \mathbb{R}$ for this system via the approach introduced by J. Dávila, M. del Pino, M. Medina and R. Rodiac in {\tt arXiv:1901.02807}. These solutions are $2π$-periodic in $t$ and have multiple interacting vortex helices. The main results are the extensions of the phenomena of interacting helical vortex filaments for the classical (single) Ginzburg-Landau equation in $\mathbb{R}^3$ which has been studied in {\tt arXiv:1901.02807}. Our results negatively answer the Gibbons conjecture \cite{Gibbons conjecture} for the Allen-Cahn equation in Ginzburg-Landau system version, which is an extension of the question originally proposed by H. Brezis.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
PMUSpill: The Counters in Performance Monitor Unit that Leak SGX-Protected Secrets
Authors:
Pengfei Qiu,
Yongqiang Lyu,
Haixia Wang,
Dongsheng Wang,
Chang Liu,
Qiang Gao,
Chunlu Wang,
Rihui Sun,
Gang Qu
Abstract:
Performance Monitor Unit (PMU) is a significant hardware module on the current processors, which counts the events launched by processor into a set of PMU counters. Ideally, the events triggered by instructions that are executed but the results are not successfully committed (transient execution) should not be recorded. However, in this study, we discover that some PMU events triggered by the tran…
▽ More
Performance Monitor Unit (PMU) is a significant hardware module on the current processors, which counts the events launched by processor into a set of PMU counters. Ideally, the events triggered by instructions that are executed but the results are not successfully committed (transient execution) should not be recorded. However, in this study, we discover that some PMU events triggered by the transient execution instructions will actually be recorded by PMU. Based on this, we propose the PMUSpill attack, which enables attackers to maliciously leak the secret data that are loaded during transient executions. The biggest challenge is how to encode the secret data into PMU events. We construct an instruction gadget to solve this challenge, whose execution path that can be identified by PMU counters represents what values the secret data are. We successfully implement the PMUSpill attack to leak the secret data stored in Intel Software Guard Extensions (SGX) (a Trusted Execution Environment (TEE) in the Intel's processors) through real experiments. Besides, we locate the vulnerable PMU counters and their trigger instructions by iterating all the valid PMU counters and instructions. The experiment results demonstrate that there are up to 20 PMU counters available to implement the PMUSpill attack. We also provide some possible hardware and software-based countermeasures for addressing the PMUSpill attack, which can be utilized to enhance the security of processors in future.
△ Less
Submitted 24 July, 2022;
originally announced July 2022.
-
HSH-carbon: A novel sp2-sp3 carbon allotrope with an ultrawide energy gap
Authors:
Jia-Qi Liu,
Qian Gao,
Zhen-Peng Hu
Abstract:
A sp2-sp3 hybrid carbon allotrope named HSH-carbon is proposed by the first-principles calculations. The structure of HSH-carbon can be regarded as a template polymerization of [1.1.1]propellane molecules in a hexagonal lattice, as well as, an AA stacking of recently reported HSH-C10 consisting of carbon trigonal bipyramids. Based on calculations, the stability of this structure is demonstrated in…
▽ More
A sp2-sp3 hybrid carbon allotrope named HSH-carbon is proposed by the first-principles calculations. The structure of HSH-carbon can be regarded as a template polymerization of [1.1.1]propellane molecules in a hexagonal lattice, as well as, an AA stacking of recently reported HSH-C10 consisting of carbon trigonal bipyramids. Based on calculations, the stability of this structure is demonstrated in terms of the cohesive energy, phonon dispersion, Born-Huang stability criteria, and ab initio molecular dynamics. HSH-carbon is predicted to be a semiconductor with an indirect energy gap of 3.56 eV at the PBE level or 4.80 eV at the HSE06 level. It is larger than the gap of Si and close to the gap of c-diamond, which indicates HSH-carbon is potentially an ultrawide bandgap semiconductor. The effective masses of carriers in the VB and CB edge are comparable with wide bandgap semiconductors such as GaN and ZnO. The elastic behavior of HSH-carbon such as bulk modulus, Young's modulus and shear modulus is comparable with that of T-carbon and much smaller than that of c-diamond, which suggests that HSH-carbon would be much easier to be processed than c-diamond in practice.
△ Less
Submitted 24 July, 2022;
originally announced July 2022.
-
Quad-Net: Quad-domain Network for CT Metal Artifact Reduction
Authors:
Zilong Li,
Qi Gao,
Ya** Wu,
Chuang Niu,
Jun** Zhang,
Meiyun Wang,
Ge Wang,
Hongming Shan
Abstract:
Metal implants and other high-density objects in patients introduce severe streaking artifacts in CT images, compromising image quality and diagnostic performance. Although various methods were developed for CT metal artifact reduction over the past decades, including the latest dual-domain deep networks, remaining metal artifacts are still clinically challenging in many cases. Here we extend the…
▽ More
Metal implants and other high-density objects in patients introduce severe streaking artifacts in CT images, compromising image quality and diagnostic performance. Although various methods were developed for CT metal artifact reduction over the past decades, including the latest dual-domain deep networks, remaining metal artifacts are still clinically challenging in many cases. Here we extend the state-of-the-art dual-domain deep network approach into a quad-domain counterpart so that all the features in the sinogram, image, and their corresponding Fourier domains are synergized to eliminate metal artifacts optimally without compromising structural subtleties. Our proposed quad-domain network for MAR, referred to as Quad-Net, takes little additional computational cost since the Fourier transform is highly efficient, and works across the four receptive fields to learn both global and local features as well as their relations. Specifically, we first design a Sinogram-Fourier Restoration Network (SFR-Net) in the sinogram domain and its Fourier space to faithfully inpaint metal-corrupted traces. Then, we couple SFR-Net with an Image-Fourier Refinement Network (IFR-Net) which takes both an image and its Fourier spectrum to improve a CT image reconstructed from the SFR-Net output using cross-domain contextual information. Quad-Net is trained on clinical datasets to minimize a composite loss function. Quad-Net does not require precise metal masks, which is of great importance in clinical practice. Our experimental results demonstrate the superiority of Quad-Net over the state-of-the-art MAR methods quantitatively, visually, and statistically. The Quad-Net code is publicly available at https://github.com/longzilicart/Quad-Net.
△ Less
Submitted 31 May, 2023; v1 submitted 24 July, 2022;
originally announced July 2022.
-
Conservation of the particle-hole symmetry in the pseudogap state in optimally-doped Bi2Sr2CuO6+δ superconductor
Authors:
Hongtao Yan,
Qiang Gao,
Chunyao Song,
Chaohui Yin,
Yiwen Chen,
Fengfeng Zhang,
Feng Yang,
Shen** Zhang,
Qinjun Peng,
Guodong Liu,
Lin Zhao,
Zuyan Xu,
Xingjiang Zhou
Abstract:
The pseudogap state is one of the most enigmatic characteristics in the anomalous normal state properties of the high temperature cuprate superconductors. A central issue is to reveal whether there is a symmetry breaking and which symmetries are broken across the pseudogap transition. By performing high resolution laser-based angle-resolved photoemission measurements on the optimally-doped Bi2Sr1.…
▽ More
The pseudogap state is one of the most enigmatic characteristics in the anomalous normal state properties of the high temperature cuprate superconductors. A central issue is to reveal whether there is a symmetry breaking and which symmetries are broken across the pseudogap transition. By performing high resolution laser-based angle-resolved photoemission measurements on the optimally-doped Bi2Sr1.6La0.4CuO6+δ superconductor, we report the observations of the particle-hole symmetry conservation in both the superconducting state and the pseudogap state along the entire Fermi surface. These results provide key insights in understanding the nature of the pseudogap and its relation with high temperature superconductivity.
△ Less
Submitted 16 July, 2022;
originally announced July 2022.
-
Robust optimization for quantum reinforcement learning control using partial observations
Authors:
Chen Jiang,
Yu Pan,
Zheng-Guang Wu,
Qing Gao,
Daoyi Dong
Abstract:
The current quantum reinforcement learning control models often assume that the quantum states are known a priori for control optimization. However, full observation of quantum state is experimentally infeasible due to the exponential scaling of the number of required quantum measurements on the number of qubits. In this paper, we investigate a robust reinforcement learning method using partial ob…
▽ More
The current quantum reinforcement learning control models often assume that the quantum states are known a priori for control optimization. However, full observation of quantum state is experimentally infeasible due to the exponential scaling of the number of required quantum measurements on the number of qubits. In this paper, we investigate a robust reinforcement learning method using partial observations to overcome this difficulty. This control scheme is compatible with near-term quantum devices, where the noise is prevalent and predetermining the dynamics of quantum state is practically impossible. We show that this simplified control scheme can achieve similar or even better performance when compared to the conventional methods relying on full observation. We demonstrate the effectiveness of this scheme on examples of quantum state control and quantum approximate optimization algorithm. It has been shown that high-fidelity state control can be achieved even if the noise amplitude is at the same level as the control amplitude. Besides, an acceptable level of optimization accuracy can be achieved for QAOA with noisy control Hamiltonian. This robust control optimization model can be trained to compensate the uncertainties in practical quantum computing.
△ Less
Submitted 29 June, 2022;
originally announced June 2022.
-
Joint Location and Beamforming Design for STAR-RIS Assisted NOMA Systems
Authors:
Qiling Gao,
Yuanwei Liu,
Xidong Mu,
Min Jia,
Dongbo Li,
Lajos Hanzo
Abstract:
Simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted non-orthogonal multiple access (NOMA) communication systems are investigated in its vicinity, where a STAR-RIS is deployed within a predefined region for establishing communication links for users. Both beamformer-based NOMA and cluster-based NOMA schemes are employed at the multi-antenna base station…
▽ More
Simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted non-orthogonal multiple access (NOMA) communication systems are investigated in its vicinity, where a STAR-RIS is deployed within a predefined region for establishing communication links for users. Both beamformer-based NOMA and cluster-based NOMA schemes are employed at the multi-antenna base station (BS). For each scheme, the STAR-RIS deployment location, the passive transmitting and reflecting beamforming (BF) of the STAR-RIS, and the active BF at the BS are jointly optimized for maximizing the weighted sum-rate (WSR) of users. To solve the resultant non-convex problems, an alternating optimization (AO) algorithm is proposed, where successive convex approximation (SCA) and semi-definite programming (SDP) methods are invoked for iteratively addressing the non-convexity of each sub-problem. Numerical results reveal that 1) the WSR performance can be significantly enhanced by optimizing the specific deployment location of the STAR-RIS; 2) both beamformer-based and cluster-based NOMA prefer asymmetric STAR-RIS deployment.
△ Less
Submitted 26 June, 2022;
originally announced June 2022.
-
PSP: Million-level Protein Sequence Dataset for Protein Structure Prediction
Authors:
Sirui Liu,
Jun Zhang,
Haotian Chu,
Min Wang,
Boxin Xue,
Ningxi Ni,
Jialiang Yu,
Yuhao Xie,
Zhenyu Chen,
Mengyun Chen,
Yuan Liu,
Piya Patra,
Fan Xu,
Jie Chen,
Zidong Wang,
Lijiang Yang,
Fan Yu,
Lei Chen,
Yi Qin Gao
Abstract:
Proteins are essential component of human life and their structures are important for function and mechanism analysis. Recent work has shown the potential of AI-driven methods for protein structure prediction. However, the development of new models is restricted by the lack of dataset and benchmark training procedure. To the best of our knowledge, the existing open source datasets are far less to…
▽ More
Proteins are essential component of human life and their structures are important for function and mechanism analysis. Recent work has shown the potential of AI-driven methods for protein structure prediction. However, the development of new models is restricted by the lack of dataset and benchmark training procedure. To the best of our knowledge, the existing open source datasets are far less to satisfy the needs of modern protein sequence-structure related research. To solve this problem, we present the first million-level protein structure prediction dataset with high coverage and diversity, named as PSP. This dataset consists of 570k true structure sequences (10TB) and 745k complementary distillation sequences (15TB). We provide in addition the benchmark training procedure for SOTA protein structure prediction model on this dataset. We validate the utility of this dataset for training by participating CAMEO contest in which our model won the first place. We hope our PSP dataset together with the training benchmark can enable a broader community of AI/biology researchers for AI-driven protein related research.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Constraint on the mass of graviton with gravitational waves
Authors:
Qing Gao
Abstract:
We consider the effects of the mass of graviton on both the waveform of gravitational waves and the antenna response to gravitational waves. We find the effect on the response function is negligible for small mass. By using the Fisher matrix method, we make parameter estimations with space-based gravitational wave detectors for massive black hole binaries in massive gravity theory. The wavelength…
▽ More
We consider the effects of the mass of graviton on both the waveform of gravitational waves and the antenna response to gravitational waves. We find the effect on the response function is negligible for small mass. By using the Fisher matrix method, we make parameter estimations with space-based gravitational wave detectors for massive black hole binaries in massive gravity theory. The wavelength of massive graviton is constrained to be $λ_g\gtrsim 10^{17}$ m and the mass is constrained to be $m_g\lesssim 10^{-60}$ kg by one year's observation of massive black hole binaries with space-based gravitational wave detectors.
△ Less
Submitted 25 November, 2022; v1 submitted 5 June, 2022;
originally announced June 2022.
-
Symmetry Origin of Lattice Vibration Modes in Twisted Multilayer Graphene: Phasons vs Moiré Phonons
Authors:
Qiang Gao,
Eslam Khalaf
Abstract:
Lattice dynamics play a crucial role in the physics of Moiré systems. In twisted bilayer graphene (TBG), it was shown that, in addition to the graphene phonons, there is another set of gapless excitations termed Moiré Phonons [Phys. Rev. B, 075416, 2019] reflecting the lattice dynamics at the Moire superlattice level. These modes were later suggested to be phasons due to the incommensurate stackin…
▽ More
Lattice dynamics play a crucial role in the physics of Moiré systems. In twisted bilayer graphene (TBG), it was shown that, in addition to the graphene phonons, there is another set of gapless excitations termed Moiré Phonons [Phys. Rev. B, 075416, 2019] reflecting the lattice dynamics at the Moire superlattice level. These modes were later suggested to be phasons due to the incommensurate stacking of the two graphene layers [Phys. Rev. B, 155426, 2019]. In this work, we elucidate the equivalence of these two seemingly distinct perspectives by identifying an underlying symmetry, which we dub mismatch symmetry, that exists for any twist angle. For commensurate angles, this is a discrete symmetry whereas for incommensurate angles, it is equivalent to a continuous phase symmetry giving rise to phason modes. In the small angle limit, such symmetry becomes a continuous local symmetry whose spontaneous breaking gives rise to Moiré phonons as its Goldstone mode. We derive an effective field theory for these collective modes in TBG in precise agreement with the full model and discuss their different properties. Our analysis is then generalized to twisted multilayer graphene (TMG) where we identify higher order mismatch and deduce the count of gapless modes including graphene phonons, Moiré phonons and phasons. Especially, we study twisted mirror-symmetric trilayer graphene with an alternating twist angle $θ$ and find that it can be mapped to a TBG with the re-scaled twist angle $\sqrt{2/3}θ$, hosting the same Moiré phonon modes in the even mirror sector with an additional set of gapped modes in the odd sector. Our work presents a systematic study of lattice symmetries in TMG providing insights into its unique lattice dynamics.
△ Less
Submitted 10 August, 2022; v1 submitted 13 May, 2022;
originally announced May 2022.
-
Deep Depth Completion from Extremely Sparse Data: A Survey
Authors:
Junjie Hu,
Chenyu Bao,
Mete Ozay,
Chenyou Fan,
Qing Gao,
Honghai Liu,
Tin Lun Lam
Abstract:
Depth completion aims at predicting dense pixel-wise depth from an extremely sparse map captured from a depth sensor, e.g., LiDARs. It plays an essential role in various applications such as autonomous driving, 3D reconstruction, augmented reality, and robot navigation. Recent successes on the task have been demonstrated and dominated by deep learning based solutions. In this article, for the firs…
▽ More
Depth completion aims at predicting dense pixel-wise depth from an extremely sparse map captured from a depth sensor, e.g., LiDARs. It plays an essential role in various applications such as autonomous driving, 3D reconstruction, augmented reality, and robot navigation. Recent successes on the task have been demonstrated and dominated by deep learning based solutions. In this article, for the first time, we provide a comprehensive literature review that helps readers better grasp the research trends and clearly understand the current advances. We investigate the related studies from the design aspects of network architectures, loss functions, benchmark datasets, and learning strategies with a proposal of a novel taxonomy that categorizes existing methods. Besides, we present a quantitative comparison of model performance on three widely used benchmarks, including indoor and outdoor datasets. Finally, we discuss the challenges of prior works and provide readers with some insights for future research directions.
△ Less
Submitted 29 August, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Forgetting Prevention for Cross-regional Fraud Detection with Heterogeneous Trade Graph
Authors:
Yujie Li,
Yuxuan Yang,
Xin Yang,
Qiang Gao,
Fan Zhou
Abstract:
With the booming growth of e-commerce, detecting financial fraud has become an urgent task to avoid transaction risks. Despite the successful applications of Graph Neural Networks (GNNs) in fraud detection, the existing solutions are only suitable for a narrow scope due to the limitation in data collection. Especially when expanding a business into new territory, e.g., new cities or new countries,…
▽ More
With the booming growth of e-commerce, detecting financial fraud has become an urgent task to avoid transaction risks. Despite the successful applications of Graph Neural Networks (GNNs) in fraud detection, the existing solutions are only suitable for a narrow scope due to the limitation in data collection. Especially when expanding a business into new territory, e.g., new cities or new countries, develo** a totally new model will bring the cost issue and result in forgetting previous knowledge. Moreover, recent works strive to devise GNNs to expose the implicit interactions behind financial transactions. However, most existing GNNs-based solutions concentrate on either homogeneous graphs or decomposing heterogeneous interactions into several homogeneous connections for convenience. To this end, this study proposes a novel solution based on heterogeneous trade graphs, namely HTG-CFD, to prevent knowledge forgetting of cross-regional fraud detection. In particular, the heterogeneous trade graph (HTG) is meticulously constructed from original transaction records to explore the complex semantics among different types of entities and relationships. And motivated by recent continual learning, we present a practical and task-oriented forgetting prevention method to alleviate knowledge forgetting in the context of cross-regional detection. Extensive experiments demonstrate that the proposed HTG-CFD not only promotes the performance in cross-regional scenarios but also significantly contributes to single-regional fraud detection.
△ Less
Submitted 22 May, 2022; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Stellar Atmospheric Parameters of M-type Stars from LAMOST DR8
Authors:
Ming-Yi Ding,
Jian-Rong Shi,
Yue Wu,
Hugh R. A. Jones,
Hong-liang Yan,
Chun-Qian Li,
Qi Gao,
Tian-Yi Chen,
**g-Hua Zhang,
Shuai Liu,
Tai-Sheng Yan,
Xiao-** Xie
Abstract:
The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Low Resolution Spectroscopic Survey (LRS) provides massive spectroscopic data of M-type stars, and the derived stellar parameters could bring vital help to various studies. We adopt the ULySS package to perform $χ^2$ minimization with model spectra generated from the MILES interpolator, and determine the stellar atmospheric par…
▽ More
The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Low Resolution Spectroscopic Survey (LRS) provides massive spectroscopic data of M-type stars, and the derived stellar parameters could bring vital help to various studies. We adopt the ULySS package to perform $χ^2$ minimization with model spectra generated from the MILES interpolator, and determine the stellar atmospheric parameters for the M-type stars from LAMOST LRS Data Release (DR) 8. Comparison with the stellar parameters from APOGEE Stellar Parameter and Chemical Abundance Pipeline (ASPCAP) suggests that most of our results have good consistency. For M dwarfs, we achieve dispersions better than 74 K, 0.19 dex and 0.16 dex for $T_{\rm eff}$, $\log{g}$ and [Fe/H], while for M giants, the internal uncertainties are 58 K, 0.32 dex and 0.26 dex, respectively. Compared to ASPCAP we also find a systematic underestimation of $Δ{T_{\rm eff}} =$ $-$176 K for M dwarfs, and a systematic overestimation of $Δ{\log{g}} =$ 0.30 dex for M giants. However, such differences are less significant when we make comparison with common stars from other literature, which indicates that systematic biases exist in the difference of ASPCAP and other measurements. A catalog of 763,136 spectra corresponding to 616,314 M-type stars with derived stellar parameters is presented. We determine the stellar parameters for stars with $T_{\rm eff}$ higher than 2,900 K, with $\log{g}$ from -0.24 dex to 5.9 dex. The typical precisions are 45 K, 0.25 dex and 0.22 dex, for $T_{\rm eff}$, $\log{g}$ and [Fe/H], respectively, which are estimated from the duplicate observations of the same stars.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding
Authors:
Zeming Chen,
Qiyue Gao
Abstract:
In the age of large transformer language models, linguistic evaluation play an important role in diagnosing models' abilities and limitations on natural language understanding. However, current evaluation methods show some significant shortcomings. In particular, they do not provide insight into how well a language model captures distinct linguistic skills essential for language understanding and…
▽ More
In the age of large transformer language models, linguistic evaluation play an important role in diagnosing models' abilities and limitations on natural language understanding. However, current evaluation methods show some significant shortcomings. In particular, they do not provide insight into how well a language model captures distinct linguistic skills essential for language understanding and reasoning. Thus they fail to effectively map out the aspects of language understanding that remain challenging to existing models, which makes it hard to discover potential limitations in models and datasets. In this paper, we introduce Curriculum as a new format of NLI benchmark for evaluation of broad-coverage linguistic phenomena. Curriculum contains a collection of datasets that covers 36 types of major linguistic phenomena and an evaluation procedure for diagnosing how well a language model captures reasoning skills for distinct types of linguistic phenomena. We show that this linguistic-phenomena-driven benchmark can serve as an effective tool for diagnosing model behavior and verifying model learning quality. In addition, our experiments provide insight into the limitation of existing benchmark datasets and state-of-the-art models that may encourage future research on re-designing datasets, model architectures, and learning objectives.
△ Less
Submitted 4 May, 2022; v1 submitted 13 April, 2022;
originally announced April 2022.
-
Turbulence-free computational ghost imaging
Authors:
Qiang Gao,
Yuge Li,
Yunjie Xia,
Deyang Duan
Abstract:
Turbulence-free images cannot be produced by conventional computational ghost imaging because calculated light is not affected by the same atmospheric turbulence as real light. In this article, we first addressed this issue by measuring the photon number fluctuation autocorrelation of the signals generated by a conventional computational ghost imaging device. Our results illustrate how conventiona…
▽ More
Turbulence-free images cannot be produced by conventional computational ghost imaging because calculated light is not affected by the same atmospheric turbulence as real light. In this article, we first addressed this issue by measuring the photon number fluctuation autocorrelation of the signals generated by a conventional computational ghost imaging device. Our results illustrate how conventional computational ghost imaging without structural changes can be used to produce turbulence-free images.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
Authors:
Qiankun Gao,
Chen Zhao,
Bernard Ghanem,
Jian Zhang
Abstract:
Class-Incremental Learning (CIL) struggles with catastrophic forgetting when learning new knowledge, and Data-Free CIL (DFCIL) is even more challenging without access to the training data of previously learned classes. Though recent DFCIL works introduce techniques such as model inversion to synthesize data for previous classes, they fail to overcome forgetting due to the severe domain gap between…
▽ More
Class-Incremental Learning (CIL) struggles with catastrophic forgetting when learning new knowledge, and Data-Free CIL (DFCIL) is even more challenging without access to the training data of previously learned classes. Though recent DFCIL works introduce techniques such as model inversion to synthesize data for previous classes, they fail to overcome forgetting due to the severe domain gap between the synthetic and real data. To address this issue, this paper proposes relation-guided representation learning (RRL) for DFCIL, dubbed R-DFCIL. In RRL, we introduce relational knowledge distillation to flexibly transfer the structural relation of new data from the old model to the current model. Our RRL-boosted DFCIL can guide the current model to learn representations of new classes better compatible with representations of previous classes, which greatly reduces forgetting while improving plasticity. To avoid the mutual interference between representation and classifier learning, we employ local rather than global classification loss during RRL. After RRL, the classification head is refined with global class-balanced classification loss to address the data imbalance issue as well as learn the decision boundaries between new and previous classes. Extensive experiments on CIFAR100, Tiny-ImageNet200, and ImageNet100 demonstrate that our R-DFCIL significantly surpasses previous approaches and achieves a new state-of-the-art performance for DFCIL. Code is available at https://github.com/jianzhangcs/R-DFCIL
△ Less
Submitted 20 July, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
Towards Large-Scale Interpretable Knowledge Graph Reasoning for Dialogue Systems
Authors:
Yi-Lin Tuan,
Sajjad Beygi,
Maryam Fazel-Zarandi,
Qiaozi Gao,
Alessandra Cervone,
William Yang Wang
Abstract:
Users interacting with voice assistants today need to phrase their requests in a very specific manner to elicit an appropriate response. This limits the user experience, and is partly due to the lack of reasoning capabilities of dialogue platforms and the hand-crafted rules that require extensive labor. One possible way to improve user experience and relieve the manual efforts of designers is to b…
▽ More
Users interacting with voice assistants today need to phrase their requests in a very specific manner to elicit an appropriate response. This limits the user experience, and is partly due to the lack of reasoning capabilities of dialogue platforms and the hand-crafted rules that require extensive labor. One possible way to improve user experience and relieve the manual efforts of designers is to build an end-to-end dialogue system that can do reasoning itself while perceiving user's utterances. In this work, we propose a novel method to incorporate the knowledge reasoning capability into dialogue systems in a more scalable and generalizable manner. Our proposed method allows a single transformer model to directly walk on a large-scale knowledge graph to generate responses. To the best of our knowledge, this is the first work to have transformer models generate responses by reasoning over differentiable knowledge graphs. We investigate the reasoning abilities of the proposed method on both task-oriented and domain-specific chit-chat dialogues. Empirical results show that this method can effectively and efficiently incorporate a knowledge graph into a dialogue system with fully-interpretable reasoning paths.
△ Less
Submitted 20 March, 2022;
originally announced March 2022.
-
Application limit of the photocentre displacement to fundamental stellar parameters of fast rotators -- Illustration on the edge-on fast rotator Regulus
Authors:
M. Hadjara,
R. G. Petrov,
S. Jankov,
P. Cruzalèbes,
A. Boskri,
A. Spang,
S. Lagarde,
J. He,
X. Chen,
C. Nitschelm,
E. S. G. de Almeida,
G. Pereira,
E. A. Michael,
Q. Gao,
W. Wang,
I. Reyes,
C. Arcos,
I. Araya,
M. Curé
Abstract:
Differential Interferometry allows to obtain the differential visibility and phase, in addition to the spectrum. The differential phase contains important information about the structure and motion of stellar photosphere such as stellar spots and non-radial pulsations, and particularly the rotation. Thus, this interferometric observable strongly helps to constrain the stellar fundamental parameter…
▽ More
Differential Interferometry allows to obtain the differential visibility and phase, in addition to the spectrum. The differential phase contains important information about the structure and motion of stellar photosphere such as stellar spots and non-radial pulsations, and particularly the rotation. Thus, this interferometric observable strongly helps to constrain the stellar fundamental parameters of fast rotators. The spectro-astrometry mainly uses the photocentre displacements, which is a first approximation of the differential phase, and is applicable only for unresolved or marginally objects. We study here the sensitivity of relevant stellar parameters to the simulated photocentres using the SCIROCCO code: a semi-analytical algorithm dedicated to fast rotators, applied to two theoretical modeling stars based on Achernar and Regulus, in order to classify the importance of these parameters and their impact on the modeling. We compare our simulations with published VLTI/AMBER data. This current work sets the limits of application of photocentre displacements to fast rotators, and under which conditions we can use the photocentres and/or the differential phase, through a pre-established physical criterion. To validate our theoretical study, we apply our method of analysis on observed data of the edge-on fast rotator Regulus. For unresolved targets, with a visibility $V\sim 1$, the photocentre can constrain the main stellar fundamental parameters of fast rotators, whereas from marginally resolved objects ($0.8 \leq V < 1$), mainly the rotation axis position angle ($\rm PA_{\rm rot}$) can be directly deduced from the vectorial photocentre displacement, which is very important for young cluster studies.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
Authors:
Dingkun Long,
Qiong Gao,
Kuan Zou,
Guangwei Xu,
Pengjun Xie,
Ruijie Guo,
Jian Xu,
Guanjun Jiang,
Luxi Xing,
** Yang
Abstract:
Passage retrieval is a fundamental task in information retrieval (IR) research, which has drawn much attention recently. In the English field, the availability of large-scale annotated dataset (e.g, MS MARCO) and the emergence of deep pre-trained language models (e.g, BERT) has resulted in a substantial improvement of existing passage retrieval systems. However, in the Chinese field, especially fo…
▽ More
Passage retrieval is a fundamental task in information retrieval (IR) research, which has drawn much attention recently. In the English field, the availability of large-scale annotated dataset (e.g, MS MARCO) and the emergence of deep pre-trained language models (e.g, BERT) has resulted in a substantial improvement of existing passage retrieval systems. However, in the Chinese field, especially for specific domains, passage retrieval systems are still immature due to quality-annotated dataset being limited by scale. Therefore, in this paper, we present a novel multi-domain Chinese dataset for passage retrieval (Multi-CPR). The dataset is collected from three different domains, including E-commerce, Entertainment video and Medical. Each dataset contains millions of passages and a certain amount of human annotated query-passage related pairs. We implement various representative passage retrieval methods as baselines. We find that the performance of retrieval models trained on dataset from general domain will inevitably decrease on specific domain. Nevertheless, a passage retrieval system built on in-domain annotated dataset can achieve significant improvement, which indeed demonstrates the necessity of domain labeled data for further optimization. We hope the release of the Multi-CPR dataset could benchmark Chinese passage retrieval task in specific domain and also make advances for future studies.
△ Less
Submitted 24 April, 2022; v1 submitted 7 March, 2022;
originally announced March 2022.
-
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
Authors:
Xiaofeng Gao,
Qiaozi Gao,
Ran Gong,
Kaixiang Lin,
Govind Thattai,
Gaurav S. Sukhatme
Abstract:
Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively. We present DialFRED, a dialogue-enabled embodied instruction following benchmark based on the ALFRED benchmark. DialFRED allows an agent t…
▽ More
Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively. We present DialFRED, a dialogue-enabled embodied instruction following benchmark based on the ALFRED benchmark. DialFRED allows an agent to actively ask questions to the human user; the additional information in the user's response is used by the agent to better complete its task. We release a human-annotated dataset with 53K task-relevant questions and answers and an oracle to answer questions. To solve DialFRED, we propose a questioner-performer framework wherein the questioner is pre-trained with the human-annotated data and fine-tuned with reinforcement learning. We make DialFRED publicly available and encourage researchers to propose and evaluate their solutions to building dialog-enabled embodied agents.
△ Less
Submitted 15 August, 2022; v1 submitted 27 February, 2022;
originally announced February 2022.
-
Metamorphic dynamical quantum phase transition in double-quench processes at finite temperatures
Authors:
Xu-Yang Hou,
Qu-Cheng Gao,
Hao Guo,
Chih-Chun Chien
Abstract:
By deriving a general framework and analyzing concrete examples, we demonstrate a class of dynamical quantum phase transitions (DQPTs) in one-dimensional two-band systems going through double-quench processes. When this type of DQPT occurs, the Loschmidt amplitude vanishes and the rate function remains singular after the second quench, meaning the final state continually has no overlap with the in…
▽ More
By deriving a general framework and analyzing concrete examples, we demonstrate a class of dynamical quantum phase transitions (DQPTs) in one-dimensional two-band systems going through double-quench processes. When this type of DQPT occurs, the Loschmidt amplitude vanishes and the rate function remains singular after the second quench, meaning the final state continually has no overlap with the initial state. This type of DQPT is named metamorphic DQPT to differentiate it from ordinary DQPTs that only exhibit zero Loschmidt amplitude and singular rate function at discrete time points. The metamorphic DQPTs occur at zero as well as finite temperatures. Our examples of the Su-Schrieffer-Heeger (SSH) model and Kitaev chain illustrate the conditions and behavior of the metamorphic DQPT. Since ordinary DQPTs have been experimentally realized in many systems, similar setups with double quenches will demonstrate the metamorphic DQPT. Our findings thus provide additional controls of dynamical evolution of quantum systems.
△ Less
Submitted 29 May, 2022; v1 submitted 21 February, 2022;
originally announced February 2022.
-
Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra
Authors:
Liang Liao,
Sen Lin,
Lun Li,
Xiuwei Zhang,
Song Zhao,
Yan Wang,
Xinqiang Wang,
Qi Gao,
**gyu Wang
Abstract:
Low-rank approximation of images via singular value decomposition is well-received in the era of big data. However, singular value decomposition (SVD) is only for order-two data, i.e., matrices. It is necessary to flatten a higher order input into a matrix or break it into a series of order-two slices to tackle higher order data such as multispectral images and videos with the SVD. Higher order si…
▽ More
Low-rank approximation of images via singular value decomposition is well-received in the era of big data. However, singular value decomposition (SVD) is only for order-two data, i.e., matrices. It is necessary to flatten a higher order input into a matrix or break it into a series of order-two slices to tackle higher order data such as multispectral images and videos with the SVD. Higher order singular value decomposition (HOSVD) extends the SVD and can approximate higher order data using sums of a few rank-one components. We consider the problem of generalizing HOSVD over a finite dimensional commutative algebra. This algebra, referred to as a t-algebra, generalizes the field of complex numbers. The elements of the algebra, called t-scalars, are fix-sized arrays of complex numbers. One can generalize matrices and tensors over t-scalars and then extend many canonical matrix and tensor algorithms, including HOSVD, to obtain higher-performance versions. The generalization of HOSVD is called THOSVD. Its performance of approximating multi-way data can be further improved by an alternating algorithm. THOSVD also unifies a wide range of principal component analysis algorithms. To exploit the potential of generalized algorithms using t-scalars for approximating images, we use a pixel neighborhood strategy to convert each pixel to "deeper-order" t-scalar. Experiments on publicly available images show that the generalized algorithm over t-scalars, namely THOSVD, compares favorably with its canonical counterparts.
△ Less
Submitted 25 August, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Authors:
Zhiwei Jia,
Kaixiang Lin,
Yizhou Zhao,
Qiaozi Gao,
Govind Thattai,
Gaurav Sukhatme
Abstract:
Recent years have witnessed an emerging paradigm shift toward embodied artificial intelligence, in which an agent must learn to solve challenging tasks by interacting with its environment. There are several challenges in solving embodied multimodal tasks, including long-horizon planning, vision-and-language grounding, and efficient exploration. We focus on a critical bottleneck, namely the perform…
▽ More
Recent years have witnessed an emerging paradigm shift toward embodied artificial intelligence, in which an agent must learn to solve challenging tasks by interacting with its environment. There are several challenges in solving embodied multimodal tasks, including long-horizon planning, vision-and-language grounding, and efficient exploration. We focus on a critical bottleneck, namely the performance of planning and navigation. To tackle this challenge, we propose a Neural SLAM approach that, for the first time, utilizes several modalities for exploration, predicts an affordance-aware semantic map, and plans over it at the same time. This significantly improves exploration efficiency, leads to robust long-horizon planning, and enables effective vision-and-language grounding. With the proposed Affordance-aware Multimodal Neural SLAM (AMSLAM) approach, we obtain more than 40% improvement over prior published work on the ALFRED benchmark and set a new state-of-the-art generalization performance at a success rate of 23.48% on the test unseen scenes.
△ Less
Submitted 24 October, 2022; v1 submitted 24 January, 2022;
originally announced January 2022.
-
Ubiquitous Coexisting Electron-Mode Couplings in High Temperature Cuprate Superconductors
Authors:
Hongtao Yan,
** Mo Bok,
Junfeng He,
Wentao Zhang,
Qiang Gao,
Xiangyu Luo,
Yongqing Cai,
Yingying Peng,
Jianqiao Meng,
Cong Li,
Hao Chen,
Chunyao Song,
Chaohui Yin,
Taimin Miao,
Genda Gu,
Chengtian Lin,
Fengfeng Zhang,
Feng Yang,
Shen** Zhang,
Qinjun Peng,
Guodong Liu,
Lin Zhao,
Han-Yong Choi,
Zuyan Xu,
X. J. Zhou
Abstract:
In conventional superconductors, the electron-phonon coupling plays a dominant role in pairing the electrons and generating superconductivity. In high temperature cuprate superconductors, the existence of the electron coupling with phonons and other boson modes and its role in producing high temperature superconductivity remain unclear. The evidence of the electron-boson coupling mainly comes from…
▽ More
In conventional superconductors, the electron-phonon coupling plays a dominant role in pairing the electrons and generating superconductivity. In high temperature cuprate superconductors, the existence of the electron coupling with phonons and other boson modes and its role in producing high temperature superconductivity remain unclear. The evidence of the electron-boson coupling mainly comes from the angle-resolved photoemission (ARPES) observations of the ~70meV nodal dispersion kink and the ~40meV antinodal kink. However, the reported results are sporadic and the nature of the involved bosons are still under debate. Here we report new findings of ubiquitous two coexisting electron-mode couplings in cuprate superconductors. By taking ultra-high resolution laser-based ARPES measurements, combined with the improved second derivative analysis method, we discovered that the electrons are coupled simultaneously with two sharp phonon modes with energies of ~70meV and ~40meV in different superconductors with different do** levels, over the entire momentum space and at different temperatures above and below the superconducting transition temperature. The observed electron-phonon couplings are unusual because the associated energy scales do not exhibit an obvious change across the superconducting transition. We further find that the well-known "peak-dip-hump" structure, which has long been considered as a hallmark of superconductivity, is also omnipresent and consists of finer structures that originates from electron coupling with two sharp phonon modes. These comprehensive results provide a unified picture to reconcile all the reported observations and pinpoint the origin of the electron-mode couplings in cuprate superconductors. They provide key information to understand the role of the electron-phonon coupling in generating high temperature superconductivity.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Best of Both Worlds: A Hybrid Approach for Multi-Hop Explanation with Declarative Facts
Authors:
Shane Storks,
Qiaozi Gao,
Aishwarya Reganti,
Govind Thattai
Abstract:
Language-enabled AI systems can answer complex, multi-hop questions to high accuracy, but supporting answers with evidence is a more challenging task which is important for the transparency and trustworthiness to users. Prior work in this area typically makes a trade-off between efficiency and accuracy; state-of-the-art deep neural network systems are too cumbersome to be useful in large-scale app…
▽ More
Language-enabled AI systems can answer complex, multi-hop questions to high accuracy, but supporting answers with evidence is a more challenging task which is important for the transparency and trustworthiness to users. Prior work in this area typically makes a trade-off between efficiency and accuracy; state-of-the-art deep neural network systems are too cumbersome to be useful in large-scale applications, while the fastest systems lack reliability. In this work, we integrate fast syntactic methods with powerful semantic methods for multi-hop explanation generation based on declarative facts. Our best system, which learns a lightweight operation to simulate multi-hop reasoning over pieces of evidence and fine-tunes language models to re-rank generated explanation chains, outperforms a purely syntactic baseline from prior work by up to 7% in gold explanation retrieval rate.
△ Less
Submitted 17 December, 2021;
originally announced January 2022.
-
Population inversion and Dirac fermion cooling in 3D Dirac semimetal Cd$_3$As$_2$
Authors:
Changhua Bao,
Qian Li,
Sheng Xu,
Shaohua Zhou,
Xiang-Yu Zeng,
Haoyuan Zhong,
Qixuan Gao,
Laipeng Luo,
Dong Sun,
Tian-Long Xia,
Shuyun Zhou
Abstract:
Revealing the ultrafast dynamics of three-dimensional (3D) Dirac fermions upon photoexcitation is critical for both fundamental science and device applications. So far, how the cooling of 3D Dirac fermions differs from that of two-dimensional (2D) Dirac fermions and whether there is population inversion are fundamental questions that remain to be answered. Here we reveal the ultrafast dynamics of…
▽ More
Revealing the ultrafast dynamics of three-dimensional (3D) Dirac fermions upon photoexcitation is critical for both fundamental science and device applications. So far, how the cooling of 3D Dirac fermions differs from that of two-dimensional (2D) Dirac fermions and whether there is population inversion are fundamental questions that remain to be answered. Here we reveal the ultrafast dynamics of Dirac fermions in a model 3D Dirac semimetal Cd$_3$As$_2$ by ultrafast time- and angle-resolved photoemission spectroscopy (TrARPES) with a tunable probe photon energy from 5.3 - 6.9 eV. The energy- and momentum-resolved relaxation rate shows a linear dependence on the energy, suggesting Dirac fermion cooling through intraband relaxation. Moreover, a population inversion is reported based on the observation of accumulated photoexcited carriers in the conduction band with a lifetime of $τ_n$ = 3.0 ps. Our work provides direct experimental evidence for a long-lived population inversion in a 3D Dirac semimetal, which is in contrast to 2D graphene where the interband relaxation occurs on a much faster timescale.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Thermodynamic constraints on the nonequilibrium response of one-dimensional diffusions
Authors:
Qi Gao,
Hyun-Myung Chun,
Jordan M. Horowitz
Abstract:
We analyze the static response to perturbations of nonequilibrium steady states that can be modeled as one-dimensional diffusions on the circle. We demonstrate that an arbitrary perturbation can be broken up into a combination of three specific classes of perturbations that can be fruitfully addressed individually. For each class, we derive a simple formula that quantitatively characterizes the re…
▽ More
We analyze the static response to perturbations of nonequilibrium steady states that can be modeled as one-dimensional diffusions on the circle. We demonstrate that an arbitrary perturbation can be broken up into a combination of three specific classes of perturbations that can be fruitfully addressed individually. For each class, we derive a simple formula that quantitatively characterizes the response in terms of the strength of nonequilibrium driving valid arbitrarily far from equilibrium.
△ Less
Submitted 5 January, 2022; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Probing Linguistic Information For Logical Inference In Pre-trained Language Models
Authors:
Zeming Chen,
Qiyue Gao
Abstract:
Progress in pre-trained language models has led to a surge of impressive results on downstream tasks for natural language understanding. Recent work on probing pre-trained language models uncovered a wide range of linguistic properties encoded in their contextualized representations. However, it is unclear whether they encode semantic knowledge that is crucial to symbolic inference methods. We pro…
▽ More
Progress in pre-trained language models has led to a surge of impressive results on downstream tasks for natural language understanding. Recent work on probing pre-trained language models uncovered a wide range of linguistic properties encoded in their contextualized representations. However, it is unclear whether they encode semantic knowledge that is crucial to symbolic inference methods. We propose a methodology for probing linguistic information for logical inference in pre-trained language model representations. Our probing datasets cover a list of linguistic phenomena required by major symbolic inference systems. We find that (i) pre-trained language models do encode several types of linguistic information for inference, but there are also some types of information that are weakly encoded, (ii) language models can effectively learn missing linguistic information through fine-tuning. Overall, our findings provide insights into which aspects of linguistic information for logical inference do language models and their pre-training procedures capture. Moreover, we have demonstrated language models' potential as semantic and background knowledge bases for supporting symbolic inference methods.
△ Less
Submitted 21 March, 2022; v1 submitted 3 December, 2021;
originally announced December 2021.
-
Atomistic View of Homogeneous Nucleation of Water into Polymorphic Ices
Authors:
Maodong Li,
Jun Zhang,
Niu Haiyang,
Yao Kun Lei,
Xu Han,
Lijiang Yang,
Zhiqiang Ye,
Yi Isaac Yang,
Yi Qin Gao
Abstract:
Water is one of the most abundant substances on Earth, and ice, i.e., solid water, has more than 18 known phases. Normally ice in nature exists only as Ice Ih, Ice Ic, or a stacking disordered mixture of both. Although many theoretical efforts have been devoted to understanding the thermodynamics of different ice phases at ambient temperature and pressure, there still remains many puzzles. We simu…
▽ More
Water is one of the most abundant substances on Earth, and ice, i.e., solid water, has more than 18 known phases. Normally ice in nature exists only as Ice Ih, Ice Ic, or a stacking disordered mixture of both. Although many theoretical efforts have been devoted to understanding the thermodynamics of different ice phases at ambient temperature and pressure, there still remains many puzzles. We simulated the reversible transitions between water and different ice phases by performing full atom molecular dynamics simulations. Using the enhanced sampling method MetaITS with the two selected X-ray diffraction peak intensities as collective variables, the ternary phase diagrams of liquid water, ice Ih, ice Ic at multiple were obtained. We also present a simple physical model which successfully explains the thermodynamic stability of ice. Our results agree with experiments and leads to a deeper understanding of the ice nucleation mechanism.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Structural Origin of Boson Peak in Glasses
Authors:
Yuan Tian,
Xiaozhe Shen,
Qingyang Gao,
Zhen Lu,
Jie Yang,
Qiang Zheng,
Christopher Florencio Aleman,
Duan Luo,
Alexander Hume Reid,
Bin Xu,
Michael Falk,
Howard Sheng,
Jianming Cao,
Xijie Wang,
Mingwei Chen
Abstract:
Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynami…
▽ More
Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynamic phenomenon. In this work we employed femtosecond MeV ultrafast electron diffraction to characterize the atomic dynamics of metallic glasses in real time. The experiment reveals collective atomic oscillations, presented in elastic electron scattering and atomic pair distribution functions, within the boson peak frequency range of 1.0-1.8 THz in both reciprocal and real space. It was found that the oscillation frequency has reciprocal dependence on interatomic pair distances and the corresponding wave velocity experimentally affirms the transverse acoustic wave nature of the boson peak. The observed strong correlation between THz acoustic vibrations and coherent electron scattering provides compelling evidence that the boson peak originates from the collective transverse vibrational modes of structurally ordered atoms in the disordered system.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Peta-electron volt gamma-ray emission from the Crab Nebula
Authors:
The LHAASO Collaboration,
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
H. Cai,
J. T. Cai,
Zhe Cao,
J. Chang,
J. F. Chang,
B. M. Chen,
E. S. Chen,
J. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen
, et al. (250 additional authors not shown)
Abstract:
The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet…
▽ More
The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ petaelectronvolt (PeV). The ultra-high-energy photons exhibit the presence of a PeV electron accelerator (a pevatron) with an acceleration rate exceeding 15% of the absolute theoretical limit. Assuming that unpulsed $γ$-rays are produced at the termination of the pulsar's wind, we constrain the pevatron's size, between $0.025$ and $0.1$ pc, and the magnetic field $\approx 110 μ$G. The production rate of PeV electrons, $2.5 \times 10^{36}$ erg $\rm s^{-1}$, constitutes 0.5% of the pulsar's spin-down luminosity, although we do not exclude a non-negligible contribution of PeV protons to the production of the highest energy $γ$-rays.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Do**-dependence of the electron-phonon coupling in two families of bilayer superconducting cuprates
Authors:
Yingying Peng,
Leonardo Martinelli,
Qizhi Li,
Matteo Rossi,
Matteo Mitrano,
Riccardo Arpaia,
Marco Moretti Sala,
Qiang Gao,
Xuefei Guo,
Gabriella Maria De Luca,
Andrew Walters,
Abhishek Nag,
Andi Barbour,
Genda Gu,
Jonathan Pelliciari,
Nicholas B. Brookes,
Peter Abbamonte,
Marco Salluzzo,
Xingjiang Zhou,
Ke-** Zhou,
Valentina Bisogni,
Lucio Braicovich,
Steven Johnston,
Giacomo Ghiringhelli
Abstract:
While electron-phonon coupling (EPC) is crucial for Cooper pairing in conventional superconductors, its role in high-$T_c$ superconducting cuprates is debated. Using resonant inelastic x-ray scattering at the oxygen $K$-edge, we studied the EPC in Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi2212) and Nd$_{1+x}$Ba$_{2-x}$Cu$_3$O$_{7-δ}$ (NBCO) at different do** levels ranging from heavily underdoped (…
▽ More
While electron-phonon coupling (EPC) is crucial for Cooper pairing in conventional superconductors, its role in high-$T_c$ superconducting cuprates is debated. Using resonant inelastic x-ray scattering at the oxygen $K$-edge, we studied the EPC in Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi2212) and Nd$_{1+x}$Ba$_{2-x}$Cu$_3$O$_{7-δ}$ (NBCO) at different do** levels ranging from heavily underdoped ($p =0.07$) to overdoped ($p=0.21$). We analyze the data with a localized Lang-Firsov model that allows for the coherent excitations of two phonon modes. While electronic band dispersion effects are non-negligible, we are able to perform a study of the relative values of EPC matrix elements in these cuprate families. In the case of NBCO, the choice of the excitation energy allows us to disentangle modes related to the CuO$_3$ chains and the CuO$_2$ planes. Combining the results from the two families, we find the EPC strength decreases with do** at $\mathbf{q_\parallel}=(-0.25, 0)$ r.l.u., but has a non-monotonic trend as a function of do** at smaller momenta. This behavior is attributed to the screening effect of charge carriers. We also find that the phonon intensity is enhanced in the vicinity of the charge-density-wave (CDW) excitations while the extracted EPC strength appears to be less sensitive to their proximity. By performing a comparative study of two cuprate families, we are able to identify general trends in the EPC for the cuprates and provide experimental input to theories invoking a synergistic role for this interaction in $d$-wave pairing.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Authors:
Yizhou Zhao,
Kaixiang Lin,
Zhiwei Jia,
Qiaozi Gao,
Govind Thattai,
Jesse Thomason,
Gaurav S. Sukhatme
Abstract:
Learning-based methods for training embodied agents typically require a large number of high-quality scenes that contain realistic layouts and support meaningful interactions. However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts. This paper presents Luminous, the first research framework that employs state-of-the-art ind…
▽ More
Learning-based methods for training embodied agents typically require a large number of high-quality scenes that contain realistic layouts and support meaningful interactions. However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts. This paper presents Luminous, the first research framework that employs state-of-the-art indoor scene synthesis algorithms to generate large-scale simulated scenes for Embodied AI challenges. Further, we automatically and quantitatively evaluate the quality of generated indoor scenes via their ability to support complex household tasks. Luminous incorporates a novel scene generation algorithm (Constrained Stochastic Scene Generation (CSSG)), which achieves competitive performance with human-designed scenes. Within Luminous, the EAI task executor, task instruction generation module, and video rendering toolkit can collectively generate a massive multimodal dataset of new scenes for the training and evaluation of Embodied AI agents. Extensive experimental results demonstrate the effectiveness of the data generated by Luminous, enabling the comprehensive assessment of embodied agents on generalization and robustness.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
Crystal-like Order Stabilizing Glasses: Structural Origin of Ultra-stable Metallic Glasses
Authors:
Zhen Lu,
Anh Khoa Augustin Lu,
Fan Zhang,
Yuan Tian,
**g Jiang,
Daixiu Wei,
Jiuhui Han,
Qingyang Gao,
Koji Ohara,
Hidemi Kato,
Akihiko Hirata,
Mingwei Chen
Abstract:
Glasses are featured with a disordered amorphous structure, being opposite to crystals that are constituted by periodic lattices. In this study we report that the exceptional thermodynamic and kinetic stability of an ultra-stable binary ZrCu metallic glass, fabricated by high-temperature physical vapor deposition, originates from ubiquitous crystal-like medium range order (MRO) constituted by Voro…
▽ More
Glasses are featured with a disordered amorphous structure, being opposite to crystals that are constituted by periodic lattices. In this study we report that the exceptional thermodynamic and kinetic stability of an ultra-stable binary ZrCu metallic glass, fabricated by high-temperature physical vapor deposition, originates from ubiquitous crystal-like medium range order (MRO) constituted by Voronoi polyhedron ordering with well-defined local translational symmetry beyond nearest atomic neighbors. The crystal-like MRO significantly improves the thermodynamic and kinetic stability of the glass, which is in opposition to the conventional wisdom that crystal-like order deteriorates the stability and forming ability of metallic glasses. This study unveils the structural origin of ultra-stable metallic glasses and shines a light on the intrinsic correlation of local atomic structure ordering with glass transition of metallic glasses.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
A robust single-pixel particle image velocimetry based on fully convolutional networks with cross-correlation embedded
Authors:
Qi Gao,
Hongtao Lin,
Han Tu,
Haoran Zhu,
Runjie Wei,
Guo** Zhang,
Xueming Shao
Abstract:
Particle image velocimetry (PIV) is essential in experimental fluid dynamics. In the current work, we propose a new velocity field estimation paradigm, which achieves a synergetic combination of the deep learning method and the traditional cross-correlation method. Specifically, the deep learning method is used to optimize and correct a coarse velocity guess to achieve a super-resolution calculati…
▽ More
Particle image velocimetry (PIV) is essential in experimental fluid dynamics. In the current work, we propose a new velocity field estimation paradigm, which achieves a synergetic combination of the deep learning method and the traditional cross-correlation method. Specifically, the deep learning method is used to optimize and correct a coarse velocity guess to achieve a super-resolution calculation. And the cross-correlation method provides the initial velocity field based on a coarse correlation with a large interrogation window. As a reference, the coarse velocity guess helps with improving the robustness of the proposed algorithm. This fully convolutional network with embedded cross-correlation is named as CC-FCN. CC-FCN has two types of input layers, one is for the particle images, and the other is for the initial velocity field calculated using cross-correlation with a coarse resolution. Firstly, two pyramidal modules extract features of particle images and initial velocity field respectively. Then the fusion module appropriately fuses these features. Finally, CC-FCN achieves the super-resolution calculation through a series of deconvolution layers to obtain the single-pixel velocity field. As the supervised learning strategy is considered, synthetic data sets including ground-truth fluid motions are generated to train the network parameters. Synthetic and real experimental PIV data sets are used to test the trained neural network in terms of accuracy, precision, spatial resolution and robustness. The test results show that these attributes of CC-FCN are further improved compared with those of other tested PIV algorithms. The proposed model could therefore provide competitive and robust estimations for PIV experiments.
△ Less
Submitted 30 October, 2021;
originally announced November 2021.
-
Quantum-Classical Computational Molecular Design of Deuterated High-Efficiency OLED Emitters
Authors:
Qi Gao,
Gavin O. Jones,
Michihiko Sugawara,
Takao Kobayashi,
Hiroki Yamashita,
Hideaki Kawaguchi,
Shu Tanaka,
Naoki Yamamoto
Abstract:
This study describes a hybrid quantum-classical computational approach for designing synthesizable deuterated $Alq_3$ emitters possessing desirable emission quantum efficiencies (QEs). This design process has been performed on the tris(8-hydroxyquinolinato) ligands typically bound to aluminum in $Alq_3$. It involves a multi-pronged approach which first utilizes classical quantum chemistry to predi…
▽ More
This study describes a hybrid quantum-classical computational approach for designing synthesizable deuterated $Alq_3$ emitters possessing desirable emission quantum efficiencies (QEs). This design process has been performed on the tris(8-hydroxyquinolinato) ligands typically bound to aluminum in $Alq_3$. It involves a multi-pronged approach which first utilizes classical quantum chemistry to predict the emission QEs of the $Alq_3$ ligands. These initial results were then used as a machine learning dataset for a factorization machine-based model which was applied to construct an Ising Hamiltonian to predict emission quantum efficiencies on a classical computer. We show that such a factorization machine-based approach can yield accurate property predictions for all 64 deuterated $Alq_3$ emitters with 13 training values. Moreover, another Ising Hamiltonian could be constructed by including synthetic constraints which could be used to perform optimizations on a quantum simulator and device using the variational quantum eigensolver (VQE) and quantum approximate optimization algorithm (QAOA) to discover a molecule possessing the optimal QE and synthetic cost. We observe that both VQE and QAOA calculations can predict the optimal molecule with greater than 0.95 probability on quantum simulators. These probabilities decrease to 0.83 and 0.075 for simulations with VQE and QAOA, respectively, on a quantum device, but these can be improved to 0.90 and 0.084 by mitigating readout error. Application of a binary search routine on quantum devices improves these results to a probability of 0.97 for simulations involving VQE and QAOA.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
Excited state calculations using variational quantum eigensolver with spin-restricted ansätze and automatically-adjusted constraints
Authors:
Shigeki Gocho,
Hajime Nakamura,
Shu Kanno,
Qi Gao,
Takao Kobayashi,
Taichi Inagaki,
Miho Hatanaka
Abstract:
The ground and excited state calculations at key geometries, such as the Frank-Condon (FC) and the conical intersection (CI) geometries, are essential for understanding photophysical properties. To compute these geometries on noisy intermediate-scale quantum devices, we proposed a strategy that combined a chemistry-inspired spin-restricted ansatz and a new excited state calculation method called t…
▽ More
The ground and excited state calculations at key geometries, such as the Frank-Condon (FC) and the conical intersection (CI) geometries, are essential for understanding photophysical properties. To compute these geometries on noisy intermediate-scale quantum devices, we proposed a strategy that combined a chemistry-inspired spin-restricted ansatz and a new excited state calculation method called the variational quantum eigensolver under automatically-adjusted constraints (VQE/AC). Unlike the conventional excited state calculation method, called the variational quantum deflation, the VQE/AC does not require the pre-determination of constraint weights and has the potential to describe smooth potential energy surfaces. To validate this strategy, we performed the excited state calculations at the FC and CI geometries of ethylene and phenol blue at the complete active space self-consistent field (CASSCF) level of theory, and found that the energy errors were at most 2 kcal mol$^{-1}$ even on the ibm_kawasaki device.
△ Less
Submitted 29 December, 2022; v1 submitted 27 October, 2021;
originally announced October 2021.
-
Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders
Authors:
T. Anderson Keller,
Qinghe Gao,
Max Welling
Abstract:
Category-selectivity in the brain describes the observation that certain spatially localized areas of the cerebral cortex tend to respond robustly and selectively to stimuli from specific limited categories. One of the most well known examples of category-selectivity is the Fusiform Face Area (FFA), an area of the inferior temporal cortex in primates which responds preferentially to images of face…
▽ More
Category-selectivity in the brain describes the observation that certain spatially localized areas of the cerebral cortex tend to respond robustly and selectively to stimuli from specific limited categories. One of the most well known examples of category-selectivity is the Fusiform Face Area (FFA), an area of the inferior temporal cortex in primates which responds preferentially to images of faces when compared with objects or other generic stimuli. In this work, we leverage the newly introduced Topographic Variational Autoencoder to model the emergence of such localized category-selectivity in an unsupervised manner. Experimentally, we demonstrate our model yields spatially dense neural clusters selective to faces, bodies, and places through visualized maps of Cohen's d metric. We compare our model with related supervised approaches, namely the Topographic Deep Artificial Neural Network (TDANN) of Lee et al., and discuss both theoretical and empirical similarities. Finally, we show preliminary results suggesting that our model yields a nested spatial hierarchy of increasingly abstract categories, analogous to observations from the human ventral temporal cortex.
△ Less
Submitted 18 December, 2021; v1 submitted 25 October, 2021;
originally announced October 2021.
-
ESOD:Edge-based Task Scheduling for Object Detection
Authors:
Yihao Wang,
Ling Gao,
Jie Ren,
Rui Cao,
Hai Wang,
Jie Zheng,
Quanli Gao
Abstract:
Object Detection on the mobile system is a challenge in terms of everything. Nowadays, many object detection models have been designed, and most of them concentrate on precision. However, the computation burden of those models on mobile systems is unacceptable. Researchers have designed some lightweight networks for mobiles by sacrificing precision. We present a novel edge-based task scheduling fr…
▽ More
Object Detection on the mobile system is a challenge in terms of everything. Nowadays, many object detection models have been designed, and most of them concentrate on precision. However, the computation burden of those models on mobile systems is unacceptable. Researchers have designed some lightweight networks for mobiles by sacrificing precision. We present a novel edge-based task scheduling framework for object detection (termed as ESOD). In detail, we train a DNN model (termed as pre-model) to predict which object detection model to use for the coming task and offloads to which edge servers by physical characteristics of the image task (e.g., brightness, saturation). The results show that ESOD can reduce latency and energy consumption by an average of 22.13% and 29.60% and improve the mAP to 45.8(with 0.9 mAP better), respectively, compared with the SOTA DETR model.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
Self-supervised Contrastive Attributed Graph Clustering
Authors:
Wei Xia,
Quanxue Gao,
Ming Yang,
Xinbo Gao
Abstract:
Attributed graph clustering, which learns node representation from node attribute and topological graph for clustering, is a fundamental but challenging task for graph analysis. Recently, methods based on graph contrastive learning (GCL) have obtained impressive clustering performance on this task. Yet, we observe that existing GCL-based methods 1) fail to benefit from imprecise clustering labels;…
▽ More
Attributed graph clustering, which learns node representation from node attribute and topological graph for clustering, is a fundamental but challenging task for graph analysis. Recently, methods based on graph contrastive learning (GCL) have obtained impressive clustering performance on this task. Yet, we observe that existing GCL-based methods 1) fail to benefit from imprecise clustering labels; 2) require a post-processing operation to get clustering labels; 3) cannot solve out-of-sample (OOS) problem. To address these issues, we propose a novel attributed graph clustering network, namely Self-supervised Contrastive Attributed Graph Clustering (SCAGC). In SCAGC, by leveraging inaccurate clustering labels, a self-supervised contrastive loss, which aims to maximize the similarities of intra-cluster nodes while minimizing the similarities of inter-cluster nodes, are designed for node representation learning. Meanwhile, a clustering module is built to directly output clustering labels by contrasting the representation of different clusters. Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels. Extensive experimental results on four benchmark datasets have shown that SCAGC consistently outperforms 11 competitive clustering methods.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
DC Current Generation and Power Feature in Strongly Driven Floquet-Bloch Systems
Authors:
Qiang Gao,
Yafei Ren,
Qian Niu
Abstract:
We study the DC current generation in a periodically driven Bloch system connected to a heat bath. Under a relaxation time approximation, the density matrix for such a system is obtained, which is related to two equilibria: a Floquet quasi-equilibrium where the density matrix is diagonal under the Floquet-Bloch eigenbasis and an instantaneous Bloch thermal equilibrium. Then, the current responses…
▽ More
We study the DC current generation in a periodically driven Bloch system connected to a heat bath. Under a relaxation time approximation, the density matrix for such a system is obtained, which is related to two equilibria: a Floquet quasi-equilibrium where the density matrix is diagonal under the Floquet-Bloch eigenbasis and an instantaneous Bloch thermal equilibrium. Then, the current responses and their power features, i.e. the power input behavior, are discussed in a unified manner, which reveals that there exist an intrinsic current and an extrinsic correction. Remarkably, the intrinsic part consumes no energy and corresponds to the Floquet quasi-equilibrium, while the extrinsic part needs a sustained energy input and originates from a shift between two equilibrium ensembles. We further investigate the role of the external driving field strength finding that large DC currents can be generated under a relatively strong but not too strong driving field.
△ Less
Submitted 25 January, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
The complete control of scattering waves in multi-channel structures
Authors:
Qi Gao,
Yun-Song Zhou,
Li-Ming Zhao
Abstract:
The issue of photon spin Hall effect was generalized as a universal question of how to control all the scattering waves in a multi-channel structure (complete control). A general theory was proposed, which provides a simple way to achieve the complete control. This theory shows also that the necessary condition for complete control is that the structure must contain a complete set of sources. To d…
▽ More
The issue of photon spin Hall effect was generalized as a universal question of how to control all the scattering waves in a multi-channel structure (complete control). A general theory was proposed, which provides a simple way to achieve the complete control. This theory shows also that the necessary condition for complete control is that the structure must contain a complete set of sources. To demonstrate the application of the theory, the typical scattering patterns in the two-channel and four-channel structures are achieved theoretically. Previous this research, one could only artificially control the scattering waves in two channels out of a four-channel structure.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
On the linear transformation between inertial frames
Authors:
Qing Gao,
Yungui Gong
Abstract:
In the derivation of Lorentz transformation, linear transformation between inertial frames is one of the most important steps. In teaching special relativity, we usually use the homogeneity and isotropy of spacetime to argue that the transformation must be linear transformation without providing any rigorous detail. Here in the first time we provide a solid mathematical proof of the argument that…
▽ More
In the derivation of Lorentz transformation, linear transformation between inertial frames is one of the most important steps. In teaching special relativity, we usually use the homogeneity and isotropy of spacetime to argue that the transformation must be linear transformation without providing any rigorous detail. Here in the first time we provide a solid mathematical proof of the argument that the transformation between two inertial frames must be linear because of the homogeneity and isotropy of spacetime.
△ Less
Submitted 12 December, 2021; v1 submitted 8 October, 2021;
originally announced October 2021.
-
Exploring More When It Needs in Deep Reinforcement Learning
Authors:
Youtian Guo,
Qi Gao
Abstract:
We propose a exploration mechanism of policy in Deep Reinforcement Learning, which is exploring more when agent needs, called Add Noise to Noise (AN2N). The core idea is: when the Deep Reinforcement Learning agent is in a state of poor performance in history, it needs to explore more. So we use cumulative rewards to evaluate which past states the agents have not performed well, and use cosine dist…
▽ More
We propose a exploration mechanism of policy in Deep Reinforcement Learning, which is exploring more when agent needs, called Add Noise to Noise (AN2N). The core idea is: when the Deep Reinforcement Learning agent is in a state of poor performance in history, it needs to explore more. So we use cumulative rewards to evaluate which past states the agents have not performed well, and use cosine distance to measure whether the current state needs to be explored more. This method shows that the exploration mechanism of the agent's policy is conducive to efficient exploration. We combining the proposed exploration mechanism AN2N with Deep Deterministic Policy Gradient (DDPG), Soft Actor-Critic (SAC) algorithms, and apply it to the field of continuous control tasks, such as halfCheetah, Hopper, and Swimmer, achieving considerable improvement in performance and convergence speed.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
Strain-induced enhancement of $T_c$ in infinite-layer Pr$_{0.8}$Sr$_{0.2}$NiO$_2$ films
Authors:
Xiaolin Ren,
Jiarui Li,
Wei-Chih Chen,
Qiang Gao,
Joshua J. Sanchez,
Jordyn Hales,
Hailan Luo,
Fanny Rodolakis,
Jessica L. McChesney,
Tao Xiang,
Jiang** Hu,
Fu-Chun Zhang,
Riccardo Comin,
Yao Wang,
X. J. Zhou,
Zhihai Zhu
Abstract:
The mechanism of unconventional superconductivity in correlated materials remains a great challenge in condensed matter physics. The recent discovery of superconductivity in infinite-layer nickelates, as analog to high-Tc cuprates, has opened a new route to tackle this challenge. By growing 8 nm Pr0.8Sr0.2NiO2 films on the (LaAlO3)0.3(Sr2AlTaO6)0.7 substrate, we successfully raise the transition t…
▽ More
The mechanism of unconventional superconductivity in correlated materials remains a great challenge in condensed matter physics. The recent discovery of superconductivity in infinite-layer nickelates, as analog to high-Tc cuprates, has opened a new route to tackle this challenge. By growing 8 nm Pr0.8Sr0.2NiO2 films on the (LaAlO3)0.3(Sr2AlTaO6)0.7 substrate, we successfully raise the transition temperature Tc from 9 K in the widely studied SrTiO3-substrated nickelates into 15 K. By combining x-ray absorption spectroscopy with the first-principles and many-body simulations, we find a positive correlation between Tc and the pre-edge peak intensity, which can be attributed to the hybridization between Ni and O orbitals induced by the strain. Our result suggests that structural engineering can further enhance unconventional superconductivity, and the charge-transfer property plays a crucial role in the pairing strength.
△ Less
Submitted 20 March, 2022; v1 submitted 13 September, 2021;
originally announced September 2021.