Search | arXiv e-print repository

Masked Conditional Random Fields for Sequence Labeling

Authors: Tianwen Wei, Jianwei Qi, Shenghuan He, Songtao Sun

Abstract: Conditional Random Field (CRF) based neural models are among the most performant methods for solving sequence labeling problems. Despite its great success, CRF has the shortcoming of occasionally generating illegal sequences of tags, e.g. sequences containing an "I-" tag immediately after an "O" tag, which is forbidden by the underlying BIO tagging scheme. In this work, we propose Masked Condition… ▽ More Conditional Random Field (CRF) based neural models are among the most performant methods for solving sequence labeling problems. Despite its great success, CRF has the shortcoming of occasionally generating illegal sequences of tags, e.g. sequences containing an "I-" tag immediately after an "O" tag, which is forbidden by the underlying BIO tagging scheme. In this work, we propose Masked Conditional Random Field (MCRF), an easy to implement variant of CRF that impose restrictions on candidate paths during both training and decoding phases. We show that the proposed method thoroughly resolves this issue and brings consistent improvement over existing CRF-based models with near zero additional cost. △ Less

Submitted 19 March, 2021; originally announced March 2021.

Comments: accepted by NAACL 2021

arXiv:2103.10133 [pdf, ps, other]

Evaluating Document Coherence Modelling

Authors: Aili Shen, Meladel Mistica, Bahar Salehi, Hang Li, Timothy Baldwin, Jianzhong Qi

Abstract: While pretrained language models ("LM") have driven impressive gains over morpho-syntactic and semantic tasks, their ability to model discourse and pragmatic phenomena is less clear. As a step towards a better understanding of their discourse modelling capabilities, we propose a sentence intrusion detection task. We examine the performance of a broad range of pretrained LMs on this detection task… ▽ More While pretrained language models ("LM") have driven impressive gains over morpho-syntactic and semantic tasks, their ability to model discourse and pragmatic phenomena is less clear. As a step towards a better understanding of their discourse modelling capabilities, we propose a sentence intrusion detection task. We examine the performance of a broad range of pretrained LMs on this detection task for English. Lacking a dataset for the task, we introduce INSteD, a novel intruder sentence detection dataset, containing 170,000+ documents constructed from English Wikipedia and CNN news articles. Our experiments show that pretrained LMs perform impressively in in-domain evaluation, but experience a substantial drop in the cross-domain setting, indicating limited generalisation capacity. Further results over a novel linguistic probe dataset show that there is substantial room for improvement, especially in the cross-domain setting. △ Less

Submitted 18 March, 2021; originally announced March 2021.

Comments: accepted to TACL 2021

arXiv:2103.06714 [pdf, ps, other]

Ordered Semiautomatic Rings with Applications to Geometry

Authors: Ziyuan Gao, Sanjay Jain, Ji Qi, Philipp Schlicht, Frank Stephan, Jacob Tarr

Abstract: The present work looks at semiautomatic rings with automatic addition and comparisons which are dense subrings of the real numbers and asks how these can be used to represent geometric objects such that certain operations and transformations are automatic. The underlying ring has always to be a countable dense subring of the real numbers and additions and comparisons and multiplications with const… ▽ More The present work looks at semiautomatic rings with automatic addition and comparisons which are dense subrings of the real numbers and asks how these can be used to represent geometric objects such that certain operations and transformations are automatic. The underlying ring has always to be a countable dense subring of the real numbers and additions and comparisons and multiplications with constants need to be automatic. It is shown that the ring can be selected such that equilateral triangles can be represented and rotations by 30 degrees are possible, while the standard representation of the b-adic rationals does not allow this. △ Less

Submitted 11 March, 2021; originally announced March 2021.

Journal ref: Extended version of the article in the Proceedings of the International Conference on Language and Automata Theory and Applications 2020 Mar 4 (pp. 141-153). Springer, Cham

arXiv:2103.05212 [pdf, ps, other]

doi 10.3847/1538-4357/abebe0

Testing f(R) gravity with the simulated data of gravitational waves from the Einstein Telescope

Authors: Yu Pan, Yuan He, **gZhao Qi, ** Li, Shuo Cao, Tonghua Liu, Jun Wang

Abstract: In this paper we analyze the implications of gravitational waves (GWs) as standard sirens on the modified gravity models by using the third-generation gravitational wave detector, i.e., the Einstein Telescope. Two viable models in $f(R)$ theories within the Palatini formalism are considered in our analysis ($f_{1}(\mathcal{R})=\mathcal{R}-\fracβ{\mathcal{R}^{n}}$ and… ▽ More In this paper we analyze the implications of gravitational waves (GWs) as standard sirens on the modified gravity models by using the third-generation gravitational wave detector, i.e., the Einstein Telescope. Two viable models in $f(R)$ theories within the Palatini formalism are considered in our analysis ($f_{1}(\mathcal{R})=\mathcal{R}-\fracβ{\mathcal{R}^{n}}$ and $f_{2}(\mathcal{R})=\mathcal{R}+α\ln{\mathcal{R}}-β$), with the combination of simulated GW data and the latest electromagnetic (EM) observational data (including the recently released Pantheon type Ia supernovae sample, the cosmic chronometer data, and baryon acoustic oscillation distance measurements). Our analysis reveals that the standard sirens GWs, which provide an independent and complementary alternative to current experiments, could effectively eliminate the degeneracies among parameters in the two modified gravity models. In addition, we thoroughly investigate the nature of geometrical dark energy in the modified gravity theories with the assistance of $Om(z)$ and statefinder diagnostic analysis. The present analysis makes it clear-cut that the simplest cosmological constant model is still the most preferred by the current data. However, the combination of future naturally improved GW data most recent EM observations will reveal the consistency or acknowledge the tension between the $Λ$CDM model and modified gravity theories. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: 10 pages, 11 figures, accepted for publication in ApJ

arXiv:2102.08572 [pdf, other]

doi 10.1103/PhysRevB.103.L180409

Unveiling the Hybridization Process in a Quantum Critical Ferromagnet by Ultrafast Optical Spectroscopy

Authors: Y. H. Pei, Y. J. Zhang, Z. X. Wei, Y. X. Chen, K. Hu, Y. -F Yang, H. Q. Yuan, J. Qi

Abstract: We report the ultrafast optical pump-probe spectroscopy measurements on the recently discovered quantum critical ferromagnet CeRh$_6$Ge$_4$. Our experimental results reveal the two-stage development of the hybridization between localized $f$ moments and conduction electrons with lowering temperature, as evidenced by (1) the presence of hybridization fluctuation for temperatures from $\sim$85 K (… ▽ More We report the ultrafast optical pump-probe spectroscopy measurements on the recently discovered quantum critical ferromagnet CeRh$_6$Ge$_4$. Our experimental results reveal the two-stage development of the hybridization between localized $f$ moments and conduction electrons with lowering temperature, as evidenced by (1) the presence of hybridization fluctuation for temperatures from $\sim$85 K ($T^*$) to $\sim$140 K ($T^\dagger$), and (2) the emergence of collective hybridization below the coherence temperature, $T^*$, marked by the opening of an indirect gap of 2$Δ$ $\approx$12 meV. We also observe three coherent phonon modes being softened anomalously below $T^*$, reflecting directly their coupling with the emergent coherent heavy electrons. Our findings establish the universal nature of the hybridization process in different heavy fermion systems. △ Less

Submitted 16 February, 2021; originally announced February 2021.

Journal ref: Phys. Rev. B 103, 180409 (2021)

arXiv:2102.08413 [pdf, other]

Bridging the Gap Between Simulated and Experimental Ionic Conductivities in Lithium Superionic Conductors

Authors: Ji Qi, Swastika Banerjee, Yunxing Zuo, Chi Chen, Zhuoying Zhu, H. C. Manas Likhit, Xiangguo Li, Shyue ** Ong

Abstract: Lithium superionic conductors (LSCs) are of major importance as solid electrolytes for next-generation all-solid-state lithium-ion batteries. While $ab$ $initio$ molecular dynamics have been extensively applied to study these materials, there are often large discrepancies between predicted and experimentally measured ionic conductivities and activation energies due to the high temperatures and sho… ▽ More Lithium superionic conductors (LSCs) are of major importance as solid electrolytes for next-generation all-solid-state lithium-ion batteries. While $ab$ $initio$ molecular dynamics have been extensively applied to study these materials, there are often large discrepancies between predicted and experimentally measured ionic conductivities and activation energies due to the high temperatures and short time scales of such simulations. Here, we present a strategy to bridge this gap using moment tensor potentials (MTPs). We show that MTPs trained on energies and forces computed using the van der Waals optB88 functional yield much more accurate lattice parameters, which in turn leads to accurate prediction of ionic conductivities and activation energies for the Li$_{0.33}$La$_{0.56}$TiO$_3$, Li$_3$YCl$_6$ and Li$_7$P$_3$S$_{11}$ LSCs. NPT MD simulations using the optB88 MTPs also reveal that all three LSCs undergo a transition between two quasi-linear Arrhenius regimes at relatively low temperatures. This transition can be traced to an expansion in the number and diversity of diffusion pathways, in some cases with a change in the dimensionality of diffusion. This work presents not only an approach to develop high accuracy MTPs, but also outlines the diffusion characteristics for LSCs which is otherwise inaccessible through $ab$ $initio$ computation. △ Less

Submitted 20 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

Comments: Main text: 22 pages excluding references, 6 figures; Supporting information: 13 pages, 12 figures. Submitted to Materials Today Physics

arXiv:2102.08081 [pdf, ps, other]

A Lazy Approach for Efficient Index Learning

Authors: Guanli Liu, Lars Kulik, Xingjun Ma, Jianzhong Qi

Abstract: Learned indices using neural networks have been shown to outperform traditional indices such as B-trees in both query time and memory. However, learning the distribution of a large dataset can be expensive, and updating learned indices is difficult, thus hindering their usage in practical applications. In this paper, we address the efficiency and update issues of learned indices through agile mode… ▽ More Learned indices using neural networks have been shown to outperform traditional indices such as B-trees in both query time and memory. However, learning the distribution of a large dataset can be expensive, and updating learned indices is difficult, thus hindering their usage in practical applications. In this paper, we address the efficiency and update issues of learned indices through agile model reuse. We pre-train learned indices over a set of synthetic (rather than real) datasets and propose a novel approach to reuse these pre-trained models for a new (real) dataset. The synthetic datasets are created to cover a large range of different distributions. Given a new dataset DT, we select the learned index of a synthetic dataset similar to DT, to index DT. We show a bound over the indexing error when a pre-trained index is selected. We further show how our techniques can handle data updates and bound the resultant indexing errors. Experimental results on synthetic and real datasets confirm the effectiveness and efficiency of our proposed lazy (model reuse) approach. △ Less

Submitted 17 February, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

arXiv:2102.03979 [pdf, other]

doi 10.3847/1538-4357/ac0ef5

Prospects for Constraining Interacting Dark Energy Models with 21 cm Intensity Map** Experiments

Authors: Ming Zhang, Bo Wang, Peng-Ju Wu, **g-Zhao Qi, Yidong Xu, **g-Fei Zhang, Xin Zhang

Abstract: We forecast constraints on cosmological parameters in the interacting dark energy models using the mock data generated for neutral hydrogen intensity map** (IM) experiments. In this work, we only consider the interacting dark energy models with energy transfer rate $Q=βHρ_{\rm c}$, and take BINGO, FAST, SKA1-MID, and Tianlai as typical examples of the 21 cm IM experiments. We find that the Tianl… ▽ More We forecast constraints on cosmological parameters in the interacting dark energy models using the mock data generated for neutral hydrogen intensity map** (IM) experiments. In this work, we only consider the interacting dark energy models with energy transfer rate $Q=βHρ_{\rm c}$, and take BINGO, FAST, SKA1-MID, and Tianlai as typical examples of the 21 cm IM experiments. We find that the Tianlai cylinder array will play an important role in constraining the interacting dark energy model. Assuming perfect foreground removal and calibration, and using the Tianlai-alone data, we obtain $σ(H_0)=0.19$ km s$^{-1}$ Mpc$^{-1}$, $σ(Ω_{\rm m})=0.0033$ and $σ(σ_8)=0.0033$ in the I$Λ$CDM model, which are much better than the results of Planck+optical BAO (i.e. optical galaxy surveys). However, the Tianlai-alone data cannot provide a very tight constraint on the coupling parameter $β$ compared with Planck+optical BAO, while the Planck+Tianlai data can give a rather tight constraint of $σ(β)=0.00023$ due to the parameter degeneracies being well broken by the data combination. In the I$w$CDM model, we obtain $σ(β)=0.00079$ and $σ(w)=0.013$ from Planck+Tianlai. In addition, we also make a detailed comparison among BINGO, FAST, SKA1-MID, and Tianlai in constraining the interacting dark energy models. We show that future 21 cm IM experiments will provide a useful tool for exploring the nature of dark energy and play a significant role in measuring the coupling between dark energy and dark matter. △ Less

Submitted 10 September, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

Comments: 13 pages, 5 figures

Journal ref: Astrophys. J. 918 (2021) 56

arXiv:2102.03575 [pdf, other]

A graphical algorithm for the integration of monomials in the Chow ring of the moduli space of stable marked curves of genus zero

Authors: Jiayue Qi

Abstract: The Chow group of zero cycles in the moduli space of stable pointed curves of genus zero is isomorphic to the integer additive group. Let $M$ be monomial in this Chow group. If no two factors of $M$ fulfill a particular quadratic relation, then the monomial can be represented equivalently by a specific tree; otherwise, $M$ is mapped to zero under the stated isomorphism. Starting from this tree rep… ▽ More The Chow group of zero cycles in the moduli space of stable pointed curves of genus zero is isomorphic to the integer additive group. Let $M$ be monomial in this Chow group. If no two factors of $M$ fulfill a particular quadratic relation, then the monomial can be represented equivalently by a specific tree; otherwise, $M$ is mapped to zero under the stated isomorphism. Starting from this tree representation, we introduce a graphical algorithm for computing the corresponding integer for $M$ under the aforementioned isomorphism. The algorithm is linear with respect to the size of the tree. △ Less

Submitted 6 February, 2021; originally announced February 2021.

MSC Class: 14C17; 14N10; 14H10; 14H45; 14Q15; 05C85; 05C05; 05C22; 05C62

arXiv:2102.02538 [pdf, ps, other]

doi 10.1088/1674-1137/ac6d4e

The impact of $a_0^0(980)-f_0(980)$ mixing on the localized $CP$ violations of the $B^-\rightarrow K^- π^+π^-$ decay

Authors: **g-Juan Qi, Zhen-Yang Wang, Chao Wang, Zhen-Hua Zhang, Xin-Heng Guo

Abstract: In the framework of the QCD factorization approach, we study the localized $CP$ violations of the $B^-\rightarrow K^- π^+π^-$ decay with and without $a_0^0(980)-f_0(980)$ mixing mechanism, respectively, and find that the localized $CP$ violation can be enhanced by this mixing effect when the mass of the $π^+π^-$ pair is in the vicinity of the $f_0(980)$ resonance. The corresponding theoretical pre… ▽ More In the framework of the QCD factorization approach, we study the localized $CP$ violations of the $B^-\rightarrow K^- π^+π^-$ decay with and without $a_0^0(980)-f_0(980)$ mixing mechanism, respectively, and find that the localized $CP$ violation can be enhanced by this mixing effect when the mass of the $π^+π^-$ pair is in the vicinity of the $f_0(980)$ resonance. The corresponding theoretical prediction results are $\mathcal{A}_{CP}(B^-\rightarrow K f_0 \rightarrow K^-π^+π^-)=[0.24, 0.36]$ and $\mathcal{A}_{CP}(B^-\rightarrow K^- f_0(a_0) \rightarrow K^-π^+π^-)=[0.33, 0.52]$, respectively. Meanwhile, we also calculate the branching fraction of the $B^-\rightarrow K^-f_0(980)\rightarrow K^-π^+π^-$ decay, which is consistent with the experimental results. We suggest that $a_0^0(980)-f_0(980)$ mixing mechanism should be considered when studying the $CP$ violation of the $B$ or $D$ mesons decays theoretically and experimentally. △ Less

Submitted 4 February, 2021; originally announced February 2021.

Comments: 11 pages, 3 figures

arXiv:2102.01558 [pdf, other]

Occluded Video Instance Segmentation: A Benchmark

Authors: Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

Abstract: Can our video understanding systems perceive objects when a heavy occlusion exists in a scene? To answer this question, we collect a large-scale dataset called OVIS for occluded video instance segmentation, that is, to simultaneously detect, segment, and track instances in occluded scenes. OVIS consists of 296k high-quality instance masks from 25 semantic categories, where object occlusions usua… ▽ More Can our video understanding systems perceive objects when a heavy occlusion exists in a scene? To answer this question, we collect a large-scale dataset called OVIS for occluded video instance segmentation, that is, to simultaneously detect, segment, and track instances in occluded scenes. OVIS consists of 296k high-quality instance masks from 25 semantic categories, where object occlusions usually occur. While our human vision systems can understand those occluded instances by contextual reasoning and association, our experiments suggest that current video understanding systems cannot. On the OVIS dataset, the highest AP achieved by state-of-the-art algorithms is only 16.3, which reveals that we are still at a nascent stage for understanding objects, instances, and videos in a real-world scenario. We also present a simple plug-and-play module that performs temporal feature calibration to complement missing object cues caused by occlusion. Built upon MaskTrack R-CNN and SipMask, we obtain a remarkable AP improvement on the OVIS dataset. The OVIS dataset and project code are available at http://songbai.site/ovis . △ Less

Submitted 17 May, 2022; v1 submitted 2 February, 2021; originally announced February 2021.

Comments: IJCV 2022. Project page at https://songbai.site/ovis

MSC Class: 68T07; 68T45

arXiv:2102.01292 [pdf, other]

doi 10.1088/1475-7516/2021/12/042

Using a multi-messenger and multi-wavelength observational strategy to probe the nature of dark energy through direct measurements of cosmic expansion history

Authors: **g-Zhao Qi, Shang-Jie **, Xi-Long Fan, **g-Fei Zhang, Xin Zhang

Abstract: In the near future, the redshift drift observations in optical and radio bands will provide precise measurements on $H(z)$ covering the redshift ranges of $2<z<5$ and $0<z<0.3$. In addition, gravitational wave (GW) standard siren observations could make measurements on the dipole anisotropy of luminosity distance, which will also provide the $H(z)$ measurements in the redshift range of $0<z<3$. In… ▽ More In the near future, the redshift drift observations in optical and radio bands will provide precise measurements on $H(z)$ covering the redshift ranges of $2<z<5$ and $0<z<0.3$. In addition, gravitational wave (GW) standard siren observations could make measurements on the dipole anisotropy of luminosity distance, which will also provide the $H(z)$ measurements in the redshift range of $0<z<3$. In this work, we propose a multi-messenger and multi-wavelength observational strategy to measure $H(z)$ based on the three next-generation projects, E-ELT, SKA, and DECIGO, and we wish to see whether the future $H(z)$ measurements could provide tight constraints on dark-energy parameters. The dark energy models we consider include $Λ$CDM, $w$CDM, CPL, HDE, and I$Λ$CDM models. It is found that E-ELT, SKA1, and DECIGO are highly complementary in constraining dark energy models. Although any one of these three data sets can only give rather weak constraints on each model we consider, the combination of them could significantly break the parameter degeneracies and give much tighter constraints on almost all the cosmological parameters. Moreover, we find that the combination of E-ELT, SKA1, DECIGO, and CMB could further improve the constraints on dark energy parameters, e.g., $σ(w_0)=0.024$ and $σ(w_a)=0.17$ in the CPL model, which means that these three promising probes will play a key role in hel** reveal the nature of dark energy. △ Less

Submitted 20 December, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

Comments: 18 pages, 7 figures

Journal ref: JCAP 12 (2021) 042

arXiv:2101.10897 [pdf, other]

HexCNN: A Framework for Native Hexagonal Convolutional Neural Networks

Authors: Yunxiang Zhao, Qiuhong Ke, Flip Korn, Jianzhong Qi, Rui Zhang

Abstract: Hexagonal CNN models have shown superior performance in applications such as IACT data analysis and aerial scene classification due to their better rotation symmetry and reduced anisotropy. In order to realize hexagonal processing, existing studies mainly use the ZeroOut method to imitate hexagonal processing, which causes substantial memory and computation overheads. We address this deficiency wi… ▽ More Hexagonal CNN models have shown superior performance in applications such as IACT data analysis and aerial scene classification due to their better rotation symmetry and reduced anisotropy. In order to realize hexagonal processing, existing studies mainly use the ZeroOut method to imitate hexagonal processing, which causes substantial memory and computation overheads. We address this deficiency with a novel native hexagonal CNN framework named HexCNN. HexCNN takes hexagon-shaped input and performs forward and backward propagation on the original form of the input based on hexagon-shaped filters, hence avoiding computation and memory overheads caused by imitation. For applications with rectangle-shaped input but require hexagonal processing, HexCNN can be applied by padding the input into hexagon-shape as preprocessing. In this case, we show that the time and space efficiency of HexCNN still outperforms existing hexagonal CNN methods substantially. Experimental results show that compared with the state-of-the-art models, which imitate hexagonal processing but using rectangle-shaped filters, HexCNN reduces the training time by up to 42.2%. Meanwhile, HexCNN saves the memory space cost by up to 25% and 41.7% for loading the input and performing convolution, respectively. △ Less

Submitted 25 January, 2021; originally announced January 2021.

arXiv:2101.09372 [pdf]

Full Poincaré polarimetry enabled through physical inference

Authors: Chao He, Jianyu Lin, **tao Chang, Jacopo Antonello, Ben Dai, **gyu Wang, Jiahe Cui, Ji Qi, Min Wu, Daniel S. Elson, Peng Xi, Andrew Forbes, Martin J. Booth

Abstract: While polarisation sensing is vital in many areas of research, with applications spanning from microscopy to aerospace, traditional approaches are limited by method-related error amplification or accumulation, placing fundamental limitations on precision and accuracy in single-shot polarimetry. Here, we put forward a new measurement paradigm to circumvent this, introducing the notion of a universa… ▽ More While polarisation sensing is vital in many areas of research, with applications spanning from microscopy to aerospace, traditional approaches are limited by method-related error amplification or accumulation, placing fundamental limitations on precision and accuracy in single-shot polarimetry. Here, we put forward a new measurement paradigm to circumvent this, introducing the notion of a universal full Poincaré generator to map all polarisation analyser states into a single vectorially structured light field, allowing all vector components to be analysed in a single-shot with theoretically user-defined precision. To demonstrate the advantage of our approach, we use a common GRIN optic as our map** device and show mean errors of <1% for each vector component, enhancing the sensitivity by around three times, allowing us to sense weak polarisation aberrations not measurable by traditional single-shot techniques. Our work paves the way for next-generation polarimetry, impacting a wide variety of applications relying on weak vector measurement. △ Less

Submitted 15 September, 2022; v1 submitted 22 January, 2021; originally announced January 2021.

arXiv:2101.07513 [pdf, other]

Towards Latent Space Based Manipulation of Elastic Rods using Autoencoder Models and Robust Centerline Extractions

Authors: Jiaming Qi, Guangfu Ma, Peng Zhou, Haibo Zhang, Yueyong Lyu, David Navarro-Alarcon

Abstract: The automatic shape control of deformable objects is a challenging (and currently hot) manipulation problem due to their high-dimensional geometric features and complex physical properties. In this study, a new methodology to manipulate elastic rods automatically into 2D desired shapes is presented. An efficient vision-based controller that uses a deep autoencoder network is designed to compute a… ▽ More The automatic shape control of deformable objects is a challenging (and currently hot) manipulation problem due to their high-dimensional geometric features and complex physical properties. In this study, a new methodology to manipulate elastic rods automatically into 2D desired shapes is presented. An efficient vision-based controller that uses a deep autoencoder network is designed to compute a compact representation of the object's infinite-dimensional shape. An online algorithm that approximates the sensorimotor map** between the robot's configuration and the object's shape features is used to deal with the latter's (typically unknown) mechanical properties. The proposed approach computes the rod's centerline from raw visual data in real-time by introducing an adaptive algorithm on the basis of a self-organizing network. Its effectiveness is thoroughly validated with simulations and experiments. △ Less

Submitted 9 April, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

arXiv:2101.06688 [pdf, ps, other]

doi 10.1140/epjc/s10052-021-09422-z

Studying the $\bar{D}_1K$ molecule in the Bethe-Salpeter equation approach

Authors: **g-Juan Qi, Zhen-Yang Wang, Zhu-Feng Zhang, Xin-Heng Guo

Abstract: We interpret the $X_1(2900)$ as an $S$-wave $\bar{D}_1K$ molecular state in the Bethe-Salpeter equation approach with the ladder and instantaneous approximations for the kernel. By solving the Bethe-Salpeter equation numerically with the kernel containing one-particle-exchange diagrams and introducing three different form factors (monopole, dipole, and exponential form factors) in the verties, we… ▽ More We interpret the $X_1(2900)$ as an $S$-wave $\bar{D}_1K$ molecular state in the Bethe-Salpeter equation approach with the ladder and instantaneous approximations for the kernel. By solving the Bethe-Salpeter equation numerically with the kernel containing one-particle-exchange diagrams and introducing three different form factors (monopole, dipole, and exponential form factors) in the verties, we find the bound state exists. We also study the decay width of the decay $X_1(2900)$ to $D^-K^+$. △ Less

Submitted 10 May, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

Comments: 13 pages, 4 figures. arXiv admin note: text overlap with arXiv:2004.14085, arXiv:1906.09002

arXiv:2101.03789 [pdf, other]

A tree-based algorithm for the integration of monomials in the Chow ring of the moduli space of stable marked curves of genus zero

Authors: Jiayue Qi

Abstract: The Chow ring of the moduli space of marked rational curves is generated by Keel's divisor classes. The top graded part of this Chow ring is isomorphic to the integers, generated by the class of a single point. In this paper, we give an algorithm for computing the intersection degree of tuples of Keel's divisor classes. This computation is a concrete but complicated algorithmic question in the fie… ▽ More The Chow ring of the moduli space of marked rational curves is generated by Keel's divisor classes. The top graded part of this Chow ring is isomorphic to the integers, generated by the class of a single point. In this paper, we give an algorithm for computing the intersection degree of tuples of Keel's divisor classes. This computation is a concrete but complicated algorithmic question in the field. Also, we give a simple complexity argument for the algorithm. Additionally, we introduce three identities on multinomial coefficients, as well as proofs for them. △ Less

Submitted 26 October, 2022; v1 submitted 11 January, 2021; originally announced January 2021.

Report number: DK-Report No. 2021-01 MSC Class: 14N10; 14C17; 14H10; 14H45; 14Q15; 05C05; 05C22; 05C62; 05C85; 05A19

arXiv:2101.00908 [pdf, other]

A Stochastic Multi-Agent Optimization Framework for Interdependent Transportation and Power System Analyses

Authors: Zhaomiao Guo, Fatima Afifah, Junjian Qi, Sina Baghali

Abstract: We study the interdependence between transportation and power systems considering decentralized renewable generators and electric vehicles (EVs). We formulate the problem in a stochastic multi-agent optimization framework considering the complex interactions between EV/conventional vehicle drivers, \revi{renewable}/conventional generators, and independent system operators, with locational electric… ▽ More We study the interdependence between transportation and power systems considering decentralized renewable generators and electric vehicles (EVs). We formulate the problem in a stochastic multi-agent optimization framework considering the complex interactions between EV/conventional vehicle drivers, \revi{renewable}/conventional generators, and independent system operators, with locational electricity and charging prices endogenously determined by markets. We show that the multi-agent optimization problems can be reformulated as a single convex optimization problem and prove the existence and uniqueness of the equilibrium. To cope with the curse of dimensionality, we propose ADMM-based decomposition algorithm to facilitate parallel computing. Numerical insights are generated using standard test systems in transportation and power system literature. △ Less

Submitted 4 January, 2021; originally announced January 2021.

arXiv:2012.12462 [pdf, ps, other]

doi 10.1093/mnrasl/slaa205

Measuring the viscosity of dark matter with strongly lensed gravitational waves

Authors: Shuo Cao, **gzhao Qi, Marek Biesiada, Tonghua Liu, ** Li, Zong-Hong Zhu

Abstract: Based on the strongly lensed gravitational waves (GWs) from compact binary coalescence, we propose a new strategy to examine the fluid shear viscosity of dark matter (DM) in the gravitational wave domain, i.e., whether a GW experiences the dam** effect when it propagates in DM fluid with nonzero shear viscosity. By assuming that the dark matter self-scatterings are efficient enough for the hydro… ▽ More Based on the strongly lensed gravitational waves (GWs) from compact binary coalescence, we propose a new strategy to examine the fluid shear viscosity of dark matter (DM) in the gravitational wave domain, i.e., whether a GW experiences the dam** effect when it propagates in DM fluid with nonzero shear viscosity. By assuming that the dark matter self-scatterings are efficient enough for the hydrodynamic description to be valid, our results demonstrate that future ground-based Einstein Telescope (ET) and satellite GW observatory (Big Bang Observer; BBO) may succeed in detecting any dark matter self-interactions at the scales of galaxies and clusters. △ Less

Submitted 22 December, 2020; originally announced December 2020.

Comments: 1 figure, 1 table, accepted for publication in MNRAS Letters

arXiv:2012.10107 [pdf, ps, other]

doi 10.1002/mma.7547

The number of Dirac-weighted eigenvalues of Sturm-Liouville equations with integrable potentials and an application to inverse problems

Authors: Xiao Chen, Jiangang Qi

Abstract: In this paper, we further Meirong Zhang, et al.'s work by computing the number of weighted eigenvalues for Sturm-Liouville equations, equipped with general integrable potentials and Dirac weights, under Dirichlet boundary condition. We show that, for a Sturm-Liouville equation with a general integrable potential, if its weight is a positive linear combination of $n$ Dirac Delta functions, then it… ▽ More In this paper, we further Meirong Zhang, et al.'s work by computing the number of weighted eigenvalues for Sturm-Liouville equations, equipped with general integrable potentials and Dirac weights, under Dirichlet boundary condition. We show that, for a Sturm-Liouville equation with a general integrable potential, if its weight is a positive linear combination of $n$ Dirac Delta functions, then it has at most $n$ (may be less than $n$, or even be $0$) distinct real Dirichlet eigenvalues, or every complex number is a Dirichlet eigenvalue; in particular, under some sharp condition, the number of Dirichlet eigenvalues is exactly $n$. Our main method is to introduce the concepts of characteristics matrix and characteristics polynomial for Sturm-Liouville problem with Dirac weights, and put forward a general and direct algorithm used for computing eigenvalues. As an application, a class of inverse Dirichelt problems for Sturm-Liouville equations involving single Dirac distribution weights is studied. △ Less

Submitted 18 December, 2020; originally announced December 2020.

Comments: 23 pages

MSC Class: 34A06; 34A55; 34B09

arXiv:2012.02846 [pdf, other]

doi 10.1103/PhysRevLett.126.091301

Search for Coherent Elastic Scattering of Solar $^8$B Neutrinos in the XENON1T Dark Matter Experiment

Authors: E. Aprile, J. Aalbers, F. Agostini, S. Ahmed Maouloud, M. Alfonsi, L. Althueser, F. D. Amaro, S. Andaloro, V. C. Antochi, E. Angelino, J. R. Angevaare, F. Arneodo, L. Baudis, B. Bauermeister, L. Bellagamba, M. L. Benabderrahmane, A. Brown, E. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso, D. Cichon, B. Cimmino , et al. (113 additional authors not shown)

Abstract: We report on a search for nuclear recoil signals from solar $^8$B neutrinos elastically scattering off xenon nuclei in XENON1T data, lowering the energy threshold from 2.6 keV to 1.6 keV. We develop a variety of novel techniques to limit the resulting increase in backgrounds near the threshold. No significant $^8$B neutrino-like excess is found in an exposure of 0.6 t $\times$ y. For the first tim… ▽ More We report on a search for nuclear recoil signals from solar $^8$B neutrinos elastically scattering off xenon nuclei in XENON1T data, lowering the energy threshold from 2.6 keV to 1.6 keV. We develop a variety of novel techniques to limit the resulting increase in backgrounds near the threshold. No significant $^8$B neutrino-like excess is found in an exposure of 0.6 t $\times$ y. For the first time, we use the non-detection of solar neutrinos to constrain the light yield from 1-2 keV nuclear recoils in liquid xenon, as well as non-standard neutrino-quark interactions. Finally, we improve upon world-leading constraints on dark matter-nucleus interactions for dark matter masses between 3 GeV/c$^2$ and 11 GeV/c$^2$ by as much as an order of magnitude. △ Less

Submitted 15 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

Journal ref: Phys. Rev. Lett. 126, 091301 (2021)

arXiv:2012.00803 [pdf, other]

Generator Parameter Estimation by Q-Learning Based on PMU Measurements

Authors: Seyyed Rashid Khazeiynasab, Junjian Qi, Issa Batarseh

Abstract: In this paper, a novel Q-learning based approach is proposed for estimating the parameters of synchronous generators using PMU measurements. Event playback is used to generate model outputs under different parameters for training the agent in Q-learning. We assume that the exact values of some parameters in the model are not known by the agent in Q-learning. Then, an optimal history-dependent poli… ▽ More In this paper, a novel Q-learning based approach is proposed for estimating the parameters of synchronous generators using PMU measurements. Event playback is used to generate model outputs under different parameters for training the agent in Q-learning. We assume that the exact values of some parameters in the model are not known by the agent in Q-learning. Then, an optimal history-dependent policy for the exploration-exploitation trade-off is planned. With given prior knowledge, the parameter vector can be viewed as states with a specific reward, which is a function of the fitting error compared with the measurements. The agent takes an action (either increasing or decreasing the parameter) and the estimated parameter will move to a new state. Based on the reward function, the optimal action policy will move the parameter set to a state with the highest reward. If multiple events are available, they will be used sequentially so that the updated $\mathbfcal{Q}$-value can be utilized to improve the computational efficiency. The effectiveness of the proposed approach is validated by estimating the parameters of the dynamic model of a synchronous generator. △ Less

Submitted 1 December, 2020; originally announced December 2020.

arXiv:2011.14403 [pdf]

Navigating the Complex Compositional Landscape of High-Entropy Alloys

Authors: Jie Qi, Andrew M. Cheung, S. Joseph Poon

Abstract: High-entropy alloys, which exist in the high-dimensional composition space, provide enormous unique opportunities for realizing unprecedented structural and functional properties. A fundamental challenge, however, lies in how to predict the specific alloy phases and desirable properties accurately. This review article provides an overview of the data-driven methods published to date to tackle this… ▽ More High-entropy alloys, which exist in the high-dimensional composition space, provide enormous unique opportunities for realizing unprecedented structural and functional properties. A fundamental challenge, however, lies in how to predict the specific alloy phases and desirable properties accurately. This review article provides an overview of the data-driven methods published to date to tackle this exponentially hard problem of designing high-entropy alloys. Various utilizations of empirical parameters, first-principles and thermodynamic calculations, statistical methods, and machine learning are described. In an alternative method, the effectiveness of using phenomenological features and data-inspired adaptive features in the prediction of the high-entropy solid solution phases and intermetallic alloy composites is demonstrated. The prospect of high-entropy alloys as a new class of functional materials with improved properties is featured in light of entropic effects. The successes, challenges, and limitations of the current high-entropy alloys design are discussed, and some plausible future directions are presented. △ Less

Submitted 18 May, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

Comments: To appear as a book chapter

arXiv:2011.11774 [pdf, ps, other]

Five Equivalent Representations of a Phylogenetic Tree

Authors: Jiayue Qi, Josef Schicho

Abstract: A phylogenetic tree is a tree with a fixed set of leaves that has no vertices of degree two. In this paper, we axiomatically define four other discrete structures on the set of leaves. We prove that each of these structures is an equivalent representation of a phylogenetic tree. A phylogenetic tree is a tree with a fixed set of leaves that has no vertices of degree two. In this paper, we axiomatically define four other discrete structures on the set of leaves. We prove that each of these structures is an equivalent representation of a phylogenetic tree. △ Less

Submitted 26 March, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

MSC Class: 05C99

arXiv:2011.10431 [pdf, other]

doi 10.1103/PhysRevD.103.063028

Search for inelastic scattering of WIMP dark matter in XENON1T

Authors: XENON Collaboration, E. Aprile, J. Aalbers, F. Agostini, M. Alfonsi, L. Althueser, F. D. Amaro, S. Andaloro, E. Angelino, J. R. Angevaare, V. C. Antochi, F. Arneodo, L. Baudis, B. Bauermeister, L. Bellagamba, M. L. Benabderrahmane, A. Brown, E. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso, D. Cichon, B. Cimmino , et al. (116 additional authors not shown)

Abstract: We report the results of a search for the inelastic scattering of weakly interacting massive particles (WIMPs) in the XENON1T dark matter experiment. Scattering off $^{129}$Xe is the most sensitive probe of inelastic WIMP interactions, with a signature of a 39.6 keV de-excitation photon detected simultaneously with the nuclear recoil. Using an exposure of 0.89 tonne-years, we find no evidence of i… ▽ More We report the results of a search for the inelastic scattering of weakly interacting massive particles (WIMPs) in the XENON1T dark matter experiment. Scattering off $^{129}$Xe is the most sensitive probe of inelastic WIMP interactions, with a signature of a 39.6 keV de-excitation photon detected simultaneously with the nuclear recoil. Using an exposure of 0.89 tonne-years, we find no evidence of inelastic WIMP scattering with a significance of more than 2$σ$. A profile-likelihood ratio analysis is used to set upper limits on the cross-section of WIMP-nucleus interactions. We exclude new parameter space for WIMPs heavier than 100 GeV/c${}^2$, with the strongest upper limit of $3.3 \times 10^{-39}$ cm${}^2$ for 130 GeV/c${}^2$ WIMPs at 90\% confidence level. △ Less

Submitted 26 February, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

Comments: 9 pages, 4 figures

Journal ref: Phys. Rev. D 103, 063028 (2021)

arXiv:2011.07736 [pdf]

Magnetic Dynamic Polymers for Modular Assembling and Reconfigurable Morphing Architectures

Authors: Xiao Kuang, Shuai Wu, Yi **, Qiji Ze, S. Macrae Montgomery, Liang Yue, H. Jerry Qi, Ruike Zhao

Abstract: Shape morphing magnetic soft materials, composed of magnetic particles in a soft polymer matrix, can transform shapes reversibly, remotely, and rapidly, finding diverse applications in actuators, soft robotics, and biomedical devices. To achieve on-demand and sophisticated shape morphing, the manufacturing of structures with complex geometry and magnetization distribution is highly desired. Here,… ▽ More Shape morphing magnetic soft materials, composed of magnetic particles in a soft polymer matrix, can transform shapes reversibly, remotely, and rapidly, finding diverse applications in actuators, soft robotics, and biomedical devices. To achieve on-demand and sophisticated shape morphing, the manufacturing of structures with complex geometry and magnetization distribution is highly desired. Here, we report a magnetic dynamic polymer composite composed of hard-magnetic microparticles in a dynamic polymer network with thermal-responsive reversible linkages, which permit functionalities including targeted welding, magnetization reprogramming, and structural reconfiguration. These functions not only provide highly desirable structural and material programmability and reprogrammability but also enable the manufacturing of structures with complex geometry and magnetization distribution. The targeted welding is exploited for modular assembling of fundamental building modules with specific logics for complex actuation. The magnetization reprogramming enables altering the morphing mode of the manufactured structures. The shape reconfiguration under magnetic actuation is coupled with network plasticity to remotely transform two-dimensional tessellations into complex three-dimensional architectures, providing a new strategy of manufacturing functional soft architected materials such as three-dimensional kirigami. We anticipate that the reported magnetic dynamic polymer provides a new paradigm for the design and manufacturing of future multifunctional assemblies and reconfigurable morphing architectures and devices. △ Less

Submitted 16 November, 2020; originally announced November 2020.

arXiv:2011.00713 [pdf, other]

doi 10.1093/mnras/stab638

Measurements of the Hubble constant and cosmic curvature with quasars: ultra-compact radio structure and strong gravitational lensing

Authors: **g-Zhao Qi, Jia-Wei Zhao, Shuo Cao, Marek Biesiada, Yuting Liu

Abstract: Although the Hubble constant $H_0$ and spatial curvature $Ω_{K}$ have been measured with very high precision, they still suffer from some tensions. In this paper, we propose an improved method to combine the observations of ultra-compact structure in radio quasars and strong gravitational lensing with quasars acting as background sources to determine $H_0$ and $Ω_{K}$ simultaneously. By applying t… ▽ More Although the Hubble constant $H_0$ and spatial curvature $Ω_{K}$ have been measured with very high precision, they still suffer from some tensions. In this paper, we propose an improved method to combine the observations of ultra-compact structure in radio quasars and strong gravitational lensing with quasars acting as background sources to determine $H_0$ and $Ω_{K}$ simultaneously. By applying the distance sum rule to the time-delay measurements of 7 strong lensing systems and 120 intermediate-luminosity quasars calibrated as standard rulers, we obtain stringent constraints on the Hubble constant ($H_0=78.3\pm2.9 \mathrm{~km~s^{-1}~Mpc^{-1}}$) and the cosmic curvature ($Ω_K=0.49\pm0.24$). On the one hand, in the framework of a flat universe, the measured Hubble constant ($H_0=73.6^{+1.8}_{-1.6} \mathrm{~km~s^{-1}~Mpc^{-1}}$) is strongly consistent with that derived from the local distance ladder, with a precision of 2\%. On the other hand, if we use the local $H_0$ measurement as a prior, our results are marginally compatible with zero spatial curvature ($Ω_K=0.23^{+0.15}_{-0.17}$) and there is no significant deviation from a flat universe. Finally, we also evaluate whether strongly lensed quasars would produce robust constraints on $H_0$ and $Ω_{K}$ in the non-flat and flat $Λ$CDM model if the compact radio structure measurements are available from VLBI observations. △ Less

Submitted 1 November, 2020; originally announced November 2020.

Comments: 7 pages, 4 figures

arXiv:2011.00268 [pdf]

doi 10.1103/PhysRevLett.127.137001

The observation of in-plane quantum Griffiths singularity in two-dimensional crystalline superconductors

Authors: Yi Liu, Shichao Qi, **gchao Fang, Jian Sun, Chong Liu, Yanzhao Liu, Junjie Qi, Ying Xing, Haiwen Liu, Xi Lin, Lili Wang, Qi-Kun Xue, X. C. Xie, Jian Wang

Abstract: Quantum Griffiths singularity (QGS) reveals the profound influence of quenched disorder on the quantum phase transitions, characterized by the divergence of the dynamical critical exponent at the boundary of the vortex glass-like phase, named as quantum Griffiths phase. However, in the absence of vortices, whether the QGS can exist under parallel magnetic field remains a puzzle. Here we study the… ▽ More Quantum Griffiths singularity (QGS) reveals the profound influence of quenched disorder on the quantum phase transitions, characterized by the divergence of the dynamical critical exponent at the boundary of the vortex glass-like phase, named as quantum Griffiths phase. However, in the absence of vortices, whether the QGS can exist under parallel magnetic field remains a puzzle. Here we study the magnetic field induced superconductor-metal transition in ultrathin crystalline PdTe2 films grown by molecular beam epitaxy. Remarkably, the QGS emerges under both perpendicular and parallel magnetic field in 4-monolayer PdTe2 films. The direct activated scaling analysis with a new irrelevant correction has been proposed, providing important evidence of QGS. With increasing film thickness to 6 monolayers, the QGS disappears under perpendicular field but persists under parallel field, and this discordance may originate from the differences in microscopic processes. Our work demonstrates the universality of parallel field induced QGS and can stimulate further investigations on novel quantum phase transitions under parallel magnetic field. △ Less

Submitted 11 August, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

Journal ref: Phys. Rev. Lett. 127, 137001 (2021)

arXiv:2010.13309 [pdf, other]

doi 10.1109/ICASSP39728.2021.9413453

Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

Authors: Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Abstract: We propose a novel decentralized feature extraction approach in federated learning to address privacy-preservation issues for speech recognition. It is built upon a quantum convolutional neural network (QCNN) composed of a quantum circuit encoder for feature extraction, and a recurrent neural network (RNN) based end-to-end acoustic model (AM). To enhance model parameter protection in a decentraliz… ▽ More We propose a novel decentralized feature extraction approach in federated learning to address privacy-preservation issues for speech recognition. It is built upon a quantum convolutional neural network (QCNN) composed of a quantum circuit encoder for feature extraction, and a recurrent neural network (RNN) based end-to-end acoustic model (AM). To enhance model parameter protection in a decentralized architecture, an input speech is first up-streamed to a quantum computing server to extract Mel-spectrogram, and the corresponding convolutional features are encoded using a quantum circuit algorithm with random parameters. The encoded features are then down-streamed to the local RNN model for the final recognition. The proposed decentralized framework takes advantage of the quantum learning progress to secure models and to avoid privacy leakage attacks. Testing on the Google Speech Commands Dataset, the proposed QCNN encoder attains a competitive accuracy of 95.12% in a decentralized model, which is better than the previous architectures using centralized RNN models with convolutional features. We also conduct an in-depth study of different quantum circuit encoder architectures to provide insights into designing QCNN-based feature extractors. Neural saliency analyses demonstrate a correlation between the proposed QCNN features, class activation maps, and input spectrograms. We provide an implementation for future studies. △ Less

Submitted 12 February, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

Comments: Accepted to IEEE ICASSP 2021. Code is available: https://github.com/huckiyang/QuantumSpeech-QCNN

Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

arXiv:2010.13085 [pdf, other]

Coherent Loss: A Generic Framework for Stable Video Segmentation

Authors: Mingyang Qian, Yi Fu, Xiao Tan, Yingying Li, **qing Qi, Huchuan Lu, Shilei Wen, Errui Ding

Abstract: Video segmentation approaches are of great importance for numerous vision tasks especially in video manipulation for entertainment. Due to the challenges associated with acquiring high-quality per-frame segmentation annotations and large video datasets with different environments at scale, learning approaches shows overall higher accuracy on test dataset but lack strict temporal constraints to sel… ▽ More Video segmentation approaches are of great importance for numerous vision tasks especially in video manipulation for entertainment. Due to the challenges associated with acquiring high-quality per-frame segmentation annotations and large video datasets with different environments at scale, learning approaches shows overall higher accuracy on test dataset but lack strict temporal constraints to self-correct jittering artifacts in most practical applications. We investigate how this jittering artifact degrades the visual quality of video segmentation results and proposed a metric of temporal stability to numerically evaluate it. In particular, we propose a Coherent Loss with a generic framework to enhance the performance of a neural network against jittering artifacts, which combines with high accuracy and high consistency. Equipped with our method, existing video object/semantic segmentation approaches achieve a significant improvement in term of more satisfactory visual quality on video human dataset, which we provide for further research in this field, and also on DAVIS and Cityscape. △ Less

Submitted 25 October, 2020; originally announced October 2020.

Comments: 10 pages, 6 figures, 4 tables

arXiv:2010.12775 [pdf]

The Discovery of Tunable Universality Class in Superconducting $β$-W Thin Films

Authors: Ce Huang, Enze Zhang, Yong Zhang, **glei Zhang, Faxian Xiu, Haiwen Liu, Xiaoyi Xie, Linfeng Ai, Yunkun Yang, Minhao Zhao, Junjie Qi, Lun Li, Shanshan Liu, Zihan Li, Runze Zhan, Ya-Qing Bie, Xufeng Kou, Shaozhi Deng, X. C. Xie

Abstract: The interplay between quenched disorder and critical behavior in quantum phase transitions is conceptually fascinating and of fundamental importance for understanding phase transitions. However, it is still unclear whether or not the quenched disorder influences the universality class of quantum phase transitions. More crucially, the absence of superconducting-metal transitions under in-plane magn… ▽ More The interplay between quenched disorder and critical behavior in quantum phase transitions is conceptually fascinating and of fundamental importance for understanding phase transitions. However, it is still unclear whether or not the quenched disorder influences the universality class of quantum phase transitions. More crucially, the absence of superconducting-metal transitions under in-plane magnetic fields in 2D superconductors imposes constraints on the universality of quantum criticality. Here, we discover the tunable universality class of superconductor-metal transition by changing the disorder strength in $β$-W films with varying thickness. The finite-size scaling uncovers the switch of universality class: quantum Griffiths singularity to multiple quantum criticality at a critical thickness of $t_{c \perp 1}\sim 8 nm$ and then from multiple quantum criticality to single criticality at $t_{c\perp 2}\sim 16 nm$. Moreover, the superconducting-metal transition is observed for the first time under in-plane magnetic fields and the universality class is changed at $t_{c \parallel }\sim 8 nm$. The discovery of tunable universality class under both out-of-plane and in-plane magnetic fields provides broad information for the disorder effect on superconducting-metal transitions and quantum criticality. △ Less

Submitted 24 October, 2020; originally announced October 2020.

arXiv:2010.11498 [pdf, other]

doi 10.1103/PhysRevA.103.022607

Randomized benchmarking in the presence of time-correlated dephasing noise

Authors: Jiaan Qi, Hui Khoon Ng

Abstract: Randomized benchmarking has emerged as a popular and easy-to-implement experimental technique for gauging the quality of gate operations in quantum computing devices. A typical randomized benchmarking procedure identifies the exponential decay in the fidelity as the benchmarking sequence of gates increases in length, and the decay rate is used to estimate the fidelity of the gate. That the fidelit… ▽ More Randomized benchmarking has emerged as a popular and easy-to-implement experimental technique for gauging the quality of gate operations in quantum computing devices. A typical randomized benchmarking procedure identifies the exponential decay in the fidelity as the benchmarking sequence of gates increases in length, and the decay rate is used to estimate the fidelity of the gate. That the fidelity decays exponentially, however, relies on the assumption of time-independent or static noise in the gates, with no correlations or significant drift in the noise over the gate sequence, a well-satisfied condition in many situations. Deviations from the standard exponential decay, however, have been observed, usually attributed to some amount of time correlations in the noise, though the precise mechanisms for deviation have yet to be fully explored. In this work, we examine this question of randomized benchmarking for time-correlated noise---specifically for time-correlated dephasing noise for exact solvability---and elucidate the circumstances in which a deviation from exponential decay can be expected. △ Less

Submitted 22 October, 2020; originally announced October 2020.

Journal ref: Phys. Rev. A 103, 022607 (2021)

arXiv:2010.10919

Multi-task Metric Learning for Text-independent Speaker Verification

Authors: Yafeng Chen, Wu Guo, **g**g Shi, Jiajun Qi, Tan Liu

Abstract: In this work, we introduce metric learning (ML) to enhance the deep embedding learning for text-independent speaker verification (SV). Specifically, the deep speaker embedding network is trained with conventional cross entropy loss and auxiliary pair-based ML loss function. For the auxiliary ML task, training samples of a mini-batch are first arranged into pairs, then positive and negative pairs a… ▽ More In this work, we introduce metric learning (ML) to enhance the deep embedding learning for text-independent speaker verification (SV). Specifically, the deep speaker embedding network is trained with conventional cross entropy loss and auxiliary pair-based ML loss function. For the auxiliary ML task, training samples of a mini-batch are first arranged into pairs, then positive and negative pairs are selected and weighted through their own and relative similarities, and finally the auxiliary ML loss is calculated by the similarity of the selected pairs. To evaluate the proposed method, we conduct experiments on the Speaker in the Wild (SITW) dataset. The results demonstrate the effectiveness of the proposed method. △ Less

Submitted 22 March, 2023; v1 submitted 21 October, 2020; originally announced October 2020.

Comments: Not a particularly high-quality work, so we request withdrawal

arXiv:2010.07540 [pdf, other]

Multi-Objective PMU Allocation for Resilient Power System Monitoring

Authors: Hamed Haggi, Wei Sun, Junjian Qi

Abstract: Phasor measurement units (PMUs) enable better system monitoring and security enhancement in smart grids. In order to enhance power system resilience against outages and blackouts caused by extreme weather events or man-made attacks, it remains a major challenge to determine the optimal number and location of PMUs. In this paper, a multi-objective resilient PMU placement (MORPP) problem is formulat… ▽ More Phasor measurement units (PMUs) enable better system monitoring and security enhancement in smart grids. In order to enhance power system resilience against outages and blackouts caused by extreme weather events or man-made attacks, it remains a major challenge to determine the optimal number and location of PMUs. In this paper, a multi-objective resilient PMU placement (MORPP) problem is formulated, and solved by a modified Teaching-Learning-Based Optimization (MO-TLBO) algorithm. Three objectives are considered in the MORPP problem, minimizing the number of PMUs, maximizing the system observability, and minimizing the voltage stability index. The effectiveness of the proposed method is validated through testing on IEEE 14-bus, 30-bus, and 118-bus test systems. The advantage of the MO-TLBO-based MORPP is demonstrated through the comparison with other methods in the literature, in terms of iteration number, optimality and time of convergence. △ Less

Submitted 15 October, 2020; originally announced October 2020.

Comments: IEEE PES General Meeting 2020

arXiv:2010.06248

Exploring Universal Speech Attributes for Speaker Verification with an Improved Cross-stitch Network

Authors: Jiajun Qi, Wu Guo, **g**g Shi, Yafeng Chen, Tan Liu

Abstract: The universal speech attributes for x-vector based speaker verification (SV) are addressed in this paper. The manner and place of articulation form the fundamental speech attribute unit (SAU), and then new speech attribute (NSA) units for acoustic modeling are generated by tied tri-SAU states. An improved cross-stitch network is adopted as a multitask learning (MTL) framework for integrating these… ▽ More The universal speech attributes for x-vector based speaker verification (SV) are addressed in this paper. The manner and place of articulation form the fundamental speech attribute unit (SAU), and then new speech attribute (NSA) units for acoustic modeling are generated by tied tri-SAU states. An improved cross-stitch network is adopted as a multitask learning (MTL) framework for integrating these universal speech attributes into the x-vector network training process. Experiments are conducted on common condition 5 (CC5) of the core-core and the 10 s-10 s tests of the NIST SRE10 evaluation set, and the proposed algorithm can achieve consistent improvements over the baseline x-vector on both these tasks. △ Less

Submitted 31 May, 2023; v1 submitted 13 October, 2020; originally announced October 2020.

Comments: Not a particularly high-quality work, so we request withdrawal

arXiv:2010.05151 [pdf, ps, other]

doi 10.3847/1538-4357/abc076

Gravitational-wave constraints on the cosmic opacity at $z\sim 5$: forecast from space gravitational-wave antenna DECIGO

Authors: Shuaibo Geng, Shuo Cao, Tonghua Liu, Marek Biesiada, **gzhao Qi, Yuting Liu, Zong-Hong Zhu

Abstract: Since gravitational waves (GWs) propagate freely through a perfect fluid, coalescing compact binary systems as standard sirens allow to measure the luminosity distance directly and provide distance measurements unaffected by the cosmic opacity. DECi-hertz Interferometer Gravitational-wave Observatory (DECIGO) is a future Japanese space gravitational-wave antenna sensitive to frequency range betwee… ▽ More Since gravitational waves (GWs) propagate freely through a perfect fluid, coalescing compact binary systems as standard sirens allow to measure the luminosity distance directly and provide distance measurements unaffected by the cosmic opacity. DECi-hertz Interferometer Gravitational-wave Observatory (DECIGO) is a future Japanese space gravitational-wave antenna sensitive to frequency range between target frequencies of LISA and ground-based detectors. Combining the predicted future GW observations from DECIGO and three current popular astrophysical probes (HII regions, SNe Ia Pantheon sample, quasar sample) in electromagnetic (EM) domains, one would be able to probe the opacity of the Universe at different redshifts. In this paper, we show that the cosmic opacity parameter can be constrained to a high precision ($Δε\sim 10^{-2}$) out to high redshifts ($z\sim$5). In order to reconstruct the evolution of cosmic opacity without assuming any particular functional form of it, the cosmic opacity tests should be applied to individual redshift bins independently. Therefore, we also calculate the optical depth at individual redshifts and averaged $τ(z)$ within redshift bins. Our findings indicate that, compared with the results obtained from the HII galaxies and Pantheon SNe Ia, there is an improvement in precision when the quasar sample is considered. While non-zero optical depth is statistically significant only for redshift ranges $0<z<0.5$, $1<z<2$, and $2.5<z<3.5$, such tendency is different from that obtained in the framework of its parametrized form. Therefore the importance of cosmic-opacity test without a prescribed phenomenological function should be emphasized. △ Less

Submitted 10 October, 2020; originally announced October 2020.

Comments: 11pages, 9figures, accepted for publication in ApJ

arXiv:2010.03755 [pdf, other]

Generalizable and Explainable Dialogue Generation via Explicit Action Learning

Authors: Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

Abstract: Response generation for task-oriented dialogues implicitly optimizes two objectives at the same time: task completion and language quality. Conditioned response generation serves as an effective approach to separately and better optimize these two objectives. Such an approach relies on system action annotations which are expensive to obtain. To alleviate the need of action annotations, latent acti… ▽ More Response generation for task-oriented dialogues implicitly optimizes two objectives at the same time: task completion and language quality. Conditioned response generation serves as an effective approach to separately and better optimize these two objectives. Such an approach relies on system action annotations which are expensive to obtain. To alleviate the need of action annotations, latent action learning is introduced to map each utterance to a latent representation. However, this approach is prone to over-dependence on the training data, and the generalization capability is thus restricted. To address this issue, we propose to learn natural language actions that represent utterances as a span of words. This explicit action representation promotes generalization via the compositional structure of language. It also enables an explainable generation process. Our proposed unsupervised approach learns a memory component to summarize system utterances into a short span of words. To further promote a compact action representation, we propose an auxiliary task that restores state annotations as the summarized dialogue context using the memory component. Our proposed approach outperforms latent action baselines on MultiWOZ, a benchmark multi-domain dataset. △ Less

Submitted 8 October, 2020; originally announced October 2020.

Comments: Accepted to Proceedings of EMNLP 2020 (Findings)

arXiv:2010.00882 [pdf, other]

Remote Sensing Image Scene Classification with Self-Supervised Paradigm under Limited Labeled Samples

Authors: Chao Tao, Ji Qi, Weipeng Lu, Hao Wang, Haifeng Li

Abstract: With the development of deep learning, supervised learning methods perform well in remote sensing images (RSIs) scene classification. However, supervised learning requires a huge number of annotated data for training. When labeled samples are not sufficient, the most common solution is to fine-tune the pre-training models using a large natural image dataset (e.g. ImageNet). However, this learning… ▽ More With the development of deep learning, supervised learning methods perform well in remote sensing images (RSIs) scene classification. However, supervised learning requires a huge number of annotated data for training. When labeled samples are not sufficient, the most common solution is to fine-tune the pre-training models using a large natural image dataset (e.g. ImageNet). However, this learning paradigm is not a panacea, especially when the target remote sensing images (e.g. multispectral and hyperspectral data) have different imaging mechanisms from RGB natural images. To solve this problem, we introduce new self-supervised learning (SSL) mechanism to obtain the high-performance pre-training model for RSIs scene classification from large unlabeled data. Experiments on three commonly used RSIs scene classification datasets demonstrated that this new learning paradigm outperforms the traditional dominant ImageNet pre-trained model. Moreover, we analyze the impacts of several factors in SSL on RSIs scene classification tasks, including the choice of self-supervised signals, the domain difference between the source and target dataset, and the amount of pre-training data. The insights distilled from our studies can help to foster the development of SSL in the remote sensing community. Since SSL could learn from unlabeled massive RSIs which are extremely easy to obtain, it will be a potentially promising way to alleviate dependence on labeled samples and thus efficiently solve many problems, such as global map**. △ Less

Submitted 2 October, 2020; originally announced October 2020.

Comments: 6 pages, 4 figures

arXiv:2009.14155 [pdf, other]

Resilience Analysis and Cascading FailureModeling of Power Systems under Extreme Temperatures

Authors: Seyyed Rashid Khazeiynasab, Junjian Qi

Abstract: In this paper, we propose an AC power flow based cascading failure model that explicitly considers external weather conditions, extreme temperatures in particular, and evaluates the impact of extreme temperature on the initiation and propagation of cascading blackouts. Specifically, load and dynamic line rating changes are modeled due to temperature disturbance, the probabilities for transmission… ▽ More In this paper, we propose an AC power flow based cascading failure model that explicitly considers external weather conditions, extreme temperatures in particular, and evaluates the impact of extreme temperature on the initiation and propagation of cascading blackouts. Specifically, load and dynamic line rating changes are modeled due to temperature disturbance, the probabilities for transmission line and generator outages are evaluated, and the timing for each type of events is carefully calculated to decide the actual event sequence. It should be emphasized that the correlated events, in the advent of external temperature changes, could together contribute to voltage instability. Besides, we model undervoltage load shedding and operator re-dispatch as control strategies for preventing the propagation of cascading failures. The effectiveness of the proposed model is verified by simulation results on the RTS-96 3-area system and it is found that temperature disturbances can lead to correlated load change and line/generator trip**, which together will greatly increase the risk of cascading and voltage instability. Critical temperature change, critical area with temperature disturbance, identification of most vulnerable buses, and comparison of different control strategies are also carefully investigated. △ Less

Submitted 29 September, 2020; originally announced September 2020.

arXiv:2009.05721 [pdf, other]

Short-Term and Long-Term Context Aggregation Network for Video Inpainting

Authors: Ang Li, Shanshan Zhao, Xingjun Ma, Mingming Gong, Jianzhong Qi, Rui Zhang, Dacheng Tao, Ramamohanarao Kotagiri

Abstract: Video inpainting aims to restore missing regions of a video and has many applications such as video editing and object removal. However, existing methods either suffer from inaccurate short-term context aggregation or rarely explore long-term frame information. In this work, we present a novel context aggregation network to effectively exploit both short-term and long-term frame information for vi… ▽ More Video inpainting aims to restore missing regions of a video and has many applications such as video editing and object removal. However, existing methods either suffer from inaccurate short-term context aggregation or rarely explore long-term frame information. In this work, we present a novel context aggregation network to effectively exploit both short-term and long-term frame information for video inpainting. In the encoding stage, we propose boundary-aware short-term context aggregation, which aligns and aggregates, from neighbor frames, local regions that are closely related to the boundary context of missing regions into the target frame. Furthermore, we propose dynamic long-term context aggregation to globally refine the feature map generated in the encoding stage using long-term frame features, which are dynamically updated throughout the inpainting process. Experiments show that it outperforms state-of-the-art methods with better inpainting results and fast inpainting speed. △ Less

Submitted 11 September, 2020; originally announced September 2020.

Comments: Accepted by ECCV 2020 as a spotlight paper

arXiv:2009.02827 [pdf]

MFL_COVID19: Quantifying Country-based Factors affecting Case Fatality Rate in Early Phase of COVID-19 Epidemic via Regularised Multi-task Feature Learning

Authors: Po Yang, Jun Qi, Xulong Wang, Yun Yang

Abstract: Recent outbreak of COVID-19 has led a rapid global spread around the world. Many countries have implemented timely intensive suppression to minimize the infections, but resulted in high case fatality rate (CFR) due to critical demand of health resources. Other country-based factors such as sociocultural issues, ageing population etc., has also influenced practical effectiveness of taking intervent… ▽ More Recent outbreak of COVID-19 has led a rapid global spread around the world. Many countries have implemented timely intensive suppression to minimize the infections, but resulted in high case fatality rate (CFR) due to critical demand of health resources. Other country-based factors such as sociocultural issues, ageing population etc., has also influenced practical effectiveness of taking interventions to improve morality in early phase. To better understand the relationship of these factors across different countries with COVID-19 CFR is of primary importance to prepare for potentially second wave of COVID-19 infections. In the paper, we propose a novel regularized multi-task learning based factor analysis approach for quantifying country-based factors affecting CFR in early phase of COVID-19 epidemic. We formulate the prediction of CFR progression as a ML regression problem with observed CFR and other countries-based factors. In this formulation, all CFR related factors were categorized into 6 sectors with 27 indicators. We proposed a hybrid feature selection method combining filter, wrapper and tree-based models to calibrate initial factors for a preliminary feature interaction. Then we adopted two typical single task model (Ridge and Lasso regression) and one state-of-the-art MTFL method (fused sparse group lasso) in our formulation. The fused sparse group Lasso (FSGL) method allows the simultaneous selection of a common set of country-based factors for multiple time points of COVID-19 epidemic and also enables incorporating temporal smoothness of each factor over the whole early phase period. Finally, we proposed one novel temporal voting feature selection scheme to balance the weight instability of multiple factors in our MTFL model. △ Less

Submitted 6 September, 2020; originally announced September 2020.

arXiv:2009.01003 [pdf, other]

Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

Authors: Jun Qi, Xu Liu, Javier Tejedor

Abstract: This paper proposes to generalize the variational recurrent neural network (RNN) with variational inference (VI)-based dropout regularization employed for the long short-term memory (LSTM) cells to more advanced RNN architectures like gated recurrent unit (GRU) and bi-directional LSTM/GRU. The new variational RNNs are employed for slot filling, which is an intriguing but challenging task in spoken… ▽ More This paper proposes to generalize the variational recurrent neural network (RNN) with variational inference (VI)-based dropout regularization employed for the long short-term memory (LSTM) cells to more advanced RNN architectures like gated recurrent unit (GRU) and bi-directional LSTM/GRU. The new variational RNNs are employed for slot filling, which is an intriguing but challenging task in spoken language understanding. The experiments on the ATIS dataset suggest that the variational RNNs with the VI-based dropout regularization can significantly improve the naive dropout regularization RNNs-based baseline systems in terms of F-measure. Particularly, the variational RNN with bi-directional LSTM/GRU obtains the best F-measure score. △ Less

Submitted 23 August, 2020; originally announced September 2020.

Comments: conference paper, 5 pages

arXiv:2008.08458 [pdf, ps, other]

doi 10.1088/1674-1137/abeb06

Phenomenological studies on the $\bar{B}^0\rightarrow [K^-π^+]_{S/V}[π^+π^-]_{V/S} \rightarrow K^-π^+π^+π^-$ decay

Authors: **g-Juan Qi, Zhen-Yang Wang, Zhu-Feng Zhang, Xin-Heng Guo

Abstract: Within the quasi-two-body decay model, we study the localized $CP$ violation and branching fraction of the four-body decay $\bar{B}^0\rightarrow [K^-π^+]_{S/V}[π^+π^-]_{V/S} \rightarrow K^-π^+π^-π^+$ when $K^-π^+$ and $π^-π^+$ pair invariant masses are $0.35<m_{K^-π^+}<2.04 \, \mathrm{GeV}$ and $0<m_{π^-π^+}<1.06\, \mathrm{GeV}$, with the pairs being dominated by the $\bar{K}^*_0(700)^0$,… ▽ More Within the quasi-two-body decay model, we study the localized $CP$ violation and branching fraction of the four-body decay $\bar{B}^0\rightarrow [K^-π^+]_{S/V}[π^+π^-]_{V/S} \rightarrow K^-π^+π^-π^+$ when $K^-π^+$ and $π^-π^+$ pair invariant masses are $0.35<m_{K^-π^+}<2.04 \, \mathrm{GeV}$ and $0<m_{π^-π^+}<1.06\, \mathrm{GeV}$, with the pairs being dominated by the $\bar{K}^*_0(700)^0$, $\bar{K}^*(892)^0$, $\bar{K}^*(1410)^0$, $\bar{K}^*_0(1430)$ and $\bar{K}^*(1680)^0$, and $f_0(500)$, $ρ^0(770)$ , $ω(782)$ and $f_0(980)$ resonances, respectively. When dealing with the dynamical functions of these resonances, $f_0(500)$, $ρ^0(770)$, $f_0(980)$ and $\bar{K}^*_0(1430)$ are modeled with the Bugg model, Gounaris-Sakurai function, Flatt$\acute{\mathrm{e}}$ formalism and LASS lineshape, respectively, while others are described by the relativistic Breit-Wigner function. Adopting the end point divergence parameters $ρ_A\in[0,0.5]$ and $φ_A\in[0,2π]$, our predicted results are $\mathcal{A_{CP}}(\bar{B}^0\rightarrow K^-π^+π^+π^-)\in[-0.383,0.421]$ and $\mathcal{B}(\bar{B}^0\rightarrow K^-π^+π^+π^-)\in[7.36,199.69]\times10^{-8}$ based on the hypothetical $q\bar{q}$ structures for the scalar mesons in the QCD factorization approach. Meanwhile, we calculate the $CP$ violating asymmetries and branching fractions of the two-body decays $\bar{B}^0\rightarrow SV(VS)$ and all the individual four-body decays $\bar{B}^0\rightarrow SV(VS) \rightarrow K^-π^+π^-π^+$, respectively. Our theoretical results for the two-body decays $\bar{B}^0\rightarrow \bar{K}^*(892)^0$$f_0(980)$, $\bar{B}^0\rightarrow \bar{K}^*_0(1430)^0$$ω(782)$, $\bar{B}^0\rightarrow \bar{K}^*(892)^0f_0(980)$, $\bar{B}^0\rightarrow\bar{K}^*_0(1430)^0ρ$, △ Less

Submitted 12 January, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

Comments: 20 pages, 4 tables

arXiv:2008.07281 [pdf, ps, other]

doi 10.1109/LSP.2020.3016837

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Abstract: In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression… ▽ More In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression. First, we show that a generalized upper-bound for DNN-based vector- to-vector regression can be ensured by leveraging the known Lipschitz continuity property of MAE. Next, we derive a new generalized upper bound in the presence of additive noise. Finally, in contrast to conventional MSE commonly adopted to approximate Gaussian errors for regression, we show that MAE can be interpreted as an error modeled by Laplacian distribution. Speech enhancement experiments are conducted to corroborate our proposed theorems and validate the performance advantages of MAE over MSE for DNN based regression. △ Less

Submitted 12 August, 2020; originally announced August 2020.

Journal ref: IEEE Signal Processing Letters, 2020

arXiv:2008.06896 [pdf, other]

Adaptive Shape Servoing of Elastic Rods using Parameterized Regression Features and Auto-Tuning Motion Controls

Authors: Jiaming Qi, Guangtao Ran, Bohui Wang, Jian Liu, Wanyu Ma, Peng Zhou, David Navarro-Alarcon

Abstract: The robotic manipulation of deformable linear objects has shown great potential in a wide range of real-world applications. However, it presents many challenges due to the objects' complex nonlinearity and high-dimensional configuration. In this paper, we propose a new shape servoing framework to automatically manipulate elastic rods through visual feedback. Our new method uses parameterized regre… ▽ More The robotic manipulation of deformable linear objects has shown great potential in a wide range of real-world applications. However, it presents many challenges due to the objects' complex nonlinearity and high-dimensional configuration. In this paper, we propose a new shape servoing framework to automatically manipulate elastic rods through visual feedback. Our new method uses parameterized regression features to compute a compact (low-dimensional) feature vector that quantifies the object's shape, thus, enabling to establish an explicit shape servo-loop. To automatically deform the rod into a desired shape, the proposed adaptive controller iteratively estimates the differential transformation between the robot's motion and the relative shape changes; This valuable capability allows to effectively manipulate objects with unknown mechanical models. An auto-tuning algorithm is introduced to adjust the robot's sha** motions in real-time based on optimal performance criteria. To validate the proposed framework, a detailed experimental study with vision-guided robotic manipulators is presented. △ Less

Submitted 9 September, 2023; v1 submitted 16 August, 2020; originally announced August 2020.

Comments: 8 pages, 12 figures

arXiv:2008.05459 [pdf, other]

doi 10.1109/TSP.2020.2993164

Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

Authors: Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Abstract: In this paper, we show that, in vector-to-vector regression utilizing deep neural networks (DNNs), a generalized loss of mean absolute error (MAE) between the predicted and expected feature vectors is upper bounded by the sum of an approximation error, an estimation error, and an optimization error. Leveraging upon error decomposition techniques in statistical learning theory and non-convex optimi… ▽ More In this paper, we show that, in vector-to-vector regression utilizing deep neural networks (DNNs), a generalized loss of mean absolute error (MAE) between the predicted and expected feature vectors is upper bounded by the sum of an approximation error, an estimation error, and an optimization error. Leveraging upon error decomposition techniques in statistical learning theory and non-convex optimization theory, we derive upper bounds for each of the three aforementioned errors and impose necessary constraints on DNN models. Moreover, we assess our theoretical results through a set of image de-noising and speech enhancement experiments. Our proposed upper bounds of MAE for DNN based vector-to-vector regression are corroborated by the experimental results and the upper bounds are valid with and without the "over-parametrization" technique. △ Less

Submitted 4 August, 2020; originally announced August 2020.

Journal ref: IEEE Transactions on Signal Processing, Vol 68, pp. 3411-3422, 2020

arXiv:2008.03498 [pdf, ps, other]

doi 10.1103/PhysRevC.102.064629

Nuclear fission in intense laser fields

Authors: **tao Qi, Libin Fu, Xu Wang

Abstract: Rapid-advancing intense laser technologies enable the possibility of a direct laser-nucleus coupling. In this paper the effect of intense laser fields on a series of nuclear fission processes, including proton decay, alpha decay, and cluster decay, is theoretically studied with the help of nuclear double folding potentials. The results show that the half-lives of these decay processes can be modif… ▽ More Rapid-advancing intense laser technologies enable the possibility of a direct laser-nucleus coupling. In this paper the effect of intense laser fields on a series of nuclear fission processes, including proton decay, alpha decay, and cluster decay, is theoretically studied with the help of nuclear double folding potentials. The results show that the half-lives of these decay processes can be modified by non-negligible amounts, for example on the order of 0.01 or 0.1 percents in intense laser fields available in the forthcoming years. In addition to numerical results, an approximate analytical formula is derived to connect the laser-induced modification to the decay half-life and the decay energy. △ Less

Submitted 8 August, 2020; originally announced August 2020.

Journal ref: Phys. Rev. C 102, 064629 (2020)

arXiv:2007.13024 [pdf, other]

Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement

Authors: Jun Qi, Hu Hu, Yannan Wang, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee

Abstract: This paper investigates different trade-offs between the number of model parameters and enhanced speech qualities by employing several deep tensor-to-vector regression models for speech enhancement. We find that a hybrid architecture, namely CNN-TT, is capable of maintaining a good quality performance with a reduced model parameter size. CNN-TT is composed of several convolutional layers at the bo… ▽ More This paper investigates different trade-offs between the number of model parameters and enhanced speech qualities by employing several deep tensor-to-vector regression models for speech enhancement. We find that a hybrid architecture, namely CNN-TT, is capable of maintaining a good quality performance with a reduced model parameter size. CNN-TT is composed of several convolutional layers at the bottom for feature extraction to improve speech quality and a tensor-train (TT) output layer on the top to reduce model parameters. We first derive a new upper bound on the generalization power of the convolutional neural network (CNN) based vector-to-vector regression models. Then, we provide experimental evidence on the Edinburgh noisy speech corpus to demonstrate that, in single-channel speech enhancement, CNN outperforms DNN at the expense of a small increment of model sizes. Besides, CNN-TT slightly outperforms the CNN counterpart by utilizing only 32\% of the CNN model parameters. Besides, further performance improvement can be attained if the number of CNN-TT parameters is increased to 44\% of the CNN model size. Finally, our experiments of multi-channel speech enhancement on a simulated noisy WSJ0 corpus demonstrate that our proposed hybrid CNN-TT architecture achieves better results than both DNN and CNN models in terms of better-enhanced speech qualities and smaller parameter sizes. △ Less

Submitted 2 August, 2020; v1 submitted 25 July, 2020; originally announced July 2020.

Comments: Accepted to InterSpeech 2020

arXiv:2007.10993 [pdf, other]

doi 10.1093/mnras/stab1803

Gas-phase metallicity gradients of TNG50 star-forming galaxies

Authors: Z. S. Hemler, Paul Torrey, Jia Qi, Lars Hernquist, Mark Vogelsberger, Xiangcheng Ma, Lisa J. Kewley, Dylan Nelson, Annalisa Pillepich, Rüdiger Pakmor, Federico Marinacci

Abstract: We present the radial gas-phase, mass-weighted metallicity profiles and gradients of the TNG50 star-forming galaxy population measured at redshifts $z=$ 0--3. We investigate the redshift evolution of gradients and examine relations between gradient steepness and galaxy properties. We find that TNG50 gradients are predominantly negative at all redshifts, although we observe significant diversity am… ▽ More We present the radial gas-phase, mass-weighted metallicity profiles and gradients of the TNG50 star-forming galaxy population measured at redshifts $z=$ 0--3. We investigate the redshift evolution of gradients and examine relations between gradient steepness and galaxy properties. We find that TNG50 gradients are predominantly negative at all redshifts, although we observe significant diversity among these negative gradients. We determine that the gradient steepness of all galaxies increases approximately monotonically with redshift at a roughly constant rate. This rate does not vary significantly with galaxy mass. We observe a weak negative correlation between gradient steepness and galaxy stellar mass at redshifts $z\leq2$. However, when we normalize gradients by a characteristic radius defined by the galactic star formation distribution, we find that these normalized gradients remain invariant with both stellar mass and redshift. We place our results in the context of previous simulations and show that TNG50 high-redshift gradients are steeper than those of models featuring burstier feedback, which may further highlight high-redshift gradients as important discriminators of galaxy formation models. We also find that redshift $z=0$ and $z=0.5$ TNG50 gradients are consistent with the gradients observed in galaxies at these redshifts, although the preference for flat gradients observed in redshift $z\gtrsim1$ galaxies is not present in TNG50. If future JWST and ELT observations validate these flat gradients, it may indicate a need for simulation models to implement more powerful radial gas mixing within the ISM, possibly via turbulence and/or stronger winds △ Less

Submitted 21 July, 2020; originally announced July 2020.

Comments: 24 pages, 14 figures, submitted to MNRAS

arXiv:2007.04670 [pdf, other]

Multi-Granularity Modularized Network for Abstract Visual Reasoning

Authors: Xiangru Tang, Haoyuan Wang, Xiang Pan, Jiyang Qi

Abstract: Abstract visual reasoning connects mental abilities to the physical world, which is a crucial factor in cognitive development. Most toddlers display sensitivity to this skill, but it is not easy for machines. Aimed at it, we focus on the Raven Progressive Matrices Test, designed to measure cognitive reasoning. Recent work designed some black-boxes to solve it in an end-to-end fashion, but they are… ▽ More Abstract visual reasoning connects mental abilities to the physical world, which is a crucial factor in cognitive development. Most toddlers display sensitivity to this skill, but it is not easy for machines. Aimed at it, we focus on the Raven Progressive Matrices Test, designed to measure cognitive reasoning. Recent work designed some black-boxes to solve it in an end-to-end fashion, but they are incredibly complicated and difficult to explain. Inspired by cognitive studies, we propose a Multi-Granularity Modularized Network (MMoN) to bridge the gap between the processing of raw sensory information and symbolic reasoning. Specifically, it learns modularized reasoning functions to model the semantic rule from the visual grounding in a neuro-symbolic and semi-supervision way. To comprehensively evaluate MMoN, our experiments are conducted on the dataset of both seen and unseen reasoning rules. The result shows that MMoN is well suited for abstract visual reasoning and also explainable on the generalization test. △ Less

Submitted 9 July, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

Showing 251–300 of 511 results for author: Qi, J