-
Self-Supervised Diffusion Model for 3-D Seismic Data Reconstruction
Authors:
Xinyang Wang,
Qianyu Ge,
Xintong Dong,
Shiqi Dong,
Tie Zhong
Abstract:
Seismic data reconstruction is an effective tool for compensating nonuniform and incomplete seismic geometry. Compared with methods for 2D seismic data, 3D reconstruction methods could consider more spatial structure correlation in seismic data. In the early studies, 3D reconstruction methods are mainly theory-driven and have some limitations due to their prior assumptions on the seismic data. To…
▽ More
Seismic data reconstruction is an effective tool for compensating nonuniform and incomplete seismic geometry. Compared with methods for 2D seismic data, 3D reconstruction methods could consider more spatial structure correlation in seismic data. In the early studies, 3D reconstruction methods are mainly theory-driven and have some limitations due to their prior assumptions on the seismic data. To release these limitations, deep learning-based reconstruction methods rise and show potential in dealing with reconstruction problems. However, there are mainly two shortcomings in existing deep learning-methods. On the one hand, most of existing deep learning-based methods adopt the convolutional neural network, having some difficulties in dealing with data with complex or time-varying distributions. Recently, the diffusion model has been reported to possess the capability to solve data with complex distributions by gradually complicating the distribution of data to optimize the network. On the other hand, existing methods need enough paired-data to train the network, which are very hard to obtain especially for the starved 3D seismic data. Deep prior-based unsupervised and sampling-based self-supervised networks offer an available solution to this problem. In this paper, we develop a self-supervised diffusion model (S2DM) for 3D seismic data reconstruction. The proposed model mainly contains a diffusion restoration model and a variational time-spatial module. Extensive synthetic and field experiments demonstrate the superiority of the proposed S2DM algorithm.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Provable Optimality of the Square-Tooth Atomic Frequency Comb Quantum Memory
Authors:
Allen Zang,
Martin Suchara,
Tian Zhong
Abstract:
Atomic frequency comb (AFC) quantum memories are a promising technology for quantum repeater networks because they enable multi-mode, high-fidelity storage of photons with on-demand retrieval. The optimization of the retrieval efficiency of an AFC memory is important because it strongly impacts the entanglement generation rate in quantum networks. Despite initial theoretical analyses and recent ex…
▽ More
Atomic frequency comb (AFC) quantum memories are a promising technology for quantum repeater networks because they enable multi-mode, high-fidelity storage of photons with on-demand retrieval. The optimization of the retrieval efficiency of an AFC memory is important because it strongly impacts the entanglement generation rate in quantum networks. Despite initial theoretical analyses and recent experimental demonstrations, a rigorous proof of the universally optimal configuration for the highest AFC retrieval efficiency has not been presented. In this paper we offer a simple analytical proof which shows that the optimized square-tooth AFC provides the highest retrieval efficiency among all possible comb tooth shapes, under the physical constraint of maximal optical depth of an atomic ensemble. The optimality still holds even when the non-zero background absorption and the finite homogeneous broadening of atoms are considered. Our proof provides experimentalists with rigorous arguments how to create optimal AFC under realistic experimental conditions. Finally, we also identify other functional optimization problems where our proof technique is applicable, thus proving the optimality of the square function in more general scenarios.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Searching for $|V_{cd}|$ through the exclusive decay $D_s^+ \to K^0e^+ν_e$ within QCD Sum Rules
Authors:
Hai-Jiang Tian,
Yin-Long Yang,
Dan-Dan Hu,
Hai-Bing Fu,
Tao Zhong,
Xing-Gang Wu
Abstract:
In this paper, we carry out an investigation into the semileptonic decays $D_s^+ \to K^0\ell^+ν_\ell$ with $\ell=(e,μ)$ by employing the QCD light-cone sum rules approach. The vector transition form factor (TFF) $f_+^{D_s^+ K^0}(q^2)$ for $D_s^+\to K^0$ decay is calculated while considering its next-to-leading order contribution. Subsequently, we briefly introduce the twist-2, 3 kaon distribution…
▽ More
In this paper, we carry out an investigation into the semileptonic decays $D_s^+ \to K^0\ell^+ν_\ell$ with $\ell=(e,μ)$ by employing the QCD light-cone sum rules approach. The vector transition form factor (TFF) $f_+^{D_s^+ K^0}(q^2)$ for $D_s^+\to K^0$ decay is calculated while considering its next-to-leading order contribution. Subsequently, we briefly introduce the twist-2, 3 kaon distribution amplitudes, which are calculated by using QCD sum rules within the framework of the background field theory. At the large recoil point, the TFF has $f_+^{D_s^+ K^0}(0)=0.692_{-0.026}^{+0.027}$. Then, we extrapolate $f_+^{D_s^+ K^0}(q^2)$ to the whole physical $q^2$-region via the simplified $z(q^2,t)$-series expansion, and the behavior of TFF $f_+^{D_s^+ K^0}(q^2)$ is exhibited in the numerical results part, including the theoretical and experimental predictions for comparison. In addition, we compute the differential branching fraction $\mathcal{B}(D_s^+ \to K^0\ell^+ν_\ell)$ with the electron and muon channels, which are expected to be $\mathcal{B}(D_s^+ \to K^0e^+ν_e)=3.379_{-0.275}^{+0.301}\times 10^{-3}$ and $\mathcal{B}(D_s^+ \to K^0μ^+ν_μ)=3.351_{-0.273}^{+0.299}\times 10^{-3}$ as well as contained other results for comparison. Our results show good agreement with the BESIII measurements and theoretical predictions. Furthermore, we present our prediction with respect to the CKM matrix element $|V_{cd}|$ by using the $\mathcal{B}(D_s^+ \to K^0e^+ν_e)$ result from BESIII Collaboration, yielding $|V_{cd}|=0.221_{-0.010}^{+0.008}$. Finally, we provide the ratio between $D_s^+ \to K^0e^+ν_e$ and $D_s^+ \to ηe^+ν_e$ channels, i.e. $\mathcal{R}_{K^0/η}^e=0.144_{-0.020}^{+0.028}$.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Adapting to Distribution Shift by Visual Domain Prompt Generation
Authors:
Zhixiang Chi,
Li Gu,
Tao Zhong,
Huan Liu,
Yuanhao Yu,
Konstantinos N Plataniotis,
Yang Wang
Abstract:
In this paper, we aim to adapt a model at test-time using a few unlabeled data to address distribution shifts. To tackle the challenges of extracting domain knowledge from a limited amount of data, it is crucial to utilize correlated information from pre-trained backbones and source domains. Previous studies fail to utilize recent foundation models with strong out-of-distribution generalization. A…
▽ More
In this paper, we aim to adapt a model at test-time using a few unlabeled data to address distribution shifts. To tackle the challenges of extracting domain knowledge from a limited amount of data, it is crucial to utilize correlated information from pre-trained backbones and source domains. Previous studies fail to utilize recent foundation models with strong out-of-distribution generalization. Additionally, domain-centric designs are not flavored in their works. Furthermore, they employ the process of modelling source domains and the process of learning to adapt independently into disjoint training stages. In this work, we propose an approach on top of the pre-computed features of the foundation model. Specifically, we build a knowledge bank to learn the transferable knowledge from source domains. Conditioned on few-shot target data, we introduce a domain prompt generator to condense the knowledge bank into a domain-specific prompt. The domain prompt then directs the visual features towards a particular domain via a guidance module. Moreover, we propose a domain-aware contrastive loss and employ meta-learning to facilitate domain knowledge extraction. Extensive experiments are conducted to validate the domain knowledge extraction. The proposed method outperforms previous work on 5 large-scale benchmarks including WILDS and DomainNet.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Modeling Seismic Wave Propagation in TTI Media Using Residual Perfectly Matched Layer
Authors:
Yuqin Luo,
Xintong Dong,
Shiqi Dong,
Tie Zhong,
Yu Zhang,
Ying Wang,
Ning Hu
Abstract:
The perfectly matched layer(PML) is commonly used in wave propagation, radiation and diffraction problems in unbounded space domains. A new implementation scheme of PML is presented. The PML formulation is pre-defined, and the wave field absorption is achieved by calculating the residual between the PML equation and original equation through backward induction. Two forms of the Residual PML (RPML)…
▽ More
The perfectly matched layer(PML) is commonly used in wave propagation, radiation and diffraction problems in unbounded space domains. A new implementation scheme of PML is presented. The PML formulation is pre-defined, and the wave field absorption is achieved by calculating the residual between the PML equation and original equation through backward induction. Two forms of the Residual PML (RPML) are presented: RPML-1, which defines the residual as the difference between the original and PML equations, and RPML-2, which defines the residual as the difference between the original and PML wave fields. RPML-2 is the simplest and easiest to extend, as it does not alter the original equation and only has one time partial derivative term in the residual equation. Additionally, since the residual equation has no spatial partial derivative term, high-order spatial difference discretization is unnecessary, which results in higher accuracy and computational efficiency. Furthermore, simulating a wave field in TTI media requires a high absorption effect and stability of PML. The numerical simulation demonstrates that RPML-2 provides better absorption performance and stability compared to ADEPML and NPML. To meet the needs of wave field simulation for complex media, a multiaxial complex frequency shifted RPML-2 (MCFS-RPML-2) is introduced, which employs double dam** profiles and complex frequency shift technology to achieve higher stability and absorption effects.
△ Less
Submitted 22 April, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Seismic Interpolation Transformer for Consecutively Missing Data: A Case Study in DAS-VSP Data
Authors:
Ming Cheng,
Jun Lin,
Xintong Dong,
Shao** Lu,
Tie Zhong
Abstract:
Distributed optical fiber acoustic sensing (DAS) is a rapidly-developed seismic acquisition technology with advantages of low cost, high resolution, high sensitivity, and small interval, etc. Nonetheless, consecutively missing cases often appear in real seismic data acquired by DAS system due to some factors, including optical fiber damage and inferior coupling between cable and well. Recently, so…
▽ More
Distributed optical fiber acoustic sensing (DAS) is a rapidly-developed seismic acquisition technology with advantages of low cost, high resolution, high sensitivity, and small interval, etc. Nonetheless, consecutively missing cases often appear in real seismic data acquired by DAS system due to some factors, including optical fiber damage and inferior coupling between cable and well. Recently, some deep-learning seismic interpolation methods based on convolutional neural network (CNN) have shown impressive performance in regular and random missing cases but still remain the consecutively missing case as a challenging task. The main reason is that the weight sharing makes it difficult for CNN to capture enough comprehensive features. In this paper, we propose a transformer-based interpolation method, called seismic interpolation transformer (SIT), to deal with the consecutively missing case. This proposed SIT is an encoder-decoder structure connected by some U-shaped swin-transformer blocks. In encoder and decoder part, the multi-head self-attention (MSA) mechanism is used to capture global features which is essential for the reconstruction of consecutively missing traces. The U-shaped swin-transformer blocks are utilized to perform feature extraction operations on feature maps with different resolutions. Moreover, we combine the loss based on structural similarity index (SSIM) and L1 norm to propose a novel loss function for SIT. In experiments, this proposed SIT outperforms U-Net and swin-transformer. Moreover, ablation studies also demonstrate the advantages of new network architecture and loss function.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
First-Order Vortex Lattice Melting in Bilayer Ice: A Monte Carlo Method Study
Authors:
Telun Zhong,
Heyang Ma,
Peijun Zheng,
Jie Zhang,
Wanzhou Zhang
Abstract:
Inspired by the stable bilayer water ice grown in the laboratory (Nature 577, 60, 2020), we propose a model representing water ice as a two-layer six-vertex model. Using the loop update Monte Carlo method, we unveil meaningful findings. While the square lattice six-vertex model exhibits an antiferromagnetic to disordered phase transition known as the Berezinskii-Kosterlitz-Thouless transition, we…
▽ More
Inspired by the stable bilayer water ice grown in the laboratory (Nature 577, 60, 2020), we propose a model representing water ice as a two-layer six-vertex model. Using the loop update Monte Carlo method, we unveil meaningful findings. While the square lattice six-vertex model exhibits an antiferromagnetic to disordered phase transition known as the Berezinskii-Kosterlitz-Thouless transition, we observe a different scenario for the bilayer six-vertex model, where the transition type transforms into an Ising transition. We discover the emergence of vortices in the disordered phase, and to stabilize them, vortex excitation is induced. This leads to the presence of distinct 1/2 filling and 2/3 filling vortex lattice phases. More importantly, we identify the phase transitions between the vortex lattice phase and the disordered phase, as well as between the 1/2 and 2/3 vortex lattices, as being of first order. We also propose an experimental scheme for realizing a two-layer six-vertex model based on the artificial ice of particles in a double well trap array. Our findings provide valuable insight into the nature of phase transitions occurring in layered water ice and artificial spin ice systems in experimental setups.
△ Less
Submitted 17 May, 2024; v1 submitted 9 April, 2024;
originally announced April 2024.
-
Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation
Authors:
Tianqi Zhong,
Zhaoyi Li,
Quan Wang,
Linqi Song,
Ying Wei,
Defu Lian,
Zhendong Mao
Abstract:
Compositional generalization, representing the model's ability to generate text with new attribute combinations obtained by recombining single attributes from the training data, is a crucial property for multi-aspect controllable text generation (MCTG) methods. Nonetheless, a comprehensive compositional generalization evaluation benchmark of MCTG is still lacking. We propose CompMCTG, a benchmark…
▽ More
Compositional generalization, representing the model's ability to generate text with new attribute combinations obtained by recombining single attributes from the training data, is a crucial property for multi-aspect controllable text generation (MCTG) methods. Nonetheless, a comprehensive compositional generalization evaluation benchmark of MCTG is still lacking. We propose CompMCTG, a benchmark encompassing diverse multi-aspect labeled datasets and a crafted three-dimensional evaluation protocol, to holistically evaluate the compositional generalization of MCTG approaches. We observe that existing MCTG works generally confront a noticeable performance drop in compositional testing. To mitigate this issue, we introduce Meta-MCTG, a training framework incorporating meta-learning, where we enable models to learn how to generalize by simulating compositional generalization scenarios in the training phase. We demonstrate the effectiveness of Meta-MCTG through achieving obvious improvement (by at most 3.64%) for compositional testing performance in 94.4% cases.
△ Less
Submitted 3 June, 2024; v1 submitted 5 April, 2024;
originally announced April 2024.
-
Superior and Pragmatic Talking Face Generation with Teacher-Student Framework
Authors:
Chao Liang,
Jianwen Jiang,
Tianyun Zhong,
Gaojie Lin,
Zhengkun Rong,
Jiaqi Yang,
Yongming Zhu
Abstract:
Talking face generation technology creates talking videos from arbitrary appearance and motion signal, with the "arbitrary" offering ease of use but also introducing challenges in practical applications. Existing methods work well with standard inputs but suffer serious performance degradation with intricate real-world ones. Moreover, efficiency is also an important concern in deployment. To compr…
▽ More
Talking face generation technology creates talking videos from arbitrary appearance and motion signal, with the "arbitrary" offering ease of use but also introducing challenges in practical applications. Existing methods work well with standard inputs but suffer serious performance degradation with intricate real-world ones. Moreover, efficiency is also an important concern in deployment. To comprehensively address these issues, we introduce SuperFace, a teacher-student framework that balances quality, robustness, cost and editability. We first propose a simple but effective teacher model capable of handling inputs of varying qualities to generate high-quality results. Building on this, we devise an efficient distillation strategy to acquire an identity-specific student model that maintains quality with significantly reduced computational load. Our experiments validate that SuperFace offers a more comprehensive solution than existing methods for the four mentioned objectives, especially in reducing FLOPs by 99\% with the student model. SuperFace can be driven by both video and audio and allows for localized facial attributes editing.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
An improved light-cone harmonic oscillator model for the $φ$-meson longitudinal leading-twist light-cone distribution amplitude
Authors:
Dan-Dan Hu,
Xing-Gang Wu,
Long Zeng,
Hai-Bing Fu,
Tao Zhong
Abstract:
In the present paper, we study the properties of $φ$-meson longitudinal leading-twist light-cone distribution amplitude $φ_{2;φ}^{\|}(x,μ)$ by starting from a light-cone harmonic oscillator model for its wavefunction. To fix the input parameters, we derive the first ten $ξ$-moments of $φ_{2;φ}^{\|}(x,μ)$ by using the QCD sum rules approach under the background field theory. The shape of…
▽ More
In the present paper, we study the properties of $φ$-meson longitudinal leading-twist light-cone distribution amplitude $φ_{2;φ}^{\|}(x,μ)$ by starting from a light-cone harmonic oscillator model for its wavefunction. To fix the input parameters, we derive the first ten $ξ$-moments of $φ_{2;φ}^{\|}(x,μ)$ by using the QCD sum rules approach under the background field theory. The shape of $φ_{2;φ}^{\|}(x,μ=2~{\rm GeV})$ tends to be a single-peak behavior, which is consistent with the latest Lattice QCD result. As an application, we derive the $D^+_s \to φ$ transition form factors (TFFs) by using the light-cone sum rules approach. At the large recoil point, we obtain $A_1(0) = 0.512_{-0.020}^{+0.030}$, $A_2(0) = 0.402_{-0.067}^{+0.078}$, $A_0(0) = 0.596_{-0.020}^{+0.025}$ and $V(0) = 0.882_{-0.036}^{+0.040}$. As for the two typical ratios $γ_V$ and $γ_2$, we obtain $γ_V = 1.723_{-0.021}^{+0.023}$ and $γ_2 = 0.785_{-0.104}^{+0.100}$. After extrapolating those TFFs to the physically allowable region, we then obtain the transverse, longitudinal and total decay widths for semi-leptonic decay $D^+_s\toφ\ell^+ν_{\ell}$. Then the branching fractions are ${\cal B}(D^+_s\to φe^+ν_e) = (2.367_{-0.132}^{+0.256})\times 10^{-3}$ and ${\cal B}(D^+_s\to φμ^+ν_μ) = (2.349_{-0.132}^{+0.255})\times 10^{-3}$, which show good agreement with the data issued by the BESIII, the CLEO, and the BABAR Collaborations. We finally calculate $D^+_s\toφ\ell^+ ν_\ell$ polarization and asymmetry parameters.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
New constraints on Triton's atmosphere from the 6 October 2022 stellar occultation
Authors:
Ye Yuan,
Chen Zhang,
Fan Li,
Jian Chen,
Yanning Fu,
Chunhai Bai,
Xing Gao,
Yong Wang,
Tuhong Zhong,
Yixing Gao,
Liang Wang,
Donghua Chen,
Yixing Zhang,
Yang Zhang,
Wenpeng Xie,
Shupi Zhang,
Ding Liu,
Jun Cao,
Xiangdong Yin,
Xiaojun Mo,
**g Liu,
Xinru Han,
Tong Liu,
Yuqiang Chen,
Zhendong Gao
, et al. (25 additional authors not shown)
Abstract:
The atmosphere of Triton was probed directly by observing a ground-based stellar occultation on 6 October 2022. This rare event yielded 23 positive light curves collected from 13 separate observation stations contributing to our campaign. The significance of this event lies in its potential to directly validate the modest pressure fluctuation on Triton, a phenomenon not definitively verified by pr…
▽ More
The atmosphere of Triton was probed directly by observing a ground-based stellar occultation on 6 October 2022. This rare event yielded 23 positive light curves collected from 13 separate observation stations contributing to our campaign. The significance of this event lies in its potential to directly validate the modest pressure fluctuation on Triton, a phenomenon not definitively verified by previous observations, including only five stellar occultations, and the Voyager 2 radio occultation in 1989. Using an approach consistent with a comparable study, we precisely determined a surface pressure of $14.07_{-0.13}^{+0.21}~\mathrm{μbar}$ in 2022. This new pressure rules out any significant monotonic variation in pressure between 2017 and 2022 through direct observations, as it is in alignment with the 2017 value. Additionally, both the pressures in 2017 and 2022 align with the 1989 value. This provides further support for the conclusion drawn from the previous volatile transport model simulation, which is consistent with the observed alignment between the pressures in 1989 and 2017; that is to say, the pressure fluctuation is modest. Moreover, this conclusion suggests the existence of a northern polar cap extended down to at least $45^\circ$N$-60^\circ$N and the presence of nitrogen between $30^\circ$S and $0^\circ$.
△ Less
Submitted 24 March, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Authors:
Zhenhui Ye,
Tianyun Zhong,
Yi Ren,
Jiaqi Yang,
Weichuang Li,
Jiawei Huang,
Ziyue Jiang,
**zheng He,
Rongjie Huang,
**glin Liu,
Chen Zhang,
Xiang Yin,
Zejun Ma,
Zhou Zhao
Abstract:
One-shot 3D talking portrait generation aims to reconstruct a 3D avatar from an unseen image, and then animate it with a reference video or audio to generate a talking portrait video. The existing methods fail to simultaneously achieve the goals of accurate 3D avatar reconstruction and stable talking face animation. Besides, while the existing works mainly focus on synthesizing the head part, it i…
▽ More
One-shot 3D talking portrait generation aims to reconstruct a 3D avatar from an unseen image, and then animate it with a reference video or audio to generate a talking portrait video. The existing methods fail to simultaneously achieve the goals of accurate 3D avatar reconstruction and stable talking face animation. Besides, while the existing works mainly focus on synthesizing the head part, it is also vital to generate natural torso and background segments to obtain a realistic talking portrait video. To address these limitations, we present Real3D-Potrait, a framework that (1) improves the one-shot 3D reconstruction power with a large image-to-plane model that distills 3D prior knowledge from a 3D face generative model; (2) facilitates accurate motion-conditioned animation with an efficient motion adapter; (3) synthesizes realistic video with natural torso movement and switchable background using a head-torso-background super-resolution model; and (4) supports one-shot audio-driven talking face generation with a generalizable audio-to-motion model. Extensive experiments show that Real3D-Portrait generalizes well to unseen identities and generates more realistic talking portrait videos compared to previous methods. Video samples and source code are available at https://real3dportrait.github.io .
△ Less
Submitted 23 March, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Authors:
Xijun Li,
Fangzhou Zhu,
Hui-Ling Zhen,
Weilin Luo,
Meng Lu,
Yimin Huang,
Zhenan Fan,
Zirui Zhou,
Yufei Kuang,
Zhihai Wang,
Zijie Geng,
Yang Li,
Haoyang Liu,
Zhiwu An,
Muming Yang,
Jianshu Li,
Jie Wang,
Junchi Yan,
Defeng Sun,
Tao Zhong,
Yong Zhang,
Jia Zeng,
Mingxuan Yuan,
Jianye Hao,
Jun Yao
, et al. (1 additional authors not shown)
Abstract:
In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt…
▽ More
In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional optimization techniques. We showcase our methods for generating complex SAT and MILP instances utilizing generative models that mirror multifaceted structures of real-world problem. Furthermore, we introduce a training framework leveraging augmentation policies to maintain solvers' utility in dynamic environments. Besides the data generation and augmentation, our proposed approaches also include novel ML-driven policies for personalized solver strategies, with an emphasis on applications like graph convolutional networks for initial basis selection and reinforcement learning for advanced presolving and cut selection. Additionally, we detail the incorporation of state-of-the-art parameter tuning algorithms which markedly elevate solver performance. Compared with traditional solvers such as Cplex and SCIP, our ML-augmented OptVerse AI Solver demonstrates superior speed and precision across both established benchmarks and real-world scenarios, reinforcing the practical imperative and effectiveness of machine learning techniques in mathematical programming solvers.
△ Less
Submitted 17 January, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
High-topological-number skyrmions and phase transition in two-dimensional frustrated $J_1$-$J_2$ magnets
Authors:
Hongliang Hu,
Zhong Shen,
Zheng Chen,
** Wu,
Tingting Zhong,
Changsheng Song
Abstract:
With the rapidly expanded field of two-dimensional(2D) magnetic materials, the frustrated magnetic skyrmions are attracting growing interest recently. Here, based on hexagonal close-packed (HCP) lattice of $J_1$-$J_2$ Heisenberg spins model, we systematically investigate the frustrated skyrmions and phase transition by micromagnetic simulations and first-principles calculations. The results show t…
▽ More
With the rapidly expanded field of two-dimensional(2D) magnetic materials, the frustrated magnetic skyrmions are attracting growing interest recently. Here, based on hexagonal close-packed (HCP) lattice of $J_1$-$J_2$ Heisenberg spins model, we systematically investigate the frustrated skyrmions and phase transition by micromagnetic simulations and first-principles calculations. The results show that four spin phases of antiferromagnetic, labyrinth domain, skyrmion and ferromagnetic textures are determined by the identified ranges of $J_1$-$J_2$. Importantly, skyrmion phase with an increasing topological number ($Q$) covers a wider $J_1$-$J_2$ area. Then, the diameter of skyrmions can be tuned by the frustration strength ($|J_2/J_1|$) or external magnetic field. Besides, a phase transition from N$\acute{e}$el to Bloch type skyrmion is observed due to the change of the helicity with the variation of $|J_2/J_1|$. Furthermore, as increasing magnetic field, the skyrmions with high $Q$ ($\ge 3$) tend to split into the ones with $Q=1$, thereby achieving a lower systematic energy. Additionally, we find that the CoCl$_2$ monolayer satisfies the requirement of the frustrated $J_1$-$J_2$ magnet, and the related magnetic behaviors agree with the above conclusions. The frustration-induced skyrmions are stable without the manipulation of temperature and magnetic field. Our results may open a possible way toward spintronic applications based on High-topological-number and nanoscale topological spin textures of skyrmions.
△ Less
Submitted 20 January, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
M31N 2013-10c: A Newly Identified Recurrent Nova in M31
Authors:
Allen W. Shafter,
Kamil Hornoch,
Hana Kučáková,
Petr Fatka,
**gyuan Zhao,
Xing Gao,
Shahidin Yaqup,
Tuhong Zhong,
Ali Esamdin,
Chunhai Bai,
Na Wang,
Paul Benni,
Aiden Luo,
Ilana Yousuf
Abstract:
The nova M31N 2023-11f (2023yoa) has been recently identified as the second eruption of a previously recognized nova, M31N 2013-10c, establishing the latter object as the 21st recurrent nova system thus far identified in M31. Here we present well sampled $R$-band lightcurves of both the 2013 and 2023 eruptions of this system. The photometric evolution of each eruption was quite similar as expected…
▽ More
The nova M31N 2023-11f (2023yoa) has been recently identified as the second eruption of a previously recognized nova, M31N 2013-10c, establishing the latter object as the 21st recurrent nova system thus far identified in M31. Here we present well sampled $R$-band lightcurves of both the 2013 and 2023 eruptions of this system. The photometric evolution of each eruption was quite similar as expected for the same progenitor system. The 2013 and 2023 eruptions each reached peak magnitudes just brighter than $R\sim16$, with fits to the declining branches of the eruptions yielding times to decline by two magnitudes of $t_2(R)=5.5\pm1.7$ and $t_2(R)=3.4\pm1.5$ days, respectively. M31N 2013-10c has an absolute magnitude at peak, $M_R=-8.8\pm0.2$, making it the most luminous known recurrent nova in M31.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Investigation for $D^+ \to π^+ ν\barν$ decay process within QCDSR approach
Authors:
Yu Chen,
Hai-Bing Fu,
Tao Zhong,
Sheng-Bo Wu,
Dong Huang
Abstract:
In the paper, we investigate the charmed meson rare decay process $D^+ \to π^+ν\barν$ by using QCD sum rules approach. Firstly, the pion twist-2 and twist-3 distribution amplitude $ξ$-moments $\langleξ_{2;π}^n\rangle|_μ$ up to 10th-order and $\langle ξ_{3;π}^{(p,σ),n}\rangle|_μ$ up to fourth-order are calculated by using QCD sum rule under background field theory. After constructing the light-cone…
▽ More
In the paper, we investigate the charmed meson rare decay process $D^+ \to π^+ν\barν$ by using QCD sum rules approach. Firstly, the pion twist-2 and twist-3 distribution amplitude $ξ$-moments $\langleξ_{2;π}^n\rangle|_μ$ up to 10th-order and $\langle ξ_{3;π}^{(p,σ),n}\rangle|_μ$ up to fourth-order are calculated by using QCD sum rule under background field theory. After constructing the light-cone harmonic oscillator model for pion twist-2, 3 DAs, we get their behaviors by matching the calculated $ξ$-moments. Then, the $D\to π$ transition form factors are calculated by using QCD light-cone sum rules approach. The vector form factor at large recoil region is $f_+^{D\toπ}(0) = 0.627^{+0.120} _{-0.080}$. By taking the rapidly $z(q^2,t)$ converging simplified series expansion, we present the TFFs and the corresponding angular coefficients in the whole squared momentum transfer physical region. Furthermore, we display the semileptonic decay process $\bar D^0 \to π^+ e\bar ν_e$ differential decay widths and branching fraction with ${\cal B}(\bar D^0\toπ^+e\barν_e) = 0.308^{+0.155}_{-0.066} \times 10^{2}$. The $\bar D^0\toπ^+e\barν_e$ differential/total predictions for forward-backward asymmetry, $q^2$-differential flat terms and lepton polarization asymmetry are also given. After considering the non-standard neutrino interactions, the predictions for the $D^+ \to π^+ ν\barν$ branching fraction is ${\cal B}(D^+ \to π^+ {ν}{\barν}) = 1.85^{+0.93}_{-0.46}\times10^{-8}$.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Authors:
Yiheng Liu,
Hao He,
Tianle Han,
Xu Zhang,
Mengyuan Liu,
Jiaming Tian,
Yutong Zhang,
Jiaqi Wang,
Xiaohui Gao,
Tianyang Zhong,
Yi Pan,
Shaochen Xu,
Zihao Wu,
Zhengliang Liu,
Xin Zhang,
Shu Zhang,
Xintao Hu,
Tuo Zhang,
Ning Qiang,
Tianming Liu,
Bao Ge
Abstract:
The introduction of ChatGPT has led to a significant increase in the utilization of Large Language Models (LLMs) for addressing downstream tasks. There's an increasing focus on cost-efficient training and deployment within this context. Low-cost training and deployment of LLMs represent the future development trend. This paper reviews the evolution of large language model training techniques and i…
▽ More
The introduction of ChatGPT has led to a significant increase in the utilization of Large Language Models (LLMs) for addressing downstream tasks. There's an increasing focus on cost-efficient training and deployment within this context. Low-cost training and deployment of LLMs represent the future development trend. This paper reviews the evolution of large language model training techniques and inference deployment technologies aligned with this emerging trend. The discussion on training includes various aspects, including data preprocessing, training architecture, pre-training tasks, parallel training, and relevant content related to model fine-tuning. On the inference side, the paper covers topics such as model compression, parallel computation, memory scheduling, and structural optimization. It also explores LLMs' utilization and provides insights into their future development.
△ Less
Submitted 5 January, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Language Model is a Branch Predictor for Simultaneous Machine Translation
Authors:
Aoxiong Yin,
Tianyun Zhong,
Haoyuan Li,
Siliang Tang,
Zhou Zhao
Abstract:
The primary objective of simultaneous machine translation (SiMT) is to minimize latency while preserving the quality of the final translation. Drawing inspiration from CPU branch prediction techniques, we propose incorporating branch prediction techniques in SiMT tasks to reduce translation latency. Specifically, we utilize a language model as a branch predictor to predict potential branch directi…
▽ More
The primary objective of simultaneous machine translation (SiMT) is to minimize latency while preserving the quality of the final translation. Drawing inspiration from CPU branch prediction techniques, we propose incorporating branch prediction techniques in SiMT tasks to reduce translation latency. Specifically, we utilize a language model as a branch predictor to predict potential branch directions, namely, future source words. Subsequently, we utilize the predicted source words to decode the output in advance. When the actual source word deviates from the predicted source word, we use the real source word to decode the output again, replacing the predicted output. To further reduce computational costs, we share the parameters of the encoder and the branch predictor, and utilize a pre-trained language model for initialization. Our proposed method can be seamlessly integrated with any SiMT model. Extensive experimental results demonstrate that our approach can improve translation quality and latency at the same time. Our code is available at https://github.com/YinAoXiong/simt_branch_predictor .
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
A Microservices Identification Method Based on Spectral Clustering for Industrial Legacy Systems
Authors:
Teng Zhong,
Yinglei Teng,
Shijun Ma,
Jiaxuan Chen,
Sicong Yu
Abstract:
The advent of Industrial Internet of Things (IIoT) has imposed more stringent requirements on industrial software in terms of communication delay, scalability, and maintainability. Microservice architecture (MSA), a novel software architecture that has emerged from cloud computing and DevOps, presents itself as the most promising solution due to its independently deployable and loosely coupled nat…
▽ More
The advent of Industrial Internet of Things (IIoT) has imposed more stringent requirements on industrial software in terms of communication delay, scalability, and maintainability. Microservice architecture (MSA), a novel software architecture that has emerged from cloud computing and DevOps, presents itself as the most promising solution due to its independently deployable and loosely coupled nature. Currently, practitioners are inclined to migrate industrial legacy systems to MSA, despite numerous challenges it presents. In this paper, we propose an automated microservice decomposition method for extracting microservice candidates based on spectral graph theory to address the problems associated with manual extraction, which is time-consuming, labor intensive, and highly subjective. The method is divided into three steps. Firstly, static and dynamic analysis tools are employed to extract dependency information of the legacy system. Subsequently, information is transformed into a graph structure that captures inter-class structure and performance relationships in legacy systems. Finally, graph-based clustering algorithm is utilized to identify potential microservice candidates that conform to the principles of high cohesion and low coupling. Comparative experiments with state of-the-art methods demonstrate the significant advantages of our proposed method in terms of performance metrics. Moreover, Practice show that our method can yield favorable results even without the involvement of domain experts.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Holistic Evaluation of GPT-4V for Biomedical Imaging
Authors:
Zhengliang Liu,
Hanqi Jiang,
Tianyang Zhong,
Zihao Wu,
Chong Ma,
Yiwei Li,
Xiaowei Yu,
Yutong Zhang,
Yi Pan,
Peng Shu,
Yanjun Lyu,
Lu Zhang,
Junjie Yao,
Peixin Dong,
Chao Cao,
Zhenxiang Xiao,
Jiaqi Wang,
Huan Zhao,
Shaochen Xu,
Yaonai Wei,
**gyuan Chen,
Haixing Dai,
Peilong Wang,
Hao He,
Zewei Wang
, et al. (25 additional authors not shown)
Abstract:
In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor…
▽ More
In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and more. Tasks include modality recognition, anatomy localization, disease diagnosis, report generation, and lesion detection. The extensive experiments provide insights into GPT-4V's strengths and weaknesses. Results show GPT-4V's proficiency in modality and anatomy recognition but difficulty with disease diagnosis and localization. GPT-4V excels at diagnostic report generation, indicating strong image captioning skills. While promising for biomedical imaging AI, GPT-4V requires further enhancement and validation before clinical deployment. We emphasize responsible development and testing for trustworthy integration of biomedical AGI. This rigorous evaluation of GPT-4V on diverse medical images advances understanding of multimodal large language models (LLMs) and guides future work toward impactful healthcare applications.
△ Less
Submitted 10 November, 2023;
originally announced December 2023.
-
Ophtha-LLaMA2: A Large Language Model for Ophthalmology
Authors:
Huan Zhao,
Qian Ling,
Yi Pan,
Tianyang Zhong,
**-Yu Hu,
Junjie Yao,
Fengqian Xiao,
Zhenxiang Xiao,
Yutong Zhang,
San-Hua Xu,
Shi-Nan Wu,
Min Kang,
Zihao Wu,
Zhengliang Liu,
Xi Jiang,
Tianming Liu,
Yi Shao
Abstract:
In recent years, pre-trained large language models (LLMs) have achieved tremendous success in the field of Natural Language Processing (NLP). Prior studies have primarily focused on general and generic domains, with relatively less research on specialized LLMs in the medical field. The specialization and high accuracy requirements for diagnosis in the medical field, as well as the challenges in co…
▽ More
In recent years, pre-trained large language models (LLMs) have achieved tremendous success in the field of Natural Language Processing (NLP). Prior studies have primarily focused on general and generic domains, with relatively less research on specialized LLMs in the medical field. The specialization and high accuracy requirements for diagnosis in the medical field, as well as the challenges in collecting large-scale data, have constrained the application and development of LLMs in medical scenarios. In the field of ophthalmology, clinical diagnosis mainly relies on doctors' interpretation of reports and making diagnostic decisions. In order to take advantage of LLMs to provide decision support for doctors, we collected three modalities of ophthalmic report data and fine-tuned the LLaMA2 model, successfully constructing an LLM termed the "Ophtha-LLaMA2" specifically tailored for ophthalmic disease diagnosis. Inference test results show that even with a smaller fine-tuning dataset, Ophtha-LLaMA2 performs significantly better in ophthalmic diagnosis compared to other LLMs. It demonstrates that the Ophtha-LLaMA2 exhibits satisfying accuracy and efficiency in ophthalmic disease diagnosis, making it a valuable tool for ophthalmologists to provide improved diagnostic support for patients. This research provides a useful reference for the application of LLMs in the field of ophthalmology, while showcasing the immense potential and prospects in this domain.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Enhanced quantum sensing mediated by a cavity in open systems
Authors:
Quinn Langfitt,
Zain H. Saleem,
Tian Zhong,
Anil Shaji,
Stephen K. Gray
Abstract:
We simulate the dynamics of systems with $N$ = 1-20 qubits coupled to a cavity in order to assess their potential for quantum metrology of a parameter in the open systems limit. The qubits and the cavity are both allowed to have losses and the system is studied under various coupling strength regimes. The focus is primarily on the coupling between the qubits using the quantum Fisher information as…
▽ More
We simulate the dynamics of systems with $N$ = 1-20 qubits coupled to a cavity in order to assess their potential for quantum metrology of a parameter in the open systems limit. The qubits and the cavity are both allowed to have losses and the system is studied under various coupling strength regimes. The focus is primarily on the coupling between the qubits using the quantum Fisher information as the measured parameter. Some results on estimating the qubit-cavity detuning parameter are also presented. We investigate the scaling of the uncertainty in the estimate of the qubit-cavity coupling with the number of qubits and for different initial states of the qubits that act as the quantum probe. As initial probe states, we consider Dicke states with varying excitation numbers, the GHZ state, and separable X-polarized states. It is shown that in the strong coupling regime, i.e., when the coupling between the qubits and the cavity is greater than the decay parameters of both the qubits and the cavity, Dicke states with a large excitation number can achieve the Heisenberg limit, with the precision scaling improving as the excitation number increases. A particularly intriguing finding of our study is that in the weak coupling regime, as well as in situations where either the qubit or cavity decay parameters exceed the coupling, the separable $X$-polarized state is the best in terms of scaling and is even able to achieve the Heisenberg limit in these lossy regimes for the range of $N$ considered.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
RINAS: Training with Dataset Shuffling Can Be General and Fast
Authors:
Tianle Zhong,
Jiechen Zhao,
Xindi Guo,
Qiang Su,
Geoffrey Fox
Abstract:
Deep learning datasets are expanding at an unprecedented pace, creating new challenges for data processing in model training pipelines. A crucial aspect of these pipelines is dataset shuffling, which significantly improves unbiased learning and convergence accuracy by adhering to the principles of random sampling. However, loading shuffled data for large datasets incurs significant overhead in the…
▽ More
Deep learning datasets are expanding at an unprecedented pace, creating new challenges for data processing in model training pipelines. A crucial aspect of these pipelines is dataset shuffling, which significantly improves unbiased learning and convergence accuracy by adhering to the principles of random sampling. However, loading shuffled data for large datasets incurs significant overhead in the deep learning pipeline and severely impacts the end-to-end training throughput. To mitigate this, current deep learning systems often resort to partial dataset shuffling, sacrificing global randomness to maintain acceptable training throughput on large datasets, still leaving global shuffling efficiency issues not fully explored.
In this work, we present RINAS, a data loading framework that systematically addresses the performance bottleneck of loading global shuffled datasets. Our key contribution is to offer an intra-batch unordered data fetching approach, which unleashes unexplored parallelism of data loading. We implement RINAS under the PyTorch framework for common dataset libraries HuggingFace and TorchVision. Our experimental results show that RINAS improves the throughput of general language model training and vision model training by up to 59% and 89%, respectively.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
RTP: Rethinking Tensor Parallelism with Memory Deduplication
Authors:
Cheng Luo,
Tianle Zhong,
Geoffrey Fox
Abstract:
In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models. Addressing this challenge, this study delves deep into the Rotated Tensor Parallelism (RTP). RTP is an innovative approach that strategically focuses on memory deduplication in distributed training environments. It boasts of unique featu…
▽ More
In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models. Addressing this challenge, this study delves deep into the Rotated Tensor Parallelism (RTP). RTP is an innovative approach that strategically focuses on memory deduplication in distributed training environments. It boasts of unique features like a customized communication primitive and the Flyweight Pattern initialization. Furthermore, RTP ensures a seamless overlap between partition computation and partition weight communication, optimizing the training process. Our empirical evaluations underscore RTP's efficiency, revealing that its memory consumption during distributed system training is remarkably close to the optimal - distributing the memory overhead of a single machine equitably among multiple machines. The experimental results demonstrate that RTP is capable of achieving comparable performance to Distributed Data Parallel while providing support for significantly larger models with near-linear scalability in terms of memory. Code of RTP is available at https://github.com/wdlctc/rtp.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
Authors:
Tianqi Zhong,
Quan Wang,
**gxuan Han,
Yongdong Zhang,
Zhendong Mao
Abstract:
Controllable text generation (CTG) aims to generate text with desired attributes, and decoding-time-based methods have shown promising performance on this task. However, in this paper, we identify the phenomenon of Attribute Collapse for the first time. It causes the fluency of generated text to rapidly decrease when the control strength exceeds a critical value, rendering the text completely unus…
▽ More
Controllable text generation (CTG) aims to generate text with desired attributes, and decoding-time-based methods have shown promising performance on this task. However, in this paper, we identify the phenomenon of Attribute Collapse for the first time. It causes the fluency of generated text to rapidly decrease when the control strength exceeds a critical value, rendering the text completely unusable. This limitation hinders the effectiveness of decoding methods in achieving high levels of controllability. To address this problem, we propose a novel lightweight decoding framework named Air-Decoding. Its main idea is reconstructing the attribute distributions to balance the weights between attribute words and non-attribute words to generate more fluent text. Specifically, we train prefixes by prefix-tuning to obtain attribute distributions. Then we design a novel attribute distribution reconstruction method to balance the obtained distributions and use the reconstructed distributions to guide language models for generation, effectively avoiding the issue of Attribute Collapse. Experiments on multiple CTG tasks prove that our method achieves a new state-of-the-art control performance.
△ Less
Submitted 1 November, 2023; v1 submitted 23 October, 2023;
originally announced October 2023.
-
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models
Authors:
Saleh Ashkboos,
Ilia Markov,
Elias Frantar,
Tingxuan Zhong,
Xincheng Wang,
Jie Ren,
Torsten Hoefler,
Dan Alistarh
Abstract:
Large Language Models (LLMs) from the GPT family have become extremely popular, leading to a race towards reducing their inference costs to allow for efficient local computation. Yet, the vast majority of existing work focuses on weight-only quantization, which can reduce runtime costs in the memory-bound one-token-at-a-time generative setting, but does not address them in compute-bound scenarios,…
▽ More
Large Language Models (LLMs) from the GPT family have become extremely popular, leading to a race towards reducing their inference costs to allow for efficient local computation. Yet, the vast majority of existing work focuses on weight-only quantization, which can reduce runtime costs in the memory-bound one-token-at-a-time generative setting, but does not address them in compute-bound scenarios, such as batched inference or prompt processing. In this paper, we address the general quantization problem, where both weights and activations should be quantized. We show, for the first time, that the majority of inference computations for large generative models such as LLaMA, OPT, and Falcon can be performed with both weights and activations being cast to 4 bits, in a way that leads to practical speedups, while at the same time maintaining good accuracy. We achieve this via a hybrid quantization strategy called QUIK, which compresses most of the weights and activations to 4-bit, while kee** some outlier weights and activations in higher-precision. The key feature of our scheme is that it is designed with computational efficiency in mind: we provide GPU kernels matching the QUIK format with highly-efficient layer-wise runtimes, which lead to practical end-to-end throughput improvements of up to 3.4x relative to FP16 execution. We provide detailed studies for models from the OPT, LLaMA-2 and Falcon families, as well as a first instance of accurate inference using quantization plus 2:4 sparsity. Code is available at: https://github.com/IST-DASLab/QUIK.
△ Less
Submitted 2 November, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Dual epitaxial telecom spin-photon interfaces with correlated long-lived coherence
Authors:
Shobhit Gupta,
Yizhong Huang,
Shihan Liu,
Yuxiang Pei,
Natasha Tomm,
Richard J. Warburton,
Tian Zhong
Abstract:
Optically active solid-state spin qubits thrive as an appealing technology for quantum interconnect and quantum networking, owing to their atomic size, scalable creation, long-lived coherence, and ability to coherently interface with flying qubits. Trivalent erbium dopants in particular emerge as a compelling candidate with their telecom C band emission and shielded 4f intra-shell spin-optical tra…
▽ More
Optically active solid-state spin qubits thrive as an appealing technology for quantum interconnect and quantum networking, owing to their atomic size, scalable creation, long-lived coherence, and ability to coherently interface with flying qubits. Trivalent erbium dopants in particular emerge as a compelling candidate with their telecom C band emission and shielded 4f intra-shell spin-optical transitions. However, prevailing top-down architecture for rare-earth qubits and devices has not allowed simultaneous long optical and spin coherence necessary for long-distance quantum networks. Here we demonstrate dual erbium telecom spin-photon interfaces in an epitaxial thin-film platform via wafer-scale bottom-up synthesis. Harnessing precise controls over the matrix purity, dopant placement, and symmetry unique to this platform, we simultaneously achieve millisecond erbium spin coherence time and $<$3 kilohertz optical dephasing rate in an inversion-symmetry protected site and realize both optical and microwave control in a fiber-integrated package for rapid scaling up. These results demonstrate a significant prospect for high-quality rare-earth qubits and quantum memories assembled using a bottom-up method and pave the way for the large-scale development of quantum light-matter interfaces for telecommunication quantum networks.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Authors:
Tianyang Zhong,
Wei Zhao,
Yutong Zhang,
Yi Pan,
Peixin Dong,
Zuowei Jiang,
Xiaoyan Kui,
Youlan Shang,
Li Yang,
Yaonai Wei,
Longtao Yang,
Hao Chen,
Huan Zhao,
Yuxiao Liu,
Ning Zhu,
Yiwei Li,
Yisong Wang,
Jiaqi Yao,
Jiaqi Wang,
Ying Zeng,
Lei He,
Chao Zheng,
Zhixue Zhang,
Ming Li,
Zhengliang Liu
, et al. (17 additional authors not shown)
Abstract:
Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviousl…
▽ More
Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviously distinctive among institutions, body regions inspected and radiologists. Recently, the advent of large language models (LLM) offers great potential for recognizing signs of health conditions. To resolve the above problem, we collaborate with the Second Xiangya Hospital in China and propose ChatRadio-Valuer based on the LLM, a tailored model for automatic radiology report generation that learns generalizable representations and provides a basis pattern for model adaptation in sophisticated analysts' cases. Specifically, ChatRadio-Valuer is trained based on the radiology reports from a single institution by means of supervised fine-tuning, and then adapted to disease diagnosis tasks for human multi-system evaluation (i.e., chest, abdomen, muscle-skeleton, head, and maxillofacial $\&$ neck) from six different institutions in clinical-level events. The clinical dataset utilized in this study encompasses a remarkable total of \textbf{332,673} observations. From the comprehensive results on engineering indicators, clinical efficacy and deployment cost metrics, it can be shown that ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al., in terms of the diseases diagnosis from radiology reports. ChatRadio-Valuer provides an effective avenue to boost model generalization performance and alleviate the annotation workload of experts to enable the promotion of clinical AI applications in radiology reports.
△ Less
Submitted 9 October, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Chat2Brain: A Method for Map** Open-Ended Semantic Queries to Brain Activation Maps
Authors:
Yaonai Wei,
Tuo Zhang,
Han Zhang,
Tianyang Zhong,
Lin Zhao,
Zhengliang Liu,
Chong Ma,
Songyao Zhang,
Muheng Shang,
Lei Du,
Xiao Li,
Tianming Liu,
Junwei Han
Abstract:
Over decades, neuroscience has accumulated a wealth of research results in the text modality that can be used to explore cognitive processes. Meta-analysis is a typical method that successfully establishes a link from text queries to brain activation maps using these research results, but it still relies on an ideal query environment. In practical applications, text queries used for meta-analyses…
▽ More
Over decades, neuroscience has accumulated a wealth of research results in the text modality that can be used to explore cognitive processes. Meta-analysis is a typical method that successfully establishes a link from text queries to brain activation maps using these research results, but it still relies on an ideal query environment. In practical applications, text queries used for meta-analyses may encounter issues such as semantic redundancy and ambiguity, resulting in an inaccurate map** to brain images. On the other hand, large language models (LLMs) like ChatGPT have shown great potential in tasks such as context understanding and reasoning, displaying a high degree of consistency with human natural language. Hence, LLMs could improve the connection between text modality and neuroscience, resolving existing challenges of meta-analyses. In this study, we propose a method called Chat2Brain that combines LLMs to basic text-2-image model, known as Text2Brain, to map open-ended semantic queries to brain activation maps in data-scarce and complex query environments. By utilizing the understanding and reasoning capabilities of LLMs, the performance of the map** model is optimized by transferring text queries to semantic queries. We demonstrate that Chat2Brain can synthesize anatomically plausible neural activation patterns for more complex tasks of text queries.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
$ρ$-meson longitudinal leading-twist distribution amplitude revisited and the $D\to ρ$ semileptonic decay
Authors:
Tao Zhong,
Ya-Hong Dai,
Hai-Bing Fu
Abstract:
Motivated by our previous work [Phys. Rev. D \textbf{104}, no.1, 016021 (2021)] on pionic leading-twist distribution amplitude (DA), we revisit $ρ$-meson leading-twist longitudinal DA $φ_{2;ρ}^\|(x,μ)$ in this paper. A model proposed by Chang based on the Dyson-Schwinger equations (DSEs) is adopted to describe the behavior of $φ_{2;ρ}^\|(x,μ)$. On the other hand, the $ξ$-moments of…
▽ More
Motivated by our previous work [Phys. Rev. D \textbf{104}, no.1, 016021 (2021)] on pionic leading-twist distribution amplitude (DA), we revisit $ρ$-meson leading-twist longitudinal DA $φ_{2;ρ}^\|(x,μ)$ in this paper. A model proposed by Chang based on the Dyson-Schwinger equations (DSEs) is adopted to describe the behavior of $φ_{2;ρ}^\|(x,μ)$. On the other hand, the $ξ$-moments of $φ_{2;ρ}^\|(x,μ)$ are calculated with the QCD sum rules in the framework of the background field theory. The sum rule formula for those moments are improved. More accurate values for the first five nonzero $ξ$-moments at typical scale $μ=1, 1.4, 2, 3~{\rm GeV}$ are given, e.g., at $μ= 1~{\rm GeV}$, \modi{$\langleξ^2\rangle_{2;ρ}^\| = 0.220(6) $, $\langleξ^4\rangle_{2;ρ}^\| = 0.103(4)$, $\langleξ^6\rangle_{2;ρ}^\| = 0.066(5)$, $\langleξ^8\rangle_{2;ρ}^\| = 0.046(4)$ and $\langleξ^{10}\rangle_{2;ρ}^\| = 0.035(3)$}. By fitting those values with the least squares method, the DSE model for $φ_{2;ρ}^\|(x,μ)$ is determined. By taking the left-handed current light-cone sum rule approach, we get the transition form factor at large recoil region, {\it i.e.} $A_1(0) = 0.498^{+0.014}_{-0.012}$, $A_2(0)=0.460^{+0.055}_{-0.047}$, $V(0) = 0.800^{+0.015}_{-0.014}$, and the ratio $r_2 = 0.923^{+0.133}_{-0.119}$, $r_V = 1.607^{+0.071}_{-0.071}$. After making the extrapolation with a rapidly converging series based on $z(t)$-expansion, we present the decay width for the semileptonic decays $D\toρ\ell^+ν_\ell$. Finally, the branching fractions are $\mathcal{B}(D^0\to ρ^- e^+ ν_e) = 1.889^{+0.176}_{-0.170}\pm 0.005$, $\mathcal{B}(D^+ \to ρ^0 e^+ ν_e) = 2.380^{+0.221}_{-0.214}\pm 0.012$, $\mathcal{B}(D^0\to ρ^- μ^+ ν_μ) = 1.881^{+0.174}_{-0.168}\pm 0.005$, $\mathcal{B}(D^+ \to ρ^0 μ^+ ν_μ) =2.369^{+0.219}_{-0.211}\pm 0.011$.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Nonconvex optimization for optimum retrieval of the transmission matrix of a multimode fiber
Authors:
Shengfu Cheng,
Xuyu Zhang,
Tianting Zhong,
Huanhao Li,
Haoran Li,
Lei Gong,
Honglin Liu,
Puxiang Lai
Abstract:
Transmission matrix (TM) allows light control through complex media such as multimode fibers (MMFs), gaining great attention in areas like biophotonics over the past decade. The measurement of a complex-valued TM is highly desired as it supports full modulation of the light field, yet demanding as the holographic setup is usually entailed. Efforts have been taken to retrieve a TM directly from int…
▽ More
Transmission matrix (TM) allows light control through complex media such as multimode fibers (MMFs), gaining great attention in areas like biophotonics over the past decade. The measurement of a complex-valued TM is highly desired as it supports full modulation of the light field, yet demanding as the holographic setup is usually entailed. Efforts have been taken to retrieve a TM directly from intensity measurements with several representative phase retrieval algorithms, which still see limitations like slow or suboptimum recovery, especially under noisy environment. Here, a modified non-convex optimization approach is proposed. Through numerical evaluations, it shows that the nonconvex method offers an optimum efficiency of focusing with less running time or sampling rate. The comparative test under different signal-to-noise levels further indicates its improved robustness for TM retrieval. Experimentally, the optimum retrieval of the TM of a MMF is collectively validated by multiple groups of single-spot and multi-spot focusing demonstrations. Focus scanning on the working plane of the MMF is also conducted where our method achieves 93.6% efficiency of the gold standard holography method when the sampling rate is 8. Based on the recovered TM, image transmission through the MMF with high fidelity can be realized via another phase retrieval. Thanks to parallel operation and GPU acceleration, the nonconvex approach can retrieve an 8685$\times$1024 TM (sampling rate=8) with 42.3 s on a regular computer. In brief, the proposed method provides optimum efficiency and fast implementation for TM retrieval, which will facilitate wide applications in deep-tissue optical imaging, manipulation and treatment.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Evaluating Large Language Models for Radiology Natural Language Processing
Authors:
Zhengliang Liu,
Tianyang Zhong,
Yiwei Li,
Yutong Zhang,
Yi Pan,
Zihao Zhao,
Peixin Dong,
Chao Cao,
Yuxiao Liu,
Peng Shu,
Yaonai Wei,
Zihao Wu,
Chong Ma,
Jiaqi Wang,
Sheng Wang,
Mengyue Zhou,
Zuowei Jiang,
Chunlin Li,
Jason Holmes,
Shaochen Xu,
Lu Zhang,
Haixing Dai,
Kai Zhang,
Lin Zhao,
Yuanhao Chen
, et al. (20 additional authors not shown)
Abstract:
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP). LLMs have revolutionized a multitude of domains, and they have made a significant impact in the medical field. Large language models are now more abundant than ever, and many of these models exhibit bilingual capabilities, proficient in both English and Chinese. However, a compreh…
▽ More
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP). LLMs have revolutionized a multitude of domains, and they have made a significant impact in the medical field. Large language models are now more abundant than ever, and many of these models exhibit bilingual capabilities, proficient in both English and Chinese. However, a comprehensive evaluation of these models remains to be conducted. This lack of assessment is especially apparent within the context of radiology NLP. This study seeks to bridge this gap by critically evaluating thirty two LLMs in interpreting radiology reports, a crucial component of radiology NLP. Specifically, the ability to derive impressions from radiologic findings is assessed. The outcomes of this evaluation provide key insights into the performance, strengths, and weaknesses of these LLMs, informing their practical applications within the medical domain.
△ Less
Submitted 27 July, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Gloss Attention for Gloss-free Sign Language Translation
Authors:
Aoxiong Yin,
Tianyun Zhong,
Li Tang,
Weike **,
Tao **,
Zhou Zhao
Abstract:
Most sign language translation (SLT) methods to date require the use of gloss annotations to provide additional supervision information, however, the acquisition of gloss is not easy. To solve this problem, we first perform an analysis of existing models to confirm how gloss annotations make SLT easier. We find that it can provide two aspects of information for the model, 1) it can help the model…
▽ More
Most sign language translation (SLT) methods to date require the use of gloss annotations to provide additional supervision information, however, the acquisition of gloss is not easy. To solve this problem, we first perform an analysis of existing models to confirm how gloss annotations make SLT easier. We find that it can provide two aspects of information for the model, 1) it can help the model implicitly learn the location of semantic boundaries in continuous sign language videos, 2) it can help the model understand the sign language video globally. We then propose \emph{gloss attention}, which enables the model to keep its attention within video segments that have the same semantics locally, just as gloss helps existing models do. Furthermore, we transfer the knowledge of sentence-to-sentence similarity from the natural language model to our gloss attention SLT network (GASLT) to help it understand sign language videos at the sentence level. Experimental results on multiple large-scale sign language datasets show that our proposed GASLT model significantly outperforms existing methods. Our code is provided in \url{https://github.com/YinAoXiong/GASLT}.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Properties of the $η_q$ leading-twist distribution amplitude and its effects to the $B/D^+ \toη^{(\prime)}\ell^+ ν_\ell$ decays
Authors:
Dan-Dan Hu,
Xing-Gang Wu,
Hai-Bing Fu,
Tao Zhong,
Zai-Hui Wu,
Long Zeng
Abstract:
The $η^{(\prime)}$-mesons in the quark-flavor basis are mixtures of two mesonic states $|η_{q}\rangle=|\bar u u+\bar d d\rangle/\sqrt 2$ and $|η_{s}\rangle=|\bar s s\rangle$. In the previous work, we have made a detailed study on the $η_{s}$ leading-twist distribution amplitude. As a sequential work, in the present paper, we fix the $η_q$ leading-twist distribution amplitude by using the light-con…
▽ More
The $η^{(\prime)}$-mesons in the quark-flavor basis are mixtures of two mesonic states $|η_{q}\rangle=|\bar u u+\bar d d\rangle/\sqrt 2$ and $|η_{s}\rangle=|\bar s s\rangle$. In the previous work, we have made a detailed study on the $η_{s}$ leading-twist distribution amplitude. As a sequential work, in the present paper, we fix the $η_q$ leading-twist distribution amplitude by using the light-cone harmonic oscillator model for its wave function and by using the QCD sum rules within the QCD background field to calculate its moments. The input parameters of $η_q$ leading-twist distribution amplitude $φ_{2;η_q}$ at an initial scale $μ_0\sim 1$ GeV are then fixed by using those moments. The sum rules for the $0_{\rm th}$-order moment can also be used to fix the magnitude of $η_q$ decay constant, which gives $f_{η_q}=0.141\pm0.005$ GeV. As an application of the present derived $φ_{2;η_q}$, we calculate the transition form factors $B(D)^+ \toη^{(\prime)}$ by using the QCD light-cone sum rules up to twist-4 accuracy and by including the next-to-leading order QCD corrections to the twist-2 part, and then fix the related CKM matrix element and the decay width for the semi-leptonic decays $B(D)^+ \toη^{(\prime)}\ell^+ ν_\ell$.
△ Less
Submitted 5 December, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Fast-Grasp'D: Dexterous Multi-finger Grasp Generation Through Differentiable Simulation
Authors:
Dylan Turpin,
Tao Zhong,
Shutong Zhang,
Guanglei Zhu,
**gzhou Liu,
Ritvik Singh,
Eric Heiden,
Miles Macklin,
Stavros Tsogkas,
Sven Dickinson,
Animesh Garg
Abstract:
Multi-finger gras** relies on high quality training data, which is hard to obtain: human data is hard to transfer and synthetic data relies on simplifying assumptions that reduce grasp quality. By making grasp simulation differentiable, and contact dynamics amenable to gradient-based optimization, we accelerate the search for high-quality grasps with fewer limiting assumptions. We present Grasp'…
▽ More
Multi-finger gras** relies on high quality training data, which is hard to obtain: human data is hard to transfer and synthetic data relies on simplifying assumptions that reduce grasp quality. By making grasp simulation differentiable, and contact dynamics amenable to gradient-based optimization, we accelerate the search for high-quality grasps with fewer limiting assumptions. We present Grasp'D-1M: a large-scale dataset for multi-finger robotic gras**, synthesized with Fast- Grasp'D, a novel differentiable gras** simulator. Grasp'D- 1M contains one million training examples for three robotic hands (three, four and five-fingered), each with multimodal visual inputs (RGB+depth+segmentation, available in mono and stereo). Grasp synthesis with Fast-Grasp'D is 10x faster than GraspIt! and 20x faster than the prior Grasp'D differentiable simulator. Generated grasps are more stable and contact-rich than GraspIt! grasps, regardless of the distance threshold used for contact generation. We validate the usefulness of our dataset by retraining an existing vision-based gras** pipeline on Grasp'D-1M, and showing a dramatic increase in model performance, predicting grasps with 30% more contact, a 33% higher epsilon metric, and 35% lower simulated displacement. Additional details at https://dexgrasp.github.io.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Investigating $D_s^+ \to π^0 \ell^+ ν_\ell$ decay process within QCD sum rule approach
Authors:
Hai-Jiang Tian,
Hai-Bing Fu,
Tao Zhong,
Xuan Luo,
Dan-Dan Hu,
Yin-Long Yang
Abstract:
In this paper, the semileptonic decays $D_s^+ \to π^0\ell^+ ν_\ell$ with $\ell=(e,μ)$ are investigated by using the light-cone sum rule approach. Firstly, the neutral meson mixing scheme between $π^0$, $η$, $η^\prime$ and pseudoscalar gluonium $G$ is discussed in a unified way, which leads to the direct connection between two different channels for $D_s^+\to π^0\ell^+ν_\ell$ and…
▽ More
In this paper, the semileptonic decays $D_s^+ \to π^0\ell^+ ν_\ell$ with $\ell=(e,μ)$ are investigated by using the light-cone sum rule approach. Firstly, the neutral meson mixing scheme between $π^0$, $η$, $η^\prime$ and pseudoscalar gluonium $G$ is discussed in a unified way, which leads to the direct connection between two different channels for $D_s^+\to π^0\ell^+ν_\ell$ and $D_s^+ \to η\ell^+ν_\ell$ by the $π^0-η$ mixing angle. Then we calculated the $D_s\to π^0$ transition form factors (TFFs) within QCD light-cone sum rule approach up to next-to-leading order correction. At the large recoil point, we have $f_+^{D_s^+π^0}(0)=0.0113_{-0.0019}^{+0.0024}$ and $f_-^{D_s^+π^0}(0)=0.0020_{-0.0009}^{+0.0008}$. Furthermore, the TFFs are extrapolated to the whole physical $q^2$-region by using the simplified $z(q^2)$-series expansion. The behaviors of TFFs and related three angular coefficient functions $a_{θ_\ell}(q^2)$, $b_{θ_\ell}(q^2)$ and $c_{θ_\ell}(q^2)$ are given. The differential decay widths for $D_s^+ \to π^0\ell^+ ν_\ell$ with respect to $q^2$ and $\cosθ_\ell$ are presented, and also lead to the branching fractions ${\cal B}(D_s^+\to π^0e^+ν_e) =2.60_{-0.51}^{+0.57}\times 10^{-5}$ and ${\cal B}(D_s^+\to π^0μ^+ν_μ)= 2.58_{-0.51}^{+0.56}\times 10^{-5}$. These results show well agreement with the recent BESIII measurements and theoretical predictions. Then the differential distributions and integrated predictions for three angular observables, {\it i.e.} forward-backward asymmetries, $q^2$-differential flat terms and lepton polarization asymmetries are given separately. Lastly, we estimate the ratio for different decay channels ${\cal R}_{π^0/η}^{\ell}=1.108_{-0.071}^{+0.039}\times 10^{-3}$.
△ Less
Submitted 11 October, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
Embrace Opportunities and Face Challenges: Using ChatGPT in Undergraduate Students' Collaborative Interdisciplinary Learning
Authors:
Gaoxia Zhu,
Xiuyi Fan,
Chenyu Hou,
Tianlong Zhong,
Peter Seow,
Annabel Chen Shen-Hsing,
Preman Rajalingam,
Low Kin Yew,
Tan Lay Poh
Abstract:
ChatGPT, launched in November 2022, has gained widespread attention from students and educators globally, with an online report by Hu (2023) stating it as the fastest-growing consumer application in history. While discussions on the use of ChatGPT in higher education are abundant, empirical studies on its impact on collaborative interdisciplinary learning are rare. To investigate its potential, we…
▽ More
ChatGPT, launched in November 2022, has gained widespread attention from students and educators globally, with an online report by Hu (2023) stating it as the fastest-growing consumer application in history. While discussions on the use of ChatGPT in higher education are abundant, empirical studies on its impact on collaborative interdisciplinary learning are rare. To investigate its potential, we conducted a quasi-experimental study with 130 undergraduate students (STEM and non-STEM) learning digital literacy with or without ChatGPT over two weeks. Weekly surveys were conducted on collaborative interdisciplinary problem-solving, physical and cognitive engagement, and individual reflections on ChatGPT use. Analysis of survey responses showed significant main effects of topics on collaborative interdisciplinary problem-solving and physical and cognitive engagement, a marginal interaction effect between disciplinary backgrounds and ChatGPT conditions for cognitive engagement, and a significant interaction effect for physical engagement. Sentiment analysis of student reflections suggested no significant difference between STEM and non-STEM students' opinions towards ChatGPT. Qualitative analysis of reflections generated eight positive themes, including efficiency, addressing knowledge gaps, and generating human-like responses, and eight negative themes, including generic responses, lack of innovation, and counterproductive to self-discipline and thinking. Our findings suggest that ChatGPT use needs to be optimized by considering the topics being taught and the disciplinary backgrounds of students rather than applying it uniformly. These findings have implications for both pedagogical research and practices.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Entanglement Distribution in Quantum Repeater with Purification and Optimized Buffer Time
Authors:
Allen Zang,
Xinan Chen,
Alexander Kolar,
Joaquin Chung,
Martin Suchara,
Tian Zhong,
Rajkumar Kettimuthu
Abstract:
Quantum repeater networks that allow long-distance entanglement distribution will be the backbone of distributed quantum information processing. In this paper we explore entanglement distribution using quantum repeaters with optimized buffer time, equipped with noisy quantum memories and performing imperfect entanglement purification and swap**. We observe that increasing the number of memories…
▽ More
Quantum repeater networks that allow long-distance entanglement distribution will be the backbone of distributed quantum information processing. In this paper we explore entanglement distribution using quantum repeaters with optimized buffer time, equipped with noisy quantum memories and performing imperfect entanglement purification and swap**. We observe that increasing the number of memories on end nodes leads to a higher entanglement distribution rate per memory and higher probability of high-fidelity entanglement distribution, at least for the case with perfect operations. When imperfect operations are considered, however, we make the surprising observation that the per-memory entanglement rate decreases with increasing number of memories. Our results suggest that building quantum repeaters that perform well under realistic conditions requires careful modeling and design that takes into consideration the operations and resources that are finite and imperfect.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT
Authors:
Tianyang Zhong,
Yaonai Wei,
Li Yang,
Zihao Wu,
Zhengliang Liu,
Xiaozheng Wei,
Wenjun Li,
Junjie Yao,
Chong Ma,
Xiang Li,
Dajiang Zhu,
Xi Jiang,
Junwei Han,
Dinggang Shen,
Tianming Liu,
Tuo Zhang
Abstract:
Large language models (LLMs) such as ChatGPT have recently demonstrated significant potential in mathematical abilities, providing valuable reasoning paradigm consistent with human natural language. However, LLMs currently have difficulty in bridging perception, language understanding and reasoning capabilities due to incompatibility of the underlying information flow among them, making it challen…
▽ More
Large language models (LLMs) such as ChatGPT have recently demonstrated significant potential in mathematical abilities, providing valuable reasoning paradigm consistent with human natural language. However, LLMs currently have difficulty in bridging perception, language understanding and reasoning capabilities due to incompatibility of the underlying information flow among them, making it challenging to accomplish tasks autonomously. On the other hand, abductive learning (ABL) frameworks for integrating the two abilities of perception and reasoning has seen significant success in inverse decipherment of incomplete facts, but it is limited by the lack of semantic understanding of logical reasoning rules and the dependence on complicated domain knowledge representation. This paper presents a novel method (ChatABL) for integrating LLMs into the ABL framework, aiming at unifying the three abilities in a more user-friendly and understandable manner. The proposed method uses the strengths of LLMs' understanding and logical reasoning to correct the incomplete logical facts for optimizing the performance of perceptual module, by summarizing and reorganizing reasoning rules represented in natural language format. Similarly, perceptual module provides necessary reasoning examples for LLMs in natural language format. The variable-length handwritten equation deciphering task, an abstract expression of the Mayan calendar decoding, is used as a testbed to demonstrate that ChatABL has reasoning ability beyond most existing state-of-the-art methods, which has been well supported by comparative studies. To our best knowledge, the proposed ChatABL is the first attempt to explore a new pattern for further approaching human-level cognitive ability via natural language interaction with ChatGPT.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Quantum Optical Memory for Entanglement Distribution
Authors:
Yisheng Lei,
Faezeh Kimiaee Asadi,
Tian Zhong,
Alexander Kuzmich,
Christoph Simon,
Mahdi Hosseini
Abstract:
Optical photons are powerful carriers of quantum information, which can be delivered in free space by satellites or in fibers on the ground over long distances. Entanglement of quantum states over long distances can empower quantum computing, quantum communications, and quantum sensing. Quantum optical memories can effectively store and manipulate quantum states, which makes them indispensable ele…
▽ More
Optical photons are powerful carriers of quantum information, which can be delivered in free space by satellites or in fibers on the ground over long distances. Entanglement of quantum states over long distances can empower quantum computing, quantum communications, and quantum sensing. Quantum optical memories can effectively store and manipulate quantum states, which makes them indispensable elements in future long-distance quantum networks. Over the past two decades, quantum optical memories with high fidelity, high efficiencies, long storage times, and promising multiplexing capabilities have been developed, especially at the single photon level. In this review, we introduce the working principles of commonly used quantum memory protocols and summarize the recent advances in quantum memory demonstrations. We also offer a vision for future quantum optical memory devices that may enable entanglement distribution over long distances.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Supercharging Distributed Computing Environments For High Performance Data Engineering
Authors:
Niranda Perera,
Kaiying Shan,
Supun Kamburugamuwe,
Thejaka Amila Kanewela,
Chathura Widanage,
Arup Sarker,
Mills Staylor,
Tianle Zhong,
Vibhatha Abeykoon,
Geoffrey Fox
Abstract:
The data engineering and data science community has embraced the idea of using Python & R dataframes for regular applications. Driven by the big data revolution and artificial intelligence, these applications are now essential in order to process terabytes of data. They can easily exceed the capabilities of a single machine, but also demand significant developer time & effort. Therefore it is esse…
▽ More
The data engineering and data science community has embraced the idea of using Python & R dataframes for regular applications. Driven by the big data revolution and artificial intelligence, these applications are now essential in order to process terabytes of data. They can easily exceed the capabilities of a single machine, but also demand significant developer time & effort. Therefore it is essential to design scalable dataframe solutions. There have been multiple attempts to tackle this problem, the most notable being the dataframe systems developed using distributed computing environments such as Dask and Ray. Even though Dask/Ray distributed computing features look very promising, we perceive that the Dask Dataframes/Ray Datasets still have room for optimization. In this paper, we present CylonFlow, an alternative distributed dataframe execution methodology that enables state-of-the-art performance and scalability on the same Dask/Ray infrastructure (thereby supercharging them!). To achieve this, we integrate a high performance dataframe system Cylon, which was originally based on an entirely different execution paradigm, into Dask and Ray. Our experiments show that on a pipeline of dataframe operators, CylonFlow achieves 30x more distributed performance than Dask Dataframes. Interestingly, it also enables superior sequential performance due to the native C++ execution of Cylon. We believe the success of Cylon & CylonFlow extends beyond the data engineering domain, and can be used to consolidate high performance computing and distributed computing ecosystems.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Transition metal ion ensembles in crystals as solid-state coherent spin-photon interfaces: The case of nickel in magnesium oxide
Authors:
E. Poem,
S. Gupta,
I. Morris,
K. Klink,
L. Singh,
T. Zhong,
J. N. Becker,
O. Firstenberg
Abstract:
We present general guidelines for finding solid-state systems that could serve as coherent electron spin-photon interfaces even at relatively high temperatures, where phonons are abundant but cooling is easier, and show that transition metal ions in various crystals could comply with these guidelines. As an illustrative example, we focus on divalent nickel ions in magnesium oxide. We perform elect…
▽ More
We present general guidelines for finding solid-state systems that could serve as coherent electron spin-photon interfaces even at relatively high temperatures, where phonons are abundant but cooling is easier, and show that transition metal ions in various crystals could comply with these guidelines. As an illustrative example, we focus on divalent nickel ions in magnesium oxide. We perform electron spin resonance spectroscopy and polarization-sensitive magneto-optical fluorescence spectroscopy of a dense ensemble of these ions and find that (i) the ground-state electron spin stays coherent at liquid-helium temperatures for several microseconds, and (ii) there exists energetically well-isolated excited states which can couple to two ground state spin sub-levels via optical transitions of orthogonal polarizations. The latter implies that fast, coherent optical control over the electron spin is possible. We then propose schemes for optical initialization and control of the ground-state electron spin using polarized optical pulses, as well as two schemes for implementing a noise-free, broadband quantum-optical memory at near-telecom wavelengths in this material system.
△ Less
Submitted 22 August, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
Hybrid Cloud and HPC Approach to High-Performance Dataframes
Authors:
Kaiying Shan,
Niranda Perera,
Damitha Lenadora,
Tianle Zhong,
Arup Sarker,
Supun Kamburugamuve,
Thejaka Amila Kanewela,
Chathura Widanage,
Geoffrey Fox
Abstract:
Data pre-processing is a fundamental component in any data-driven application. With the increasing complexity of data processing operations and volume of data, Cylon, a distributed dataframe system, is developed to facilitate data processing both as a standalone application and as a library, especially for Python applications. While Cylon shows promising performance results, we experienced difficu…
▽ More
Data pre-processing is a fundamental component in any data-driven application. With the increasing complexity of data processing operations and volume of data, Cylon, a distributed dataframe system, is developed to facilitate data processing both as a standalone application and as a library, especially for Python applications. While Cylon shows promising performance results, we experienced difficulties trying to integrate with frameworks incompatible with the traditional Message Passing Interface (MPI). While MPI implementations encompass scalable and efficient communication routines, their process launching mechanisms work well with mainstream HPC systems but are incompatible with some environments that adopt their own resource management systems. In this work, we alleviated this issue by directly integrating the Unified Communication X (UCX) framework, which supports a variety of classic HPC and non-HPC process-bootstrap** mechanisms as our communication framework. While we experimented with our methodology on Cylon, the same technique can be used to bring MPI communication to other applications that do not employ MPI's built-in process management approach.
△ Less
Submitted 29 December, 2022; v1 submitted 28 December, 2022;
originally announced December 2022.
-
Simulation of Entanglement Generation between Absorptive Quantum Memories
Authors:
Allen Zang,
Alexander Kolar,
Joaquin Chung,
Martin Suchara,
Tian Zhong,
Rajkumar Kettimuthu
Abstract:
Quantum entanglement is an essential resource for quantum networks. However, the generation of entanglement between physical devices at remote network nodes is a challenging task towards practical implementation of quantum networks. In this work, we use the open-source Simulator of QUantum Network Communication (SeQUeNCe), developed by our team, to simulate entanglement generation between two atom…
▽ More
Quantum entanglement is an essential resource for quantum networks. However, the generation of entanglement between physical devices at remote network nodes is a challenging task towards practical implementation of quantum networks. In this work, we use the open-source Simulator of QUantum Network Communication (SeQUeNCe), developed by our team, to simulate entanglement generation between two atomic frequency comb (AFC) absorptive quantum memories to be deployed on the Argonne-Chicago quantum network. We realize the representation of photonic quantum states within truncated Fock spaces in SeQUeNCe and build models for a spontaneous parametric down-conversion (SPDC) source, AFC absorptive quantum memories, and measurement devices with non-number-resolving photon detectors. Based on these developments, we observe varying fidelity with SPDC source mean photon number, and varying entanglement generation rate with both mean photon number and memory mode number. We also simulate tomographic reconstruction of the effective density matrix for the bipartite photonic states retrieved from quantum memories. Our work extends the usability of the SeQUeNCe simulator with new hardware modules and Fock state representation that will improve the simulation of near term quantum network hardware and protocols.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Revisiting $D$-meson twist-2, 3 distribution amplitudes
Authors:
Tao Zhong,
Dong Huang,
Hai-Bing Fu
Abstract:
Due to the significant difference between the experimental measurements and the theoretical predictions of standard model (SM) for the value of $\mathcal{R}(D)$ of the semileptonic decay $B\to D\ell\barν_{\ell}$, people speculate that it may be the evidence of new physics beyond the SM. Usually, the $D$-meson twist-2, 3 distribution amplitudes (DAs) $φ_{2;D}(x,μ)$, $φ_{3;D}^p(x,μ)$ and…
▽ More
Due to the significant difference between the experimental measurements and the theoretical predictions of standard model (SM) for the value of $\mathcal{R}(D)$ of the semileptonic decay $B\to D\ell\barν_{\ell}$, people speculate that it may be the evidence of new physics beyond the SM. Usually, the $D$-meson twist-2, 3 distribution amplitudes (DAs) $φ_{2;D}(x,μ)$, $φ_{3;D}^p(x,μ)$ and $φ_{3;D}^σ(x,μ)$ are the main error sources when using perturbative QCD factorization and light-cone QCD sum rules to study $B\to D\ell\barν_{\ell}$. Therefore, it is important to get more reasonable and accurate behaviors for those DAs. Motivated by our previous work [Phys. Rev. D 104, no.1, 016021 (2021)] on pionic leading-twist DA, we revisit $D$-meson twist-2, 3 DAs $φ_{2;D}(x,μ)$, $φ_{3;D}^p(x,μ)$ and $φ_{3;D}^σ(x,μ)$. New sum rules formulae for the $ξ$-moments of these three DAs are suggested to obtain more accurate values. The light-cone harmonic oscillator models for those DAs are improved, and whose model parameters are determined by fitting the values of $ξ$-moments with the least squares method.
△ Less
Submitted 6 April, 2023; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Searching for $a_0(980)$-meson parton distribution function
Authors:
Zai-Hui Wu,
Hai-Bing Fu,
Tao Zhong,
Yu Chen,
Ya-Hong Dai
Abstract:
In this paper, we calculate the scalar $a_0(980)$-meson leading-twist wavefunction by using light-cone harmonic oscillator model (LCHO). In which the model parameters are determined by fitting the $ξ$-moments $\langleξ_{a_0}^n\rangle_ζ$ of its light-cone distribution amplitudes. Then, the $a_0(980)$-meson leading-twist light-cone distribution amplitudes with three different scales…
▽ More
In this paper, we calculate the scalar $a_0(980)$-meson leading-twist wavefunction by using light-cone harmonic oscillator model (LCHO). In which the model parameters are determined by fitting the $ξ$-moments $\langleξ_{a_0}^n\rangle_ζ$ of its light-cone distribution amplitudes. Then, the $a_0(980)$-meson leading-twist light-cone distribution amplitudes with three different scales $ζ= (1.0, 2.0, 5.2)~{\rm GeV}$ are given. After constructing the relationship between $a_0(980)$-meson leading-twist parton distribution functions/valence quark distribution function and its LCHO wavefunction, we exhibit the $q^{a_0}(x,ζ)$ and $x q^{a_0}(x,ζ)$ with different scales. Furthermore, we also calculate the Mellin moments of the $a_0(980)$-meson's valence quark distribution function $\langle x^n q^{a_0}\rangle_ζ$ with $n = (1,2,3)$, i.e. $\langle x q^{a_0}\rangle_{ζ_5} = 0.026$, $\langle x^2 q^{a_0}\rangle_{ζ_5} = 0.017$ and $\langle x^3 q^{a_0}\rangle_{ζ_5} = 0.012$. Finally, the scale evolution for the ratio of the Mellin moments $x^n_{a_0}(ζ,ζ_k)$ are presented.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
$K_0^\ast(1430)$ Twist-2 Distribution Amplitude and $B_s,D_s \to K_0^\ast(1430)$ Transition Form Factors
Authors:
Dong Huang,
Tao Zhong,
Hai-Bing Fu,
Zai-Hui Wu,
Xing-Gang Wu,
Hong Tong
Abstract:
Based on the scenario that the $K_0^\ast(1430)$ is viewed as the ground state of $s\bar{q}$ or $q\bar{s}$, we study the $K_0^\ast(1430)$ leading-twist distribution amplitude (DA) $φ_{2;K_0^\ast}(x,μ)$ with the QCD sum rules in the framework of background field theory. A more reasonable sum rule formula for $ξ$-moments $\langleξ^n\rangle_{2;K_0^\ast}$ is suggested, which eliminates the influence br…
▽ More
Based on the scenario that the $K_0^\ast(1430)$ is viewed as the ground state of $s\bar{q}$ or $q\bar{s}$, we study the $K_0^\ast(1430)$ leading-twist distribution amplitude (DA) $φ_{2;K_0^\ast}(x,μ)$ with the QCD sum rules in the framework of background field theory. A more reasonable sum rule formula for $ξ$-moments $\langleξ^n\rangle_{2;K_0^\ast}$ is suggested, which eliminates the influence brought by the fact that the sum rule of $\langleξ^0_p\rangle_{3;K_0^\ast}$ cannot be normalized in whole Borel region. More accurate values of the first ten $ξ$-moments, $\langleξ^n\rangle_{2;K_0^\ast} (n = 1,2,\cdots,10)$, are evaluated. A new light-cone harmonic oscillator (LCHO) model for $K_0^\ast(1430)$ leading-twist DA is established for the first times. By fitting the resulted values of $\langleξ^n\rangle_{2;K_0^\ast} (n = 1,2,\cdots,10)$ via the least squares method, the behavior of $K_0^\ast(1430)$ leading-twist DA described with LCHO model is determined. Further, by adopting the light-cone QCD sum rules, we calculate the $B_s,D_s \to K_0^\ast(1430)$ transition form factors and branching fractions of the semileptonic decays $B_s,D_s \to K_0^\ast(1430) \ell ν_\ell$. The corresponding numerical results can be used to extract the Cabibbo-Kobayashi-Maskawa matrix elements by combining the relative experimental data in the future.
△ Less
Submitted 1 August, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
$a_0(980)$-meson twist-2 distribution amplitude within the QCD sum rules and investigation of $D \to a_0(980) (\toηπ) e^+ν_e$
Authors:
Zai-Hui Wu,
Hai-Bing Fu,
Tao Zhong,
Dong Huang,
Dan-Dan Hu,
Xing-Gang Wu
Abstract:
In this paper, moments of $a_0(980)$-meson twist-2 light-cone distribution amplitudes were deeply researched by using QCD sum rules approach within background field theory. Up to 9th-order accuracy, we present $\langleξ_{2;a_0}^n\rangle|_{μ_0}$ at the initial scale $μ_0 = 1~{\rm GeV}$, i.e. $\langleξ^1_{2;a_0}\rangle|_{μ_0} = -0.307(43)$, $\langleξ^3_{2;a_0}\rangle|_{μ_0} = -0.181(34)$,…
▽ More
In this paper, moments of $a_0(980)$-meson twist-2 light-cone distribution amplitudes were deeply researched by using QCD sum rules approach within background field theory. Up to 9th-order accuracy, we present $\langleξ_{2;a_0}^n\rangle|_{μ_0}$ at the initial scale $μ_0 = 1~{\rm GeV}$, i.e. $\langleξ^1_{2;a_0}\rangle|_{μ_0} = -0.307(43)$, $\langleξ^3_{2;a_0}\rangle|_{μ_0} = -0.181(34)$, $\langleξ^5_{2;a_0}\rangle|_{μ_0} = -0.078(28)$, $\langleξ^7_{2;a_0}\rangle|_{μ_0} = -0.049(26)$, $\langleξ^9_{2;a_0}\rangle|_{μ_0} = -0.036(24)$, respectively. An improved light-cone harmonic oscillator model for $a_0(980)$-meson twist-2 light-cone distribution amplitudes is adopted, where its parameters are fixed by using the least squares method based on the $\langleξ_{2;a_0}^n\rangle|_{μ_0}$, and their goodness of fit reach to $95.4\%$. Then, we calculate the $D\to a_0(980)$ transition form factors within the light-cone sum rules approach, and at largest recoil point, we obtain $f_+^{D\to a_0}(0) = 1.058^{+0.068}_{-0.035}$ and $f_-^{D\to a_0}(0) = 0.764^{+0.044}_{-0.036}$. As a further application, the branching fractions of the $D\to a_0(980)\ell\barν_\ell$ semileptonic decays are given. Taking the decay $a_0(980)\to ηπ$ into consideration, we obtain ${\cal B}(D^0 \to a_0(980)^- (\to ηπ^-) e^+ν_e) =(1.330^{+0.216}_{-0.134})\times10^{-4}$, ${\cal B}(D^+\to a_0(980)^0(\to ηπ^0)e^+ν_e)=(1.675^{+0.272}_{-0.169})\times10^{-4}$, which are consistent with the BESIII collaboration and PDG data within errors. Finally, we present the angle observables of forward-backward asymmetries, $q^2$-differential flat terms and lepton polarization asymmetry of the semileptonic decay $D\to a_0(980)\ell\barν_\ell$.
△ Less
Submitted 2 May, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts
Authors:
Tao Zhong,
Zhixiang Chi,
Li Gu,
Yang Wang,
Yuanhao Yu,
** Tang
Abstract:
In this paper, we tackle the problem of domain shift. Most existing methods perform training on multiple source domains using a single model, and the same trained model is used on all unseen target domains. Such solutions are sub-optimal as each target domain exhibits its own specialty, which is not adapted. Furthermore, expecting single-model training to learn extensive knowledge from multiple so…
▽ More
In this paper, we tackle the problem of domain shift. Most existing methods perform training on multiple source domains using a single model, and the same trained model is used on all unseen target domains. Such solutions are sub-optimal as each target domain exhibits its own specialty, which is not adapted. Furthermore, expecting single-model training to learn extensive knowledge from multiple source domains is counterintuitive. The model is more biased toward learning only domain-invariant features and may result in negative knowledge transfer. In this work, we propose a novel framework for unsupervised test-time adaptation, which is formulated as a knowledge distillation process to address domain shift. Specifically, we incorporate Mixture-of-Experts (MoE) as teachers, where each expert is separately trained on different source domains to maximize their specialty. Given a test-time target domain, a small set of unlabeled data is sampled to query the knowledge from MoE. As the source domains are correlated to the target domains, a transformer-based aggregator then combines the domain knowledge by examining the interconnection among them. The output is treated as a supervision signal to adapt a student prediction network toward the target domain. We further employ meta-learning to enforce the aggregator to distill positive knowledge and the student network to achieve fast adaptation. Extensive experiments demonstrate that the proposed method outperforms the state-of-the-art and validates the effectiveness of each proposed component. Our code is available at https://github.com/n3il666/Meta-DMoE.
△ Less
Submitted 11 January, 2023; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Constraint of $ξ$-moments calculated with QCD sum rules on the pion distribution amplitude models
Authors:
Tao Zhong,
Zhi-Hao Zhu,
Hai-Bing Fu
Abstract:
So far, the behavior of the pionic leading-twist distribution amplitude (DA) $φ_{2;π}(x,μ)$ $-$ which is universal physical quantity and enters the high-energy processes involving pion based on the factorization theorem $-$ has not been completely consistent. The form of $φ_{2;π}(x,μ)$ is usually described by phenomenological models and constrained by the experimental data of the exclusive process…
▽ More
So far, the behavior of the pionic leading-twist distribution amplitude (DA) $φ_{2;π}(x,μ)$ $-$ which is universal physical quantity and enters the high-energy processes involving pion based on the factorization theorem $-$ has not been completely consistent. The form of $φ_{2;π}(x,μ)$ is usually described by phenomenological models and constrained by the experimental data of the exclusive processes containing pion or the moments calculated with the QCD sum rules and lattice QCD theory. Obviously, an appropriate model is very important for us to determine the exact behavior of $φ_{2;π}(x,μ)$. In this paper, by adopting the least squares method to fit the $ξ$-moments calculated with QCD sum rules based on the background field theory, we perform an analysis for several commonly used models of the pionic leading-twist DA in the literature, such as the truncation form of the Gegenbauer polynomial series, the light-cone harmonic oscillator model, the form from the Dyson-Schwinger equations, the model from the light-front holographic AdS/QCD and a simple power-law parametrization form.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.