-
Study of excited $Ξ$ states in $ψ(3686)\rightarrow{}K^{-}Λ\overlineΞ^{+}+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (587 additional authors not shown)
Abstract:
Based on a sample of $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decays of $ψ(3686)\to{}K^{-}Λ\overlineΞ^{+} + c.c.$ with $\overlineΞ^+ \to \overlineΛ π^+$, $\overlineΛ\to \overline{p} π^+$ are studied.Two excited hyperons, $Ξ(1690)^-$ and $Ξ(1820)^-$, are observed with large significance ($ \gg 10 σ$) in the $K^{-}Λ$ invariant mass distributions.…
▽ More
Based on a sample of $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decays of $ψ(3686)\to{}K^{-}Λ\overlineΞ^{+} + c.c.$ with $\overlineΞ^+ \to \overlineΛ π^+$, $\overlineΛ\to \overline{p} π^+$ are studied.Two excited hyperons, $Ξ(1690)^-$ and $Ξ(1820)^-$, are observed with large significance ($ \gg 10 σ$) in the $K^{-}Λ$ invariant mass distributions. A partial wave analysis is performed, and the spin-parities of $Ξ(1690)^-$ and $Ξ(1820)^-$ are determined to be $\frac{1}{2}^{-}$ and $\frac{3}{2}^{-}$, respectively. The masses, widths, and product branching fractions of $Ξ(1690)^-$ and $Ξ(1820)^-$ are also measured.
△ Less
Submitted 28 April, 2024; v1 submitted 29 August, 2023;
originally announced August 2023.
-
LAMBO: Large Language Model Empowered Edge Intelligence
Authors:
Li Dong,
Feibo Jiang,
Yubo Peng,
Kezhi Wang,
Kun Yang,
Cunhua Pan,
Robert Schober
Abstract:
Next-generation edge intelligence is anticipated to bring huge benefits to various applications, e.g., offloading systems. However, traditional deep offloading architectures face several issues, including heterogeneous constraints, partial perception, uncertain generalization, and lack of tractability. In this context, the integration of offloading with large language models (LLMs) presents numero…
▽ More
Next-generation edge intelligence is anticipated to bring huge benefits to various applications, e.g., offloading systems. However, traditional deep offloading architectures face several issues, including heterogeneous constraints, partial perception, uncertain generalization, and lack of tractability. In this context, the integration of offloading with large language models (LLMs) presents numerous advantages. Therefore, we propose an LLM-Based Offloading (LAMBO) framework for mobile edge computing (MEC), which comprises four components: (i) Input embedding (IE), which is used to represent the information of the offloading system with constraints and prompts through learnable vectors with high quality; (ii) Asymmetric encoderdecoder (AED) model, which is a decision-making module with a deep encoder and a shallow decoder. It can achieve high performance based on multi-head self-attention schemes; (iii) Actor-critic reinforcement learning (ACRL) module, which is employed to pre-train the whole AED for different optimization tasks under corresponding prompts; and (iv) Active learning from expert feedback (ALEF), which can be used to finetune the decoder part of the AED while adapting to dynamic environmental changes. Our simulation results corroborate the advantages of the proposed LAMBO framework.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Generations of Knowledge Graphs: The Crazy Ideas and the Business Impact
Authors:
Xin Luna Dong
Abstract:
Knowledge Graphs (KGs) have been used to support a wide range of applications, from web search to personal assistant. In this paper, we describe three generations of knowledge graphs: entity-based KGs, which have been supporting general search and question answering (e.g., at Google and Bing); text-rich KGs, which have been supporting search and recommendations for products, bio-informatics, etc.…
▽ More
Knowledge Graphs (KGs) have been used to support a wide range of applications, from web search to personal assistant. In this paper, we describe three generations of knowledge graphs: entity-based KGs, which have been supporting general search and question answering (e.g., at Google and Bing); text-rich KGs, which have been supporting search and recommendations for products, bio-informatics, etc. (e.g., at Amazon and Alibaba); and the emerging integration of KGs and LLMs, which we call dual neural KGs. We describe the characteristics of each generation of KGs, the crazy ideas behind the scenes in constructing such KGs, and the techniques developed over time to enable industry impact. In addition, we use KGs as examples to demonstrate a recipe to evolve research ideas from innovations to production practice, and then to the next level of innovations, to advance both science and business.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Search for the light hadron decay $χ_{c1}(3872) \to π^{+}π^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for…
▽ More
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for
$σ[e^{+}e^{-} \rightarrow γχ_{c1}(3872)] \mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-}η]$ at center-of-mass energies from 4.13 to 4.34 GeV are determined.
By normalizing to the $χ_{c1}(3872)\toπ^+π^- J/ψ$ decay channel, a 90\% confidence level upper limit for the branching fraction ratio
$\mathcal{R}=\mathcal{B}[χ_{c1}(3872) \rightarrowπ^{+}π^{-}η]/\mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-} J/ψ] < 0.12$ is given.
These measurements provide important inputs for understanding the internal structure of the $χ_{c1}(3872)$ resonance.
△ Less
Submitted 19 January, 2024; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Improved measurement of the branching fractions for $J/ψ\toγπ^0$, $γη$ and $γη^\prime$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=…
▽ More
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=$(5.40\pm 0.01\pm0.11)\times 10^{-3}$, where the first uncertainties are statistical and the second are systematic. These results are consistent with the world average values within two standard deviations. The ratio of partial widths $Γ(J/ψ\toγη^\prime)/Γ(J/ψ\toγη)$ is measured to be $4.93 \pm 0.13$. The singlet-octet pseudoscalar mixing angle $θ_P$ is determined to be $θ_P = -(22.11 \pm0.26)^\circ$ or $-(19.34 \pm 0.34)^\circ$ with two different phenomenological models.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Stable higher-charge vortex solitons in the cubic-quintic medium with a ring potential
Authors:
Liangwei Dong,
Ming**g Fan,
Boris A. Malomed
Abstract:
We put forward a model for trap** stable optical vortex solitons (VSs) with high topological charges $m$. The cubic-quintic nonlinear medium with an imprinted ring-shaped modulation of the refractive index is shown to support two branches of VSs, which are controlled by the radius, width and depth of the modulation profile. While the lower-branch VSs are unstable in their nearly whole existence…
▽ More
We put forward a model for trap** stable optical vortex solitons (VSs) with high topological charges $m$. The cubic-quintic nonlinear medium with an imprinted ring-shaped modulation of the refractive index is shown to support two branches of VSs, which are controlled by the radius, width and depth of the modulation profile. While the lower-branch VSs are unstable in their nearly whole existence domain, the upper branch is completely stable. Vortex solitons with $m\leq 12$ obey the anti-Vakhitov-Kolokolov stability criterion. The results suggest possibilities for the creation of stable narrow optical VSs with a low power, carrying higher vorticities.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Instruction Tuning for Large Language Models: A Survey
Authors:
Shengyu Zhang,
Linfeng Dong,
Xiaoya Li,
Sen Zhang,
Xiaofei Sun,
Shuhe Wang,
Jiwei Li,
Runyi Hu,
Tianwei Zhang,
Fei Wu,
Guoyin Wang
Abstract:
This paper surveys research works in the quickly advancing field of instruction tuning (IT), a crucial technique to enhance the capabilities and controllability of large language models (LLMs). Instruction tuning refers to the process of further training LLMs on a dataset consisting of \textsc{(instruction, output)} pairs in a supervised fashion, which bridges the gap between the next-word predict…
▽ More
This paper surveys research works in the quickly advancing field of instruction tuning (IT), a crucial technique to enhance the capabilities and controllability of large language models (LLMs). Instruction tuning refers to the process of further training LLMs on a dataset consisting of \textsc{(instruction, output)} pairs in a supervised fashion, which bridges the gap between the next-word prediction objective of LLMs and the users' objective of having LLMs adhere to human instructions. In this work, we make a systematic review of the literature, including the general methodology of IT, the construction of IT datasets, the training of IT models, and applications to different modalities, domains and applications, along with an analysis on aspects that influence the outcome of IT (e.g., generation of instruction outputs, size of the instruction dataset, etc). We also review the potential pitfalls of IT along with criticism against it, along with efforts pointing out current deficiencies of existing strategies and suggest some avenues for fruitful research. Project page: github.com/xiaoya-li/Instruction-Tuning-Survey
△ Less
Submitted 13 March, 2024; v1 submitted 21 August, 2023;
originally announced August 2023.
-
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?
Authors:
Kai Sun,
Yifan Ethan Xu,
Hanwen Zha,
Yue Liu,
Xin Luna Dong
Abstract:
Since the recent prosperity of Large Language Models (LLMs), there have been interleaved discussions regarding how to reduce hallucinations from LLM responses, how to increase the factuality of LLMs, and whether Knowledge Graphs (KGs), which store the world knowledge in a symbolic form, will be replaced with LLMs. In this paper, we try to answer these questions from a new angle: How knowledgeable…
▽ More
Since the recent prosperity of Large Language Models (LLMs), there have been interleaved discussions regarding how to reduce hallucinations from LLM responses, how to increase the factuality of LLMs, and whether Knowledge Graphs (KGs), which store the world knowledge in a symbolic form, will be replaced with LLMs. In this paper, we try to answer these questions from a new angle: How knowledgeable are LLMs?
To answer this question, we constructed Head-to-Tail, a benchmark that consists of 18K question-answer (QA) pairs regarding head, torso, and tail facts in terms of popularity. We designed an automated evaluation method and a set of metrics that closely approximate the knowledge an LLM confidently internalizes. Through a comprehensive evaluation of 16 publicly available LLMs, we show that existing LLMs are still far from being perfect in terms of their grasp of factual knowledge, especially for facts of torso-to-tail entities.
△ Less
Submitted 2 April, 2024; v1 submitted 20 August, 2023;
originally announced August 2023.
-
Language-guided Human Motion Synthesis with Atomic Actions
Authors:
Yuanhao Zhai,
Mingzhen Huang,
Tianyu Luan,
Lu Dong,
Ifeoma Nwogu,
Siwei Lyu,
David Doermann,
Junsong Yuan
Abstract:
Language-guided human motion synthesis has been a challenging task due to the inherent complexity and diversity of human behaviors. Previous methods face limitations in generalization to novel actions, often resulting in unrealistic or incoherent motion sequences. In this paper, we propose ATOM (ATomic mOtion Modeling) to mitigate this problem, by decomposing actions into atomic actions, and emplo…
▽ More
Language-guided human motion synthesis has been a challenging task due to the inherent complexity and diversity of human behaviors. Previous methods face limitations in generalization to novel actions, often resulting in unrealistic or incoherent motion sequences. In this paper, we propose ATOM (ATomic mOtion Modeling) to mitigate this problem, by decomposing actions into atomic actions, and employing a curriculum learning strategy to learn atomic action composition. First, we disentangle complex human motions into a set of atomic actions during learning, and then assemble novel actions using the learned atomic actions, which offers better adaptability to new actions. Moreover, we introduce a curriculum learning training strategy that leverages masked motion modeling with a gradual increase in the mask ratio, and thus facilitates atomic action assembly. This approach mitigates the overfitting problem commonly encountered in previous methods while enforcing the model to learn better motion representations. We demonstrate the effectiveness of ATOM through extensive experiments, including text-to-motion and action-to-motion synthesis tasks. We further illustrate its superiority in synthesizing plausible and coherent text-guided human motion sequences.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Study of $e^+e^-\toηφ$ at center-of-mass energies from 3.773 to 4.600 GeV
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like…
▽ More
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like vector states. No significant signals of the $Y(4230)$ and $Y(4360)$ resonances are observed.
△ Less
Submitted 24 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (584 additional authors not shown)
Abstract:
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$. No signal is found and the upper limit on the branching fraction of $φ\to π^+ π^+ e^- e^-$ is set to be $9.7\times10^{-6}$ at the 90\% confidence level.
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$. No signal is found and the upper limit on the branching fraction of $φ\to π^+ π^+ e^- e^-$ is set to be $9.7\times10^{-6}$ at the 90\% confidence level.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Breaking Speaker Recognition with PaddingBack
Authors:
Zhe Ye,
Diqun Yan,
Li Dong,
Kailai Shen
Abstract:
Machine Learning as a Service (MLaaS) has gained popularity due to advancements in Deep Neural Networks (DNNs). However, untrusted third-party platforms have raised concerns about AI security, particularly in backdoor attacks. Recent research has shown that speech backdoors can utilize transformations as triggers, similar to image backdoors. However, human ears can easily be aware of these transfo…
▽ More
Machine Learning as a Service (MLaaS) has gained popularity due to advancements in Deep Neural Networks (DNNs). However, untrusted third-party platforms have raised concerns about AI security, particularly in backdoor attacks. Recent research has shown that speech backdoors can utilize transformations as triggers, similar to image backdoors. However, human ears can easily be aware of these transformations, leading to suspicion. In this paper, we propose PaddingBack, an inaudible backdoor attack that utilizes malicious operations to generate poisoned samples, rendering them indistinguishable from clean ones. Instead of using external perturbations as triggers, we exploit the widely-used speech signal operation, padding, to break speaker recognition systems. Experimental results demonstrate the effectiveness of our method, achieving a significant attack success rate while retaining benign accuracy. Furthermore, PaddingBack demonstrates the ability to resist defense methods and maintain its stealthiness against human perception.
△ Less
Submitted 11 March, 2024; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Measurement of the $e^+e^- \to Λ\barΣ^0 + c.c.$ cross sections at $\sqrt{s}$ from 2.3094 to 3.0800 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (601 additional authors not shown)
Abstract:
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical…
▽ More
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical significance of more than five standard deviations, and the cross sections at other energies are obtained with improved precision compared to earlier measurements from the BaBar Collaboration. The Born cross-section lineshape is described better by a shape with a plateau near the threshold than by a pQCD motivated functional form.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition
Authors:
JiaCheng Deng,
Li Dong,
Jiahao Chen,
Diqun Yan,
Rangding Wang,
Dengpan Ye,
Lingchen Zhao,
**yu Tian
Abstract:
Optical Character Recognition (OCR) enables automatic text extraction from scanned or digitized text images, but it also makes it easy to pirate valuable or sensitive text from these images. Previous methods to prevent OCR piracy by distorting characters in text images are impractical in real-world scenarios, as pirates can capture arbitrary portions of the text images, rendering the defenses inef…
▽ More
Optical Character Recognition (OCR) enables automatic text extraction from scanned or digitized text images, but it also makes it easy to pirate valuable or sensitive text from these images. Previous methods to prevent OCR piracy by distorting characters in text images are impractical in real-world scenarios, as pirates can capture arbitrary portions of the text images, rendering the defenses ineffective. In this work, we propose a novel and effective defense mechanism termed the Universal Defensive Underpainting Patch (UDUP) that modifies the underpainting of text images instead of the characters. UDUP is created through an iterative optimization process to craft a small, fixed-size defensive patch that can generate non-overlap** underpainting for text images of any size. Experimental results show that UDUP effectively defends against unauthorized OCR under the setting of any screenshot range or complex image background. It is agnostic to the content, size, colors, and languages of characters, and is robust to typical image operations such as scaling and compressing. In addition, the transferability of UDUP is demonstrated by evading several off-the-shelf OCRs. The code is available at https://github.com/QRICKDD/UDUP.
△ Less
Submitted 4 August, 2023;
originally announced August 2023.
-
Determination of the $Σ^{+}$ Timelike Electromagnetic Form Factors
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike regio…
▽ More
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike region are extracted. The relative phase between the electric and magnetic form factors is determined to be $\sinΔΦ$ = -0.67~$\pm$~0.29~(stat)~$\pm$~0.18~(syst) at $\sqrt{s}$ = 2.3960 GeV, $ΔΦ$ = 55$^{\circ}$~$\pm$~19$^{\circ}$~(stat) $\pm$~14$^{\circ}$~(syst) at $\sqrt{s}$ = 2.6454 GeV, and 78$^{\circ}$~$\pm$~22$^{\circ}$~(stat) $\pm$~9$^{\circ}$~(syst) at $\sqrt{s}$ = 2.9000 GeV. For the first time, the phase of the hyperon electromagnetic form factors is explored in a wide range of four-momentum transfer. The evolution of the phase along with four-momentum transfer is an important input for understanding its asymptotic behavior and the dynamics of baryons.
△ Less
Submitted 5 March, 2024; v1 submitted 29 July, 2023;
originally announced July 2023.
-
Observation of the decay $J/ψ\to e^+ e^- η(1405)$ with $η(1405) \to π^0 f_0(980)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be…
▽ More
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be ${\mathcal B}(J/ψ\to e^+ e^- π^0 η(1405) \to e^+ e^- π^0 f_0(980) \to e^+ e^- π^0 π^+ π^-)=(2.02\pm0.24(\rm{stat.})\pm0.09(\rm{syst.}))\times 10^{-7}$. The branching-fraction ratio ${\mathcal B}(J/ψ\to e^+ e^- η(1405))$/${\mathcal B}(J/ψ\to γη(1405))$ is determined to be $(1.35\pm0.19(\rm{stat.})\pm0.06(\rm{syst.}))\times10^{-2}$. Furthermore, an $e^+e^-$ invariant-mass dependent transition form factor of $J/ψ\to e^+ e^-η(1405)$ is presented for the first time. The obtained result provides input for different theoretical models, and is valuable for the improved understanding the intrinsic structure of the $η(1405)$ meson.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Improved measurement of the branching fraction of $D_s^+\toμ^+ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (598 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data with an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector operating at the BEPCII collider, the branching fraction of the leptonic decay $D_s^+\toμ^+ν_μ$ is measured to be $(0.5294\pm0.0108_{\rm stat}\pm0.0085_{\rm syst})$\%. Based on this, the product of the $D_s^+$ decay constan…
▽ More
Using $e^+e^-$ collision data with an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector operating at the BEPCII collider, the branching fraction of the leptonic decay $D_s^+\toμ^+ν_μ$ is measured to be $(0.5294\pm0.0108_{\rm stat}\pm0.0085_{\rm syst})$\%. Based on this, the product of the $D_s^+$ decay constant $f_{D_s^+}$ and the magnitude of the $c\to s$ quark mixing matrix element $|V_{cs}|$ is determined to be $f_{D_s^+}|V_{cs}|=241.8\pm2.5_{\rm stat}\pm2.2_{\rm syst}~\mathrm{MeV}$. Using the value of $|V_{cs}|$ given by the global standard model fit, $f_{D_s^+}$ is found to be $248.4\pm2.5_{\rm stat}\pm2.2_{\rm syst}$\,MeV. Alternatively, using the value of $f_{D_s^+}$ from a recent lattice quantum chromodynamics calculation, $|V_{cs}|$ is determined to be $0.968\pm0.010_{\rm stat}\pm0.009_{\rm syst}$.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Observation of $D^+_s\to η^\prime μ^+ν_μ$, Precision Test of Lepton Flavor Universality with $D^+_s\to η^{(\prime)} \ell^+ν_\ell$, and First Measurements of $D^+_s\to η^{(\prime)}μ^+ν_μ$ Decay Dynamics
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (584 additional authors not shown)
Abstract:
By analyzing 7.33 fb$^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we report the observation of the semileptonic decay $D^+_s\to η^\prime μ^+ν_μ$, with a statistical significance larger than 10$σ$, and the measurements of the $D_s^+ \to ημ^+ν_μ$ and $D_s^+ \to η^\primeμ^+ν_μ$ decay dynamics for the first time. The br…
▽ More
By analyzing 7.33 fb$^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we report the observation of the semileptonic decay $D^+_s\to η^\prime μ^+ν_μ$, with a statistical significance larger than 10$σ$, and the measurements of the $D_s^+ \to ημ^+ν_μ$ and $D_s^+ \to η^\primeμ^+ν_μ$ decay dynamics for the first time. The branching fractions of $D_s^+ \to ημ^+ν_μ$ and $D_s^+ \to η^\primeμ^+ν_μ$ are determined to be $(2.235\pm0.051_{\rm stat}\pm0.052_{\rm syst})\%$ and $(0.801\pm0.055_{\rm stat}\pm0.028_{\rm syst})\%$, respectively, with precision improved by factors of 6.0 and 6.6 compared to the previous best measurements. Combined with the results for the decays $D_s^+ \to ηe^+ν_e$ and $D_s^+ \to η^\prime e^+ν_e$, the ratios of the decay widths are examined both inclusively and in several $\ell^+ν_\ell$ four-momentum transfer ranges. No evidence for lepton flavor universality violation is found within the current statistics. The products of the hadronic form factors $f_{+,0}^{η^{(\prime)}}(0)$ and the $c\to s$ Cabibbo-Kobayashi-Maskawa matrix element $|V_{cs}|$ are determined. The results based on the two-parameter series expansion are $f^η_{+,0}(0)|V_{cs}| = 0.452\pm0.010_{\rm stat}\pm0.007_{\rm syst}$ and $f^{η^{\prime}}_{+,0}(0)|V_{cs}| = 0.504\pm0.037_{\rm stat}\pm0.012_{\rm syst}$, which help to constrain present models on $f_{+,0}^{η^{(\prime)}}(0)$. The forward-backward asymmetries are determined to be $\langle A_{\rm FB}^η\rangle=-0.059\pm0.031_{\rm stat}\pm0.005_{\rm syst}$ and $\langle A_{\rm FB}^{η^\prime}\rangle=-0.064\pm0.079_{\rm stat}\pm0.006_{\rm syst}$ for the first time, which are consistent with the theoretical calculation.
△ Less
Submitted 28 February, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Measurement of $e^{+}e^{-}\toφη'$ cross sections at center-of-mass energies from 3.508 to 4.951 GeV and search for the decay $ψ(3770)\toφη'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in ag…
▽ More
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in agreement with the Nambu-Jona-Lasinio model prediction of $n=3.5\pm 0.9$. In addition, the charmless decay $ψ(3770)\toφη'$ is searched for by fitting the measured cross sections, yet no significant signal is observed. The upper limit of ${\cal B}(ψ(3770)\toφη')$ at the 90\% confidence level is determined to be $2.3\times 10^{-5}$.
△ Less
Submitted 11 September, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
First Observation of a Three-Resonance Structure in $e^+e^-\rightarrow$Nonopen Charm Hadrons
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state…
▽ More
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state is observed for the first time, while the $\mathcal R(3760)$ and $\mathcal R(3780)$ states are observed for the first time in the nOCH cross sections. Two sets of resonance parameters describe the energy-dependent line shape of the cross sections well. In set I [set II], the $\mathcal R(3810)$ state has mass $(3805.7 \pm 1.1 \pm 2.7)$ [$(3805.7 \pm 1.1 \pm 2.7)$] MeV/$c^2$, total width $(11.6 \pm 2.9 \pm 1.9)$ [$(11.5 \pm 2.8 \pm 1.9)$] MeV, and an electronic width multiplied by the nOCH decay branching fraction of $(10.9\pm 3.8\pm 2.5)$ [$(11.0\pm 3.4\pm 2.5)$] eV. In addition, we measure the branching fractions ${\mathcal B}[{\mathcal R}(3760)$$\rightarrow$nOCH$]=(25.2 \pm 16.1 \pm 30.4)\% [(6.4 \pm 4.8 \pm 7.7)\%]$ and ${\mathcal B}[\mathcal R(3780)$$\rightarrow$nOCH$]=(12.3 \pm 6.6 \pm 8.3)\% [(10.4 \pm 4.8 \pm 7.0)\%]$ for the first time. The $\mathcal R(3760)$ state can be interpreted as an open-charm (OC) molecular state, but containing a simple four-quark state component. The $\mathcal R(3810)$ state can be interpreted as a hadrocharmonium state.
△ Less
Submitted 11 May, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Comprehensive study of the blazars from Fermi-LAT LCR: The log-normal flux distribution and linear RMS-Flux relation
Authors:
Na Wang,
Ting-Feng Yi,
Liang Wang,
Li-Sheng Mao,
Zhi-Yuan Pu,
Gong-Ming Ning,
Wei-Tian Huang,
He Lu,
Shun Zhang,
Yu-Tong Chen,
Liang Dong
Abstract:
Fermi-LAT LCR provide continuous and regularly-sampled gamma-ray light curves, spanning about 14 years, for a large sample of blazars. The log-normal flux distribution and linear RMS-Flux relation of the light curves for a few of Fermi blazar have been examined in previous studies. However, the probability that blazars exhibit log-normal flux distribution and linear RMS-Flux relation in their gamm…
▽ More
Fermi-LAT LCR provide continuous and regularly-sampled gamma-ray light curves, spanning about 14 years, for a large sample of blazars. The log-normal flux distribution and linear RMS-Flux relation of the light curves for a few of Fermi blazar have been examined in previous studies. However, the probability that blazars exhibit log-normal flux distribution and linear RMS-Flux relation in their gamma-ray light curves has not been systematically explored. In this study, we comprehensively research on the distribution of gamma-ray flux and the statistical characteristics on a large sample of 1414 variable blazars from the Fermi-LAT LCR catalog, including 572 FSRQs, 477 BL Lacs, and 365 BCUs, and statistically compare their flux distributions with normal and log-normal distributions. The results indicate that the probability of not reject log-normal is 42.05% for the large sample, and there is still 2.05% probability of not reject normality, based on the joint of Kolmogorov-Smirnov, Shapiro-Wilk and Normality tests. We further find that the probability that BL Lacs conforms to the log-normal distribution is higher than that of FSRQs. Besides, after removing sources with less than 200 data points from this large sample, a sample of 549 blazars, which is still a large sample comparing to the previous studies, was obtained. Basing on dividing the light curves into segments every 20 points (or 40 points, or one year), we fitted the linear RMS-Flux relation of this three different sets, and found that the Pearson correlation coefficients are all close to 1 of the most blazars. This result indicates a strong linear correlation between the RMS and the flux of this 549 blazars. The log-normal distribution and linear RMS-Flux relation indicate that the variability of gamma-ray flux for most blazars is non-linear and multiplicative process.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
NEAT: Distilling 3D Wireframes from Neural Attraction Fields
Authors:
Nan Xue,
Bin Tan,
Yuxi Xiao,
Liang Dong,
Gui-Song Xia,
Tianfu Wu,
Yujun Shen
Abstract:
This paper studies the problem of structured 3D reconstruction using wireframes that consist of line segments and junctions, focusing on the computation of structured boundary geometries of scenes. Instead of leveraging matching-based solutions from 2D wireframes (or line segments) for 3D wireframe reconstruction as done in prior arts, we present NEAT, a rendering-distilling formulation using neur…
▽ More
This paper studies the problem of structured 3D reconstruction using wireframes that consist of line segments and junctions, focusing on the computation of structured boundary geometries of scenes. Instead of leveraging matching-based solutions from 2D wireframes (or line segments) for 3D wireframe reconstruction as done in prior arts, we present NEAT, a rendering-distilling formulation using neural fields to represent 3D line segments with 2D observations, and bipartite matching for perceiving and distilling of a sparse set of 3D global junctions. The proposed {NEAT} enjoys the joint optimization of the neural fields and the global junctions from scratch, using view-dependent 2D observations without precomputed cross-view feature matching. Comprehensive experiments on the DTU and BlendedMVS datasets demonstrate our NEAT's superiority over state-of-the-art alternatives for 3D wireframe reconstruction. Moreover, the distilled 3D global junctions by NEAT, are a better initialization than SfM points, for the recently-emerged 3D Gaussian Splatting for high-fidelity novel view synthesis using about 20 times fewer initial 3D points. Project page: \url{https://xuenan.net/neat}.
△ Less
Submitted 3 April, 2024; v1 submitted 14 July, 2023;
originally announced July 2023.
-
Measurement of the branching fractions of the singly Cabibbo-suppressed decays $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be…
▽ More
Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be $(1.57\pm0.11_{\rm {stat}}\pm0.04_{\rm{syst}})\times10^{-3}$ and $(1.11\pm0.20_{\rm{stat}}\pm0.07_{\rm{syst}})\times10^{-3}$, with a statistical significance of greater than 10 $σ$ and 5.7 $σ$, respectively. These results are consistent with the previous measurements by BESIII, LHCb and Belle, and the result of $Λ_{c}^{+}\to pη$ is the most precise to date.
△ Less
Submitted 17 October, 2023; v1 submitted 18 July, 2023;
originally announced July 2023.
-
Retentive Network: A Successor to Transformer for Large Language Models
Authors:
Yutao Sun,
Li Dong,
Shaohan Huang,
Shuming Ma,
Yuqing Xia,
Jilong Xue,
Jianyong Wang,
Furu Wei
Abstract:
In this work, we propose Retentive Network (RetNet) as a foundation architecture for large language models, simultaneously achieving training parallelism, low-cost inference, and good performance. We theoretically derive the connection between recurrence and attention. Then we propose the retention mechanism for sequence modeling, which supports three computation paradigms, i.e., parallel, recurre…
▽ More
In this work, we propose Retentive Network (RetNet) as a foundation architecture for large language models, simultaneously achieving training parallelism, low-cost inference, and good performance. We theoretically derive the connection between recurrence and attention. Then we propose the retention mechanism for sequence modeling, which supports three computation paradigms, i.e., parallel, recurrent, and chunkwise recurrent. Specifically, the parallel representation allows for training parallelism. The recurrent representation enables low-cost $O(1)$ inference, which improves decoding throughput, latency, and GPU memory without sacrificing performance. The chunkwise recurrent representation facilitates efficient long-sequence modeling with linear complexity, where each chunk is encoded parallelly while recurrently summarizing the chunks. Experimental results on language modeling show that RetNet achieves favorable scaling results, parallel training, low-cost deployment, and efficient inference. The intriguing properties make RetNet a strong successor to Transformer for large language models. Code will be available at https://aka.ms/retnet.
△ Less
Submitted 9 August, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Measurement of the Energy-Dependent Electromagnetic Form Factors of a Charmed Baryon
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process…
▽ More
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process $Λ_{c}^{+}\to pK^{-}π^{+}$. Flat cross sections around $4.63~\mathrm{GeV}$ are obtained and no indication of the resonant structure $Y(4630)$, as reported by Belle, is found. In addition, no oscillatory behavior is discerned in the $|G_{\mathrm{eff}}|$ energy-dependence of $Λ_{c}^{+}$, in contrast to what is seen for the proton and neutron cases. Analyzing the cross section together with the polar-angle distribution of the $Λ_{c}^{+}$ baryon at each energy point, the moduli of electric and magnetic form factors ($|G_{E}|$ and $|G_{M}|$) are extracted and separated. For the first time, the energy-dependence of the form factor ratio $|G_{E}/G_{M}|$ is observed, which can be well described by an oscillatory function.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Large AI Model-Based Semantic Communications
Authors:
Feibo Jiang,
Yubo Peng,
Li Dong,
Kezhi Wang,
Kun Yang,
Cunhua Pan,
Xiaohu You
Abstract:
Semantic communication (SC) is an emerging intelligent paradigm, offering solutions for various future applications like metaverse, mixed-reality, and the Internet of everything. However, in current SC systems, the construction of the knowledge base (KB) faces several issues, including limited knowledge representation, frequent knowledge updates, and insecure knowledge sharing. Fortunately, the de…
▽ More
Semantic communication (SC) is an emerging intelligent paradigm, offering solutions for various future applications like metaverse, mixed-reality, and the Internet of everything. However, in current SC systems, the construction of the knowledge base (KB) faces several issues, including limited knowledge representation, frequent knowledge updates, and insecure knowledge sharing. Fortunately, the development of the large AI model provides new solutions to overcome above issues. Here, we propose a large AI model-based SC framework (LAM-SC) specifically designed for image data, where we first design the segment anything model (SAM)-based KB (SKB) that can split the original image into different semantic segments by universal semantic knowledge. Then, we present an attention-based semantic integration (ASI) to weigh the semantic segments generated by SKB without human participation and integrate them as the semantic-aware image. Additionally, we propose an adaptive semantic compression (ASC) encoding to remove redundant information in semantic features, thereby reducing communication overhead. Finally, through simulations, we demonstrate the effectiveness of the LAM-SC framework and the significance of the large AI model-based KB development in future SC paradigms.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Studies of the decay $D^+_s\to K^+K^- μ^+ ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (598 additional authors not shown)
Abstract:
The $D^+_s\to K^+K^-μ^+ν_μ$ decay is studied based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies in the range from 4.128 to 4.226 GeV. The absolute branching fraction is measured as ${\mathcal B}(D^+_s\to φμ^+ν_μ) = (2.25\pm 0.09 \pm 0.07) \times10^{-2}$, the most precise measurement to date. Combining with the world average of…
▽ More
The $D^+_s\to K^+K^-μ^+ν_μ$ decay is studied based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies in the range from 4.128 to 4.226 GeV. The absolute branching fraction is measured as ${\mathcal B}(D^+_s\to φμ^+ν_μ) = (2.25\pm 0.09 \pm 0.07) \times10^{-2}$, the most precise measurement to date. Combining with the world average of ${\mathcal B}(D^+_s\to φe^+ν_e)$, the ratio of the branching fractions obtained is$\frac{{\mathcal B}(D^+_s\to φμ^+ν_μ)}{{\mathcal B}(D^+_s\to φe^+ν_e)} = 0.94\pm0.08$, in agreement with lepton universality. By performing a partial wave analysis, the hadronic form factor ratios at $q^{2}=0$ are extracted, finding $r_{V}=\frac{V(0)}{A_{1}(0)}=1.58\pm0.17\pm0.02$ and $r_{2}=\frac{A_{2}(0)}{A_{1}(0)}=0.71\pm0.14\pm0.02$, where the first uncertainties are statistical and the second are systematic. No significant $S$-wave contribution from $f_0(980)\to K^+K^-$ is found. The upper limit $\mathcal{B}(D_s^+\to f_0(980)μ^{+}ν_μ) \cdot{\mathcal B}(f_0(980)\to K^+K^-) < 5.45 \times 10^{-4}$ is set at 90\% confidence level.
△ Less
Submitted 18 July, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
LongNet: Scaling Transformers to 1,000,000,000 Tokens
Authors:
Jiayu Ding,
Shuming Ma,
Li Dong,
Xingxing Zhang,
Shaohan Huang,
Wenhui Wang,
Nanning Zheng,
Furu Wei
Abstract:
Scaling sequence length has become a critical demand in the era of large language models. However, existing methods struggle with either computational complexity or model expressivity, rendering the maximum sequence length restricted. To address this issue, we introduce LongNet, a Transformer variant that can scale sequence length to more than 1 billion tokens, without sacrificing the performance…
▽ More
Scaling sequence length has become a critical demand in the era of large language models. However, existing methods struggle with either computational complexity or model expressivity, rendering the maximum sequence length restricted. To address this issue, we introduce LongNet, a Transformer variant that can scale sequence length to more than 1 billion tokens, without sacrificing the performance on shorter sequences. Specifically, we propose dilated attention, which expands the attentive field exponentially as the distance grows. LongNet has significant advantages: 1) it has a linear computation complexity and a logarithm dependency between any two tokens in a sequence; 2) it can be served as a distributed trainer for extremely long sequences; 3) its dilated attention is a drop-in replacement for standard attention, which can be seamlessly integrated with the existing Transformer-based optimization. Experiments results demonstrate that LongNet yields strong performance on both long-sequence modeling and general language tasks. Our work opens up new possibilities for modeling very long sequences, e.g., treating a whole corpus or even the entire Internet as a sequence.
△ Less
Submitted 19 July, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Measurement of $e^+e^-\to pK^-\barΛ+c.c.$ cross sections between 4.009 GeV and 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (599 additional authors not shown)
Abstract:
Using $e^+e^-$ collision datasets corresponding to total integrated luminosity of 21.7 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 4.009 GeV to 4.951 GeV, the energy-dependent cross sections of $e^+e^-\to pK^-\barΛ+c.c.$ are measured for the first time. By fitting these energy-dependent cross sections, we search for the excited $ψ$ st…
▽ More
Using $e^+e^-$ collision datasets corresponding to total integrated luminosity of 21.7 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 4.009 GeV to 4.951 GeV, the energy-dependent cross sections of $e^+e^-\to pK^-\barΛ+c.c.$ are measured for the first time. By fitting these energy-dependent cross sections, we search for the excited $ψ$ states $ψ(4160)$ and $ψ(4415)$, and the vector charmonium-like states $ψ(4230)$, $ψ(4360)$, and $ψ(4660)$. No evidence for these is observed and the upper limits on the branching fractions of these states decaying into $pK^-\bar Λ+c.c.$ are set at the 90\% confidence level.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Search for the semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (603 additional authors not shown)
Abstract:
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be…
▽ More
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be $\mathcal{B}(J/ψ\to D^{-}μ^{+}ν_μ+c.c.)<5.6\times10^{-7}$ at $90\%$ confidence level. This is the first search for the weak decay of charmonium with a muon in the final state.
△ Less
Submitted 12 December, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion
Authors:
Zhe Ye,
Terui Mao,
Li Dong,
Diqun Yan
Abstract:
Deep speech classification has achieved tremendous success and greatly promoted the emergence of many real-world applications. However, backdoor attacks present a new security threat to it, particularly with untrustworthy third-party platforms, as pre-defined triggers set by the attacker can activate the backdoor. Most of the triggers in existing speech backdoor attacks are sample-agnostic, and ev…
▽ More
Deep speech classification has achieved tremendous success and greatly promoted the emergence of many real-world applications. However, backdoor attacks present a new security threat to it, particularly with untrustworthy third-party platforms, as pre-defined triggers set by the attacker can activate the backdoor. Most of the triggers in existing speech backdoor attacks are sample-agnostic, and even if the triggers are designed to be unnoticeable, they can still be audible. This work explores a backdoor attack that utilizes sample-specific triggers based on voice conversion. Specifically, we adopt a pre-trained voice conversion model to generate the trigger, ensuring that the poisoned samples does not introduce any additional audible noise. Extensive experiments on two speech classification tasks demonstrate the effectiveness of our attack. Furthermore, we analyzed the specific scenarios that activated the proposed backdoor and verified its resistance against fine-tuning.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Kosmos-2: Grounding Multimodal Large Language Models to the World
Authors:
Zhiliang Peng,
Wenhui Wang,
Li Dong,
Yaru Hao,
Shaohan Huang,
Shuming Ma,
Furu Wei
Abstract:
We introduce Kosmos-2, a Multimodal Large Language Model (MLLM), enabling new capabilities of perceiving object descriptions (e.g., bounding boxes) and grounding text to the visual world. Specifically, we represent refer expressions as links in Markdown, i.e., ``[text span](bounding boxes)'', where object descriptions are sequences of location tokens. Together with multimodal corpora, we construct…
▽ More
We introduce Kosmos-2, a Multimodal Large Language Model (MLLM), enabling new capabilities of perceiving object descriptions (e.g., bounding boxes) and grounding text to the visual world. Specifically, we represent refer expressions as links in Markdown, i.e., ``[text span](bounding boxes)'', where object descriptions are sequences of location tokens. Together with multimodal corpora, we construct large-scale data of grounded image-text pairs (called GrIT) to train the model. In addition to the existing capabilities of MLLMs (e.g., perceiving general modalities, following instructions, and performing in-context learning), Kosmos-2 integrates the grounding capability into downstream applications. We evaluate Kosmos-2 on a wide range of tasks, including (i) multimodal grounding, such as referring expression comprehension, and phrase grounding, (ii) multimodal referring, such as referring expression generation, (iii) perception-language tasks, and (iv) language understanding and generation. This work lays out the foundation for the development of Embodiment AI and sheds light on the big convergence of language, multimodal perception, action, and world modeling, which is a key step toward artificial general intelligence. Code and pretrained models are available at https://aka.ms/kosmos-2.
△ Less
Submitted 13 July, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Precise measurement of the branching fractions of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ and $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
Based on a data sample of $(10087\pm44)\times10^6$ $J/ψ$ events collected with the BESIII detector, the branching fraction of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ is measured to be $(1.221\pm 0.002\pm 0.038)\times10^{-3}$, and the branching fraction of its isospin partner mode $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$ is measured to be $(1.244\pm 0.002\pm 0.045)\times10^{-3}$ with improved precision. H…
▽ More
Based on a data sample of $(10087\pm44)\times10^6$ $J/ψ$ events collected with the BESIII detector, the branching fraction of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ is measured to be $(1.221\pm 0.002\pm 0.038)\times10^{-3}$, and the branching fraction of its isospin partner mode $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$ is measured to be $(1.244\pm 0.002\pm 0.045)\times10^{-3}$ with improved precision. Here the first uncertainties are statistical and the second ones systematic. The isospin symmetry of the $Σ$ baryon in charmonium hadronic decay and the "$12\%$ rule" are tested, and no violation is found. The potential of using these channels as $Σ$ baryon sources for nuclear physics research is studied, and the momentum and angular distributions of these sources are provided.
△ Less
Submitted 24 December, 2023; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Pushing the Limits of ChatGPT on NLP Tasks
Authors:
Xiaofei Sun,
Linfeng Dong,
Xiaoya Li,
Zhen Wan,
Shuhe Wang,
Tianwei Zhang,
Jiwei Li,
Fei Cheng,
Lingjuan Lyu,
Fei Wu,
Guoyin Wang
Abstract:
Despite the success of ChatGPT, its performances on most NLP tasks are still well below the supervised baselines. In this work, we looked into the causes, and discovered that its subpar performance was caused by the following factors: (1) token limit in the prompt does not allow for the full utilization of the supervised datasets; (2) mismatch between the generation nature of ChatGPT and NLP tasks…
▽ More
Despite the success of ChatGPT, its performances on most NLP tasks are still well below the supervised baselines. In this work, we looked into the causes, and discovered that its subpar performance was caused by the following factors: (1) token limit in the prompt does not allow for the full utilization of the supervised datasets; (2) mismatch between the generation nature of ChatGPT and NLP tasks; (3) intrinsic pitfalls of LLMs models, e.g., hallucination, overly focus on certain keywords, etc.
In this work, we propose a collection of general modules to address these issues, in an attempt to push the limits of ChatGPT on NLP tasks. Our proposed modules include (1) a one-input-multiple-prompts strategy that employs multiple prompts for one input to accommodate more demonstrations; (2) using fine-tuned models for better demonstration retrieval; (3) transforming tasks to formats that are more tailored to the generation nature; (4) employing reasoning strategies that are tailored to addressing the task-specific complexity; (5) the self-verification strategy to address the hallucination issue of LLMs; (6) the paraphrase strategy to improve the robustness of model predictions.
We conduct experiments on 21 datasets of 10 representative NLP tasks, including question answering, commonsense reasoning, natural language inference, sentiment analysis, named entity recognition, entity-relation extraction, event extraction, dependency parsing, semantic role labeling, and part-of-speech tagging. Using the proposed assemble of techniques, we are able to significantly boost the performance of ChatGPT on the selected NLP tasks, achieving performances comparable to or better than supervised baselines, or even existing SOTA performances.
△ Less
Submitted 9 October, 2023; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Semi-Offline Reinforcement Learning for Optimized Text Generation
Authors:
Changyu Chen,
Xiting Wang,
Yiqiao **,
Victor Ye Dong,
Li Dong,
Jie Cao,
Yi Liu,
Rui Yan
Abstract:
In reinforcement learning (RL), there are two major settings for interacting with the environment: online and offline. Online methods explore the environment at significant time cost, and offline methods efficiently obtain reward signals by sacrificing exploration capability. We propose semi-offline RL, a novel paradigm that smoothly transits from offline to online settings, balances exploration c…
▽ More
In reinforcement learning (RL), there are two major settings for interacting with the environment: online and offline. Online methods explore the environment at significant time cost, and offline methods efficiently obtain reward signals by sacrificing exploration capability. We propose semi-offline RL, a novel paradigm that smoothly transits from offline to online settings, balances exploration capability and training cost, and provides a theoretical foundation for comparing different RL settings. Based on the semi-offline formulation, we present the RL setting that is optimal in terms of optimization cost, asymptotic error, and overfitting error bound. Extensive experiments show that our semi-offline approach is efficient and yields comparable or often better performance compared with state-of-the-art methods.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
MiniLLM: Knowledge Distillation of Large Language Models
Authors:
Yuxian Gu,
Li Dong,
Furu Wei,
Minlie Huang
Abstract:
Knowledge Distillation (KD) is a promising technique for reducing the high computational demand of large language models (LLMs). However, previous KD methods are primarily applied to white-box classification models or training small models to imitate black-box model APIs like ChatGPT. How to effectively distill the knowledge of white-box LLMs into small models is still under-explored, which become…
▽ More
Knowledge Distillation (KD) is a promising technique for reducing the high computational demand of large language models (LLMs). However, previous KD methods are primarily applied to white-box classification models or training small models to imitate black-box model APIs like ChatGPT. How to effectively distill the knowledge of white-box LLMs into small models is still under-explored, which becomes more important with the prosperity of open-source LLMs. In this work, we propose a KD approach that distills LLMs into smaller language models. We first replace the forward Kullback-Leibler divergence (KLD) objective in the standard KD approaches with reverse KLD, which is more suitable for KD on generative language models, to prevent the student model from overestimating the low-probability regions of the teacher distribution. Then, we derive an effective optimization approach to learn this objective. The student models are named MiniLLM. Extensive experiments in the instruction-following setting show that MiniLLM generates more precise responses with higher overall quality, lower exposure bias, better calibration, and higher long-text generation performance than the baselines. Our method is scalable for different model families with 120M to 13B parameters. Our code, data, and model checkpoints can be found in https://github.com/microsoft/LMOps/tree/main/minillm.
△ Less
Submitted 9 April, 2024; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Augmenting Language Models with Long-Term Memory
Authors:
Weizhi Wang,
Li Dong,
Hao Cheng,
Xiaodong Liu,
Xifeng Yan,
Jianfeng Gao,
Furu Wei
Abstract:
Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models Augmented with Long-Term Memory (LongMem), which enables LLMs to memorize long history. We design a novel decoupled network architecture with the original backbon…
▽ More
Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models Augmented with Long-Term Memory (LongMem), which enables LLMs to memorize long history. We design a novel decoupled network architecture with the original backbone LLM frozen as a memory encoder and an adaptive residual side-network as a memory retriever and reader. Such a decoupled memory design can easily cache and update long-term past contexts for memory retrieval without suffering from memory staleness. Enhanced with memory-augmented adaptation training, LongMem can thus memorize long past context and use long-term memory for language modeling. The proposed memory retrieval module can handle unlimited-length context in its memory bank to benefit various downstream tasks. Typically, LongMem can enlarge the long-form memory to 65k tokens and thus cache many-shot extra demonstration examples as long-form memory for in-context learning. Experiments show that our method outperforms strong long-context models on ChapterBreak, a challenging long-context modeling benchmark, and achieves remarkable improvements on memory-augmented in-context learning over LLMs. The results demonstrate that the proposed method is effective in hel** language models to memorize and utilize long-form contents. Our code is open-sourced at https://aka.ms/LongMem.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition
Authors:
Zhiyun Fan,
Linhao Dong,
Chen Shen,
Zhenlin Liang,
Jun Zhang,
Lu Lu,
Zejun Ma
Abstract:
Code-switching speech recognition (CSSR) transcribes speech that switches between multiple languages or dialects within a single sentence. The main challenge in this task is that different languages often have similar pronunciations, making it difficult for models to distinguish between them. In this paper, we propose a method for solving the CSSR task from the perspective of language-specific aco…
▽ More
Code-switching speech recognition (CSSR) transcribes speech that switches between multiple languages or dialects within a single sentence. The main challenge in this task is that different languages often have similar pronunciations, making it difficult for models to distinguish between them. In this paper, we propose a method for solving the CSSR task from the perspective of language-specific acoustic boundary learning. We introduce language-specific weight estimators (LSWE) to model acoustic boundary learning in different languages separately. Additionally, a non-autoregressive (NAR) decoder and a language change detection (LCD) module are employed to assist in training. Evaluated on the SEAME corpus, our method achieves a state-of-the-art mixed error rate (MER) of 16.29% and 22.81% on the test_man and test_sge sets. We also demonstrate the effectiveness of our method on a 9000-hour in-house meeting code-switching dataset, where our method achieves a relatively 7.9% MER reduction.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Precision Measurements of $D_s^+ \to ηe^+ ν_e$ and $D_s^+ \to η^\prime e^+ ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (591 additional authors not shown)
Abstract:
Precision measurements of the semileptonic decays $D_s^+ \to ηe^+ ν_e$ and $D_s^+ \to η^\prime e^+ ν_e$ are performed with 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector. The branching fractions obtained are $\mathcal{B}(D_s^+ \to ηe^{+} ν_e)$ = $(2.255\pm0.039_{\rm stat}\pm 0.051_{\rm syst})\%$ and…
▽ More
Precision measurements of the semileptonic decays $D_s^+ \to ηe^+ ν_e$ and $D_s^+ \to η^\prime e^+ ν_e$ are performed with 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector. The branching fractions obtained are $\mathcal{B}(D_s^+ \to ηe^{+} ν_e)$ = $(2.255\pm0.039_{\rm stat}\pm 0.051_{\rm syst})\%$ and $\mathcal{B}(D_s^+ \to η^{\prime} e^{+} ν_e)$ = $(0.810\pm0.038_{\rm stat}\pm 0.024_{\rm syst})\%$. Combining these results with the $\mathcal{B}(D^+\toηe^+ ν_e)$ and $\mathcal{B}(D^+\toη^\prime e^+ ν_e)$ obtained from previous BESIII measurements, the $η-η^\prime$ mixing angle in the quark flavor basis is determined to be $φ_{\rm P} = (40.0\pm2.0_{\rm stat}\pm0.6_{\rm syst})^\circ$. Moreover, from the fits to the partial decay rates of $D_s^+ \to ηe^+ ν_e$ and $D_s^+ \to η^\prime e^+ ν_e$, the products of the hadronic transition form factors $f_+^{η^{(\prime)}}(0)$ and the modulus of the $c\to s$ Cabibbo-Kobayashi-Maskawa matrix element $|V_{cs}|$ are determined by using different hadronic transition form factor parametrizations. Based on the two-parameter series expansion, the products $f^η_+(0)|V_{cs}| = 0.4519\pm0.0071_{\rm stat}\pm0.0065_{\rm syst}$ and $f^{η^\prime}_+(0)|V_{cs}| = 0.525\pm0.024_{\rm stat}\pm0.009_{\rm syst}$ are extracted. All results determined in this work supersede those measured in the previous BESIII analyses based on the 3.19 fb$^{-1}$ subsample of data at 4.178 GeV.
△ Less
Submitted 27 October, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Propagation of very-high-energy $γ$-rays from distant blazars
Authors:
L. J. Dong,
Y. G. Zheng,
S. J. Kang
Abstract:
We re-derive the possible dependence of the redshift with very high energy (VHE) $γ$-ray photon index. The results suggest that the universe to VHE $γ$-rays is becoming more transparent than usually expected. We introduce the extragalactic background light (EBL) plus the photon to axion-like particle (ALP) oscillations to explain this phenomenon. We concentrate our analysis on 70 blazars up to red…
▽ More
We re-derive the possible dependence of the redshift with very high energy (VHE) $γ$-ray photon index. The results suggest that the universe to VHE $γ$-rays is becoming more transparent than usually expected. We introduce the extragalactic background light (EBL) plus the photon to axion-like particle (ALP) oscillations to explain this phenomenon. We concentrate our analysis on 70 blazars up to redshift $z \simeq 1$. Assuming this correlation is solely the result of photon-photon absorption of VHE photons with the EBL, which finds the deviations between the predictions and observations, especially at redshifts $0.2 < z < 1$. We then discuss the implications of photon-ALP oscillations for the VHE $γ$-ray spectra of blazars. A strong evidence shows that: 1) the EBL attenuation results that the VHE $γ$-ray photon index increases non-linearly at the ranges of redshift, $0.03 < z < 0.2$; 2) the photon-ALP oscillation results in a attractive characteristic in the VHE $γ$-ray photon index at the ranges of redshift, $0.2 < z < 1$. We suggest that both the EBL absorption and photon-ALP oscillation can influence on the propagation of VHE $γ$-rays from distant blazars.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
GaitMPL: Gait Recognition with Memory-Augmented Progressive Learning
Authors:
Huanzhang Dou,
Pengyi Zhang,
Yuhan Zhao,
Lin Dong,
Zequn Qin,
Xi Li
Abstract:
Gait recognition aims at identifying the pedestrians at a long distance by their biometric gait patterns. It is inherently challenging due to the various covariates and the properties of silhouettes (textureless and colorless), which result in two kinds of pair-wise hard samples: the same pedestrian could have distinct silhouettes (intra-class diversity) and different pedestrians could have simila…
▽ More
Gait recognition aims at identifying the pedestrians at a long distance by their biometric gait patterns. It is inherently challenging due to the various covariates and the properties of silhouettes (textureless and colorless), which result in two kinds of pair-wise hard samples: the same pedestrian could have distinct silhouettes (intra-class diversity) and different pedestrians could have similar silhouettes (inter-class similarity). In this work, we propose to solve the hard sample issue with a Memory-augmented Progressive Learning network (GaitMPL), including Dynamic Reweighting Progressive Learning module (DRPL) and Global Structure-Aligned Memory bank (GSAM). Specifically, DRPL reduces the learning difficulty of hard samples by easy-to-hard progressive learning. GSAM further augments DRPL with a structure-aligned memory mechanism, which maintains and models the feature distribution of each ID. Experiments on two commonly used datasets, CASIA-B and OU-MVLP, demonstrate the effectiveness of GaitMPL. On CASIA-B, we achieve the state-of-the-art performance, i.e., 88.0% on the most challenging condition (Clothing) and 93.3% on the average condition, which outperforms the other methods by at least 3.8% and 1.4%, respectively.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Hyperuniform organization in human settlements
Authors:
Lei Dong
Abstract:
Quantifying the spatial organization of human settlements is fundamental to understanding the complexity of urban systems. However, the quantitative patterns of the distribution of villages, towns, and cities that lie between random and regular, are still largely unknown. Here, by analyzing the geographic location of settlements in diverse regions, we show that the apparently complex urban systems…
▽ More
Quantifying the spatial organization of human settlements is fundamental to understanding the complexity of urban systems. However, the quantitative patterns of the distribution of villages, towns, and cities that lie between random and regular, are still largely unknown. Here, by analyzing the geographic location of settlements in diverse regions, we show that the apparently complex urban systems can be characterized by disordered hyperuniformity (with small density fluctuations), an intriguing pattern that has been identified in many physical and biological systems, but has rarely been documented in socio-economic systems. By introducing the mechanisms of spatial matching and competition, we develop a growth model that shows how settlements evolve towards hyperuniformity. Our model also predicts the heavy-tail population distribution across settlements, in agreement with empirical observations. These results provide insights into the self-organization of cities, and reveal the universality of spatial organization shared by social, physical, and biological systems.
△ Less
Submitted 4 July, 2023; v1 submitted 7 June, 2023;
originally announced June 2023.
-
Study of $Λ_c^+\rightarrow Λμ^+ν_μ$ and Test of Lepton Flavor Universality with $Λ_c^+\rightarrow Λ\ell^+ν_{\ell}$ Decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (604 additional authors not shown)
Abstract:
The measurement of the Cabibbo-favored semileptonic decay $Λ_c^+\rightarrow Λμ^+ν_μ$ is reported using $4.5~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies ranging from 4.600~GeV to 4.699~GeV. The branching fraction of the decay is measured to be $\mathcal{B}(Λ_c^+\rightarrow Λμ^+ν_μ)=(3.48\pm0.14_{\rm stat.}\pm0.10_{\rm syst.})\%$, three times more precise tha…
▽ More
The measurement of the Cabibbo-favored semileptonic decay $Λ_c^+\rightarrow Λμ^+ν_μ$ is reported using $4.5~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies ranging from 4.600~GeV to 4.699~GeV. The branching fraction of the decay is measured to be $\mathcal{B}(Λ_c^+\rightarrow Λμ^+ν_μ)=(3.48\pm0.14_{\rm stat.}\pm0.10_{\rm syst.})\%$, three times more precise than the prior world average result. Tests of lepton flavor universality using $Λ_c^+\rightarrow Λ\ell^+ν_{\ell}$ ($\ell=e, μ$) decays are reported for the first time, based on measurements of the differential decay rates and the forward-backward asymmetries in separate four-momentum transfer regions. The results are compatible with Standard Model predictions. Furthermore, we improve the determination of the form-factor parameters in $Λ_c^+\rightarrow Λ\ell^+ν_{\ell}$ decays, which provide stringent tests and calibration for lattice quantum chromodynamics (LQCD) calculations.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today
Authors:
Zhuo Wang,
Rongzhen Li,
Bowen Dong,
Jie Wang,
Xiuxing Li,
Ning Liu,
Chenhui Mao,
Wei Zhang,
Liling Dong,
**g Gao,
Jianyong Wang
Abstract:
Recent investigations show that large language models (LLMs), specifically GPT-4, not only have remarkable capabilities in common Natural Language Processing (NLP) tasks but also exhibit human-level performance on various professional and academic benchmarks. However, whether GPT-4 can be directly used in practical applications and replace traditional artificial intelligence (AI) tools in speciali…
▽ More
Recent investigations show that large language models (LLMs), specifically GPT-4, not only have remarkable capabilities in common Natural Language Processing (NLP) tasks but also exhibit human-level performance on various professional and academic benchmarks. However, whether GPT-4 can be directly used in practical applications and replace traditional artificial intelligence (AI) tools in specialized domains requires further experimental validation. In this paper, we explore the potential of LLMs such as GPT-4 to outperform traditional AI tools in dementia diagnosis. Comprehensive comparisons between GPT-4 and traditional AI tools are conducted to examine their diagnostic accuracy in a clinical setting. Experimental results on two real clinical datasets show that, although LLMs like GPT-4 demonstrate potential for future advancements in dementia diagnosis, they currently do not surpass the performance of traditional AI tools. The interpretability and faithfulness of GPT-4 are also evaluated by comparison with real doctors. We discuss the limitations of GPT-4 in its current state and propose future research directions to enhance GPT-4 in dementia diagnosis.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Authors:
Linhao Dong,
Zhecheng An,
Peihao Wu,
Jun Zhang,
Lu Lu,
Zejun Ma
Abstract:
Speech or text representation generated by pre-trained models contains modal-specific information that could be combined for benefiting spoken language understanding (SLU) tasks. In this work, we propose a novel pre-training paradigm termed Continuous Integrate-and-Fire Pre-Training (CIF-PT). It relies on a simple but effective frame-to-token alignment: continuous integrate-and-fire (CIF) to bridg…
▽ More
Speech or text representation generated by pre-trained models contains modal-specific information that could be combined for benefiting spoken language understanding (SLU) tasks. In this work, we propose a novel pre-training paradigm termed Continuous Integrate-and-Fire Pre-Training (CIF-PT). It relies on a simple but effective frame-to-token alignment: continuous integrate-and-fire (CIF) to bridge the representations between speech and text. It jointly performs speech-to-text training and language model distillation through CIF as the pre-training (PT). Evaluated on SLU benchmark SLURP dataset, CIF-PT outperforms the state-of-the-art model by 1.94% of accuracy and 2.71% of SLU-F1 on the tasks of intent classification and slot filling, respectively. We also observe the cross-modal representation extracted by CIF-PT obtains better performance than other neural interfaces for the tasks of SLU, including the dominant speech representation learned from self-supervised pre-training.
△ Less
Submitted 27 May, 2023;
originally announced May 2023.
-
Amplitude analysis and branching fraction measurement of the decay $D^{+} \to K_S^0π^+π^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
Using 2.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy 3.773\,GeV, we perform the first amplitude analysis of the decay $D^+\to K_S^0π^+π^0π^0$ and determine the relative magnitudes and phases of different intermediate processes. The absolute branching fraction of $D^+\to K_S^0π^+π^0π^0$ is measured to be…
▽ More
Using 2.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy 3.773\,GeV, we perform the first amplitude analysis of the decay $D^+\to K_S^0π^+π^0π^0$ and determine the relative magnitudes and phases of different intermediate processes. The absolute branching fraction of $D^+\to K_S^0π^+π^0π^0$ is measured to be $(2.888\pm0.058_{\rm stat.}\pm0.069_{\rm syst.})\%$. The dominant intermediate processes are $D^+\to K_S^0a_1(1260)^+(\to ρ^+π^0)$ and $D^+\to \bar{K}^{*0}ρ^+$, with branching fractions of $(8.66\pm1.04_{\rm stat.}\pm1.39_{\rm syst.})\!\times \!10^{-3}$ and $(9.70\pm0.81_{\rm stat.}\pm0.53_{\rm syst.})\!\times \!10^{-3}$, respectively.
△ Less
Submitted 5 August, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Determination of spin and parity of $D^{*}_{(s)}$ mesons
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
The spin and parity of the charmed mesons $D_{s}^{*+}$, $D^{*0}$ and $D^{*+}$ are determined for the first time to be $J^P=1^{-}$ with significances greater than 10$σ$ over other hypotheses of $2^{+}$ and $3^{-}$, using an $e^+e^-$ collision data sample with an integrated luminosity of 3.19 fb$^{-1}$ collected by the BESIII detector at a center-of-mass energy of 4.178 GeV. Different spin-parity hy…
▽ More
The spin and parity of the charmed mesons $D_{s}^{*+}$, $D^{*0}$ and $D^{*+}$ are determined for the first time to be $J^P=1^{-}$ with significances greater than 10$σ$ over other hypotheses of $2^{+}$ and $3^{-}$, using an $e^+e^-$ collision data sample with an integrated luminosity of 3.19 fb$^{-1}$ collected by the BESIII detector at a center-of-mass energy of 4.178 GeV. Different spin-parity hypotheses for $D_{s}^{*+}$, $D^{*0}$, and $D^{*+}$ mesons are tested via a helicity amplitude analysis of the processes $e^+e^-\to D^{*+}_{s}D^{-}_{s}$, $D^{*0}D^{0}$ and $D^{*+}D^{-}$, with $D^{*+}_{s}\to D^{+}_{s} γ$, $D^{*0}\to D^{0}π^{0}$, and $D^{*+}\to D^{+}π^{0}$. The results confirm the quark model predictions.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Production of doubly-charged $Δ$ baryon in $e^{+}e^{-}$ annihilation at energies from 2.3094 to 2.6464 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
R. Baldini Ferroli,
I. Balossino,
Y. Ban,
V. Batozskaya,
D. Becker,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (579 additional authors not shown)
Abstract:
The processes $e^{+}e^{-} \to Δ^{++}\barΔ^{--}$ and $e^{+}e^{-}\to Δ^{++} \bar{p} π^{-} + c.c.$ are studied for the first time with $179~{\rm pb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected with the BESIII detector at center-of-mass energies from $2.3094$ GeV to $2.6464$ GeV. No significant signal for the $e^{+}e^{-}\to Δ^{++}\barΔ^{--}$ process is observed and the upper limit of the Born cr…
▽ More
The processes $e^{+}e^{-} \to Δ^{++}\barΔ^{--}$ and $e^{+}e^{-}\to Δ^{++} \bar{p} π^{-} + c.c.$ are studied for the first time with $179~{\rm pb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected with the BESIII detector at center-of-mass energies from $2.3094$ GeV to $2.6464$ GeV. No significant signal for the $e^{+}e^{-}\to Δ^{++}\barΔ^{--}$ process is observed and the upper limit of the Born cross section is estimated at each energy point. For the process $e^{+}e^{-} \to Δ^{++} \bar{p} π^{-} + c.c.$, a significant signal is observed at center-of-mass energies near 2.6454 GeV and the corresponding Born cross section is reported.
△ Less
Submitted 14 July, 2023; v1 submitted 20 May, 2023;
originally announced May 2023.
-
Search for a scalar partner of the $X(3872)$ via $ψ(3770)$ decays into $γηη'$ and $γπ^{+}π^{-}J/ψ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (599 additional authors not shown)
Abstract:
Using a data sample corresponding to an integrated luminosity of 2.93 fb$^{-1}$ collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider, we search for a scalar partner of the $X(3872)$, denoted as $X(3700)$, via $ψ(3770)\to γηη'$ and $γπ^{+}π^{-}J/ψ$ processes. No significant signals are observed and the upper limits of the product branching fractions…
▽ More
Using a data sample corresponding to an integrated luminosity of 2.93 fb$^{-1}$ collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider, we search for a scalar partner of the $X(3872)$, denoted as $X(3700)$, via $ψ(3770)\to γηη'$ and $γπ^{+}π^{-}J/ψ$ processes. No significant signals are observed and the upper limits of the product branching fractions $ {\cal B}(ψ(3770)\toγX(3700))\cdot {\cal B}(X(3700)\to ηη')$ and ${\cal B}(ψ(3770)\toγX(3700))\cdot {\cal B}(X(3700)\toπ^{+}π^{-}J/ψ)$ are determined at the 90\% confidence level, for the narrow $X(3700)$ with a mass ranging from 3710 to 3740 MeV/$c^2$, which are from 0.8 to 1.8 $(\times 10^{-5})$ and 0.9 to 3.4 $(\times 10^{-5})$, respectively.
△ Less
Submitted 6 September, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Tests of $CP$ symmetry in the entangled $Ξ^0-\barΞ^0$ Pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
The $J/ψ\to Ξ^0 \barΞ^{0}$ process and subsequent decays are investigated using $(10087 \pm 44)\times 10^6$ $J/ψ$ events collected at the BESIII experiment. The decay parameters of $Ξ^0$ and $\barΞ^0$ are measured with greatly improved precision over previous measurements to be $α_Ξ = -0.3750 \pm 0.0034 \pm 0.0016$, $\barα_Ξ = 0.3790 \pm 0.0034 \pm 0.0021$, $φ_Ξ = 0.0051 \pm 0.0096 \pm 0.0018$~rad…
▽ More
The $J/ψ\to Ξ^0 \barΞ^{0}$ process and subsequent decays are investigated using $(10087 \pm 44)\times 10^6$ $J/ψ$ events collected at the BESIII experiment. The decay parameters of $Ξ^0$ and $\barΞ^0$ are measured with greatly improved precision over previous measurements to be $α_Ξ = -0.3750 \pm 0.0034 \pm 0.0016$, $\barα_Ξ = 0.3790 \pm 0.0034 \pm 0.0021$, $φ_Ξ = 0.0051 \pm 0.0096 \pm 0.0018$~rad, $\barφ_Ξ = -0.0053 \pm 0.0097 \pm 0.0019$~rad, where the first and the second uncertainties are statistical and systematic, respectively. From these measurements, precise $CP$ symmetry tests in $Ξ^0$ decay are performed, and $A^Ξ_{CP} = (-5.4 \pm 6.5 \pm 3.1) \times 10^{-3}$ and $Δφ^Ξ_{CP} = (-0.1 \pm 6.9 \pm 0.9) \times 10^{-3}$~rad are consistent with $CP$ conservation. The sequential decay also enables a separation of weak and strong phase differences, which are found for the first time to be $ξ_{P}-ξ_{S} = (0.0 \pm 1.7 \pm 0.2) \times 10^{-2}$~rad and $δ_{P}-δ_{S} = (-1.3 \pm 1.7 \pm 0.4)\times 10^{-2}$~rad, respectively. In addition, we measure the $Λ$ decay parameters and test $CP$ symmetry in $Λ$ decays.
△ Less
Submitted 14 August, 2023; v1 submitted 16 May, 2023;
originally announced May 2023.