-
Measurement of the cross sections for $e^+e^-\toηπ^+π^-$ at center-of-mass energies between 2.00 and 3.08 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of th…
▽ More
Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of the process $e^+e^-\toηπ^+π^-$ are systematically higher than those of BaBar by more than $3σ$ at center-of-mass energies between 2.000 and 2.300 GeV. In the cross section lineshape for $e^+e^-\to a_2(1320)π$, a resonant structure is observed with a significance of $5.5σ$, with $M=(2044\pm31\pm4)$ MeV/$c^2$, $Γ=(163\pm69\pm24)$ MeV and $\mathcal{B_{R}}\cdotΓ_{e^+e^-}^{R}=(34.6\pm17.1\pm6.0)$ eV or $(137.1\pm73.3\pm2.1)$ eV. In the cross section lineshape for $e^+e^-\toρη$, an evidence of a dip structure around 2180 MeV/$c^2$ is observed with statistical significance of $3.0σ$.
△ Less
Submitted 28 November, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
A Thorough Search for Short Timescale Periodicity in Five Repeating FRBs
Authors:
Chen Du,
Yong-Feng Huang,
Zhi-Bin Zhang,
Alexander Rodin,
Viktoriya Fedorova,
Abdusattar Kurban,
Di Li
Abstract:
Fast Radio Bursts (FRBs) are bright radio transients with millisecond durations which typically occur at extragalactic distances. The association of FRB 20200428 with the Galactic magnetar SGR J1935+2154 strongly indicates that they could originate from neutron stars, which naturally leads to the expectation that periodicity connected with the spinning of magnetars should exist in the activities o…
▽ More
Fast Radio Bursts (FRBs) are bright radio transients with millisecond durations which typically occur at extragalactic distances. The association of FRB 20200428 with the Galactic magnetar SGR J1935+2154 strongly indicates that they could originate from neutron stars, which naturally leads to the expectation that periodicity connected with the spinning of magnetars should exist in the activities of repeating FRBs. However, previous studies have failed to find any signatures supporting such a conjecture. Here we perform a thorough search for short timescale periodicity in the five most active repeating sources, i.e. FRBs 20121102A, 20180916B, 20190520B, 20200120E, and 20201124A. Three different methods are employed, including the phase folding algorithm, the Schuster periodogram and the Lomb-Scargle periodogram. For the two most active repeaters from which more than 1600 bursts have been detected, i.e. FRB 20121102A and FRB 20201124A, more in-depth period searches are conducted by considering various burst properties such as the pulse width, peak flux, fluence, and the brightness temperature. For these two repeaters, we have also selected those days on which a large number of bursts were detected and performed periodicity analysis based on the single-day bursts. No periodicity in a period range of 1 ms-1000 s is found in all the efforts, although possible existence of a very short period between 1 ms-10 ms still could not be completely excluded for FRBs 20200120E and 20201124A due to limited timing accuracy of currently available observations. Implications of such a null result on the theoretical models of FRBs are discussed.
△ Less
Submitted 27 October, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Search for $J/ψ$ weak decays containing $D$ meson
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
Using a sample of about 10 billion $J/ψ$ events with the BESIII detector, we search for the weak decays of $J/ψ\to \bar{D}^0π^0 + c.c.$, $J/ψ\to \bar{D}^0η+ c.c.$, $J/ψ\to \bar{D}^0ρ^0 + c.c.$, $J/ψ\to D^-π^+ + c.c.$, and $J/ψ\to D^-ρ^+ + c.c.$. Since no significant signal is observed, we set the upper limits of the branching fractions of these decays to be…
▽ More
Using a sample of about 10 billion $J/ψ$ events with the BESIII detector, we search for the weak decays of $J/ψ\to \bar{D}^0π^0 + c.c.$, $J/ψ\to \bar{D}^0η+ c.c.$, $J/ψ\to \bar{D}^0ρ^0 + c.c.$, $J/ψ\to D^-π^+ + c.c.$, and $J/ψ\to D^-ρ^+ + c.c.$. Since no significant signal is observed, we set the upper limits of the branching fractions of these decays to be $\mathcal{B}(J/ψ\to \bar{D}^0π^0 + c.c.) < 4.7 \times 10^{-7}$, $\mathcal{B}(J/ψ\to \bar{D}^0η+ c.c.) < 6.8 \times 10^{-7}$, $\mathcal{B}(J/ψ\to \bar{D}^0ρ^0 + c.c.) < 5.2 \times 10^{-7}$, $\mathcal{B}(J/ψ\to D^-π^+ + c.c.) < 7.0 \times 10^{-8}$, and $\mathcal{B}(J/ψ\to D^-ρ^+ + c.c.) < 6.0 \times 10^{-7}$ at the 90\% confidence level.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
What Makes for Robust Multi-Modal Models in the Face of Missing Modalities?
Authors:
Siting Li,
Chenzhuang Du,
Yue Zhao,
Yu Huang,
Hang Zhao
Abstract:
With the growing success of multi-modal learning, research on the robustness of multi-modal models, especially when facing situations with missing modalities, is receiving increased attention. Nevertheless, previous studies in this domain exhibit certain limitations, as they often lack theoretical insights or their methodologies are tied to specific network architectures or modalities. We model th…
▽ More
With the growing success of multi-modal learning, research on the robustness of multi-modal models, especially when facing situations with missing modalities, is receiving increased attention. Nevertheless, previous studies in this domain exhibit certain limitations, as they often lack theoretical insights or their methodologies are tied to specific network architectures or modalities. We model the scenarios of multi-modal models encountering missing modalities from an information-theoretic perspective and illustrate that the performance ceiling in such scenarios can be approached by efficiently utilizing the information inherent in non-missing modalities. In practice, there are two key aspects: (1) The encoder should be able to extract sufficiently good features from the non-missing modality; (2) The extracted features should be robust enough not to be influenced by noise during the fusion process across modalities. To this end, we introduce Uni-Modal Ensemble with Missing Modality Adaptation (UME-MMA). UME-MMA employs uni-modal pre-trained weights for the multi-modal model to enhance feature extraction and utilizes missing modality data augmentation techniques to better adapt to situations with missing modalities. Apart from that, UME-MMA, built on a late-fusion learning framework, allows for the plug-and-play use of various encoders, making it suitable for a wide range of modalities and enabling seamless integration of large-scale pre-trained encoders to further enhance performance. And we demonstrate UME-MMA's effectiveness in audio-visual datasets~(e.g., AV-MNIST, Kinetics-Sound, AVE) and vision-language datasets~(e.g., MM-IMDB, UPMC Food101).
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Authors:
Chenzhuang Du,
Yue Zhao,
Chonghua Liao,
Jiacheng You,
Jie Fu,
Hang Zhao
Abstract:
This paper investigates how to better leverage large-scale pre-trained uni-modal models to further enhance discriminative multi-modal learning. Even when fine-tuned with only uni-modal data, these models can outperform previous multi-modal models in certain tasks. It's clear that their incorporation into multi-modal learning would significantly improve performance. However, multi-modal learning wi…
▽ More
This paper investigates how to better leverage large-scale pre-trained uni-modal models to further enhance discriminative multi-modal learning. Even when fine-tuned with only uni-modal data, these models can outperform previous multi-modal models in certain tasks. It's clear that their incorporation into multi-modal learning would significantly improve performance. However, multi-modal learning with these models still suffers from insufficient learning of uni-modal features, which weakens the resulting multi-modal model's generalization ability. While fine-tuning uni-modal models separately and then aggregating their predictions is straightforward, it doesn't allow for adequate adaptation between modalities, also leading to sub-optimal results. To this end, we introduce Multi-Modal Low-Rank Adaptation learning (MMLoRA). By freezing the weights of uni-modal fine-tuned models, adding extra trainable rank decomposition matrices to them, and subsequently performing multi-modal joint training, our method enhances adaptation between modalities and boosts overall performance. We demonstrate the effectiveness of MMLoRA on three dataset categories: audio-visual (e.g., AVE, Kinetics-Sound, CREMA-D), vision-language (e.g., MM-IMDB, UPMC Food101), and RGB-Optical Flow (UCF101).
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Measurement of $e^{+}e^{-}\rightarrowηJ/ψ$ Cross Section from $\sqrt{s}=$ 3.808 GeV to 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two addition…
▽ More
Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two additional resonances, and a non-resonant component is performed. The mass and width of the first additional state are $(4219.7\pm2.5\pm4.5) \rm{MeV}/\rm{c}^2$ and $(80.7\pm4.4\pm1.4) \rm{MeV}$, respectively, consistent with the $ψ(4230)$. For the second state, the mass and width are $(4386\pm13\pm17) \rm{MeV}/\rm{c}^2$ and $(177\pm32\pm13) \rm{MeV}$, respectively, consistent with the $ψ(4360)$. The first uncertainties are statistical and the second ones are systematic. The statistical significance of $ψ(4040)$ is $8.0σ$ and those for $ψ(4230)$ and $ψ(4360)$ are more than $10.0σ$.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
On Memorization in Diffusion Models
Authors:
Xiangming Gu,
Chao Du,
Tianyu Pang,
Chongxuan Li,
Min Lin,
Ye Wang
Abstract:
Due to their capacity to generate novel and high-quality samples, diffusion models have attracted significant research interest in recent years. Notably, the typical training objective of diffusion models, i.e., denoising score matching, has a closed-form optimal solution that can only generate training data replicating samples. This indicates that a memorization behavior is theoretically expected…
▽ More
Due to their capacity to generate novel and high-quality samples, diffusion models have attracted significant research interest in recent years. Notably, the typical training objective of diffusion models, i.e., denoising score matching, has a closed-form optimal solution that can only generate training data replicating samples. This indicates that a memorization behavior is theoretically expected, which contradicts the common generalization ability of state-of-the-art diffusion models, and thus calls for a deeper understanding. Looking into this, we first observe that memorization behaviors tend to occur on smaller-sized datasets, which motivates our definition of effective model memorization (EMM), a metric measuring the maximum size of training data at which a learned diffusion model approximates its theoretical optimum. Then, we quantify the impact of the influential factors on these memorization behaviors in terms of EMM, focusing primarily on data distribution, model configuration, and training procedure. Besides comprehensive empirical results identifying the influential factors, we surprisingly find that conditioning training data on uninformative random labels can significantly trigger the memorization in diffusion models. Our study holds practical significance for diffusion model users and offers clues to theoretical research in deep generative models. Code is available at https://github.com/sail-sg/DiffMemorize.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
First measurement of $ΛN$ inelastic scattering with $Λ$ from $e^{+} e^{-} \rightarrow J/ψ\to Λ\barΛ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (626 additional authors not shown)
Abstract:
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the…
▽ More
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the BESIII detector. The total cross section of $Λ+ ^{9}{\rm Be} \rightarrow Σ^+ + X$ is measured to be $σ= (37.3 \pm 4.7 \pm 3.5)~{\rm mb}$ at $Λ$ beam momenta within $[1.057, 1.091]~{\rm GeV}/c$, where the uncertainties are statistical and systematic, respectively. This analysis is the first study of $Λ$-nucleon interactions at an $e^+ e^-$ collider, providing information and constraints relevant for the strong-interaction potential, the origin of color confinement, the unified model for baryon-baryon interactions, and the internal structure of neutron stars.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
Updated measurements of the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K \bar{K} π$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (609 additional authors not shown)
Abstract:
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ M…
▽ More
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ MeV/$c^{2}$ and $(10.5 \pm 1.7 (\rm {stat}) \pm 3.5 (\rm {syst}))$ MeV, respectively. The product branching fraction $\mathcal{B}\left(ψ(3686) \rightarrow γη_{c}(2 S)\right) \times \mathcal{B}(η_{c}(2 S) \rightarrow K \bar{K} π)$ is determined to be $(0.97 \pm 0.06 (\rm {stat}) \pm 0.09 (\rm {syst})) \times 10^{-5}$. Using $\mathcal{BR}(η_{c}(2S)\to K\bar{K}π)=(1.86^{+0.68}_{-0.49})\%$, we obtain the branching fraction of the radiative transition to be $\mathcal{BR}(ψ(3686) \to γη_{c}(2S)) = (5.2 \pm 0.3 (\rm {stat}) \pm 0.5 (\rm {syst}) ^{+1.9}_{-1.4} (extr)) \times 10^{-4}$, where the third uncertainty is due to the quoted $\mathcal{BR}(η_{c}(2S) \to K\bar{K}π)$.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Investigation of the $ΔI = 1/2$ rule and test of CP violation through the measurement of decay asymmetry parameters in $Ξ^-$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by fa…
▽ More
Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by factors of 4 and 1.7, respectively. The ratio of decay asymmetry parameters of $Λ\to nπ^0$ to that of $Λ\to pπ^-$, $\langle α_0 \rangle/ \langle α_{Λ-} \rangle $, is determined to be $ 0.873 \pm 0.012^{+0.011}_{-0.010}$, where the first and the second uncertainties are statistical and systematic, respectively. The ratio is smaller than unity more than $5σ$, which signifies the existence of the $ΔI = 3/2$ transition in $Λ$ for the first time. Beside, we test for CP violation in $Ξ^- \to Λπ^-$ and in $Λ\to n π^{0}$ with the best precision to date.
△ Less
Submitted 8 January, 2024; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Measurement of the $e^{+}e^{-} \to K_{S}^{0} K_{L}^{0} π^{0}$ cross sections from $\sqrt{s}=$ 2.000 to 3.080 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses…
▽ More
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ and $K^{*}_{2}(1430)^{0}\bar{K}^{0}$ to be measured. The Born cross sections for $e^{+}e^{-}\to K_{S}^{0}K_{L}^{0}π^{0}$ are consistent with previous measurements by BaBar, but with substantially improved precision. The Born cross section lineshape of the process $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ is consistent with a vector meson state around 2.2 GeV with a significance of 3.2$σ$. A Breit-Wigner fit determines its mass as $M_Y=(2164.7\pm9.1\pm3.1)~{\rm{MeV}}/c^{2}$ and its width as $Γ_{Y}=(32.4\pm21.0\pm1.8)~\rm{MeV}$.
△ Less
Submitted 26 February, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Authors:
Yifan Yang,
Feiyu Shen,
Chenpeng Du,
Ziyang Ma,
Kai Yu,
Daniel Povey,
Xie Chen
Abstract:
Self-supervised learning (SSL) proficiency in speech-related tasks has driven research into utilizing discrete tokens for speech tasks like recognition and translation, which offer lower storage requirements and great potential to employ natural language processing techniques. However, these studies, mainly single-task focused, faced challenges like overfitting and performance degradation in speec…
▽ More
Self-supervised learning (SSL) proficiency in speech-related tasks has driven research into utilizing discrete tokens for speech tasks like recognition and translation, which offer lower storage requirements and great potential to employ natural language processing techniques. However, these studies, mainly single-task focused, faced challenges like overfitting and performance degradation in speech recognition tasks, often at the cost of sacrificing performance in multi-task scenarios. This study presents a comprehensive comparison and optimization of discrete tokens generated by various leading SSL models in speech recognition and synthesis tasks. We aim to explore the universality of speech discrete tokens across multiple speech tasks. Experimental results demonstrate that discrete tokens achieve comparable results against systems trained on FBank features in speech recognition tasks and outperform mel-spectrogram features in speech synthesis in subjective and objective metrics. These findings suggest that universal discrete tokens have enormous potential in various speech-related tasks. Our work is open-source and publicly available at https://github.com/k2-fsa/icefall.
△ Less
Submitted 14 December, 2023; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Measurements of the absolute branching fractions of $Ω^-$ decays and test of the $ΔI = 1/2$ rule
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertai…
▽ More
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertainties are statistical and systematic, respectively. The ratio between $\mathcal{B}_{Ω^- \to Ξ^0 π^-}$ and $\mathcal{B}_{Ω^- \to Ξ^- π^0}$ is determined to be $2.97\pm0.19\pm0.11$, which is in good agreement with the PDG value of $2.74\pm0.15$, but greater by more than four standard deviations than the theoretical prediction of 2 obtained from the $ΔI = 1/2$ rule.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Observation of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ in the amplitude analysis of $D^{+} \to K_{S}^{0}π^+η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (604 additional authors not shown)
Abstract:
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is…
▽ More
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is the ideal decay to extract the contributions of the external and internal $W$-emission amplitudes involving $a_{0}(980)$ and study the final-state interactions. The absolute branching fraction of $D^{+}\to K_{S}^{0}π^+η$ is measured to be $(1.27\pm0.04_{\rm stat.}\pm0.03_{\rm syst.})\%$. The product branching fractions of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ with $a_{0}(980)^{+}\to π^+η$ and $D^{+}\to π^+ K_0^*(1430)^0$ with $K_0^*(1430)^0\to K_{S}^{0}η$ are measured to be $(1.33\pm0.05_{\rm stat.}\pm0.04_{\rm syst.})\%$ and $(0.14\pm0.03_{\rm stat.}\pm0.01_{\rm syst.})\%$, respectively.
△ Less
Submitted 29 March, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Observation of the Singly Cabibbo-Suppressed Decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be…
▽ More
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be $(3.8\pm1.3_{\rm stat}\pm0.2_{\rm syst})\times 10^{-4}$ in a model-independent approach. This is the first observation of a Cabibbo-suppressed $Λ_{c}^{+}$ decay involving $Σ^-$ in the final state. The ratio of branching fractions between $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ and the Cabibbo-favored decay $Λ_{c}^{+}\to Σ^- π^+π^+$ is calculated to be $(0.4 \pm 0.1)s_{c}^{2}$, where $s_{c} \equiv \sinθ_c = 0.2248$ with $θ_c$ the Cabibbo mixing angle. This ratio significantly deviates from $1.0s_{c}^{2}$ and provides important information for the understanding of nonfactorization contributions in $Λ_{c}^{+}$ decays.
△ Less
Submitted 8 May, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Authors:
Yiwei Guo,
Chenpeng Du,
Ziyang Ma,
Xie Chen,
Kai Yu
Abstract:
Although diffusion models in text-to-speech have become a popular choice due to their strong generative ability, the intrinsic complexity of sampling from diffusion models harms their efficiency. Alternatively, we propose VoiceFlow, an acoustic model that utilizes a rectified flow matching algorithm to achieve high synthesis quality with a limited number of sampling steps. VoiceFlow formulates the…
▽ More
Although diffusion models in text-to-speech have become a popular choice due to their strong generative ability, the intrinsic complexity of sampling from diffusion models harms their efficiency. Alternatively, we propose VoiceFlow, an acoustic model that utilizes a rectified flow matching algorithm to achieve high synthesis quality with a limited number of sampling steps. VoiceFlow formulates the process of generating mel-spectrograms into an ordinary differential equation conditional on text inputs, whose vector field is then estimated. The rectified flow technique then effectively straightens its sampling trajectory for efficient synthesis. Subjective and objective evaluations on both single and multi-speaker corpora showed the superior synthesis quality of VoiceFlow compared to the diffusion counterpart. Ablation studies further verified the validity of the rectified flow technique in VoiceFlow.
△ Less
Submitted 16 January, 2024; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Measurement of the cross section of $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ at center-of-mass energies between 3.510 and 4.843 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay…
▽ More
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay $ψ(3770)\rightarrowΞ^{-}\barΞ^{+}$ is observed with a significance of 4.5$σ$ by analyzing the measured cross sections together with earlier BESIII results. For the other charmonium(-like) states $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$, no significant signal of their decay to $Ξ^-\bar Ξ^+$ is found. For these states, upper limits of the products of the branching fraction and the electronic partial width at the 90% confidence level are provided.
△ Less
Submitted 30 November, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined t…
▽ More
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined to be ${\mathcal B}[D^+_s \to K_1(1270)^0 e^+ν_e] < 4.1\times 10^{-4}$ and ${\mathcal B}[D^+_s \to b_1(1235)^0 e^+ν_e]\cdot {\mathcal B}[b_1(1235)^0\to ωπ^0] < 6.4\times 10^{-4}$ at 90\% confidence level.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
First Measurement of the Decay Asymmetry in the pure W-boson-exchange Decay $Λ_{c}^{+}\toΞ^{0}K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (618 additional authors not shown)
Abstract:
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be…
▽ More
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be $α_{Ξ^{0}K^{+}}=0.01\pm0.16({\rm stat.})\pm0.03({\rm syst.})$. This result reflects the non-interference effect between the $S$- and $P$-wave amplitudes. The phase shift between $S$- and $P$-wave amplitudes has two solutions, which are $δ_{p}-δ_{s}=-1.55\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$ or $1.59\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$.
△ Less
Submitted 20 January, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
A coupled-channel analysis of the $X(3872)$ lineshape with BESIII data
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles ar…
▽ More
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles are found on the first and second Riemann sheets corresponding to the $D^{*0}\bar{D}^0$ branch cut. The pole location on the first sheet is much closer to the $D^{*0}\bar{D}^0$ threshold than the other, and is determined to be $7.04\pm0.15^{+0.07}_{-0.08}$ MeV above the $D^0\bar{D}^0π^0$ threshold with an imaginary part $-0.19\pm0.08^{+0.14}_{-0.19}$ MeV.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Non-Einsteinian Viscosity Reduction in Boron Nitride Nanotube Nanofluids
Authors:
André Guerra,
Adam McElligott,
Chong Yang Du,
Milan Marić,
Alejandro D. Rey,
Phillip Servio
Abstract:
(1) Introduction: Nanoparticles have multiple applications, including drug delivery systems, biosensing, and carbon capture. Non-Einstein-like viscosity reduction has been reported in nanoparticle-polymer blends at low nanoparticle concentrations. More recently, a similar non-Einsteinian viscosity reduction effect has been observed in aqueous ultra-low concentration carbon-based nanofluids. (2) Me…
▽ More
(1) Introduction: Nanoparticles have multiple applications, including drug delivery systems, biosensing, and carbon capture. Non-Einstein-like viscosity reduction has been reported in nanoparticle-polymer blends at low nanoparticle concentrations. More recently, a similar non-Einsteinian viscosity reduction effect has been observed in aqueous ultra-low concentration carbon-based nanofluids. (2) Methods: We use a boron nitride nanotube functionalized with hydrophilic groups in rheological experiments to investigate the viscosity reduction in ultra-low concentration nanofluids (0.1-10 ppm). We measure the dynamic viscosity in an air atmosphere and methane (0-5 MPag) at low temperatures (0-10 C). (3) Results: A negligible effect on the temperature dependence of viscosity was found. Ultra-low concentrations of BNNT reduced the viscosity of the nanofluid by up to 29% at 10 ppm in the presence of methane. The results presented here were compared to similar studies on O-GNF and O-MWCNT nanofluids, which also reported significant viscosity reductions. (4) Conclusions: This work identified a non-Einsteinian viscosity reduction in BNNT nanofluids, which was exacerbated by methane dissolved in the nanofluid.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Layer-dependent magnetism and spin fluctuations in atomically thin van der Waals magnet CrPS4
Authors:
Mengqi Huang,
Jazmine C. Green,
**gcheng Zhou,
Violet Williams,
Senlei Li,
Hanyi Lu,
Dziga Djugba,
Hailong Wang,
Benedetta Flebus,
Ni Ni,
Chunhui Rita Du
Abstract:
van der Waals (vdW) magnets, an emerging family of two-dimensional (2D) materials, have received tremendous attention due to their rich fundamental physics and significant potential for cutting-edge technological applications. In contrast to the conventional bulk counterparts, vdW magnets exhibit significant tunability of local material properties, such as stacking engineered interlayer coupling a…
▽ More
van der Waals (vdW) magnets, an emerging family of two-dimensional (2D) materials, have received tremendous attention due to their rich fundamental physics and significant potential for cutting-edge technological applications. In contrast to the conventional bulk counterparts, vdW magnets exhibit significant tunability of local material properties, such as stacking engineered interlayer coupling and layer-number dependent magnetic and electronic interactions, which promise to deliver previously unavailable merits to develop multifunctional microelectronic devices. As a further ingredient of this emerging topic, here we report nanoscale quantum sensing and imaging of atomically thin vdW magnet chromium thiophosphate CrPS4, revealing its characteristic layer-dependent 2D static magnetism and dynamic spin fluctuations. We also show a large tunneling magnetoresistance in CrPS4-based spin filter vdW heterostructures. The excellent material stability, robust strategy against environmental degradation, in combination with tailored magnetic properties highlight the potential of CrPS4 in develo** state-of-the-art 2D spintronic devices for next-generation information technologies.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Observation of a vector charmoniumlike state at 4.7 ${\rm GeV}/c^2$ and search for $Z_{cs}$ in $e^+e^-\to K^+K^-J/ψ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-d…
▽ More
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-dependent line shape of the $e^+e^-\to K^+K^-J/ψ$ cross section with a significance over $5σ$. The $K^{+}J/ψ$ system is also investigated to search for charged charmoniumlike states, but no significant $Z_{cs}^+$ states are observed. Upper limits on the Born cross sections for $e^+e^-\to K^{-} Z_{cs}(3985)^{+}/K^{-} Z_{cs}(4000)^{+} + c.c.$ with $Z_{cs}(3985)^{\pm}/Z_{cs}(4000)^{\pm}\to K^{\pm} J/ψ$ are reported at 90\% confidence levels. The ratio of branching fractions $\frac{\mathcal{B}(Z_{cs}(3985)^{+}\to K^+ J/ψ)}{\mathcal{B}(Z_{cs}(3985)^{+}\to (\bar{D}^{0}D_s^{*+} + \bar{D}^{*0}D_s^+))}$ is measured to be less than 0.03 at 90\% confidence level.
△ Less
Submitted 24 November, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Causality-Based Feature Importance Quantifying Methods: PN-FI, PS-FI and PNS-FI
Authors:
Shuxian Du,
Yaxiu Sun,
Changyi Du
Abstract:
In the current ML field models are getting larger and more complex, and data used for model training are also getting larger in quantity and higher in dimensions. Therefore, in order to train better models, and save training time and computational resources, a good Feature Selection (FS) method in the preprocessing stage is necessary. Feature importance (FI) is of great importance since it is the…
▽ More
In the current ML field models are getting larger and more complex, and data used for model training are also getting larger in quantity and higher in dimensions. Therefore, in order to train better models, and save training time and computational resources, a good Feature Selection (FS) method in the preprocessing stage is necessary. Feature importance (FI) is of great importance since it is the basis of feature selection. Therefore, this paper creatively introduces the calculation of PN (the probability of Necessity), PN (the probability of Sufficiency), and PNS (the probability of Necessity and Sufficiency) of Causality into quantifying feature importance and creates 3 new FI measuring methods, PN-FI, which means how much importance a feature has in image recognition tasks, PS-FI that means how much importance a feature has in image generating tasks, and PNS-FI which measures both. The main body of this paper is three RCTs, with whose results we show how PS-FI, PN-FI, and PNS-FI of 3 features, dog nose, dog eyes, and dog mouth are calculated. The experiments show that firstly, FI values are intervals with tight upper and lower bounds. Secondly, the feature dog eyes has the most importance while the other two have almost the same. Thirdly, the bounds of PNS and PN are tighter than the bounds of PS.
△ Less
Submitted 18 September, 2023; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Search for the light hadron decay $χ_{c1}(3872) \to π^{+}π^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for…
▽ More
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for
$σ[e^{+}e^{-} \rightarrow γχ_{c1}(3872)] \mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-}η]$ at center-of-mass energies from 4.13 to 4.34 GeV are determined.
By normalizing to the $χ_{c1}(3872)\toπ^+π^- J/ψ$ decay channel, a 90\% confidence level upper limit for the branching fraction ratio
$\mathcal{R}=\mathcal{B}[χ_{c1}(3872) \rightarrowπ^{+}π^{-}η]/\mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-} J/ψ] < 0.12$ is given.
These measurements provide important inputs for understanding the internal structure of the $χ_{c1}(3872)$ resonance.
△ Less
Submitted 19 January, 2024; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Improved measurement of the branching fractions for $J/ψ\toγπ^0$, $γη$ and $γη^\prime$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=…
▽ More
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=$(5.40\pm 0.01\pm0.11)\times 10^{-3}$, where the first uncertainties are statistical and the second are systematic. These results are consistent with the world average values within two standard deviations. The ratio of partial widths $Γ(J/ψ\toγη^\prime)/Γ(J/ψ\toγη)$ is measured to be $4.93 \pm 0.13$. The singlet-octet pseudoscalar mixing angle $θ_P$ is determined to be $θ_P = -(22.11 \pm0.26)^\circ$ or $-(19.34 \pm 0.34)^\circ$ with two different phenomenological models.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Authors:
Yizhuo Lu,
Changde Du,
Qiongyi zhou,
Dianpeng Wang,
Huiguang He
Abstract:
Reconstructing visual stimuli from brain recordings has been a meaningful and challenging task. Especially, the achievement of precise and controllable image reconstruction bears great significance in propelling the progress and utilization of brain-computer interfaces. Despite the advancements in complex image reconstruction techniques, the challenge persists in achieving a cohesive alignment of…
▽ More
Reconstructing visual stimuli from brain recordings has been a meaningful and challenging task. Especially, the achievement of precise and controllable image reconstruction bears great significance in propelling the progress and utilization of brain-computer interfaces. Despite the advancements in complex image reconstruction techniques, the challenge persists in achieving a cohesive alignment of both semantic (concepts and objects) and structure (position, orientation, and size) with the image stimuli. To address the aforementioned issue, we propose a two-stage image reconstruction model called MindDiffuser. In Stage 1, the VQ-VAE latent representations and the CLIP text embeddings decoded from fMRI are put into Stable Diffusion, which yields a preliminary image that contains semantic information. In Stage 2, we utilize the CLIP visual feature decoded from fMRI as supervisory information, and continually adjust the two feature vectors decoded in Stage 1 through backpropagation to align the structural information. The results of both qualitative and quantitative analyses demonstrate that our model has surpassed the current state-of-the-art models on Natural Scenes Dataset (NSD). The subsequent experimental findings corroborate the neurobiological plausibility of the model, as evidenced by the interpretability of the multimodal feature employed, which align with the corresponding brain responses.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Auditory Attention Decoding with Task-Related Multi-View Contrastive Learning
Authors:
Xiaoyu Chen,
Changde Du,
Qiongyi Zhou,
Huiguang He
Abstract:
The human brain can easily focus on one speaker and suppress others in scenarios such as a cocktail party. Recently, researchers found that auditory attention can be decoded from the electroencephalogram (EEG) data. However, most existing deep learning methods are difficult to use prior knowledge of different views (that is attended speech and EEG are task-related views) and extract an unsatisfact…
▽ More
The human brain can easily focus on one speaker and suppress others in scenarios such as a cocktail party. Recently, researchers found that auditory attention can be decoded from the electroencephalogram (EEG) data. However, most existing deep learning methods are difficult to use prior knowledge of different views (that is attended speech and EEG are task-related views) and extract an unsatisfactory representation. Inspired by Broadbent's filter model, we decode auditory attention in a multi-view paradigm and extract the most relevant and important information utilizing the missing view. Specifically, we propose an auditory attention decoding (AAD) method based on multi-view VAE with task-related multi-view contrastive (TMC) learning. Employing TMC learning in multi-view VAE can utilize the missing view to accumulate prior knowledge of different views into the fusion of representation, and extract the approximate task-related representation. We examine our method on two popular AAD datasets, and demonstrate the superiority of our method by comparing it to the state-of-the-art method.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction
Authors:
Zhangchi Zhu,
Lu Wang,
Pu Zhao,
Chao Du,
Wei Zhang,
Hang Dong,
Bo Qiao,
Qingwei Lin,
Saravan Rajmohan,
Dongmei Zhang
Abstract:
Learning from positive and unlabeled data is known as positive-unlabeled (PU) learning in literature and has attracted much attention in recent years. One common approach in PU learning is to sample a set of pseudo-negatives from the unlabeled data using ad-hoc thresholds so that conventional supervised methods can be applied with both positive and negative samples. Owing to the label uncertainty…
▽ More
Learning from positive and unlabeled data is known as positive-unlabeled (PU) learning in literature and has attracted much attention in recent years. One common approach in PU learning is to sample a set of pseudo-negatives from the unlabeled data using ad-hoc thresholds so that conventional supervised methods can be applied with both positive and negative samples. Owing to the label uncertainty among the unlabeled data, errors of misclassifying unlabeled positive samples as negative samples inevitably appear and may even accumulate during the training processes. Those errors often lead to performance degradation and model instability. To mitigate the impact of label uncertainty and improve the robustness of learning with positive and unlabeled data, we propose a new robust PU learning method with a training strategy motivated by the nature of human learning: easy cases should be learned first. Similar intuition has been utilized in curriculum learning to only use easier cases in the early stage of training before introducing more complex cases. Specifically, we utilize a novel ``hardness'' measure to distinguish unlabeled samples with a high chance of being negative from unlabeled samples with large label noise. An iterative training strategy is then implemented to fine-tune the selection of negative samples during the training process in an iterative manner to include more ``easy'' samples in the early stage of training. Extensive experimental validations over a wide range of learning tasks show that this approach can effectively improve the accuracy and stability of learning with positive and unlabeled data. Our code is available at https://github.com/woriazzc/Robust-PU
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
Discovery of the shell structure via break radii in the outer halo of the Milky Way
Authors:
Dashuang Ye,
Cuihua Du,
Jianrong Shi,
Jun Ma
Abstract:
Based on the \textit{Gaia} DR3 RR Lyrae catalog, we use two methods to fit the density profiles with an improved broken power law, and find that there are two break radii coinciding with the two apocenter pile-ups of high-eccentricity Gaia-Sausage-Enceladus (GSE) merger. Also, there is a break caused by the Sagittarius (Sgr) stream. Combining the positions of all breaks, we briefly analyze the met…
▽ More
Based on the \textit{Gaia} DR3 RR Lyrae catalog, we use two methods to fit the density profiles with an improved broken power law, and find that there are two break radii coinciding with the two apocenter pile-ups of high-eccentricity Gaia-Sausage-Enceladus (GSE) merger. Also, there is a break caused by the Sagittarius (Sgr) stream. Combining the positions of all breaks, we briefly analyze the metallicity and its dispersion as a function of $r$ as well as its distribution in cylindrical coordinates. For the clean sample, the $z\text{-to-}x$ ellipsoid axial ratio $q$ in $36\,{\rm kpc}\,\textless\,r\,\textless\,96\,{\rm kpc}$ becomes much smaller than that of the inner halo $(r\,\textless\,36\,{\rm kpc})$, while the major axis has a large uncertainty in the region of $36-66\,{\rm kpc}$ and the one in the region of $66-96\,{\rm kpc}$ is obviously different from that dominated by the Hercules-Aquila Cloud (HAC) and the Virgo Overdensity (VOD) in the inner halo, which indicates that there is an over-density structure distributed at low zenithal angles. Finally, we found that the over-density structure in the outer halo ($r\,\textgreater\,50\,{\rm kpc}$) is shell-shaped and relatively metal-rich compared to the outer background halo. We conclude that the shells could be the apocenter pile-ups of the high-eccentricity GSE merger, which is supported by previous numerical simulations.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
Determination of the $Σ^{+}$ Timelike Electromagnetic Form Factors
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike regio…
▽ More
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike region are extracted. The relative phase between the electric and magnetic form factors is determined to be $\sinΔΦ$ = -0.67~$\pm$~0.29~(stat)~$\pm$~0.18~(syst) at $\sqrt{s}$ = 2.3960 GeV, $ΔΦ$ = 55$^{\circ}$~$\pm$~19$^{\circ}$~(stat) $\pm$~14$^{\circ}$~(syst) at $\sqrt{s}$ = 2.6454 GeV, and 78$^{\circ}$~$\pm$~22$^{\circ}$~(stat) $\pm$~9$^{\circ}$~(syst) at $\sqrt{s}$ = 2.9000 GeV. For the first time, the phase of the hyperon electromagnetic form factors is explored in a wide range of four-momentum transfer. The evolution of the phase along with four-momentum transfer is an important input for understanding its asymptotic behavior and the dynamics of baryons.
△ Less
Submitted 5 March, 2024; v1 submitted 29 July, 2023;
originally announced July 2023.
-
Observation of the decay $J/ψ\to e^+ e^- η(1405)$ with $η(1405) \to π^0 f_0(980)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be…
▽ More
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be ${\mathcal B}(J/ψ\to e^+ e^- π^0 η(1405) \to e^+ e^- π^0 f_0(980) \to e^+ e^- π^0 π^+ π^-)=(2.02\pm0.24(\rm{stat.})\pm0.09(\rm{syst.}))\times 10^{-7}$. The branching-fraction ratio ${\mathcal B}(J/ψ\to e^+ e^- η(1405))$/${\mathcal B}(J/ψ\to γη(1405))$ is determined to be $(1.35\pm0.19(\rm{stat.})\pm0.06(\rm{syst.}))\times10^{-2}$. Furthermore, an $e^+e^-$ invariant-mass dependent transition form factor of $J/ψ\to e^+ e^-η(1405)$ is presented for the first time. The obtained result provides input for different theoretical models, and is valuable for the improved understanding the intrinsic structure of the $η(1405)$ meson.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Authors:
Chengsong Huang,
Qian Liu,
Bill Yuchen Lin,
Tianyu Pang,
Chao Du,
Min Lin
Abstract:
Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks. With just a few examples f…
▽ More
Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the purposive assembly of LoRA modules trained on diverse given tasks, with the objective of achieving adaptable performance on unseen tasks. With just a few examples from a new task, LoraHub can fluidly combine multiple LoRA modules, eliminating the need for human expertise and assumptions. Notably, the composition requires neither additional model parameters nor gradients. Empirical results on the Big-Bench Hard benchmark suggest that LoraHub, while not surpassing the performance of in-context learning, offers a notable performance-efficiency trade-off in few-shot scenarios by employing a significantly reduced number of tokens per example during inference. Notably, LoraHub establishes a better upper bound compared to in-context learning when paired with different demonstration examples, demonstrating its potential for future development. Our vision is to establish a platform for LoRA modules, empowering users to share their trained LoRA modules. This collaborative approach facilitates the seamless application of LoRA modules to novel tasks, contributing to an adaptive ecosystem. Our code is available at https://github.com/sail-sg/lorahub, and all the pre-trained LoRA modules are released at https://huggingface.co/lorahub.
△ Less
Submitted 18 January, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Measurement of $e^{+}e^{-}\toφη'$ cross sections at center-of-mass energies from 3.508 to 4.951 GeV and search for the decay $ψ(3770)\toφη'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in ag…
▽ More
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in agreement with the Nambu-Jona-Lasinio model prediction of $n=3.5\pm 0.9$. In addition, the charmless decay $ψ(3770)\toφη'$ is searched for by fitting the measured cross sections, yet no significant signal is observed. The upper limit of ${\cal B}(ψ(3770)\toφη')$ at the 90\% confidence level is determined to be $2.3\times 10^{-5}$.
△ Less
Submitted 11 September, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Identification of the melting line in the two-dimensional complex plasmas using an unsupervised machine learning method
Authors:
Hu-Sheng Li,
He Huang,
Wei Yang,
Cheng-Ran Du
Abstract:
Machine learning methods have been widely used in the investigations of the complex plasmas. In this paper, we demonstrate that the unsupervised convolutional neural network can be applied to obtain the melting line in the two-dimensional complex plasmas based on the Langevin dynamics simulation results. The training samples do not need to be labeled. The resulting melting line coincides with thos…
▽ More
Machine learning methods have been widely used in the investigations of the complex plasmas. In this paper, we demonstrate that the unsupervised convolutional neural network can be applied to obtain the melting line in the two-dimensional complex plasmas based on the Langevin dynamics simulation results. The training samples do not need to be labeled. The resulting melting line coincides with those obtained by the analysis of hexatic order parameter and supervised machine learning method.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Single entanglement connection architecture between multi-layer bipartite HEA
Authors:
Shikun Zhang,
Zheng Qin,
Yang Zhou,
Rui Li,
Chunxiao Du,
Zhisong Xiao
Abstract:
Variational quantum algorithms (VQAs) are among the most promising algorithms to achieve quantum advantages in the NISQ era. One important challenge in implementing such algorithms is to construct an effective parameterized quantum circuit (also called an ansatz). In this work, we propose a single entanglement connection architecture (SECA) for a bipartite hardware-efficient ansatz (HEA) by balanc…
▽ More
Variational quantum algorithms (VQAs) are among the most promising algorithms to achieve quantum advantages in the NISQ era. One important challenge in implementing such algorithms is to construct an effective parameterized quantum circuit (also called an ansatz). In this work, we propose a single entanglement connection architecture (SECA) for a bipartite hardware-efficient ansatz (HEA) by balancing its expressibility, entangling capability, and trainability. Numerical simulations with a one-dimensional Heisenberg model and quadratic unconstrained binary optimization (QUBO) issues were conducted. Our results indicate the superiority of SECA over the common full entanglement connection architecture (FECA) in terms of computational performance. Furthermore, combining SECA with gate-cutting technology to construct distributed quantum computation (DQC) can efficiently expand the size of NISQ devices under low overhead. We also demonstrated the effectiveness and scalability of the DQC scheme. Our study is a useful indication for understanding the characteristics associated with an effective training circuit.
△ Less
Submitted 7 March, 2024; v1 submitted 23 July, 2023;
originally announced July 2023.
-
First Observation of a Three-Resonance Structure in $e^+e^-\rightarrow$Nonopen Charm Hadrons
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state…
▽ More
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state is observed for the first time, while the $\mathcal R(3760)$ and $\mathcal R(3780)$ states are observed for the first time in the nOCH cross sections. Two sets of resonance parameters describe the energy-dependent line shape of the cross sections well. In set I [set II], the $\mathcal R(3810)$ state has mass $(3805.7 \pm 1.1 \pm 2.7)$ [$(3805.7 \pm 1.1 \pm 2.7)$] MeV/$c^2$, total width $(11.6 \pm 2.9 \pm 1.9)$ [$(11.5 \pm 2.8 \pm 1.9)$] MeV, and an electronic width multiplied by the nOCH decay branching fraction of $(10.9\pm 3.8\pm 2.5)$ [$(11.0\pm 3.4\pm 2.5)$] eV. In addition, we measure the branching fractions ${\mathcal B}[{\mathcal R}(3760)$$\rightarrow$nOCH$]=(25.2 \pm 16.1 \pm 30.4)\% [(6.4 \pm 4.8 \pm 7.7)\%]$ and ${\mathcal B}[\mathcal R(3780)$$\rightarrow$nOCH$]=(12.3 \pm 6.6 \pm 8.3)\% [(10.4 \pm 4.8 \pm 7.0)\%]$ for the first time. The $\mathcal R(3760)$ state can be interpreted as an open-charm (OC) molecular state, but containing a simple four-quark state component. The $\mathcal R(3810)$ state can be interpreted as a hadrocharmonium state.
△ Less
Submitted 11 May, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Applicability of Measurement-based Quantum Computation towards Physically-driven Variational Quantum Eigensolver
Authors:
Zheng Qin,
Xiufan Li,
Yang Zhou,
Shikun Zhang,
Rui Li,
Chunxiao Du,
Zhisong Xiao
Abstract:
Variational quantum algorithms are considered one of the most promising methods for obtaining near-term quantum advantages; however, most of these algorithms are only expressed in the conventional quantum circuit scheme. The roadblock to develo** quantum algorithms with the measurement-based quantum computation (MBQC) scheme is resource cost. Recently, we discovered that the realization of multi…
▽ More
Variational quantum algorithms are considered one of the most promising methods for obtaining near-term quantum advantages; however, most of these algorithms are only expressed in the conventional quantum circuit scheme. The roadblock to develo** quantum algorithms with the measurement-based quantum computation (MBQC) scheme is resource cost. Recently, we discovered that the realization of multi-qubit rotation operations requires a constant number of single-qubit measurements with the MBQC scheme, providing a potential advantage in terms of resource cost. The structure of the Hamiltonian variational ansatz (HVA) aligns well with this characteristic. Thus, we propose an efficient measurement-based quantum algorithm for quantum many-body system simulation tasks, called measurement-based Hamiltonian variational ansatz (MBHVA). We then demonstrate the effectiveness, efficiency, and advantages of the two-dimensional Heisenberg model and the Fermi-Hubbard chain. Numerical experiments show that MBHVA is expected to reduce resource overhead compared to quantum circuits, especially in the presence of large multi-qubit rotation operations. Furthermore, when compared to Measurement-based Hardware Efficient Ansatz (MBHEA), MBHVA also demonstrates superior performance. We conclude that the MBQC scheme is potentially feasible for achieving near-term quantum advantages in terms of both resource efficiency and error mitigation, particularly for photonic platforms.
△ Less
Submitted 20 December, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Measurement of the Energy-Dependent Electromagnetic Form Factors of a Charmed Baryon
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process…
▽ More
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process $Λ_{c}^{+}\to pK^{-}π^{+}$. Flat cross sections around $4.63~\mathrm{GeV}$ are obtained and no indication of the resonant structure $Y(4630)$, as reported by Belle, is found. In addition, no oscillatory behavior is discerned in the $|G_{\mathrm{eff}}|$ energy-dependence of $Λ_{c}^{+}$, in contrast to what is seen for the proton and neutron cases. Analyzing the cross section together with the polar-angle distribution of the $Λ_{c}^{+}$ baryon at each energy point, the moduli of electric and magnetic form factors ($|G_{E}|$ and $|G_{M}|$) are extracted and separated. For the first time, the energy-dependence of the form factor ratio $|G_{E}/G_{M}|$ is observed, which can be well described by an oscillatory function.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Revealing intrinsic domains and fluctuations of moiré magnetism by a wide-field quantum microscope
Authors:
Mengqi Huang,
Zeliang Sun,
Gerald Yan,
Hongchao Xie,
Nishkarsh Agarwal,
Gaihua Ye,
Suk Hyun Sung,
Hanyi Lu,
**gcheng Zhou,
Shaohua Yan,
Shangjie Tian,
Hechang Lei,
Robert Hovden,
Rui He,
Hailong Wang,
Liuyan Zhao,
Chunhui Rita Du
Abstract:
Moiré magnetism featured by stacking engineered atomic registry and lattice interactions has recently emerged as an appealing quantum state of matter at the forefront condensed matter physics research. Nanoscale imaging of moiré magnets is highly desirable and serves as a prerequisite to investigate a broad range of intriguing physics underlying the interplay between topology, electronic correlati…
▽ More
Moiré magnetism featured by stacking engineered atomic registry and lattice interactions has recently emerged as an appealing quantum state of matter at the forefront condensed matter physics research. Nanoscale imaging of moiré magnets is highly desirable and serves as a prerequisite to investigate a broad range of intriguing physics underlying the interplay between topology, electronic correlations, and unconventional nanomagnetism. Here we report spin defect-based wide-field imaging of magnetic domains and spin fluctuations in twisted double trilayer (tDT) chromium triiodide CrI3. We explicitly show that intrinsic moiré domains of opposite magnetizations appear over arrays of moiré supercells in low-twist-angle tDT CrI3. In contrast, spin fluctuations measured in tDT CrI3 manifest little spatial variations on the same mesoscopic length scale due to the dominant driving force of intralayer exchange interaction. Our results enrich the current understanding of exotic magnetic phases sustained by moiré magnetism and highlight the opportunities provided by quantum spin sensors in probing microscopic spin related phenomena on two-dimensional flatland.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Search for the semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (603 additional authors not shown)
Abstract:
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be…
▽ More
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be $\mathcal{B}(J/ψ\to D^{-}μ^{+}ν_μ+c.c.)<5.6\times10^{-7}$ at $90\%$ confidence level. This is the first search for the weak decay of charmonium with a muon in the final state.
△ Less
Submitted 12 December, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
AdAM: Few-Shot Image Generation via Adaptation-Aware Kernel Modulation
Authors:
Yunqing Zhao,
Keshigeyan Chandrasegaran,
Milad Abdollahzadeh,
Chao Du,
Tianyu Pang,
Ruoteng Li,
Henghui Ding,
Ngai-Man Cheung
Abstract:
Few-shot image generation (FSIG) aims to learn to generate new and diverse images given few (e.g., 10) training samples. Recent work has addressed FSIG by leveraging a GAN pre-trained on a large-scale source domain and adapting it to the target domain with few target samples. Central to recent FSIG methods are knowledge preservation criteria, which select and preserve a subset of source knowledge…
▽ More
Few-shot image generation (FSIG) aims to learn to generate new and diverse images given few (e.g., 10) training samples. Recent work has addressed FSIG by leveraging a GAN pre-trained on a large-scale source domain and adapting it to the target domain with few target samples. Central to recent FSIG methods are knowledge preservation criteria, which select and preserve a subset of source knowledge to the adapted model. However, a major limitation of existing methods is that their knowledge preserving criteria consider only source domain/task and fail to consider target domain/adaptation in selecting source knowledge, casting doubt on their suitability for setups of different proximity between source and target domain. Our work makes two contributions. Firstly, we revisit recent FSIG works and their experiments. We reveal that under setups which assumption of close proximity between source and target domains is relaxed, many existing state-of-the-art (SOTA) methods which consider only source domain in knowledge preserving perform no better than a baseline method. As our second contribution, we propose Adaptation-Aware kernel Modulation (AdAM) for general FSIG of different source-target domain proximity. Extensive experiments show that AdAM consistently achieves SOTA performance in FSIG, including challenging setups where source and target domains are more apart.
△ Less
Submitted 10 November, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Dynamic Viscosity of Methane Hydrate Systems from Non-Einsteinian, Plasma-Functionalized Carbon Nanotube Nanofluids
Authors:
Adam McElligott,
André Guerra,
Chong Yang Du,
Alejandro D. Rey,
Jean-Luc Meunier,
Phillip Servio
Abstract:
The viscosity of oxygen-functionalized multi-walled carbon nanotube (O-MWCNT) nanofluids was measured for concentrations from 0.1 to 10 ppm under conditions of 0 to 30 MPag pressures and 0 to 10 C temperatures. The presence of O-MWCNTs did not affect the temperature dependence of viscosity but did reduce the effective viscosity of solution due to cumulative hydrogen bond-disrupting surface effects…
▽ More
The viscosity of oxygen-functionalized multi-walled carbon nanotube (O-MWCNT) nanofluids was measured for concentrations from 0.1 to 10 ppm under conditions of 0 to 30 MPag pressures and 0 to 10 C temperatures. The presence of O-MWCNTs did not affect the temperature dependence of viscosity but did reduce the effective viscosity of solution due to cumulative hydrogen bond-disrupting surface effects, which overcame internal drag forces. O-MWCNTs added a weak pressure dependence to the viscosity of solution because of their ability to align more with the flow direction as pressure increased. In the liquid to hydrate phase transition, the times to reach the maximum viscosity were faster in O-MWCNT systems compared to the pure water baseline. However, the presence of O-MWCNTs limited the conditions at which hydrates formed as increased nanoparticle collisions in those systems inhibited the formation of critical clusters of hydrate nuclei. The times to viscosity values most relevant to technological applications were minimally 28.02 % (200 mPa s) and 21.08 % (500 mPa s) slower than the baseline, both in the 1 ppm system, even though all systems were faster to the final viscosity. This was attributed to O-MWCNT entanglement, which resulted in a hydrate slurry occurring at lower viscosity values.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Authors:
Sen Liu,
Yiwei Guo,
Chenpeng Du,
Xie Chen,
Kai Yu
Abstract:
Although high-fidelity speech can be obtained for intralingual speech synthesis, cross-lingual text-to-speech (CTTS) is still far from satisfactory as it is difficult to accurately retain the speaker timbres(i.e. speaker similarity) and eliminate the accents from their first language(i.e. nativeness). In this paper, we demonstrated that vector-quantized(VQ) acoustic feature contains less speaker i…
▽ More
Although high-fidelity speech can be obtained for intralingual speech synthesis, cross-lingual text-to-speech (CTTS) is still far from satisfactory as it is difficult to accurately retain the speaker timbres(i.e. speaker similarity) and eliminate the accents from their first language(i.e. nativeness). In this paper, we demonstrated that vector-quantized(VQ) acoustic feature contains less speaker information than mel-spectrogram. Based on this finding, we propose a novel dual speaker embedding TTS (DSE-TTS) framework for CTTS with authentic speaking style. Here, one embedding is fed to the acoustic model to learn the linguistic speaking style, while the other one is integrated into the vocoder to mimic the target speaker's timbre. Experiments show that by combining both embeddings, DSE-TTS significantly outperforms the state-of-the-art SANE-TTS in cross-lingual synthesis, especially in terms of nativeness.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
Precise measurement of the branching fractions of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ and $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
Based on a data sample of $(10087\pm44)\times10^6$ $J/ψ$ events collected with the BESIII detector, the branching fraction of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ is measured to be $(1.221\pm 0.002\pm 0.038)\times10^{-3}$, and the branching fraction of its isospin partner mode $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$ is measured to be $(1.244\pm 0.002\pm 0.045)\times10^{-3}$ with improved precision. H…
▽ More
Based on a data sample of $(10087\pm44)\times10^6$ $J/ψ$ events collected with the BESIII detector, the branching fraction of $J/ψ\rightarrow\barΛπ^{+}Σ^{-}+c.c.$ is measured to be $(1.221\pm 0.002\pm 0.038)\times10^{-3}$, and the branching fraction of its isospin partner mode $J/ψ\rightarrow\barΛπ^{-}Σ^{+}+c.c.$ is measured to be $(1.244\pm 0.002\pm 0.045)\times10^{-3}$ with improved precision. Here the first uncertainties are statistical and the second ones systematic. The isospin symmetry of the $Σ$ baryon in charmonium hadronic decay and the "$12\%$ rule" are tested, and no violation is found. The potential of using these channels as $Σ$ baryon sources for nuclear physics research is studied, and the momentum and angular distributions of these sources are provided.
△ Less
Submitted 24 December, 2023; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Quantum metric nonlinear Hall effect in a topological antiferromagnetic heterostructure
Authors:
Anyuan Gao,
Yu-Fei Liu,
Jian-Xiang Qiu,
Barun Ghosh,
Thaís V. Trevisan,
Yugo Onishi,
Chaowei Hu,
Tiema Qian,
Hung-Ju Tien,
Shao-Wen Chen,
Mengqi Huang,
Damien Bérubé,
Houchen Li,
Christian Tzschaschel,
Thao Dinh,
Zhe Sun,
Sheng-Chin Ho,
Shang-Wei Lien,
Bahadur Singh,
Kenji Watanabe,
Takashi Taniguchi,
David C. Bell,
Hsin Lin,
Tay-Rong Chang,
Chunhui Rita Du
, et al. (6 additional authors not shown)
Abstract:
Quantum geometry - the geometry of electron Bloch wavefunctions - is central to modern condensed matter physics. Due to the quantum nature, quantum geometry has two parts, the real part quantum metric and the imaginary part Berry curvature. The studies of Berry curvature have led to countless breakthroughs, ranging from the quantum Hall effect in 2DEGs to the anomalous Hall effect (AHE) in ferroma…
▽ More
Quantum geometry - the geometry of electron Bloch wavefunctions - is central to modern condensed matter physics. Due to the quantum nature, quantum geometry has two parts, the real part quantum metric and the imaginary part Berry curvature. The studies of Berry curvature have led to countless breakthroughs, ranging from the quantum Hall effect in 2DEGs to the anomalous Hall effect (AHE) in ferromagnets. However, in contrast to Berry curvature, the quantum metric has rarely been explored. Here, we report a new nonlinear Hall effect induced by quantum metric by interfacing even-layered MnBi2Te4 (a PT-symmetric antiferromagnet (AFM)) with black phosphorus. This novel nonlinear Hall effect switches direction upon reversing the AFM spins and exhibits distinct scaling that suggests a non-dissipative nature. Like the AHE brought Berry curvature under the spotlight, our results open the door to discovering quantum metric responses. Moreover, we demonstrate that the AFM can harvest wireless electromagnetic energy via the new nonlinear Hall effect, therefore enabling intriguing applications that bridges nonlinear electronics with AFM spintronics.
△ Less
Submitted 23 July, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Authors:
Zheng Liang,
Zheshu Song,
Ziyang Ma,
Chenpeng Du,
Kai Yu,
Xie Chen
Abstract:
Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great strides and exhibit excellent performance in general speech recognition. However, there remain several challenging scenarios that E2E models are not competent in, such as code-switching and named entity recognition (NER). Data augmentation is a common and effective practice for these two scenarios. However, the cu…
▽ More
Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great strides and exhibit excellent performance in general speech recognition. However, there remain several challenging scenarios that E2E models are not competent in, such as code-switching and named entity recognition (NER). Data augmentation is a common and effective practice for these two scenarios. However, the current data augmentation methods mainly rely on audio splicing and text-to-speech (TTS) models, which might result in discontinuous, unrealistic, and less diversified speech. To mitigate these potential issues, we propose a novel data augmentation method by applying the text-based speech editing model. The augmented speech from speech editing systems is more coherent and diversified, also more akin to real speech. The experimental results on code-switching and NER tasks show that our proposed method can significantly outperform the audio splicing and neural TTS based data augmentation systems.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Authors:
Chenpeng Du,
Yiwei Guo,
Feiyu Shen,
Zhijun Liu,
Zheng Liang,
Xie Chen,
Shuai Wang,
Hui Zhang,
Kai Yu
Abstract:
The utilization of discrete speech tokens, divided into semantic tokens and acoustic tokens, has been proven superior to traditional acoustic feature mel-spectrograms in terms of naturalness and robustness for text-to-speech (TTS) synthesis. Recent popular models, such as VALL-E and SPEAR-TTS, allow zero-shot speaker adaptation through auto-regressive (AR) continuation of acoustic tokens extracted…
▽ More
The utilization of discrete speech tokens, divided into semantic tokens and acoustic tokens, has been proven superior to traditional acoustic feature mel-spectrograms in terms of naturalness and robustness for text-to-speech (TTS) synthesis. Recent popular models, such as VALL-E and SPEAR-TTS, allow zero-shot speaker adaptation through auto-regressive (AR) continuation of acoustic tokens extracted from a short speech prompt. However, these AR models are restricted to generate speech only in a left-to-right direction, making them unsuitable for speech editing where both preceding and following contexts are provided. Furthermore, these models rely on acoustic tokens, which have audio quality limitations imposed by the performance of audio codec models. In this study, we propose a unified context-aware TTS framework called UniCATS, which is capable of both speech continuation and editing. UniCATS comprises two components, an acoustic model CTX-txt2vec and a vocoder CTX-vec2wav. CTX-txt2vec employs contextual VQ-diffusion to predict semantic tokens from the input text, enabling it to incorporate the semantic context and maintain seamless concatenation with the surrounding context. Following that, CTX-vec2wav utilizes contextual vocoding to convert these semantic tokens into waveforms, taking into consideration the acoustic context. Our experimental results demonstrate that CTX-vec2wav outperforms HifiGAN and AudioLM in terms of speech resynthesis from semantic tokens. Moreover, we show that UniCATS achieves state-of-the-art performance in both speech continuation and editing.
△ Less
Submitted 28 March, 2024; v1 submitted 13 June, 2023;
originally announced June 2023.
-
Adversarial Constrained Bidding via Minimax Regret Optimization with Causality-Aware Reinforcement Learning
Authors:
Haozhe Wang,
Chao Du,
Panyan Fang,
Li He,
Liang Wang,
Bo Zheng
Abstract:
The proliferation of the Internet has led to the emergence of online advertising, driven by the mechanics of online auctions. In these repeated auctions, software agents participate on behalf of aggregated advertisers to optimize for their long-term utility. To fulfill the diverse demands, bidding strategies are employed to optimize advertising objectives subject to different spending constraints.…
▽ More
The proliferation of the Internet has led to the emergence of online advertising, driven by the mechanics of online auctions. In these repeated auctions, software agents participate on behalf of aggregated advertisers to optimize for their long-term utility. To fulfill the diverse demands, bidding strategies are employed to optimize advertising objectives subject to different spending constraints. Existing approaches on constrained bidding typically rely on i.i.d. train and test conditions, which contradicts the adversarial nature of online ad markets where different parties possess potentially conflicting objectives. In this regard, we explore the problem of constrained bidding in adversarial bidding environments, which assumes no knowledge about the adversarial factors. Instead of relying on the i.i.d. assumption, our insight is to align the train distribution of environments with the potential test distribution meanwhile minimizing policy regret. Based on this insight, we propose a practical Minimax Regret Optimization (MiRO) approach that interleaves between a teacher finding adversarial environments for tutoring and a learner meta-learning its policy over the given distribution of environments. In addition, we pioneer to incorporate expert demonstrations for learning bidding strategies. Through a causality-aware policy design, we improve upon MiRO by distilling knowledge from the experts. Extensive experiments on both industrial data and synthetic data show that our method, MiRO with Causality-aware reinforcement Learning (MiROCL), outperforms prior methods by over 30%.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory
Authors:
Chenxu Hu,
Jie Fu,
Chenzhuang Du,
Simian Luo,
Junbo Zhao,
Hang Zhao
Abstract:
Large language models (LLMs) with memory are computationally universal. However, mainstream LLMs are not taking full advantage of memory, and the designs are heavily influenced by biological brains. Due to their approximate nature and proneness to the accumulation of errors, conventional neural memory mechanisms cannot support LLMs to simulate complex reasoning. In this paper, we seek inspiration…
▽ More
Large language models (LLMs) with memory are computationally universal. However, mainstream LLMs are not taking full advantage of memory, and the designs are heavily influenced by biological brains. Due to their approximate nature and proneness to the accumulation of errors, conventional neural memory mechanisms cannot support LLMs to simulate complex reasoning. In this paper, we seek inspiration from modern computer architectures to augment LLMs with symbolic memory for complex multi-hop reasoning. Such a symbolic memory framework is instantiated as an LLM and a set of SQL databases, where the LLM generates SQL instructions to manipulate the SQL databases. We validate the effectiveness of the proposed memory framework on a synthetic dataset requiring complex reasoning. The project website is available at https://chatdatabase.github.io/ .
△ Less
Submitted 7 June, 2023; v1 submitted 6 June, 2023;
originally announced June 2023.