-
Measurement of Born cross section of $e^{+}e^{-}\rightarrowΣ^{+}\barΣ^{-}$ at center-of-mass energies between 3.510 and 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (632 additional authors not shown)
Abstract:
Using 24.1 fb$^{-1}$ of $e^{+}e^{-}$ collision data collected with the BESIII detector at the BEPCII collider, the Born cross sections and effective form factors of the $e^{+}e^{-}\rightarrowΣ^{+}\barΣ^{-}$ reaction are measured. The measurements are performed at center-of-mass energies ranging from 3.510 to 4.951 GeV. No significant evidence for the decay of the charmonium(-like) states,…
▽ More
Using 24.1 fb$^{-1}$ of $e^{+}e^{-}$ collision data collected with the BESIII detector at the BEPCII collider, the Born cross sections and effective form factors of the $e^{+}e^{-}\rightarrowΣ^{+}\barΣ^{-}$ reaction are measured. The measurements are performed at center-of-mass energies ranging from 3.510 to 4.951 GeV. No significant evidence for the decay of the charmonium(-like) states, $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$, into a $Σ^{+}\barΣ^{-}$ final state is observed. Consequently, upper limits for the products of the branching fractions and the electronic partial widths at the 90% confidence level are reported for these decays.
△ Less
Submitted 6 May, 2024; v1 submitted 10 January, 2024;
originally announced January 2024.
-
First measurements of the absolute branching fraction of $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ and upper limit on $Λ_{c}(2595)^{+}\to Λ^{+}_{c}π^+π^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (603 additional authors not shown)
Abstract:
The absolute branching fraction of the decay $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ is measured for the first time to be $(50.7 \pm 5.0_{\rm{stat.}} \pm 4.9_{\rm{syst.}} )\%$ with 368.48 pb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at the center-of-mass energies of $\sqrt{s} = 4.918$ and $4.950$ GeV. This result is lower than the naive prediction of 67\%, obtained from isosp…
▽ More
The absolute branching fraction of the decay $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ is measured for the first time to be $(50.7 \pm 5.0_{\rm{stat.}} \pm 4.9_{\rm{syst.}} )\%$ with 368.48 pb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at the center-of-mass energies of $\sqrt{s} = 4.918$ and $4.950$ GeV. This result is lower than the naive prediction of 67\%, obtained from isospin symmetry, by more than $2σ$, thereby indicating that the novel mechanism referred to as the \textit{threshold effect}, proposed for the strong decays of $Λ_{c}(2595)^{+}$, also applies to $Λ_{c}(2625)^{+}$. This measurement is necessary to obtain the coupling constants for the transitions between $s$-wave and $p$-wave charmed baryons in heavy hadron chiral perturbation theory. In addition, we search for the decay $Λ_{c}(2595)^{+}\to Λ^{+}_{c}π^+π^-$. No significant signal is observed, and the upper limit on its branching fraction is determined to be 80.8\% at the 90\% confidence level.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Improved measurements of the Dalitz decays $η/η'\rightarrowγe^{+}e^{-}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (618 additional authors not shown)
Abstract:
Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and…
▽ More
Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and $(4.83\pm0.07\pm0.14)\times10^{-4}$, respectively. Within the single pole model, the parameter of electromagnetic transition form factor for $η\rightarrowγe^+e^-$ is determined to be $Λ_η=(0.749 \pm 0.027 \pm 0.007)~ {\rm GeV}/c^{2}$. Within the multi-pole model, we extract the electromagnetic transition form factors for $η'\rightarrowγe^+e^-$ to be $Λ_{η'} = (0.802 \pm 0.007\pm 0.008)~ {\rm GeV}/c^{2}$ and $γ_{η'} = (0.113\pm0.010\pm0.002)~ {\rm GeV}/c^{2}$. The results are consistent with both theoretical predictions and previous measurements. The characteristic sizes of the interaction regions for the $η$ and $η'$ are calculated to be $(0.645 \pm 0.023 \pm 0.007 )~ {\rm fm}$ and $(0.596 \pm 0.005 \pm 0.006)~ {\rm fm}$, respectively. In addition, we search for the dark photon in $η/η^\prime\rightarrowγe^{+}e^{-}$, and the upper limits of the branching fractions as a function of the dark photon are given at 90\% confidence level.
△ Less
Submitted 5 April, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
First study of antihyperon-nucleon scattering $\barΛp\rightarrow\barΛp$ and measurement of $Λp\rightarrowΛp$ cross section
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cr…
▽ More
Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cross sections in $-0.9\leq\rm{cos}θ_{Λ/\barΛ}\leq0.9$ are measured to be $σ(Λp\rightarrowΛp)=(12.2\pm1.6_{\rm{stat}}\pm1.1_{\rm{sys}})$ mb and $σ(\barΛ p\rightarrow\barΛ p)=(17.5\pm2.1_{\rm{stat}}\pm1.6_{\rm{sys}})$ mb at the $Λ/\barΛ$ momentum of $1.074$ GeV/$c$ within a range of $\pm0.017$ GeV/$c$, where the $θ_{Λ/\barΛ}$ are the scattering angles of the $Λ/\barΛ$ in the $Λp/\barΛp$ rest frames. Furthermore, the differential cross sections of the two reactions are also measured, where there is a slight tendency of forward scattering for $Λp\rightarrowΛp$, and a strong forward peak for $\barΛp\rightarrow\barΛp$. We present an approach to extract the total elastic cross sections by extrapolation. The study of $\barΛp\rightarrow\barΛp$ represents the first study of antihyperon-nucleon scattering, and these new measurements will serve as important inputs for the theoretical understanding of the (anti)hyperon-nucleon interaction.
△ Less
Submitted 18 May, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Observation of $ψ(3686) \to Ω^- K^+ \barΞ^0 $+c.c
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (630 additional authors not shown)
Abstract:
Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systemati…
▽ More
Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systematic. Possible baryon excited states are searched for in this decay, but no evident intermediate state is observed with the current sample size.
△ Less
Submitted 15 April, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Extended Main Sequences in Star Clusters
Authors:
Chengyuan Li,
Antonino P. Milone,
Weijia Sun,
Richard de Grijs
Abstract:
Extended main sequences (eMSs) and extended main-sequence turnoffs (eMSTOs) are fascinating phenomena that are routinely observed in star clusters. These phenomena strongly challenge the current canonical "simple stellar population" picture of star clusters, which postulates that star clusters are coeval and chemically homogeneous and can thus be described by a single, unique isochrone. Detections…
▽ More
Extended main sequences (eMSs) and extended main-sequence turnoffs (eMSTOs) are fascinating phenomena that are routinely observed in star clusters. These phenomena strongly challenge the current canonical "simple stellar population" picture of star clusters, which postulates that star clusters are coeval and chemically homogeneous and can thus be described by a single, unique isochrone. Detections of eMSs and eMSTOs provide valuable insights into stellar physics and the evolution of star clusters. This comprehensive review delves into the observational characteristics, underlying mechanisms, and astrophysical implications of the eMSs and eMSTOs observed in young (less than 600 million years) and intermediate-age (600 to 2000 million years) star clusters. Several scenarios or hypotheses have been proposed to explain these phenomena, including the presence of an age spread, binary interactions, variable stars, and differences in stellar rotation rates. This review discusses the advantages and limitations of current models. Among contemporary models and hypotheses, stellar rotation has been demonstrated as the most plausible mechanism to explain the occurrence of eMSs and eMSTOs. Research on stellar rotation and its connection to eMSs has opened up a myriad of fascinating avenues, such as investigations of the magnetic braking mechanism in stars, searches for tidally locked binary systems in star clusters, and investigations as to whether binary mergers can give rise to massive magnetars. These endeavors have yielded valuable insights and significantly enriched our understanding of stellar astrophysics.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Measurement of Solar $pp$ Neutrino Flux using Electron Recoil Data from PandaX-4T Commissioning Run
Authors:
PandaX Collaboration,
Xiaoying Lu,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Zhaokan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Lisheng Geng,
Karl Giboni,
Xuyuan Guo,
Chencheng Han,
Ke Han,
Changda He,
**rong He,
Di Huang,
Junting Huang,
Zhou Huang,
Ruquan Hou,
Yu Hou,
Xiangdong Ji
, et al. (67 additional authors not shown)
Abstract:
The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning dat…
▽ More
The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning data with 0.63 tonne$\times$year exposure. The $pp$ neutrino flux is determined to be $(8.0 \pm 3.9 \,{\rm{(stat)}} \pm 10.0 \,{\rm{(syst)}} )\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$, consistent with Standard Solar Model and existing measurements, corresponding to a flux upper limit of $23.3\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$ at 90\% C.L..
△ Less
Submitted 2 July, 2024; v1 submitted 13 January, 2024;
originally announced January 2024.
-
First observation of the decay $Λ^+_c\to nK^{0}_{S}π^+π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (630 additional authors not shown)
Abstract:
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic,…
▽ More
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic, which differs from the theoretical prediction based on isospin by 4.4$σ$. This indicates that there may be resonant contributions or some unknown dynamics in this decay.
△ Less
Submitted 28 March, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Teaching Code LLMs to Use Autocompletion Tools in Repository-Level Code Generation
Authors:
Chong Wang,
Jian Zhang,
Yebo Feng,
Tianlin Li,
Weisong Sun,
Yang Liu,
Xin Peng
Abstract:
Recent code large language models (LLMs) have shown promising performance in generating standalone functions but face limitations in repository-level code generation due to their lack of awareness of repository-level dependencies (e.g., user-defined attributes), resulting in dependency errors such as undefined-variable and no-member errors. In this work, we introduce ToolGen, an approach that inte…
▽ More
Recent code large language models (LLMs) have shown promising performance in generating standalone functions but face limitations in repository-level code generation due to their lack of awareness of repository-level dependencies (e.g., user-defined attributes), resulting in dependency errors such as undefined-variable and no-member errors. In this work, we introduce ToolGen, an approach that integrates autocompletion tools into the code LLM generation process to address these dependencies. ToolGen comprises two main phases: Trigger Insertion and Model Fine-tuning (Offline), and Tool-integrated Code Generation (Online). During the offline phase, ToolGen augments functions within a given code corpus with a special mark token, indicating positions to trigger autocompletion tools. These augmented functions, along with their corresponding docstrings, are then used to fine-tune a selected code LLM. In the online phase, ToolGen iteratively generates functions by predicting tokens step-by-step using the fine-tuned LLM. Whenever a mark token is encountered, ToolGen invokes the autocompletion tool to suggest code completions and selects the most appropriate one.
We conduct comprehensive experiments to evaluate ToolGen's effectiveness in repository-level code generation. To facilitate this evaluation, we create a benchmark comprising 680 real-world code repositories and introduce two new repository-level metrics: Dependency Coverage and Static Validity Rate. The results demonstrate that ToolGen significantly improves Dependency Coverage by 15.2% to 45.8% and Static Validity Rate by 10.9% to 42.2% across three distinct code LLMs, while maintaining competitive performance in widely-recognized similarity metrics. Furthermore, our generalizability evaluation confirms ToolGen's consistent performance when applied to diverse code LLMs, including various model architectures and scales.
△ Less
Submitted 21 January, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
Knowledge Translation: A New Pathway for Model Compression
Authors:
Wujie Sun,
Defang Chen,
Jiawei Chen,
Yan Feng,
Chun Chen,
Can Wang
Abstract:
Deep learning has witnessed significant advancements in recent years at the cost of increasing training, inference, and model storage overhead. While existing model compression methods strive to reduce the number of model parameters while maintaining high accuracy, they inevitably necessitate the re-training of the compressed model or impose architectural constraints. To overcome these limitations…
▽ More
Deep learning has witnessed significant advancements in recent years at the cost of increasing training, inference, and model storage overhead. While existing model compression methods strive to reduce the number of model parameters while maintaining high accuracy, they inevitably necessitate the re-training of the compressed model or impose architectural constraints. To overcome these limitations, this paper presents a novel framework, termed \textbf{K}nowledge \textbf{T}ranslation (KT), wherein a ``translation'' model is trained to receive the parameters of a larger model and generate compressed parameters. The concept of KT draws inspiration from language translation, which effectively employs neural networks to convert different languages, maintaining identical meaning. Accordingly, we explore the potential of neural networks to convert models of disparate sizes, while preserving their functionality. We propose a comprehensive framework for KT, introduce data augmentation strategies to enhance model performance despite restricted training data, and successfully demonstrate the feasibility of KT on the MNIST dataset. Code is available at \url{https://github.com/zju-SWJ/KT}.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Reconstruction of the Do** Profile in Vlasov-Poisson
Authors:
Ru-Yu Lai,
Qin Li,
Weiran Sun
Abstract:
We study the inverse problem of recovering the do** profile in the stationary Vlasov-Poisson equation, given the knowledge of the incoming and outgoing measurements at the boundary of the domain. This problem arises from identifying impurities in the semiconductor manufacturing. Our result states that, under suitable assumptions, the do** profile can be uniquely determined through an asymptoti…
▽ More
We study the inverse problem of recovering the do** profile in the stationary Vlasov-Poisson equation, given the knowledge of the incoming and outgoing measurements at the boundary of the domain. This problem arises from identifying impurities in the semiconductor manufacturing. Our result states that, under suitable assumptions, the do** profile can be uniquely determined through an asymptotic formula of the electric field that it generates.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
Authors:
Zhen Qin,
Weigao Sun,
Dong Li,
Xuyang Shen,
Weixuan Sun,
Yiran Zhong
Abstract:
Linear attention is an efficient attention mechanism that has recently emerged as a promising alternative to conventional softmax attention. With its ability to process tokens in linear computational complexities, linear attention, in theory, can handle sequences of unlimited length without sacrificing speed, i.e., maintaining a constant training speed for various sequence lengths with a fixed mem…
▽ More
Linear attention is an efficient attention mechanism that has recently emerged as a promising alternative to conventional softmax attention. With its ability to process tokens in linear computational complexities, linear attention, in theory, can handle sequences of unlimited length without sacrificing speed, i.e., maintaining a constant training speed for various sequence lengths with a fixed memory consumption. However, due to the issue with cumulative summation (cumsum), current linear attention algorithms cannot demonstrate their theoretical advantage in a causal setting. In this paper, we present Lightning Attention-2, the first linear attention implementation that enables linear attention to realize its theoretical computational benefits. To achieve this, we leverage the thought of tiling, separately handling the intra-block and inter-block components in linear attention calculation. Specifically, we utilize the conventional attention computation mechanism for the intra-blocks and apply linear attention kernel tricks for the inter-blocks. A tiling technique is adopted through both forward and backward procedures to take full advantage of the GPU hardware. We implement our algorithm in Triton to make it IO-aware and hardware-friendly. Various experiments are conducted on different model sizes and sequence lengths. Lightning Attention-2 retains consistent training and inference speed regardless of input sequence length and is significantly faster than other attention mechanisms. The source code is available at https://github.com/OpenNLPLab/lightning-attention.
△ Less
Submitted 15 January, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
Nonuniform Sobolev Spaces
Authors:
Ting Chen,
Loukas Grafakos,
Wenchang Sun
Abstract:
We study nonuniform Sobolev spaces, i.e., spaces of functions whose partial derivatives lie in possibly different Lebesgue spaces. Although standard proofs do not apply, we show that nonuniform Sobolev spaces share similar properties as the classical ones. These spaces arise naturally in the study of certain PDEs. For instance, we illustrate that nonuniform fractional Sobolev spaces are useful in…
▽ More
We study nonuniform Sobolev spaces, i.e., spaces of functions whose partial derivatives lie in possibly different Lebesgue spaces. Although standard proofs do not apply, we show that nonuniform Sobolev spaces share similar properties as the classical ones. These spaces arise naturally in the study of certain PDEs. For instance, we illustrate that nonuniform fractional Sobolev spaces are useful in the study of local estimates for solutions of heat equations and the convergence of Schrödinger operators. In this work we extend recent advances on local energy estimates for solutions of heat equations and the convergence of Schrödinger operators to nonuniform fractional Sobolev spaces.
△ Less
Submitted 23 January, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
Rotation in Stellar Evolution: Probing the Influence on Population Synthesis in High-Redshift Galaxies
Authors:
Weijia Sun
Abstract:
Stellar population synthesis (SPS) is essential for understanding galaxy formation and evolution. However, the recent discovery of rotation-driven phenomena in star clusters warrants a review of uncertainties in SPS models caused by overlooked factors, including stellar rotation. In this study, we investigate the impact of rotation on SPS specifically using the PARSEC V2.0 rotation model and its i…
▽ More
Stellar population synthesis (SPS) is essential for understanding galaxy formation and evolution. However, the recent discovery of rotation-driven phenomena in star clusters warrants a review of uncertainties in SPS models caused by overlooked factors, including stellar rotation. In this study, we investigate the impact of rotation on SPS specifically using the PARSEC V2.0 rotation model and its implications for high redshift galaxies with the JWST. Rotation enhances the ultraviolet (UV) flux for up to $\sim 400$ Myr after the starburst, with the slope of UV increasing as the population gets faster rotating and more metal-poor. Using the Prospector tool, we construct simulated galaxies and deduce their properties associated with dust and star formation. Our results suggest that rapid rotation models result in a gradual UV slope up to 0.1 dex higher and an approximately 50\% increase in dust attenuation for identical wide-band spectral energy distributions. Furthermore, we investigate biases if the stellar population should be characterized by rapid rotation and demonstrate that accurate estimation can be achieved for rotation rates up to $ω_\text{i}=0.6$. Accounting for the bias in the case of rapid rotation aligns specific star formation rates more closely with predictions from theoretical models. Notably, this also implies a slightly higher level of dust attenuation than previously anticipated, while still allowing for a `dust-free' interpretation of the galaxy. The impact of rapid rotation SPS models on the rest-UV luminosity function is found to be minimal. Overall, our findings have potentially important implications for comprehending dust attenuation and mass assembly history in the high-redshift Universe.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Production of Higgs Boson in ultra-peripheral heavy ion collisions with two-photon processes
Authors:
Gongming Yu,
Wenlong Sun
Abstract:
We calculated the production of the Higgs boson (H) by two-photon interaction with the equivalent photon approximation in nucleus-nucleus collision, proton-nucleus collision, and proton-proton collision. The numerical results show that the experimental study of the Higgs boson in ultra-peripheral collisions is feasible at the energies of the relativistic heavy ion collider (RHIC) and the large had…
▽ More
We calculated the production of the Higgs boson (H) by two-photon interaction with the equivalent photon approximation in nucleus-nucleus collision, proton-nucleus collision, and proton-proton collision. The numerical results show that the experimental study of the Higgs boson in ultra-peripheral collisions is feasible at the energies of the relativistic heavy ion collider (RHIC) and the large hadron collider (LHC).
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Spectral engineering of optical microresonators in anisotropic lithium niobate crystal
Authors:
Ke Zhang,
Yikun Chen,
Wenzhao Sun,
Zhaoxi Chen,
Hanke Feng,
Cheng Wang
Abstract:
On-chip optical microresonators are essential building blocks in integrated optics. The ability to arbitrarily engineer their resonant frequencies is crucial for exploring novel physics in synthetic frequency dimensions and practical applications like nonlinear optical parametric processes and dispersion-engineered frequency comb generation. Photonic crystal ring (PhCR) resonators are a versatile…
▽ More
On-chip optical microresonators are essential building blocks in integrated optics. The ability to arbitrarily engineer their resonant frequencies is crucial for exploring novel physics in synthetic frequency dimensions and practical applications like nonlinear optical parametric processes and dispersion-engineered frequency comb generation. Photonic crystal ring (PhCR) resonators are a versatile tool for such arbitrary frequency engineering, by controllably creating mode splitting at selected resonances. To date, these PhCRs have mostly been demonstrated in isotropic photonic materials, while such engineering could be significantly more complicated in anisotropic platforms that often offer more fruitful optical properties. Here, we realize the spectral engineering of chip-scale optical microresonators in the anisotropic lithium niobate (LN) crystal by a gradient design that precisely compensates for variations in both refractive index and perturbation strength. We experimentally demonstrate controllable frequency splitting at single and multiple selected resonances in LN PhCR resonators with different sizes, while maintaining high Q-factors up to 1 million. Moreover, we experimentally construct a sharp boundary in the synthetic frequency dimension based on an actively modulated x-cut LN gradient-PhCR, opening up new paths toward the arbitrary control of electro-optic comb spectral shapes and exploration of novel physics in the frequency degree of freedom.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Partial Wave Analysis of $J/ψ\rightarrow γγφ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (603 additional authors not shown)
Abstract:
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and…
▽ More
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and $η_{c}$ are observed with statistical significance greater than 5$σ$. The product branching fractions $\mathcal{B}(J/ψ\rightarrowγX, X\rightarrow γφ)$ are reported. The resonance parameters of $η(1405)$ and $X(1835)$ are also measured.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Observation of $\mathcal R(3810)$ in $e^+e^-\rightarrow {\rm hadrons}$ and Improved Measurements of the Resonance Parameters of $\mathcal R(3760)$ and $\mathcal R(3780)$
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (596 additional authors not shown)
Abstract:
We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$,…
▽ More
We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$, a total width of $(5.4 \pm 3.5 \pm 3.2)$~MeV, and an electronic partial width of $(19.4 \pm 7.4 \pm 12.1)$~eV. Its significance is $7.7σ$. The $\mathcal R(3810)$ could be interpreted as a hadro-charmonium resonance predicted by Quantum Chromodynamics (QCD). In addition, we measure the mass $(3751.9\pm 3.8\pm 2.8)$ ~MeV/$c^2$, the total width $(32.8 \pm 5.8 \pm 8.7)$~MeV, and the electronic partial width $(184\pm 75\pm 86)$~eV with improved precision for the $\mathcal R(3760)$. Furthermore, for the $\mathcal R(3780)$ we measure the mass $(3778.7\pm 0.5\pm 0.3)$ ~MeV/$c^2$ and total width $(20.3 \pm 0.8 \pm 1.7)$~MeV with improved precision, and the electronic partial width $(265\pm 69\pm 83)$~eV. The $\mathcal R(3780)$ can be interpreted as the $1^3D_1$ state of charmonium. Its mass and total width differ significantly from the corresponding fitted values given by the Particle Data Group in 2022 by 7.1 and 3.2 times the uncertainties for $ψ(3770)$, respectively. $ψ(3770)$ has been interpreted as the $1^3D_1$ state for 45 years.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Distillation is All You Need for Practically Using Different Pre-trained Recommendation Models
Authors:
Wenqi Sun,
Ruobing Xie,
Junjie Zhang,
Wayne Xin Zhao,
Leyu Lin,
Ji-Rong Wen
Abstract:
Pre-trained recommendation models (PRMs) have attracted widespread attention recently. However, their totally different model structure, huge model size and computation cost hinder their application in practical recommender systems. Hence, it is highly essential to explore how to practically utilize PRMs in real-world recommendations. In this paper, we propose a novel joint knowledge distillation…
▽ More
Pre-trained recommendation models (PRMs) have attracted widespread attention recently. However, their totally different model structure, huge model size and computation cost hinder their application in practical recommender systems. Hence, it is highly essential to explore how to practically utilize PRMs in real-world recommendations. In this paper, we propose a novel joint knowledge distillation from different pre-trained recommendation models named PRM-KD for recommendation, which takes full advantages of diverse PRMs as teacher models for enhancing student models efficiently. Specifically, PRM-KD jointly distills diverse informative knowledge from multiple representative PRMs such as UniSRec, Recformer, and UniM^2Rec. The knowledge from the above PRMs are then smartly integrated into the student recommendation model considering their confidence and consistency. We further verify the universality of PRM-KD with various types of student models, including sequential recommendation, feature interaction, and graph-based models. Extensive experiments on five real-world datasets demonstrate the effectiveness and efficacy of PRM-KD, which could be viewed as an economical shortcut in practically and conveniently making full use of different PRMs in online systems.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Machine Translation Testing via Syntactic Tree Pruning
Authors:
Quanjun Zhang,
Juan Zhai,
Chunrong Fang,
Jiawei Liu,
Weisong Sun,
Haichuan Hu,
Qingyu Wang
Abstract:
Machine translation systems have been widely adopted in our daily life, making life easier and more convenient. Unfortunately, erroneous translations may result in severe consequences, such as financial losses. This requires to improve the accuracy and the reliability of machine translation systems. However, it is challenging to test machine translation systems because of the complexity and intrac…
▽ More
Machine translation systems have been widely adopted in our daily life, making life easier and more convenient. Unfortunately, erroneous translations may result in severe consequences, such as financial losses. This requires to improve the accuracy and the reliability of machine translation systems. However, it is challenging to test machine translation systems because of the complexity and intractability of the underlying neural models. To tackle these challenges, we propose a novel metamorphic testing approach by syntactic tree pruning (STP) to validate machine translation systems. Our key insight is that a pruned sentence should have similar crucial semantics compared with the original sentence. Specifically, STP (1) proposes a core semantics-preserving pruning strategy by basic sentence structure and dependency relations on the level of syntactic tree representation; (2) generates source sentence pairs based on the metamorphic relation; (3) reports suspicious issues whose translations break the consistency property by a bag-of-words model. We further evaluate STP on two state-of-the-art machine translation systems (i.e., Google Translate and Bing Microsoft Translator) with 1,200 source sentences as inputs. The results show that STP can accurately find 5,073 unique erroneous translations in Google Translate and 5,100 unique erroneous translations in Bing Microsoft Translator (400% more than state-of-the-art techniques), with 64.5% and 65.4% precision, respectively. The reported erroneous translations vary in types and more than 90% of them cannot be found by state-of-the-art techniques. There are 9,393 erroneous translations unique to STP, which is 711.9% more than state-of-the-art techniques. Moreover, STP is quite effective to detect translation errors for the original sentences with a recall reaching 74.0%, improving state-of-the-art techniques by 55.1% on average.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Online Tensor Inference
Authors:
Xin Wen,
Will Wei Sun,
Yichen Zhang
Abstract:
Recent technological advances have led to contemporary applications that demand real-time processing and analysis of sequentially arriving tensor data. Traditional offline learning, involving the storage and utilization of all data in each computational iteration, becomes impractical for high-dimensional tensor data due to its voluminous size. Furthermore, existing low-rank tensor methods lack the…
▽ More
Recent technological advances have led to contemporary applications that demand real-time processing and analysis of sequentially arriving tensor data. Traditional offline learning, involving the storage and utilization of all data in each computational iteration, becomes impractical for high-dimensional tensor data due to its voluminous size. Furthermore, existing low-rank tensor methods lack the capability for statistical inference in an online fashion, which is essential for real-time predictions and informed decision-making. This paper addresses these challenges by introducing a novel online inference framework for low-rank tensor learning. Our approach employs Stochastic Gradient Descent (SGD) to enable efficient real-time data processing without extensive memory requirements, thereby significantly reducing computational demands. We establish a non-asymptotic convergence result for the online low-rank SGD estimator, nearly matches the minimax optimal rate of estimation error in offline models that store all historical data. Building upon this foundation, we propose a simple yet powerful online debiasing approach for sequential statistical inference in low-rank tensor learning. The entire online procedure, covering both estimation and inference, eliminates the need for data splitting or storing historical data, making it suitable for on-the-fly hypothesis testing. Given the sequential nature of our data collection, traditional analyses relying on offline methods and sample splitting are inadequate. In our analysis, we control the sum of constructed super-martingales to ensure estimates along the entire solution path remain within the benign region. Additionally, a novel spectral representation tool is employed to address statistical dependencies among iterative estimates, establishing the desired asymptotic normality.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Authors:
Haoning Wu,
Zicheng Zhang,
Weixia Zhang,
Chaofeng Chen,
Liang Liao,
Chunyi Li,
Yixuan Gao,
Annan Wang,
Erli Zhang,
Wenxiu Sun,
Qiong Yan,
Xiongkuo Min,
Guangtao Zhai,
Weisi Lin
Abstract:
The explosion of visual content available online underscores the requirement for an accurate machine assessor to robustly evaluate scores across diverse types of visual contents. While recent studies have demonstrated the exceptional potentials of large multi-modality models (LMMs) on a wide range of related fields, in this work, we explore how to teach them for visual rating aligned with human op…
▽ More
The explosion of visual content available online underscores the requirement for an accurate machine assessor to robustly evaluate scores across diverse types of visual contents. While recent studies have demonstrated the exceptional potentials of large multi-modality models (LMMs) on a wide range of related fields, in this work, we explore how to teach them for visual rating aligned with human opinions. Observing that human raters only learn and judge discrete text-defined levels in subjective studies, we propose to emulate this subjective process and teach LMMs with text-defined rating levels instead of scores. The proposed Q-Align achieves state-of-the-art performance on image quality assessment (IQA), image aesthetic assessment (IAA), as well as video quality assessment (VQA) tasks under the original LMM structure. With the syllabus, we further unify the three tasks into one model, termed the OneAlign. In our experiments, we demonstrate the advantage of the discrete-level-based syllabus over direct-score-based variants for LMMs. Our code and the pre-trained weights are released at https://github.com/Q-Future/Q-Align.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Search for a massless particle beyond the Standard Model in the $Σ^+\rightarrow p+{\rm invisible}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$…
▽ More
A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$ is determined to be $3.2\times10^{-5}$ at the 90% confidence level. This is the first search for a flavor-changing neutral current process with missing energy in hyperon decays which plays an important role in constraining new physics models.
△ Less
Submitted 5 April, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Observation of $χ_{cJ}\to 3(K^+K^-)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (632 additional authors not shown)
Abstract:
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching…
▽ More
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching fractions of $χ_{cJ}\to 3(K^+K^-)$ decays are determined to be
$\mathcal{B}_{χ_{c0}\to 3(K^+K^-)}$=$(10.7\pm1.8\pm1.1)$$\times10^{-6}$,
$\mathcal{B}_{χ_{c1}\to 3(K^+K^-)}$=$(4.2\pm0.9\pm0.5)$$\times10^{-6}$, and
$\mathcal{B}_{χ_{c2}\to 3(K^+K^-)}$=$(7.2\pm1.1\pm0.8)$$\times10^{-6}$,
where the first uncertainties are statistical and the second are systematic.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
A Prompt Learning Framework for Source Code Summarization
Authors:
Weisong Sun,
Chunrong Fang,
Yudu You,
Yuchen Chen,
Yi Liu,
Chong Wang,
Jian Zhang,
Quanjun Zhang,
Hanwei Qian,
Wei Zhao,
Yang Liu,
Zhenyu Chen
Abstract:
(Source) code summarization is the task of automatically generating natural language summaries for given code snippets. Such summaries play a key role in hel** developers understand and maintain source code. Recently, with the successful application of large language models (LLMs) in numerous fields, software engineering researchers have also attempted to adapt LLMs to solve code summarization t…
▽ More
(Source) code summarization is the task of automatically generating natural language summaries for given code snippets. Such summaries play a key role in hel** developers understand and maintain source code. Recently, with the successful application of large language models (LLMs) in numerous fields, software engineering researchers have also attempted to adapt LLMs to solve code summarization tasks. The main adaptation schemes include instruction prompting and task-oriented fine-tuning. However, instruction prompting involves designing crafted prompts for zero-shot learning or selecting appropriate samples for few-shot learning and requires users to have professional domain knowledge, while task-oriented fine-tuning requires high training costs. In this paper, we propose a novel prompt learning framework for code summarization called PromptCS. PromptCS trains a prompt agent that can generate continuous prompts to unleash the potential for LLMs in code summarization. Compared to the human-written discrete prompt, the continuous prompts are produced under the guidance of LLMs and are therefore easier to understand by LLMs. PromptCS freezes the parameters of LLMs when training the prompt agent, which can greatly reduce the requirements for training resources. We evaluate PromptCS on the CodeSearchNet dataset involving multiple programming languages. The results show that PromptCS significantly outperforms instruction prompting schemes on all four widely used metrics. In some base LLMs, e.g., CodeGen-Multi-2B and StarCoderBase-1B and -3B, PromptCS even outperforms the task-oriented fine-tuning scheme. More importantly, the training efficiency of PromptCS is faster than the task-oriented fine-tuning scheme, with a more pronounced advantage on larger LLMs. The results of the human evaluation demonstrate that PromptCS can generate more good summaries compared to baselines.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Searching for Two-Neutrino and Neutrinoless Double Beta Decay of $^{134}$Xe with the PandaX-4T Experiment
Authors:
PandaX Collaboration,
Xiyu Yan,
Zhaokan Cheng,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Chen Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Chencheng Han,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Junting Huang,
Zhou Huang
, et al. (72 additional authors not shown)
Abstract:
$^{134}$Xe is a candidate isotope for neutrinoless double beta decay~($0νββ$) search. In addition, the two-neutrino case ($2νββ$) allowed by the Standard Model of particle physics has not yet been observed. Utilizing the 10.4% of $^{134}$Xe in the natural xenon in the PandaX-4T detector and its first 94.9-day exposure, we have established the most stringent constraints on $2νββ$ and $0νββ$ of $^{1…
▽ More
$^{134}$Xe is a candidate isotope for neutrinoless double beta decay~($0νββ$) search. In addition, the two-neutrino case ($2νββ$) allowed by the Standard Model of particle physics has not yet been observed. Utilizing the 10.4% of $^{134}$Xe in the natural xenon in the PandaX-4T detector and its first 94.9-day exposure, we have established the most stringent constraints on $2νββ$ and $0νββ$ of $^{134}$Xe half-lives, with limits of $2.8\times10^{22}$ yr and $3.0\times10^{23}$ yr at 90% confidence level, respectively. The $2νββ$ ($0νββ$) limit surpasses the previously reported best result by a factor of 32 (2.7), highlighting the potential of large monolithic natural xenon detectors.
△ Less
Submitted 28 April, 2024; v1 submitted 25 December, 2023;
originally announced December 2023.
-
Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models
Authors:
Zicheng Zhang,
Haoning Wu,
Zhongpeng Ji,
Chunyi Li,
Erli Zhang,
Wei Sun,
Xiaohong Liu,
Xiongkuo Min,
Fengyu Sun,
Shangling Jui,
Weisi Lin,
Guangtao Zhai
Abstract:
Recent advancements in Multi-modality Large Language Models (MLLMs) have demonstrated remarkable capabilities in complex high-level vision tasks. However, the exploration of MLLM potential in visual quality assessment, a vital aspect of low-level vision, remains limited. To address this gap, we introduce Q-Boost, a novel strategy designed to enhance low-level MLLMs in image quality assessment (IQA…
▽ More
Recent advancements in Multi-modality Large Language Models (MLLMs) have demonstrated remarkable capabilities in complex high-level vision tasks. However, the exploration of MLLM potential in visual quality assessment, a vital aspect of low-level vision, remains limited. To address this gap, we introduce Q-Boost, a novel strategy designed to enhance low-level MLLMs in image quality assessment (IQA) and video quality assessment (VQA) tasks, which is structured around two pivotal components: 1) Triadic-Tone Integration: Ordinary prompt design simply oscillates between the binary extremes of $positive$ and $negative$. Q-Boost innovates by incorporating a `middle ground' approach through $neutral$ prompts, allowing for a more balanced and detailed assessment. 2) Multi-Prompt Ensemble: Multiple quality-centric prompts are used to mitigate bias and acquire more accurate evaluation. The experimental results show that the low-level MLLMs exhibit outstanding zeros-shot performance on the IQA/VQA tasks equipped with the Q-Boost strategy.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
A Survey on Large Language Models for Software Engineering
Authors:
Quanjun Zhang,
Chunrong Fang,
Yang Xie,
Yaxin Zhang,
Yun Yang,
Weisong Sun,
Shengcheng Yu,
Zhenyu Chen
Abstract:
Software Engineering (SE) is the systematic design, development, and maintenance of software applications, underpinning the digital infrastructure of our modern mainworld. Very recently, the SE community has seen a rapidly increasing number of techniques employing Large Language Models (LLMs) to automate a broad range of SE tasks. Nevertheless, existing information of the applications, effects, an…
▽ More
Software Engineering (SE) is the systematic design, development, and maintenance of software applications, underpinning the digital infrastructure of our modern mainworld. Very recently, the SE community has seen a rapidly increasing number of techniques employing Large Language Models (LLMs) to automate a broad range of SE tasks. Nevertheless, existing information of the applications, effects, and possible limitations of LLMs within SE is still not well-studied.
In this paper, we provide a systematic survey to summarize the current state-of-the-art research in the LLM-based SE community. We summarize 30 representative LLMs of Source Code across three model architectures, 15 pre-training objectives across four categories, and 16 downstream tasks across five categories. We then present a detailed summarization of the recent SE studies for which LLMs are commonly utilized, including 155 studies for 43 specific code-related tasks across four crucial phases within the SE workflow. Besides, we summarize existing attempts to empirically evaluate LLMs in SE, such as benchmarks, empirical studies, and exploration of SE education. We also discuss several critical aspects of optimization and applications of LLMs in SE, such as security attacks, model tuning, and model compression. Finally, we highlight several challenges and potential opportunities on applying LLMs for future SE studies, such as exploring domain LLMs and constructing clean evaluation datasets. Overall, our work can help researchers gain a comprehensive understanding about the achievements of the existing LLM-based SE studies and promote the practical application of these techniques. Our artifacts are publicly available and will continuously updated at the living repository: \url{https://github.com/iSEngLab/AwesomeLLM4SE}.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
Search for the decay $χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$…
▽ More
Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ to $χ_{c1}(3872) \to π^{+}π^{-}J/ψ$ is measured as $\mathcal{R}\equiv\frac{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}]}{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-} J/ψ]}<0.18$ at 90$\%$ confidence level. The upper limit on the product of the cross section $σ[e^{+}e^{-}\toγχ_{c1}(3872)]$ and the branching fraction $\mathcal{B}[χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}]$ at each center-of-mass energy is also given. These measurements favor the non-conventional charmonium nature of the $χ_{c1}(3872)$ state.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
IOPS: An Unified SpMM Accelerator Based on Inner-Outer-Hybrid Product
Authors:
Wenhao Sun,
Wendi Sun,
Song Chen,
Yi Kang
Abstract:
Sparse matrix multiplication (SpMM) is widely applied to numerous domains, such as graph processing, machine learning, and data analytics. However, inner product based SpMM induces redundant zero-element computing for mismatched nonzero operands, while outer product based approach lacks input reuse across Process Elements (PEs) and poor output locality for accumulating partial sum (psum) matrices.…
▽ More
Sparse matrix multiplication (SpMM) is widely applied to numerous domains, such as graph processing, machine learning, and data analytics. However, inner product based SpMM induces redundant zero-element computing for mismatched nonzero operands, while outer product based approach lacks input reuse across Process Elements (PEs) and poor output locality for accumulating partial sum (psum) matrices. Besides, current works only focus on sparse-sparse matrix multiplication (SSMM) or sparse-dense matrix multiplication (SDMM), rarely performing efficiently for both. To address these problems, this paper proposes an unified SpMM accelerator, called IOPS, hybridizing inner with outer products. It reuses the input matrix among PEs with inner product dataflow, and removes zero-element calculations with outer product approach in each PE, which can efficiently process SSMM and SDMM. Moreover, an address map** method is designed to accumulate the irregular sparse psum matrices, reducing the latency and DRAM access of psum accumulating. Furthermore, an adaptive partition strategy is proposed to tile the input matrices based on their sparsity ratios, effectively utilizing the storage of architecture and reducing DRAM access. Compared with the SSMM accelerator, SpArch, we achieve 1.7x~6.3x energy efficiency and 1.2x~4.4x resource efficiency, with 1.4x~2.1x DRAM access saving.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
An equivalent inequality for the Riemann hypothesis
Authors:
Wei Sun
Abstract:
We present a purely analytical inequality which is equivalent to the Riemann hypothesis (RH). The proof of the equivalence is based on a representation of the modulus of the Riemann $ξ$ function. As the first step to analyze the inequality, we consider polynomial approximations. We also show that the RH is equivalent to the statement that some wave functions constructed using the Brownian motion n…
▽ More
We present a purely analytical inequality which is equivalent to the Riemann hypothesis (RH). The proof of the equivalence is based on a representation of the modulus of the Riemann $ξ$ function. As the first step to analyze the inequality, we consider polynomial approximations. We also show that the RH is equivalent to the statement that some wave functions constructed using the Brownian motion never evolve into perfectly distinguishable states.
△ Less
Submitted 31 March, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Measurements of $Σ$ electromagnetic form factors in the time-like region using the untagged initial-state radiation technique
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (626 additional authors not shown)
Abstract:
The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven…
▽ More
The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven $Σ^{+}\barΣ^{-}$ invariant mass intervals from threshold to 3.04 GeV/$c^2$. The results are consistent with the previous results from Belle and BESIII. Furthermore, the branching fractions of the decays $J/ψ\toΣ^{+}\barΣ^{-}$ and $ψ(3686)\toΣ^{+}\barΣ^{-}$ are determined and the obtained results are consistent with the previous results of BESIII.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Waveform Simulation in PandaX-4T
Authors:
Jiafu Li,
Abdusalam Abdukerim,
Chen Cheng,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Zhaokan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Chencheng Han,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Zhou Huang,
Ruquan Hou
, et al. (66 additional authors not shown)
Abstract:
Signal reconstruction through software processing is a crucial component of the background and signal models in the PandaX-4T experiment, which is a multi-tonne dark matter direct search experiment. The accuracy of signal reconstruction is influenced by various detector artifacts, including noise, dark count of photomultiplier, impurity photoionization in the detector, and other relevant considera…
▽ More
Signal reconstruction through software processing is a crucial component of the background and signal models in the PandaX-4T experiment, which is a multi-tonne dark matter direct search experiment. The accuracy of signal reconstruction is influenced by various detector artifacts, including noise, dark count of photomultiplier, impurity photoionization in the detector, and other relevant considerations. In this study, we present a detailed description of a semi-data-driven approach designed to simulate the signal waveform. This work provides a reliable model for the efficiency and bias of the signal reconstruction in the data analysis of PandaX-4T. By comparing critical variables which relate to the temporal shape and hit pattern of the signals, we demonstrate a good agreement between the simulation and data.
△ Less
Submitted 21 May, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Observation of significant flavor-SU(3) breaking in the kaon wave function at $12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$ and discovery of the charmless decay $ψ(3770)\to K_S^0K_L^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (607 additional authors not shown)
Abstract:
We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$,…
▽ More
We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$, which indicates a small but significant effect of flavor-SU(3) breaking in the kaon wave function, and consequently excludes the possibility that flavor-SU(3) breaking is the primary reason for the strong experimental violation of the pQCD prediction $|F(π^{\pm})|/|F(K^{\pm})|=f^2_π/f^2_{K}$, where $F(π^{\pm})$ and $F(K^{\pm})$ are the form factors, and $f_π$ and $f_{K}$ are the decay constants of charged pions and kaons, respectively. We also observe a significant signal for the charmless decay $ψ(3770)\to K_S^0K_L^0$ for the first time. Within a $1σ$ contour of the likelihood value, the the branching fraction for $ψ(3770)\to K_S^0K_L^0$ is determined to be ${\cal B}=(2.63_{-1.59}^{+1.40})\times 10^{-5}$, and the relative phase between the continuum and $ψ(3770)$ amplitudes is $φ=(-0.39_{-0.10}^{+0.05})π$. The branching fraction is in good agreement with the $\mathcal{S}$- and $\mathcal{D}$-wave charmonia mixing scheme proposed in the interpretation of the "$ρπ$ puzzle" between $J/ψ$ and $ψ(3686)$ decays.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Advancing large-scale thin-film PPLN nonlinear photonics with segmented tunable micro-heaters
Authors:
Xiaoting Li,
Haochuan Li,
Zhenzheng Wang,
Zhaoxi Chen,
Fei Ma,
Ke Zhang,
Wenzhao Sun,
Cheng Wang
Abstract:
Thin-film periodically poled lithium niobate (TF-PPLN) devices have recently gained prominence for efficient wavelength conversion processes in both classical and quantum applications. However, the patterning and poling of TF-PPLN devices today are mostly performed at chip scales, presenting a significant bottleneck for future large-scale nonlinear photonic systems that require the integration of…
▽ More
Thin-film periodically poled lithium niobate (TF-PPLN) devices have recently gained prominence for efficient wavelength conversion processes in both classical and quantum applications. However, the patterning and poling of TF-PPLN devices today are mostly performed at chip scales, presenting a significant bottleneck for future large-scale nonlinear photonic systems that require the integration of multiple nonlinear components with consistent performance and low cost. Here, we take a pivotal step towards this goal by develo** a wafer-scale TF-PPLN nonlinear photonic platform, leveraging ultraviolet stepper lithography and an automated poling process. To address the inhomogeneous broadening of the quasi-phase matching (QPM) spectrum induced by film thickness variations across the wafer, we propose and demonstrate segmented thermal optic tuning modules that can precisely adjust and align the QPM peak wavelengths in each section. \hl{Using the segmented micro-heaters, we show the successful realignment of inhomogeneously broadened multi-peak QPM spectra with up to 57$\%$ enhancement of conversion efficiency. We achieve a high normalized conversion efficiency of 3802$\%$W$^{-1}$cm$^{-2}$ in a 6 mm long PPLN waveguide, recovering 84$\%$ of the theoretically predicted efficiency in this device.} The advanced fabrication techniques and segmented tuning architectures presented herein pave the way for wafer-scale integration of complex functional nonlinear photonic circuits with applications in quantum information processing, precision sensing and metrology, and low-noise-figure optical signal amplification.
△ Less
Submitted 15 March, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Measurements of Born Cross Sections for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + {\rm c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + {\rm c.c.}$ at $\sqrt{s}=$4918.0 and 4950.9 MeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (620 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, the Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are measured for the first time at center-of-mass energies of $\sqrt{s}=4918.0$ and 4950.9 MeV. Non-zero cross sections are observed very close to the production threshol…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, the Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are measured for the first time at center-of-mass energies of $\sqrt{s}=4918.0$ and 4950.9 MeV. Non-zero cross sections are observed very close to the production threshold. The measured Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are about $2\sim3$ times greater than those of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$, thereby indicating that the exotic structure potentially exists in the excited charmed baryons. The Born cross sections are $15.6\pm3.1\pm0.9$ pb and $29.4\pm3.7\pm2.7$ pb for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$, and are $43.4\pm4.0\pm4.1$ pb and $76.8\pm6.5\pm4.2$ pb for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- +\rm{c.c.}$ at $\sqrt s=4918.0$ and 4950.9 MeV, respectively. Based on the polar angle distributions of the $\barΛ_{c}(2625)^-$ and $Λ_{c}(2625)^+$, the form-factor ratios $\sqrt{|G_{E}|^2 + 3|G_{M}|^2}/|G_{C}|$ are determined for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ for the first time, which are $5.95\pm4.07\pm0.15$ and $0.94\pm0.32\pm0.02$ at $\sqrt s=4918.0$ and 4950.9 MeV, respectively. All of these first uncertainties are statistical and second systematic.
△ Less
Submitted 8 May, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Search for $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$, and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (604 additional authors not shown)
Abstract:
A search has been performed for the semileptonic decays $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, using $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper li…
▽ More
A search has been performed for the semileptonic decays $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, using $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper limits are set at the 90\% confidence level of $2.13\times10^{-5}$, $1.54\times10^{-5}$ and $2.10\times10^{-5}$ for the branching fractions of $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, respectively.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation
Authors:
Shaobo Xia,
Jun Yue,
Kacper Kania,
Leyuan Fang,
Andrea Tagliasacchi,
Kwang Moo Yi,
Weiwei Sun
Abstract:
We propose a weakly supervised semantic segmentation method for point clouds that predicts "per-point" labels from just "whole-scene" annotations while achieving the performance of recent fully supervised approaches. Our core idea is to propagate the scene-level labels to each point in the point cloud by creating pseudo labels in a conservative way. Specifically, we over-segment point cloud featur…
▽ More
We propose a weakly supervised semantic segmentation method for point clouds that predicts "per-point" labels from just "whole-scene" annotations while achieving the performance of recent fully supervised approaches. Our core idea is to propagate the scene-level labels to each point in the point cloud by creating pseudo labels in a conservative way. Specifically, we over-segment point cloud features via unsupervised clustering and associate scene-level labels with clusters through bipartite matching, thus propagating scene labels only to the most relevant clusters, leaving the rest to be guided solely via unsupervised clustering. We empirically demonstrate that over-segmentation and bipartite assignment plays a crucial role. We evaluate our method on ScanNet and S3DIS datasets, outperforming state of the art, and demonstrate that we can achieve results comparable to fully supervised methods.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Spherical higher order Fourier analysis over finite fields I: equidistribution for nilsequences
Authors:
Wenbo Sun
Abstract:
This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average i…
▽ More
This paper is the first part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we prove a quantitative equidistribution theorem for polynomial sequences in a nilmanifold, where the average is taken along spheres instead of cubes. To be more precise, let $Ω\subseteq\mathbb{F}_{p}^{d}$ be a sphere. We showed that if a polynomial sequence $(g(n)Γ)_{n\inΩ}$ which is $p$-periodic along $Ω$ is not equidistributed on a nilmanifold $G/Γ$, then there exists a nontrivial horizontal character $η$ of $G/Γ$ such that $η\circ g \mod \mathbb{Z}$ vanishes on $Ω$. This result will serve as a fundamental tool in later parts of the series to proof the spherical Gowers inverse theorem and the geometric Ramsey conjecture.
△ Less
Submitted 11 December, 2023; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Spherical higher order Fourier analysis over finite fields II: additive combinatorics for shifted ideals
Authors:
Wenbo Sun
Abstract:
This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form…
▽ More
This paper is the second part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we study additive combinatorial properties for shifted ideals, i.e. the structure of sets of the form $E\pm E$, where $E$ is a collection of shifted ideals of the polynomial ring $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$ and we identify two ideals if their difference contains the zero polynomial. We show that under appropriate definitions, the set $E\pm E$ enjoys properties similar to the conventional setting where $E$ is a subset of an abelian group. In particular, among other results, we prove the Balog-Gowers-Szemerédi theorem, the Rusza's quasi triangle inequality and a weak form of the Plünnecke-Rusza theorem in the setting of shifted ideals. We also show that for a special class of maps $ξ$ from $\mathbb{F}_{p}^{d}$ to the collection of all shifted ideals of $\mathbb{F}_{p}[x_{1},\dots,x_{d}]$, if the set $ξ(\mathbb{F}_{p}^{d})+ξ(\mathbb{F}_{p}^{d})$ has large additive energy, then $ξ$ is an almost linear Freiman homomorphism. This result is the crucial additive combinatorial input we need to prove the spherical Gowers inverse theorem in later parts of the series.
△ Less
Submitted 11 December, 2023; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Spherical higher order Fourier analysis over finite fields IV: an application to the Geometric Ramsey Conjecture
Authors:
Wenbo Sun
Abstract:
This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting.
In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the fini…
▽ More
This paper is the fourth and the last part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the Geometric Ramsey Conjecture in the finite field setting.
In this paper, we proof a conjecture of Graham on the Remsey properties for spherical configurations in the finite field setting. To be more precise, we show that for any spherical configuration $X$ of $\mathbb{F}_{p}^{d}$ of complexity at most $C$ with $d$ being sufficiently large with respect to $C$ and $\vert X\vert$, and for some prime $p$ being sufficiently large with respect to $C$, $\vert X\vert$ and $ε>0$, any set $E\subseteq \mathbb{F}_{p}^{d}$ with $\vert E\vert>εp^{d}$ contains at least $\gg_{C,ε,\vert X\vert}p^{(k+1)d-(k+1)k/2}$ congruent copies of $X$, where $k$ is the dimension of $\text{span}_{\mathbb{F}_{p}}(X-X)$. The novelty of our approach is that we avoid the use of harmonic analysis, and replace it by the theory of spherical higher order Fourier analysis developed in previous parts of the series.
△ Less
Submitted 11 December, 2023; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Spherical higher order Fourier analysis over finite fields III: a spherical Gowers inverse theorem
Authors:
Wenbo Sun
Abstract:
This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on…
▽ More
This paper is the third part of the series "Spherical higher order Fourier analysis over finite fields", aiming to develop the higher order Fourier analysis method along spheres over finite fields, and to solve the geometric Ramsey conjecture in the finite field setting.
In this paper, we prove an inverse theorem over finite field for spherical Gowers norms, i.e. a local Gowers norm supported on a sphere. We show that if the $(s+1)$-th spherical Gowers norm of a 1-bounded function $f\colon\mathbb{F}_{p}^{d}\to \mathbb{C}$ is at least $ε$ and if $d$ is sufficiently large depending only on $s$, then $f$ correlates on the sphere with a $p$-periodic $s$-step nilsequence, where the bounds for the complexity and correlation depend only on $d$ and $ε$. This result will be used in later parts of the series to prove the geometric Ramsey conjecture in the finite field setting.
△ Less
Submitted 11 December, 2023; v1 submitted 11 December, 2023;
originally announced December 2023.
-
Uncovering high-dimensional phase space and the application of Mixture of Experts (MoE) on building the Large CALPHAD Model (LCM)
Authors:
Zhengdi Liu,
Wenwen Sun
Abstract:
This study presents a novel approach for analyzing and establishing Large CALPHAD model (LCM) in complex alloy systems. Through the introduction of "composition space volume", a multi-dimensional metric which allows to quatitatively define alloy composition variations. Utilizing stochastic methods, the study quantifies phase space complexity through phase density, and model training costs through…
▽ More
This study presents a novel approach for analyzing and establishing Large CALPHAD model (LCM) in complex alloy systems. Through the introduction of "composition space volume", a multi-dimensional metric which allows to quatitatively define alloy composition variations. Utilizing stochastic methods, the study quantifies phase space complexity through phase density, and model training costs through data density. This leads to a strategic segmentation of the entire composition space, tailored to the complexity of each segment, thereby reducing computational efforts in model training. A significant advancement is the integration of segmented models using a Mixture of Experts (MoE) approach, ensuring accurate portrayal of phase behaviors across diverse composition spaces. This technique is demonstrated in establishing a high-dimensional phase diagram for the FeCoNiTi system, highlighting its efficiency and accuracy. The study's methodologies offer a systematic and cost-effective framework for modeling complex alloy systems, marking a step forward in the field of alloy design and analysis.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
The boundary case for the supercritical deformed Hermitian-Yang-Mills equation
Authors:
Wei Sun
Abstract:
In this paper, we shall study the weak solution to the supercritical deformed Hermitian-Yang-Mills equation in the boundary case.
In this paper, we shall study the weak solution to the supercritical deformed Hermitian-Yang-Mills equation in the boundary case.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Iterative Token Evaluation and Refinement for Real-World Super-Resolution
Authors:
Chaofeng Chen,
Shangchen Zhou,
Liang Liao,
Haoning Wu,
Wenxiu Sun,
Qiong Yan,
Weisi Lin
Abstract:
Real-world image super-resolution (RWSR) is a long-standing problem as low-quality (LQ) images often have complex and unidentified degradations. Existing methods such as Generative Adversarial Networks (GANs) or continuous diffusion models present their own issues including GANs being difficult to train while continuous diffusion models requiring numerous inference steps. In this paper, we propose…
▽ More
Real-world image super-resolution (RWSR) is a long-standing problem as low-quality (LQ) images often have complex and unidentified degradations. Existing methods such as Generative Adversarial Networks (GANs) or continuous diffusion models present their own issues including GANs being difficult to train while continuous diffusion models requiring numerous inference steps. In this paper, we propose an Iterative Token Evaluation and Refinement (ITER) framework for RWSR, which utilizes a discrete diffusion model operating in the discrete token representation space, i.e., indexes of features extracted from a VQGAN codebook pre-trained with high-quality (HQ) images. We show that ITER is easier to train than GANs and more efficient than continuous diffusion models. Specifically, we divide RWSR into two sub-tasks, i.e., distortion removal and texture generation. Distortion removal involves simple HQ token prediction with LQ images, while texture generation uses a discrete diffusion model to iteratively refine the distortion removal output with a token refinement network. In particular, we propose to include a token evaluation network in the discrete diffusion process. It learns to evaluate which tokens are good restorations and helps to improve the iterative refinement results. Moreover, the evaluation network can first check status of the distortion removal output and then adaptively select total refinement steps needed, thereby maintaining a good balance between distortion removal and texture generation. Extensive experimental results show that ITER is easy to train and performs well within just 8 iterative steps. Our codes will be available publicly.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Exploring the Naturalness of AI-Generated Images
Authors:
Zijian Chen,
Wei Sun,
Haoning Wu,
Zicheng Zhang,
Jun Jia,
Zhongpeng Ji,
Fengyu Sun,
Shangling Jui,
Xiongkuo Min,
Guangtao Zhai,
Wenjun Zhang
Abstract:
The proliferation of Artificial Intelligence-Generated Images (AGIs) has greatly expanded the Image Naturalness Assessment (INA) problem. Different from early definitions that mainly focus on tone-mapped images with limited distortions (e.g., exposure, contrast, and color reproduction), INA on AI-generated images is especially challenging as it has more diverse contents and could be affected by fa…
▽ More
The proliferation of Artificial Intelligence-Generated Images (AGIs) has greatly expanded the Image Naturalness Assessment (INA) problem. Different from early definitions that mainly focus on tone-mapped images with limited distortions (e.g., exposure, contrast, and color reproduction), INA on AI-generated images is especially challenging as it has more diverse contents and could be affected by factors from multiple perspectives, including low-level technical distortions and high-level rationality distortions. In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images. First, we construct the AI-Generated Image Naturalness (AGIN) database by conducting a large-scale subjective study to collect human opinions on the overall naturalness as well as perceptions from technical and rationality perspectives. AGIN verifies that naturalness is universally and disparately affected by technical and rationality distortions. Second, we propose the Joint Objective Image Naturalness evaluaTor (JOINT), to automatically predict the naturalness of AGIs that aligns human ratings. Specifically, JOINT imitates human reasoning in naturalness evaluation by jointly learning both technical and rationality features. We demonstrate that JOINT significantly outperforms baselines for providing more subjectively consistent results on naturalness assessment.
△ Less
Submitted 4 March, 2024; v1 submitted 9 December, 2023;
originally announced December 2023.
-
Spectroscopy-Guided Discovery of Three-Dimensional Structures of Disordered Materials with Diffusion Models
Authors:
Hyuna Kwon,
Tim Hsu,
Wenyu Sun,
Wonseok Jeong,
Fikret Aydin,
James Chapman,
Xiao Chen,
Matthew R. Carbone,
Deyu Lu,
Fei Zhou,
Tuan Anh Pham
Abstract:
The ability to rapidly develop materials with desired properties has a transformative impact on a broad range of emerging technologies. In this work, we introduce a new framework based on the diffusion model, a recent generative machine learning method to predict 3D structures of disordered materials from a target property. For demonstration, we apply the model to identify the atomic structures of…
▽ More
The ability to rapidly develop materials with desired properties has a transformative impact on a broad range of emerging technologies. In this work, we introduce a new framework based on the diffusion model, a recent generative machine learning method to predict 3D structures of disordered materials from a target property. For demonstration, we apply the model to identify the atomic structures of amorphous carbons ($a$-C) as a representative material system from the target X-ray absorption near edge structure (XANES) spectra--a common experimental technique to probe atomic structures of materials. We show that conditional generation guided by XANES spectra reproduces key features of the target structures. Furthermore, we show that our model can steer the generative process to tailor atomic arrangements for a specific XANES spectrum. Finally, our generative model exhibits a remarkable scale-agnostic property, thereby enabling generation of realistic, large-scale structures through learning from a small-scale dataset (i.e., with small unit cells). Our work represents a significant stride in bridging the gap between materials characterization and atomic structure determination; in addition, it can be leveraged for materials discovery in exploring various material properties as targeted.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Determination of spin-parity quantum numbers of X(2370) as $0^{-+}$ from $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
Based on $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector, a partial wave analysis of the decay $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$ is performed. The mass and width of the $X(2370)$ are measured to be $2395 \pm 11 ({\rm stat})^{+26}_{-94}({\rm syst})\ \mathrm{MeV}/c^{2}$ and $188^{+18}_{-17}({\rm stat})^{+124}_{-33}({\rm syst})~\mathrm{MeV}$, respectively. The c…
▽ More
Based on $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector, a partial wave analysis of the decay $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$ is performed. The mass and width of the $X(2370)$ are measured to be $2395 \pm 11 ({\rm stat})^{+26}_{-94}({\rm syst})\ \mathrm{MeV}/c^{2}$ and $188^{+18}_{-17}({\rm stat})^{+124}_{-33}({\rm syst})~\mathrm{MeV}$, respectively. The corresponding product branching fraction is $\mathcal{B}[J/ψ\rightarrowγX(2370)] \times \mathcal{B}[X(2370) \rightarrow f_{0}(980)η^{\prime}] \times \mathcal{B}[f_{0}(980) \rightarrow K^{0}_{S}K^{0}_{S}] = \left( 1.31 \pm 0.22 ({\rm stat})^{+2.85}_{-0.84}({\rm syst}) \right) \times 10^{-5}$. The statistical significance of the $X(2370)$ is greater than $11.7σ$ and the spin-parity is determined to be $0^{-+}$ for the first time. The measured mass and spin-parity of the $X(2370)$ are consistent with the predictions of the lightest pseudoscalar glueball.
△ Less
Submitted 6 May, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Rethinking and Simplifying Bootstrapped Graph Latents
Authors:
Wangbin Sun,
**tang Li,
Liang Chen,
Bingzhe Wu,
Yatao Bian,
Zibin Zheng
Abstract:
Graph contrastive learning (GCL) has emerged as a representative paradigm in graph self-supervised learning, where negative samples are commonly regarded as the key to preventing model collapse and producing distinguishable representations. Recent studies have shown that GCL without negative samples can achieve state-of-the-art performance as well as scalability improvement, with bootstrapped grap…
▽ More
Graph contrastive learning (GCL) has emerged as a representative paradigm in graph self-supervised learning, where negative samples are commonly regarded as the key to preventing model collapse and producing distinguishable representations. Recent studies have shown that GCL without negative samples can achieve state-of-the-art performance as well as scalability improvement, with bootstrapped graph latent (BGRL) as a prominent step forward. However, BGRL relies on a complex architecture to maintain the ability to scatter representations, and the underlying mechanisms enabling the success remain largely unexplored. In this paper, we introduce an instance-level decorrelation perspective to tackle the aforementioned issue and leverage it as a springboard to reveal the potential unnecessary model complexity within BGRL. Based on our findings, we present SGCL, a simple yet effective GCL framework that utilizes the outputs from two consecutive iterations as positive pairs, eliminating the negative samples. SGCL only requires a single graph augmentation and a single graph encoder without additional parameters. Extensive experiments conducted on various graph benchmarks demonstrate that SGCL can achieve competitive performance with fewer parameters, lower time and space costs, and significant convergence speedup.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Amplitude Analysis of the Decays $D^0\toπ^+π^-π^+π^-$ and $π^+π^-π^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (620 additional authors not shown)
Abstract:
Using $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ taken at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector, a joint amplitude analysis is performed on the decays $D^0\toπ^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$). The fit fractions of individual components are obtained, and large interferences among the dominant components…
▽ More
Using $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ taken at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector, a joint amplitude analysis is performed on the decays $D^0\toπ^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$). The fit fractions of individual components are obtained, and large interferences among the dominant components of $D^{0}\to a_{1}(1260)π$, $D^{0}\toπ(1300)π$, $D^{0}\toρ(770)ρ(770)$ and $D^{0}\to2(ππ)_{S}$ are found in both channels. With the obtained amplitude model, the $CP$-even fractions of $D^0\to π^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$) are determined to be $(75.2\pm1.1_{\rm stat.}\pm1.5_{\rm syst.})\%$ and $(68.9\pm1.5_{\rm stat.}\pm 2.4_{\rm syst.})\%$, respectively. The branching fractions of $D^0\to π^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$) are measured to be $(0.688\pm0.010_{\rm stat.}\pm 0.010_{\rm syst.})\%$ and $(0.951\pm0.025_{\rm stat.}\pm 0.021_{\rm syst.})\%$, respectively. The amplitude analysis provides an important model for binning strategy in the measurements of the strong phase parameters of $D^0 \to 4π$ when used to determine the CKM angle $γ(φ_{3})$ via the $B^{-}\to D K^{-}$ decay.
△ Less
Submitted 3 April, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.