-
Measurement of the $e^{+}e^{-} \to K_{S}^{0} K_{L}^{0} π^{0}$ cross sections from $\sqrt{s}=$ 2.000 to 3.080 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses…
▽ More
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ and $K^{*}_{2}(1430)^{0}\bar{K}^{0}$ to be measured. The Born cross sections for $e^{+}e^{-}\to K_{S}^{0}K_{L}^{0}π^{0}$ are consistent with previous measurements by BaBar, but with substantially improved precision. The Born cross section lineshape of the process $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ is consistent with a vector meson state around 2.2 GeV with a significance of 3.2$σ$. A Breit-Wigner fit determines its mass as $M_Y=(2164.7\pm9.1\pm3.1)~{\rm{MeV}}/c^{2}$ and its width as $Γ_{Y}=(32.4\pm21.0\pm1.8)~\rm{MeV}$.
△ Less
Submitted 26 February, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
What is the Best Automated Metric for Text to Motion Generation?
Authors:
Jordan Voas,
Yili Wang,
Qixing Huang,
Raymond Mooney
Abstract:
There is growing interest in generating skeleton-based human motions from natural language descriptions. While most efforts have focused on develo** better neural architectures for this task, there has been no significant work on determining the proper evaluation metric. Human evaluation is the ultimate accuracy measure for this task, and automated metrics should correlate well with human qualit…
▽ More
There is growing interest in generating skeleton-based human motions from natural language descriptions. While most efforts have focused on develo** better neural architectures for this task, there has been no significant work on determining the proper evaluation metric. Human evaluation is the ultimate accuracy measure for this task, and automated metrics should correlate well with human quality judgments. Since descriptions are compatible with many motions, determining the right metric is critical for evaluating and designing effective generative models. This paper systematically studies which metrics best align with human evaluations and proposes new metrics that align even better. Our findings indicate that none of the metrics currently used for this task show even a moderate correlation with human judgments on a sample level. However, for assessing average model performance, commonly used metrics such as R-Precision and less-used coordinate errors show strong correlations. Additionally, several recently developed metrics are not recommended due to their low correlation compared to alternatives. We also introduce a novel metric based on a multimodal BERT-like model, MoBERT, which offers strongly human-correlated sample-level evaluations while maintaining near-perfect model-level correlation. Our results demonstrate that this new metric exhibits extensive benefits over all current alternatives.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders
Authors:
Zhicheng Du,
Chenyao Jiang,
Xi Yuan,
Shiyao Zhai,
Zhengyang Lei,
Shuyue Ma,
Yang Liu,
Qihui Ye,
Chufan Xiao,
Qiming Huang,
Ming Xu,
Dongmei Yu,
Peiwu Qin
Abstract:
The timely identification of mental disorders in adolescents is a global public health challenge.Single factor is difficult to detect the abnormality due to its complex and subtle nature. Additionally, the generalized multimodal Computer-Aided Screening (CAS) systems with interactive robots for adolescent mental disorders are not available. Here, we design an android application with mini-games an…
▽ More
The timely identification of mental disorders in adolescents is a global public health challenge.Single factor is difficult to detect the abnormality due to its complex and subtle nature. Additionally, the generalized multimodal Computer-Aided Screening (CAS) systems with interactive robots for adolescent mental disorders are not available. Here, we design an android application with mini-games and chat recording deployed in a portable robot to screen 3,783 middle school students and construct the multimodal screening dataset, including facial images, physiological signs, voice recordings, and textual transcripts.We develop a model called GAME (Generalized Model with Attention and Multimodal EmbraceNet) with novel attention mechanism that integrates cross-modal features into the model. GAME evaluates adolescent mental conditions with high accuracy (73.34%-92.77%) and F1-Score (71.32%-91.06%).We find each modality contributes dynamically to the mental disorders screening and comorbidities among various mental disorders, indicating the feasibility of explainable model. This study provides a system capable of acquiring multimodal information and constructs a generalized multimodal integration algorithm with novel attention mechanisms for the early screening of adolescent mental disorders.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
MindAgent: Emergent Gaming Interaction
Authors:
Ran Gong,
Qiuyuan Huang,
Xiaojian Ma,
Hoi Vo,
Zane Durante,
Yusuke Noda,
Zilong Zheng,
Song-Chun Zhu,
Demetri Terzopoulos,
Li Fei-Fei,
Jianfeng Gao
Abstract:
Large Language Models (LLMs) have the capacity of performing complex scheduling in a multi-agent system and can coordinate these agents into completing sophisticated tasks that require extensive collaboration. However, despite the introduction of numerous gaming frameworks, the community has insufficient benchmarks towards building general multi-agents collaboration infrastructure that encompass b…
▽ More
Large Language Models (LLMs) have the capacity of performing complex scheduling in a multi-agent system and can coordinate these agents into completing sophisticated tasks that require extensive collaboration. However, despite the introduction of numerous gaming frameworks, the community has insufficient benchmarks towards building general multi-agents collaboration infrastructure that encompass both LLM and human-NPCs collaborations. In this work, we propose a novel infrastructure - MindAgent - to evaluate planning and coordination emergent capabilities for gaming interaction. In particular, our infrastructure leverages existing gaming framework, to i) require understanding of the coordinator for a multi-agent system, ii) collaborate with human players via un-finetuned proper instructions, and iii) establish an in-context learning on few-shot prompt with feedback. Furthermore, we introduce CUISINEWORLD, a new gaming scenario and related benchmark that dispatch a multi-agent collaboration efficiency and supervise multiple agents playing the game simultaneously. We conduct comprehensive evaluations with new auto-metric CoS for calculating the collaboration efficiency. Finally, our infrastructure can be deployed into real-world gaming scenarios in a customized VR version of CUISINEWORLD and adapted in existing broader Minecraft gaming domain. We hope our findings on LLMs and the new infrastructure for general-purpose scheduling and coordination can help shed light on how such skills can be obtained by learning from large language corpora.
△ Less
Submitted 19 September, 2023; v1 submitted 18 September, 2023;
originally announced September 2023.
-
Architecture-Aware Synthesis of Stabilizer Circuits from Clifford Tableaus
Authors:
David Winderl,
Qunsheng Huang,
Arianne Meijer-van de Griend,
Richie Yeung
Abstract:
Since quantum computing is currently in the NISQ-Era, compilation strategies to reduce the number of gates executed on specific hardware are required. In this work, we utilize the concept of synthesis of a data structure called Clifford tableaus, focusing on applying CNOTs within the respective connectivity graph of the quantum device. We hence contribute to the field of compilation or, more preci…
▽ More
Since quantum computing is currently in the NISQ-Era, compilation strategies to reduce the number of gates executed on specific hardware are required. In this work, we utilize the concept of synthesis of a data structure called Clifford tableaus, focusing on applying CNOTs within the respective connectivity graph of the quantum device. We hence contribute to the field of compilation or, more precisely, synthesis by reducing the number of CNOTs in the synthesized quantum circuit. Upon convergence, our method shows to outperform other state-of-the-art synthesis techniques, when executed with respect to a specific hardware. Upon executing the resulting circuits on real hardware, our synthesized circuits tend to increase the final fidelity and reduce the overall execution times.
△ Less
Submitted 19 September, 2023; v1 submitted 16 September, 2023;
originally announced September 2023.
-
Retrieval-Augmented Text-to-Audio Generation
Authors:
Yi Yuan,
Haohe Liu,
Xubo Liu,
Qiushi Huang,
Mark D. Plumbley,
Wenwu Wang
Abstract:
Despite recent progress in text-to-audio (TTA) generation, we show that the state-of-the-art models, such as AudioLDM, trained on datasets with an imbalanced class distribution, such as AudioCaps, are biased in their generation performance. Specifically, they excel in generating common audio classes while underperforming in the rare ones, thus degrading the overall generation performance. We refer…
▽ More
Despite recent progress in text-to-audio (TTA) generation, we show that the state-of-the-art models, such as AudioLDM, trained on datasets with an imbalanced class distribution, such as AudioCaps, are biased in their generation performance. Specifically, they excel in generating common audio classes while underperforming in the rare ones, thus degrading the overall generation performance. We refer to this problem as long-tailed text-to-audio generation. To address this issue, we propose a simple retrieval-augmented approach for TTA models. Specifically, given an input text prompt, we first leverage a Contrastive Language Audio Pretraining (CLAP) model to retrieve relevant text-audio pairs. The features of the retrieved audio-text data are then used as additional conditions to guide the learning of TTA models. We enhance AudioLDM with our proposed approach and denote the resulting augmented system as Re-AudioLDM. On the AudioCaps dataset, Re-AudioLDM achieves a state-of-the-art Frechet Audio Distance (FAD) of 1.37, outperforming the existing approaches by a large margin. Furthermore, we show that Re-AudioLDM can generate realistic audio for complex scenes, rare audio classes, and even unseen audio types, indicating its potential in TTA tasks.
△ Less
Submitted 5 January, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons
Authors:
Sicheng Yang,
Zilin Wang,
Zhiyong Wu,
Minglei Li,
Zhensong Zhang,
Qiaochu Huang,
Lei Hao,
Songcen Xu,
Xiaofei Wu,
changpeng yang,
Zonghong Dai
Abstract:
The automatic co-speech gesture generation draws much attention in computer animation. Previous works designed network structures on individual datasets, which resulted in a lack of data volume and generalizability across different motion capture standards. In addition, it is a challenging task due to the weak correlation between speech and gestures. To address these problems, we present UnifiedGe…
▽ More
The automatic co-speech gesture generation draws much attention in computer animation. Previous works designed network structures on individual datasets, which resulted in a lack of data volume and generalizability across different motion capture standards. In addition, it is a challenging task due to the weak correlation between speech and gestures. To address these problems, we present UnifiedGesture, a novel diffusion model-based speech-driven gesture synthesis approach, trained on multiple gesture datasets with different skeletons. Specifically, we first present a retargeting network to learn latent homeomorphic graphs for different motion capture standards, unifying the representations of various gestures while extending the dataset. We then capture the correlation between speech and gestures based on a diffusion model architecture using cross-local attention and self-attention to generate better speech-matched and realistic gestures. To further align speech and gesture and increase diversity, we incorporate reinforcement learning on the discrete gesture units with a learned reward function. Extensive experiments show that UnifiedGesture outperforms recent approaches on speech-driven gesture generation in terms of CCA, FGD, and human-likeness. All code, pre-trained models, databases, and demos are available to the public at https://github.com/YoungSeng/UnifiedGesture.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Measurements of the absolute branching fractions of $Ω^-$ decays and test of the $ΔI = 1/2$ rule
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertai…
▽ More
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertainties are statistical and systematic, respectively. The ratio between $\mathcal{B}_{Ω^- \to Ξ^0 π^-}$ and $\mathcal{B}_{Ω^- \to Ξ^- π^0}$ is determined to be $2.97\pm0.19\pm0.11$, which is in good agreement with the PDG value of $2.74\pm0.15$, but greater by more than four standard deviations than the theoretical prediction of 2 obtained from the $ΔI = 1/2$ rule.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Integration of Quantum Accelerators with High Performance Computing -- A Review of Quantum Programming Tools
Authors:
Amr Elsharkawy,
Xiao-Ting Michelle To,
Philipp Seitz,
Yanbin Chen,
Yannick Stade,
Manuel Geiger,
Qunsheng Huang,
Xiaorang Guo,
Muhammad Arslan Ansari,
Christian B. Mendl,
Dieter Kranzlmüller,
Martin Schulz
Abstract:
Quantum computing (QC) introduces a novel mode of computation with the possibility of greater computational power that remains to be exploited - presenting exciting opportunities for high performance computing (HPC) applications. However, recent advancements in the field have made clear that QC does not supplant conventional HPC, but can rather be incorporated into current heterogeneous HPC infras…
▽ More
Quantum computing (QC) introduces a novel mode of computation with the possibility of greater computational power that remains to be exploited - presenting exciting opportunities for high performance computing (HPC) applications. However, recent advancements in the field have made clear that QC does not supplant conventional HPC, but can rather be incorporated into current heterogeneous HPC infrastructures as an additional accelerator, thereby enabling the optimal utilization of both paradigms. The desire for such integration significantly affects the development of software for quantum computers, which in turn influences the necessary software infrastructure. To date, previous review papers have investigated various quantum programming tools (QPTs) (such as languages, libraries, frameworks) in their ability to program, compile, and execute quantum circuits. However, the integration effort with classical HPC frameworks or systems has not been addressed. This study aims to characterize existing QPTs from an HPC perspective, investigating if existing QPTs have the potential to be efficiently integrated with classical computing models and determining where work is still required. This work structures a set of criteria into an analysis blueprint that enables HPC scientists to assess whether a QPT is suitable for the quantum-accelerated classical application at hand.
△ Less
Submitted 18 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Observation of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ in the amplitude analysis of $D^{+} \to K_{S}^{0}π^+η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (604 additional authors not shown)
Abstract:
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is…
▽ More
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is the ideal decay to extract the contributions of the external and internal $W$-emission amplitudes involving $a_{0}(980)$ and study the final-state interactions. The absolute branching fraction of $D^{+}\to K_{S}^{0}π^+η$ is measured to be $(1.27\pm0.04_{\rm stat.}\pm0.03_{\rm syst.})\%$. The product branching fractions of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ with $a_{0}(980)^{+}\to π^+η$ and $D^{+}\to π^+ K_0^*(1430)^0$ with $K_0^*(1430)^0\to K_{S}^{0}η$ are measured to be $(1.33\pm0.05_{\rm stat.}\pm0.04_{\rm syst.})\%$ and $(0.14\pm0.03_{\rm stat.}\pm0.01_{\rm syst.})\%$, respectively.
△ Less
Submitted 29 March, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Observation of the Singly Cabibbo-Suppressed Decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be…
▽ More
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be $(3.8\pm1.3_{\rm stat}\pm0.2_{\rm syst})\times 10^{-4}$ in a model-independent approach. This is the first observation of a Cabibbo-suppressed $Λ_{c}^{+}$ decay involving $Σ^-$ in the final state. The ratio of branching fractions between $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ and the Cabibbo-favored decay $Λ_{c}^{+}\to Σ^- π^+π^+$ is calculated to be $(0.4 \pm 0.1)s_{c}^{2}$, where $s_{c} \equiv \sinθ_c = 0.2248$ with $θ_c$ the Cabibbo mixing angle. This ratio significantly deviates from $1.0s_{c}^{2}$ and provides important information for the understanding of nonfactorization contributions in $Λ_{c}^{+}$ decays.
△ Less
Submitted 8 May, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Authors:
Yang **,
Kun Xu,
Kun Xu,
Liwei Chen,
Chao Liao,
Jianchao Tan,
Quzhe Huang,
Bin Chen,
Chenyi Lei,
An Liu,
Chengru Song,
Xiaoqiang Lei,
Di Zhang,
Wenwu Ou,
Kun Gai,
Yadong Mu
Abstract:
Recently, the remarkable advance of the Large Language Model (LLM) has inspired researchers to transfer its extraordinary reasoning capability to both vision and language data. However, the prevailing approaches primarily regard the visual input as a prompt and focus exclusively on optimizing the text generation process conditioned upon vision content by a frozen LLM. Such an inequitable treatment…
▽ More
Recently, the remarkable advance of the Large Language Model (LLM) has inspired researchers to transfer its extraordinary reasoning capability to both vision and language data. However, the prevailing approaches primarily regard the visual input as a prompt and focus exclusively on optimizing the text generation process conditioned upon vision content by a frozen LLM. Such an inequitable treatment of vision and language heavily constrains the model's potential. In this paper, we break through this limitation by representing both vision and language in a unified form. Specifically, we introduce a well-designed visual tokenizer to translate the non-linguistic image into a sequence of discrete tokens like a foreign language that LLM can read. The resulting visual tokens encompass high-level semantics worthy of a word and also support dynamic sequence length varying from the image. Coped with this tokenizer, the presented foundation model called LaVIT can handle both image and text indiscriminately under the same generative learning paradigm. This unification empowers LaVIT to serve as an impressive generalist interface to understand and generate multi-modal content simultaneously. Extensive experiments further showcase that it outperforms the existing models by a large margin on massive vision-language tasks. Our code and models are available at https://github.com/jy0205/LaVIT.
△ Less
Submitted 22 March, 2024; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Measurement of the cross section of $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ at center-of-mass energies between 3.510 and 4.843 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay…
▽ More
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay $ψ(3770)\rightarrowΞ^{-}\barΞ^{+}$ is observed with a significance of 4.5$σ$ by analyzing the measured cross sections together with earlier BESIII results. For the other charmonium(-like) states $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$, no significant signal of their decay to $Ξ^-\bar Ξ^+$ is found. For these states, upper limits of the products of the branching fraction and the electronic partial width at the 90% confidence level are provided.
△ Less
Submitted 30 November, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Novel method to extract the femtometer structure of strange baryons using the vacuum polarization effect
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
M. Albrecht,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
R. Baldini Ferroli,
I. Balossino,
Y. Ban,
V. Batozskaya,
D. Becker,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko
, et al. (560 additional authors not shown)
Abstract:
One of the fundamental goals of particle physics is to gain microscopic understanding of the strong interaction. Electromagnetic form factors quantify the structure of hadrons in terms of charge and magnetization distributions. While the nucleon structure has been investigated extensively, data on hyperons is still scarce. It has recently been demonstrated that electron-positron annihilations into…
▽ More
One of the fundamental goals of particle physics is to gain microscopic understanding of the strong interaction. Electromagnetic form factors quantify the structure of hadrons in terms of charge and magnetization distributions. While the nucleon structure has been investigated extensively, data on hyperons is still scarce. It has recently been demonstrated that electron-positron annihilations into hyperon-antihyperon pairs provide a powerful tools to investigate their inner structure. We present a novel method useful for hyperon-antihyperon pairs of different types which exploits the cross section enhancement due to the vacuum polarization effect at the $J/ψ$ resonance. Using the 10 billion $J/ψ$ events collected with the BESIII detector, this allows a thorough determination of the hyperon structure . The result is essentially a precise snapshot of a $\barΛΣ^0$~($Λ\barΣ^0$) pair in the making, encoded in the form factor ratio and the phase. Their values are measured to be $R = 0.860\pm0.029({\rm stat.})\pm0.010({\rm syst.})$, $ΔΦ_1=(1.011\pm0.094({\rm stat.})\pm0.010({\rm syst.}))~\rm rad$ for $\barΛΣ^0$ and $ΔΦ_2=(2.128\pm0.094({\rm stat.})\pm0.010({\rm syst.}))~\rm rad$ for $Λ\barΣ^0$, respectively. Furthermore, charge-parity (CP) breaking is investigated for the first time in this reaction and found to be consistent with CP symmetry.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined t…
▽ More
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined to be ${\mathcal B}[D^+_s \to K_1(1270)^0 e^+ν_e] < 4.1\times 10^{-4}$ and ${\mathcal B}[D^+_s \to b_1(1235)^0 e^+ν_e]\cdot {\mathcal B}[b_1(1235)^0\to ωπ^0] < 6.4\times 10^{-4}$ at 90\% confidence level.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
TSGBench: Time Series Generation Benchmark
Authors:
Yihao Ang,
Qiang Huang,
Yifan Bao,
Anthony K. H. Tung,
Zhiyong Huang
Abstract:
Synthetic Time Series Generation (TSG) is crucial in a range of applications, including data augmentation, anomaly detection, and privacy preservation. Although significant strides have been made in this field, existing methods exhibit three key limitations: (1) They often benchmark against similar model types, constraining a holistic view of performance capabilities. (2) The use of specialized sy…
▽ More
Synthetic Time Series Generation (TSG) is crucial in a range of applications, including data augmentation, anomaly detection, and privacy preservation. Although significant strides have been made in this field, existing methods exhibit three key limitations: (1) They often benchmark against similar model types, constraining a holistic view of performance capabilities. (2) The use of specialized synthetic and private datasets introduces biases and hampers generalizability. (3) Ambiguous evaluation measures, often tied to custom networks or downstream tasks, hinder consistent and fair comparison.
To overcome these limitations, we introduce \textsf{TSGBench}, the inaugural Time Series Generation Benchmark, designed for a unified and comprehensive assessment of TSG methods. It comprises three modules: (1) a curated collection of publicly available, real-world datasets tailored for TSG, together with a standardized preprocessing pipeline; (2) a comprehensive evaluation measures suite including vanilla measures, new distance-based assessments, and visualization tools; (3) a pioneering generalization test rooted in Domain Adaptation (DA), compatible with all methods. We have conducted comprehensive experiments using \textsf{TSGBench} across a spectrum of ten real-world datasets from diverse domains, utilizing ten advanced TSG methods and twelve evaluation measures. The results highlight the reliability and efficacy of \textsf{TSGBench} in evaluating TSG methods. Crucially, \textsf{TSGBench} delivers a statistical analysis of the performance rankings of these methods, illuminating their varying performance across different datasets and measures and offering nuanced insights into the effectiveness of each method.
△ Less
Submitted 7 December, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
First Measurement of the Decay Asymmetry in the pure W-boson-exchange Decay $Λ_{c}^{+}\toΞ^{0}K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (618 additional authors not shown)
Abstract:
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be…
▽ More
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be $α_{Ξ^{0}K^{+}}=0.01\pm0.16({\rm stat.})\pm0.03({\rm syst.})$. This result reflects the non-interference effect between the $S$- and $P$-wave amplitudes. The phase shift between $S$- and $P$-wave amplitudes has two solutions, which are $δ_{p}-δ_{s}=-1.55\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$ or $1.59\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$.
△ Less
Submitted 20 January, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
A coupled-channel analysis of the $X(3872)$ lineshape with BESIII data
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles ar…
▽ More
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles are found on the first and second Riemann sheets corresponding to the $D^{*0}\bar{D}^0$ branch cut. The pole location on the first sheet is much closer to the $D^{*0}\bar{D}^0$ threshold than the other, and is determined to be $7.04\pm0.15^{+0.07}_{-0.08}$ MeV above the $D^0\bar{D}^0π^0$ threshold with an imaginary part $-0.19\pm0.08^{+0.14}_{-0.19}$ MeV.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
When Measures are Unreliable: Imperceptible Adversarial Perturbations toward Top-$k$ Multi-Label Learning
Authors:
Yuchen Sun,
Qianqian Xu,
Zitai Wang,
Qingming Huang
Abstract:
With the great success of deep neural networks, adversarial learning has received widespread attention in various studies, ranging from multi-class learning to multi-label learning. However, existing adversarial attacks toward multi-label learning only pursue the traditional visual imperceptibility but ignore the new perceptible problem coming from measures such as Precision@$k$ and mAP@$k$. Speci…
▽ More
With the great success of deep neural networks, adversarial learning has received widespread attention in various studies, ranging from multi-class learning to multi-label learning. However, existing adversarial attacks toward multi-label learning only pursue the traditional visual imperceptibility but ignore the new perceptible problem coming from measures such as Precision@$k$ and mAP@$k$. Specifically, when a well-trained multi-label classifier performs far below the expectation on some samples, the victim can easily realize that this performance degeneration stems from attack, rather than the model itself. Therefore, an ideal multi-labeling adversarial attack should manage to not only deceive visual perception but also evade monitoring of measures. To this end, this paper first proposes the concept of measure imperceptibility. Then, a novel loss function is devised to generate such adversarial perturbations that could achieve both visual and measure imperceptibility. Furthermore, an efficient algorithm, which enjoys a convex objective, is established to optimize this objective. Finally, extensive experiments on large-scale benchmark datasets, such as PASCAL VOC 2012, MS COCO, and NUS WIDE, demonstrate the superiority of our proposed method in attacking the top-$k$ multi-label systems.
△ Less
Submitted 5 September, 2023; v1 submitted 27 July, 2023;
originally announced September 2023.
-
FedDD: Toward Communication-efficient Federated Learning with Differential Parameter Dropout
Authors:
Zhiying Feng,
Xu Chen,
Qiong Wu,
Wen Wu,
Xiaoxi Zhang,
Qianyi Huang
Abstract:
Federated Learning (FL) requires frequent exchange of model parameters, which leads to long communication delay, especially when the network environments of clients vary greatly. Moreover, the parameter server needs to wait for the slowest client (i.e., straggler, which may have the largest model size, lowest computing capability or worst network condition) to upload parameters, which may signific…
▽ More
Federated Learning (FL) requires frequent exchange of model parameters, which leads to long communication delay, especially when the network environments of clients vary greatly. Moreover, the parameter server needs to wait for the slowest client (i.e., straggler, which may have the largest model size, lowest computing capability or worst network condition) to upload parameters, which may significantly degrade the communication efficiency. Commonly-used client selection methods such as partial client selection would lead to the waste of computing resources and weaken the generalization of the global model. To tackle this problem, along a different line, in this paper, we advocate the approach of model parameter dropout instead of client selection, and accordingly propose a novel framework of Federated learning scheme with Differential parameter Dropout (FedDD). FedDD consists of two key modules: dropout rate allocation and uploaded parameter selection, which will optimize the model parameter uploading ratios tailored to different clients' heterogeneous conditions and also select the proper set of important model parameters for uploading subject to clients' dropout rate constraints. Specifically, the dropout rate allocation is formulated as a convex optimization problem, taking system heterogeneity, data heterogeneity, and model heterogeneity among clients into consideration. The uploaded parameter selection strategy prioritizes on eliciting important parameters for uploading to speedup convergence. Furthermore, we theoretically analyze the convergence of the proposed FedDD scheme. Extensive performance evaluations demonstrate that the proposed FedDD scheme can achieve outstanding performances in both communication efficiency and model convergence, and also possesses a strong generalization capability to data of rare classes.
△ Less
Submitted 1 September, 2023; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Authors:
Weiqin Li,
Shun Lei,
Qiaochu Huang,
Yixuan Zhou,
Zhiyong Wu,
Shiyin Kang,
Helen Meng
Abstract:
The spontaneous behavior that often occurs in conversations makes speech more human-like compared to reading-style. However, synthesizing spontaneous-style speech is challenging due to the lack of high-quality spontaneous datasets and the high cost of labeling spontaneous behavior. In this paper, we propose a semi-supervised pre-training method to increase the amount of spontaneous-style speech an…
▽ More
The spontaneous behavior that often occurs in conversations makes speech more human-like compared to reading-style. However, synthesizing spontaneous-style speech is challenging due to the lack of high-quality spontaneous datasets and the high cost of labeling spontaneous behavior. In this paper, we propose a semi-supervised pre-training method to increase the amount of spontaneous-style speech and spontaneous behavioral labels. In the process of semi-supervised learning, both text and speech information are considered for detecting spontaneous behaviors labels in speech. Moreover, a linguistic-aware encoder is used to model the relationship between each sentence in the conversation. Experimental results indicate that our proposed method achieves superior expressive speech synthesis performance with the ability to model spontaneous behavior in spontaneous-style speech and predict reasonable spontaneous behavior from text.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary Resuscitation Instruction via Temporal Action Segmentation
Authors:
Yang Liu,
Xiaoyun Zhong,
Shiyao Zhai,
Zhicheng Du,
Zhenyuan Gao,
Qiming Huang,
Canyang Zhang,
Bin Jiang,
Vijay Kumar Pandey,
Sanyang Han,
Runming Wang,
Yuxing Han,
Peiwu Qin
Abstract:
The vast majority of people who suffer unexpected cardiac arrest are performed cardiopulmonary resuscitation (CPR) by passersby in a desperate attempt to restore life, but endeavors turn out to be fruitless on account of disqualification. Fortunately, many pieces of research manifest that disciplined training will help to elevate the success rate of resuscitation, which constantly desires a seamle…
▽ More
The vast majority of people who suffer unexpected cardiac arrest are performed cardiopulmonary resuscitation (CPR) by passersby in a desperate attempt to restore life, but endeavors turn out to be fruitless on account of disqualification. Fortunately, many pieces of research manifest that disciplined training will help to elevate the success rate of resuscitation, which constantly desires a seamless combination of novel techniques to yield further advancement. To this end, we collect a custom CPR video dataset in which trainees make efforts to behave resuscitation on mannequins independently in adherence to approved guidelines, thereby devising an auxiliary toolbox to assist supervision and rectification of intermediate potential issues via modern deep learning methodologies. Our research empirically views this problem as a temporal action segmentation (TAS) task in computer vision, which aims to segment an untrimmed video at a frame-wise level. Here, we propose a Prompt-enhanced hierarchical Transformer (PhiTrans) that integrates three indispensable modules, including a textual prompt-based Video Features Extractor (VFE), a transformer-based Action Segmentation Executor (ASE), and a regression-based Prediction Refinement Calibrator (PRC). The backbone of the model preferentially derives from applications in three approved public datasets (GTEA, 50Salads, and Breakfast) collected for TAS tasks, which accounts for the excavation of the segmentation pipeline on the CPR dataset. In general, we unprecedentedly probe into a feasible pipeline that genuinely elevates the CPR instruction qualification via action segmentation in conjunction with cutting-edge deep learning techniques. Associated experiments advocate our implementation with multiple metrics surpassing 91.0%.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks
Authors:
Qiang Huang,
Jiawei Jiang,
Xi Susie Rao,
Ce Zhang,
Zhichao Han,
Zitao Zhang,
Xin Wang,
Yongjun He,
Quanqing Xu,
Yang Zhao,
Chuang Hu,
Shuo Shang,
Bo Du
Abstract:
To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed. Despite the success of these TGNNs, the previous TGNN evaluations reveal several limitations regarding four critical issues: 1) inconsistent datasets, 2) inconsistent evaluation pipelines, 3) lacking workload diversity, and 4) lacking efficient compari…
▽ More
To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed. Despite the success of these TGNNs, the previous TGNN evaluations reveal several limitations regarding four critical issues: 1) inconsistent datasets, 2) inconsistent evaluation pipelines, 3) lacking workload diversity, and 4) lacking efficient comparison. Overall, there lacks an empirical study that puts TGNN models onto the same ground and compares them comprehensively. To this end, we propose BenchTemp, a general benchmark for evaluating TGNN models on various workloads. BenchTemp provides a set of benchmark datasets so that different TGNN models can be fairly compared. Further, BenchTemp engineers a standard pipeline that unifies the TGNN evaluation. With BenchTemp, we extensively compare the representative TGNN models on different tasks (e.g., link prediction and node classification) and settings (transductive and inductive), w.r.t. both effectiveness and efficiency metrics. We have made BenchTemp publicly available at https://github.com/qianghuangwhu/benchtemp.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
Observation of a vector charmoniumlike state at 4.7 ${\rm GeV}/c^2$ and search for $Z_{cs}$ in $e^+e^-\to K^+K^-J/ψ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-d…
▽ More
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-dependent line shape of the $e^+e^-\to K^+K^-J/ψ$ cross section with a significance over $5σ$. The $K^{+}J/ψ$ system is also investigated to search for charged charmoniumlike states, but no significant $Z_{cs}^+$ states are observed. Upper limits on the Born cross sections for $e^+e^-\to K^{-} Z_{cs}(3985)^{+}/K^{-} Z_{cs}(4000)^{+} + c.c.$ with $Z_{cs}(3985)^{\pm}/Z_{cs}(4000)^{\pm}\to K^{\pm} J/ψ$ are reported at 90\% confidence levels. The ratio of branching fractions $\frac{\mathcal{B}(Z_{cs}(3985)^{+}\to K^+ J/ψ)}{\mathcal{B}(Z_{cs}(3985)^{+}\to (\bar{D}^{0}D_s^{*+} + \bar{D}^{*0}D_s^+))}$ is measured to be less than 0.03 at 90\% confidence level.
△ Less
Submitted 24 November, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Study of excited $Ξ$ states in $ψ(3686)\rightarrow{}K^{-}Λ\overlineΞ^{+}+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (587 additional authors not shown)
Abstract:
Based on a sample of $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decays of $ψ(3686)\to{}K^{-}Λ\overlineΞ^{+} + c.c.$ with $\overlineΞ^+ \to \overlineΛ π^+$, $\overlineΛ\to \overline{p} π^+$ are studied.Two excited hyperons, $Ξ(1690)^-$ and $Ξ(1820)^-$, are observed with large significance ($ \gg 10 σ$) in the $K^{-}Λ$ invariant mass distributions.…
▽ More
Based on a sample of $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decays of $ψ(3686)\to{}K^{-}Λ\overlineΞ^{+} + c.c.$ with $\overlineΞ^+ \to \overlineΛ π^+$, $\overlineΛ\to \overline{p} π^+$ are studied.Two excited hyperons, $Ξ(1690)^-$ and $Ξ(1820)^-$, are observed with large significance ($ \gg 10 σ$) in the $K^{-}Λ$ invariant mass distributions. A partial wave analysis is performed, and the spin-parities of $Ξ(1690)^-$ and $Ξ(1820)^-$ are determined to be $\frac{1}{2}^{-}$ and $\frac{3}{2}^{-}$, respectively. The masses, widths, and product branching fractions of $Ξ(1690)^-$ and $Ξ(1820)^-$ are also measured.
△ Less
Submitted 28 April, 2024; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Search for the light hadron decay $χ_{c1}(3872) \to π^{+}π^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for…
▽ More
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for
$σ[e^{+}e^{-} \rightarrow γχ_{c1}(3872)] \mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-}η]$ at center-of-mass energies from 4.13 to 4.34 GeV are determined.
By normalizing to the $χ_{c1}(3872)\toπ^+π^- J/ψ$ decay channel, a 90\% confidence level upper limit for the branching fraction ratio
$\mathcal{R}=\mathcal{B}[χ_{c1}(3872) \rightarrowπ^{+}π^{-}η]/\mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-} J/ψ] < 0.12$ is given.
These measurements provide important inputs for understanding the internal structure of the $χ_{c1}(3872)$ resonance.
△ Less
Submitted 19 January, 2024; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Improved measurement of the branching fractions for $J/ψ\toγπ^0$, $γη$ and $γη^\prime$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=…
▽ More
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=$(5.40\pm 0.01\pm0.11)\times 10^{-3}$, where the first uncertainties are statistical and the second are systematic. These results are consistent with the world average values within two standard deviations. The ratio of partial widths $Γ(J/ψ\toγη^\prime)/Γ(J/ψ\toγη)$ is measured to be $4.93 \pm 0.13$. The singlet-octet pseudoscalar mixing angle $θ_P$ is determined to be $θ_P = -(22.11 \pm0.26)^\circ$ or $-(19.34 \pm 0.34)^\circ$ with two different phenomenological models.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition
Authors:
Chang Li,
Qian Huang,
Yingchi Mao
Abstract:
Graph Convolutional Networks (GCNs) have been widely used in skeleton-based human action recognition. In GCN-based methods, the spatio-temporal graph is fundamental for capturing motion patterns. However, existing approaches ignore the physical dependency and synchronized spatio-temporal correlations between joints, which limits the representation capability of GCNs. To solve these problems, we co…
▽ More
Graph Convolutional Networks (GCNs) have been widely used in skeleton-based human action recognition. In GCN-based methods, the spatio-temporal graph is fundamental for capturing motion patterns. However, existing approaches ignore the physical dependency and synchronized spatio-temporal correlations between joints, which limits the representation capability of GCNs. To solve these problems, we construct the directed diffusion graph for action modeling and introduce the activity partition strategy to optimize the weight sharing mechanism of graph convolution kernels. In addition, we present the spatio-temporal synchronization encoder to embed synchronized spatio-temporal semantics. Finally, we propose Directed Diffusion Graph Convolutional Network (DD-GCN) for action recognition, and the experiments on three public datasets: NTU-RGB+D, NTU-RGB+D 120, and NW-UCLA, demonstrate the state-of-the-art performance of our method.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Tailoring magnetism of nanographenes via tip-controlled dehydrogenation
Authors:
Chenxiao Zhao,
Qiang Huang,
Leoš Valenta,
Kristjan Eimre,
Lin Yang,
Aliaksandr V. Yakutovich,
Wangwei Xu,
Ji Ma,
Xinliang Feng,
Michal Jurí{č}ek,
Roman Fasel,
Pascal Ruffieux,
Carlo A. Pignedoli
Abstract:
Atomically precise graphene nanoflakes, called nanographenes, have emerged as a promising platform to realize carbon magnetism. Their ground state spin configuration can be anticipated by Ovchinnikov-Lieb rules based on the mismatch of π-electrons from two sublattices. While rational geometrical design achieves specific spin configurations, further direct control over the π-electrons offers a desi…
▽ More
Atomically precise graphene nanoflakes, called nanographenes, have emerged as a promising platform to realize carbon magnetism. Their ground state spin configuration can be anticipated by Ovchinnikov-Lieb rules based on the mismatch of π-electrons from two sublattices. While rational geometrical design achieves specific spin configurations, further direct control over the π-electrons offers a desirable extension for efficient spin manipulations and potential quantum device operations. To this end, we apply a site-specific dehydrogenation using a scanning tunneling microscope tip to nanographenes deposited on a Au(111) substrate, which shows the capability of precisely tailoring the underlying π-electron system and therefore efficiently manipulating their magnetism. Through first-principles calculations and tight-binding mean-field-Hubbard modelling, we demonstrate that the dehydrogenation-induced Au-C bond formation along with the resulting hybridization between frontier π-orbitals and Au substrate states effectively eliminate the unpaired π-electron. Our results establish an efficient technique for controlling the magnetism of nanographenes.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Does Misclassifying Non-confounding Covariates as Confounders Affect the Causal Inference within the Potential Outcomes Framework?
Authors:
Yonghe Zhao,
Qiang Huang,
Shuai Fu,
Huiyan Sun
Abstract:
The Potential Outcome Framework (POF) plays a prominent role in the field of causal inference. Most causal inference models based on the POF (CIMs-POF) are designed for eliminating confounding bias and default to an underlying assumption of Confounding Covariates. This assumption posits that the covariates consist solely of confounders. However, the assumption of Confounding Covariates is challeng…
▽ More
The Potential Outcome Framework (POF) plays a prominent role in the field of causal inference. Most causal inference models based on the POF (CIMs-POF) are designed for eliminating confounding bias and default to an underlying assumption of Confounding Covariates. This assumption posits that the covariates consist solely of confounders. However, the assumption of Confounding Covariates is challenging to maintain in practice, particularly when dealing with high-dimensional covariates. While certain methods have been proposed to differentiate the distinct components of covariates prior to conducting causal inference, the consequences of treating non-confounding covariates as confounders remain unclear. This ambiguity poses a potential risk when conducting causal inference in practical scenarios. In this paper, we present a unified graphical framework for the CIMs-POF, which greatly enhances the comprehension of these models' underlying principles. Using this graphical framework, we quantitatively analyze the extent to which the inference performance of CIMs-POF is influenced when incorporating various types of non-confounding covariates, such as instrumental variables, mediators, colliders, and adjustment variables. The key findings are: in the task of eliminating confounding bias, the optimal scenario is for the covariates to exclusively encompass confounders; in the subsequent task of inferring counterfactual outcomes, the adjustment variables contribute to more accurate inferences. Furthermore, extensive experiments conducted on synthetic datasets consistently validate these theoretical conclusions.
△ Less
Submitted 4 September, 2023; v1 submitted 22 August, 2023;
originally announced August 2023.
-
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
Authors:
Qidong Huang,
Xiaoyi Dong,
Dongdong Chen,
Yinpeng Chen,
Lu Yuan,
Gang Hua,
Weiming Zhang,
Nenghai Yu
Abstract:
In this paper, we investigate the adversarial robustness of vision transformers that are equipped with BERT pretraining (e.g., BEiT, MAE). A surprising observation is that MAE has significantly worse adversarial robustness than other BERT pretraining methods. This observation drives us to rethink the basic differences between these BERT pretraining methods and how these differences affect the robu…
▽ More
In this paper, we investigate the adversarial robustness of vision transformers that are equipped with BERT pretraining (e.g., BEiT, MAE). A surprising observation is that MAE has significantly worse adversarial robustness than other BERT pretraining methods. This observation drives us to rethink the basic differences between these BERT pretraining methods and how these differences affect the robustness against adversarial perturbations. Our empirical analysis reveals that the adversarial robustness of BERT pretraining is highly related to the reconstruction target, i.e., predicting the raw pixels of masked image patches will degrade more adversarial robustness of the model than predicting the semantic context, since it guides the model to concentrate more on medium-/high-frequency components of images. Based on our analysis, we provide a simple yet effective way to boost the adversarial robustness of MAE. The basic idea is using the dataset-extracted domain knowledge to occupy the medium-/high-frequency of images, thus narrowing the optimization space of adversarial perturbations. Specifically, we group the distribution of pretraining data and optimize a set of cluster-specific visual prompts on frequency domain. These prompts are incorporated with input images through prototype-based prompt selection during test period. Extensive evaluation shows that our method clearly boost MAE's adversarial robustness while maintaining its clean performance on ImageNet-1k classification. Our code is available at: https://github.com/shikiw/RobustMAE.
△ Less
Submitted 22 August, 2023; v1 submitted 20 August, 2023;
originally announced August 2023.
-
Poisson quadrature method of moments for 2D kinetic equations with velocity of constant magnitude
Authors:
Yihong Chen,
Qian Huang,
Wen-An Yong,
Ruixi Zhang
Abstract:
This work is concerned with kinetic equations with velocity of constant magnitude. We propose a quadrature method of moments based on the Poisson kernel, called Poisson-EQMOM. The derived moment closure systems are well defined for all physically relevant moments and the resultant approximations of the distribution function converge as the number of moments goes to infinity. The convergence makes…
▽ More
This work is concerned with kinetic equations with velocity of constant magnitude. We propose a quadrature method of moments based on the Poisson kernel, called Poisson-EQMOM. The derived moment closure systems are well defined for all physically relevant moments and the resultant approximations of the distribution function converge as the number of moments goes to infinity. The convergence makes our method stand out from most existing moment methods. Moreover, we devise a delicate moment inversion algorithm. As an application, the Vicsek model is studied for overdamped active particles. Then the Poisson-EQMOM is validated with a series of numerical tests including spatially homogeneous, one-dimensional and two-dimensional problems.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Observation of Non-Hermitian Skin Effect in Thermal Diffusion
Authors:
Yun-Kai Liu,
Pei-Chao Cao,
Minghong Qi,
Qiang-Kai-Lai Huang,
Yu-Gui Peng,
Ying Li,
Xue-Feng Zhu
Abstract:
The paradigm shift of the Hermitian systems into the non-Hermitian regime profoundly modifies the inherent topological property, leading to various unprecedented effects such as the non-Hermitian skin effect (NHSE). In the past decade, the NHSE effect has been demonstrated in quantum, optical and acoustic systems. Besides in those non-Hermitian wave systems, the NHSE in diffusive systems has not y…
▽ More
The paradigm shift of the Hermitian systems into the non-Hermitian regime profoundly modifies the inherent topological property, leading to various unprecedented effects such as the non-Hermitian skin effect (NHSE). In the past decade, the NHSE effect has been demonstrated in quantum, optical and acoustic systems. Besides in those non-Hermitian wave systems, the NHSE in diffusive systems has not yet been explicitly demonstrated, despite recent abundant advances in the study of topological thermal diffusion. Here we first design a thermal diffusion lattice based on a modified Su-Schrieffer-Heeger model which enables the observation of diffusive NHSE. In the proposed model, the periodic heat exchange rate among adjacent unit cells and the asymmetric temperature field coupling inside unit cells can be judiciously realized by appropriate configurations of structural parameters of unit cells. The transient concentration feature of temperature field on the boundary regardless of initial excitation conditions can be clearly observed, indicating the occurrence of transient thermal skin effect. Nonetheless, we experimentally demonstrated the NHSE and verified the remarkable robustness against various defects. Our work provides a platform for exploration of non-Hermitian physics in the diffusive systems, which has important applications in efficient heat collection, highly sensitive thermal sensing and others.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Study of $e^+e^-\toηφ$ at center-of-mass energies from 3.773 to 4.600 GeV
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like…
▽ More
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like vector states. No significant signals of the $Y(4230)$ and $Y(4360)$ resonances are observed.
△ Less
Submitted 24 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Full analysis of the scalar-induced gravitational waves for the curvature perturbation with local-type non-Gaussianities
Authors:
Chen Yuan,
De-Shuang Meng,
Qing-Guo Huang
Abstract:
Primordial black holes (PBHs) are supposed to form through the gravitational collapse of regions with large density fluctuations. The formation of PBHs inevitably leads to the emission of scalar-induced gravitational wave (SIGW) signals, offering a unique opportunity to test the hypothesis of PBHs as a constituent of dark matter (DM). Previous studies have calculated the energy spectrum of SIGWs i…
▽ More
Primordial black holes (PBHs) are supposed to form through the gravitational collapse of regions with large density fluctuations. The formation of PBHs inevitably leads to the emission of scalar-induced gravitational wave (SIGW) signals, offering a unique opportunity to test the hypothesis of PBHs as a constituent of dark matter (DM). Previous studies have calculated the energy spectrum of SIGWs in local-type non-Gaussian models, primarily considering the contributions from the $F_{\mathrm{NL}}$-order or the $G_{\mathrm{NL}}$-order while neglecting connected diagrams. In this study, we extend the previous work by (i) considering the full contribution of non-Gaussian diagrams up to the $G_{\mathrm{NL}}$-order; (ii) deriving the generic scaling of the SIGW energy spectrum in the infrared region. We derive semi-analytical results applicable to arbitrary primordial power spectra and numerically evaluate the energy spectrum of SIGWs for a log-normal power spectrum.
△ Less
Submitted 17 February, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
Search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (584 additional authors not shown)
Abstract:
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$. No signal is found and the upper limit on the branching fraction of $φ\to π^+ π^+ e^- e^-$ is set to be $9.7\times10^{-6}$ at the 90\% confidence level.
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the lepton number violation decay $φ\to π^+ π^+ e^- e^-$ via $J/ψ\to φη$. No signal is found and the upper limit on the branching fraction of $φ\to π^+ π^+ e^- e^-$ is set to be $9.7\times10^{-6}$ at the 90\% confidence level.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Adaptive Intellect Unleashed: The Feasibility of Knowledge Transfer in Large Language Models
Authors:
Qing Huang,
Yishun Wu,
Zhenchang Xing,
He Jiang,
Yu Cheng,
Huan **
Abstract:
We conduct the first empirical study on using knowledge transfer to improve the generalization ability of large language models (LLMs) in software engineering tasks, which often require LLMs to generalize beyond their training data. Our proposed general knowledge transfer approach guides the LLM towards a similar and familiar API or code snippet it has encountered before, improving the model's gen…
▽ More
We conduct the first empirical study on using knowledge transfer to improve the generalization ability of large language models (LLMs) in software engineering tasks, which often require LLMs to generalize beyond their training data. Our proposed general knowledge transfer approach guides the LLM towards a similar and familiar API or code snippet it has encountered before, improving the model's generalization ability for unseen knowledge. We apply this approach to three software engineering tasks: API inference, code example generation, and FQN inference, and find transfer span, transfer strategy, and transfer architecture as key factors affecting the method. Our findings demonstrate the feasibility of knowledge transfer and its potential to enhance LLMs' performance in various software engineering tasks. The effectiveness of knowledge transfer varies depending on the target domain and task, with the hierarchical strategy being more effective than direct transfer, and AI-Chain outperforming CoT in prompt design. The implications of these findings extend beyond software engineering tasks and suggest that knowledge transfer can enhance LLMs' ability to handle unknowns in any natural language task.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Deep Learning-Based Knowledge Injection for Metaphor Detection: A Comprehensive Review
Authors:
Cheng Yang,
Zheng Li,
Zhiyue Liu,
Qingbao Huang
Abstract:
Metaphor as an advanced cognitive modality works by extracting familiar concepts in the target domain in order to understand vague and abstract concepts in the source domain. This helps humans to quickly understand and master new domains and thus adapt to changing environments. With the continuous development of metaphor research in the natural language community, many studies using knowledge-assi…
▽ More
Metaphor as an advanced cognitive modality works by extracting familiar concepts in the target domain in order to understand vague and abstract concepts in the source domain. This helps humans to quickly understand and master new domains and thus adapt to changing environments. With the continuous development of metaphor research in the natural language community, many studies using knowledge-assisted models to detect textual metaphors have emerged in recent years. Compared to not using knowledge, systems that introduce various kinds of knowledge achieve greater performance gains and reach SOTA in a recent study. Based on this, the goal of this paper is to provide a comprehensive review of research advances in the application of deep learning for knowledge injection in metaphor detection tasks. We will first systematically summarize and generalize the mainstream knowledge and knowledge injection principles. Then, the datasets, evaluation metrics, and benchmark models used in metaphor detection tasks are examined. Finally, we explore the current issues facing knowledge injection methods and provide an outlook on future research directions.
△ Less
Submitted 8 January, 2024; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
Authors:
Daiheng Gao,
Xu Chen,
Xindi Zhang,
Qi Wang,
Ke Sun,
Bang Zhang,
Liefeng Bo,
Qixing Huang
Abstract:
Fabricating and designing 3D garments has become extremely demanding with the increasing need for synthesizing realistic dressed persons for a variety of applications, e.g. 3D virtual try-on, digitalization of 2D clothes into 3D apparel, and cloth animation. It thus necessitates a simple and straightforward pipeline to obtain high-quality texture from simple input, such as 2D reference images. Sin…
▽ More
Fabricating and designing 3D garments has become extremely demanding with the increasing need for synthesizing realistic dressed persons for a variety of applications, e.g. 3D virtual try-on, digitalization of 2D clothes into 3D apparel, and cloth animation. It thus necessitates a simple and straightforward pipeline to obtain high-quality texture from simple input, such as 2D reference images. Since traditional war**-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process. We propose a novel method, called Cloth2Tex, which eliminates the human burden in this process. Cloth2Tex is a self-supervised method that generates texture maps with reasonable layout and structural consistency. Another key feature of Cloth2Tex is that it can be used to support high-fidelity texture inpainting. This is done by combining Cloth2Tex with a prevailing latent diffusion model. We evaluate our approach both qualitatively and quantitatively and demonstrate that Cloth2Tex can generate high-quality texture maps and achieve the best visual effects in comparison to other methods. Project page: tomguluson92.github.io/projects/cloth2tex/
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
Authors:
Ming Nie,
Yu**g Xue,
Chunwei Wang,
Chaoqiang Ye,
Hang Xu,
Xinge Zhu,
Qingqiu Huang,
Michael Bi Mi,
Xinchao Wang,
Li Zhang
Abstract:
Recently, polar-based representation has shown promising properties in perceptual tasks. In addition to Cartesian-based approaches, which separate point clouds unevenly, representing point clouds as polar grids has been recognized as an alternative due to (1) its advantage in robust performance under different resolutions and (2) its superiority in streaming-based approaches. However, state-of-the…
▽ More
Recently, polar-based representation has shown promising properties in perceptual tasks. In addition to Cartesian-based approaches, which separate point clouds unevenly, representing point clouds as polar grids has been recognized as an alternative due to (1) its advantage in robust performance under different resolutions and (2) its superiority in streaming-based approaches. However, state-of-the-art polar-based detection methods inevitably suffer from the feature distortion problem because of the non-uniform division of polar representation, resulting in a non-negligible performance gap compared to Cartesian-based approaches. To tackle this issue, we present PARTNER, a novel 3D object detector in the polar coordinate. PARTNER alleviates the dilemma of feature distortion with global representation re-alignment and facilitates the regression by introducing instance-level geometric information into the detection head. Extensive experiments show overwhelming advantages in streaming-based detection and different resolutions. Furthermore, our method outperforms the previous polar-based works with remarkable margins of 3.68% and 9.15% on Waymo and ONCE validation set, thus achieving competitive results over the state-of-the-art methods.
△ Less
Submitted 2 December, 2023; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Enhancing Nucleus Segmentation with HARU-Net: A Hybrid Attention Based Residual U-Blocks Network
Authors:
Junzhou Chen,
Qian Huang,
Yulin Chen,
Linyi Qian,
Chengyuan Yu
Abstract:
Nucleus image segmentation is a crucial step in the analysis, pathological diagnosis, and classification, which heavily relies on the quality of nucleus segmentation. However, the complexity of issues such as variations in nucleus size, blurred nucleus contours, uneven staining, cell clustering, and overlap** cells poses significant challenges. Current methods for nucleus segmentation primarily…
▽ More
Nucleus image segmentation is a crucial step in the analysis, pathological diagnosis, and classification, which heavily relies on the quality of nucleus segmentation. However, the complexity of issues such as variations in nucleus size, blurred nucleus contours, uneven staining, cell clustering, and overlap** cells poses significant challenges. Current methods for nucleus segmentation primarily rely on nuclear morphology or contour-based approaches. Nuclear morphology-based methods exhibit limited generalization ability and struggle to effectively predict irregular-shaped nuclei, while contour-based extraction methods face challenges in accurately segmenting overlap** nuclei. To address the aforementioned issues, we propose a dual-branch network using hybrid attention based residual U-blocks for nucleus instance segmentation. The network simultaneously predicts target information and target contours. Additionally, we introduce a post-processing method that combines the target information and target contours to distinguish overlap** nuclei and generate an instance segmentation image. Within the network, we propose a context fusion block (CF-block) that effectively extracts and merges contextual information from the network. Extensive quantitative evaluations are conducted to assess the performance of our method. Experimental results demonstrate the superior performance of the proposed method compared to state-of-the-art approaches on the BNS, MoNuSeg, CoNSeg, and CPM-17 datasets.
△ Less
Submitted 10 August, 2023; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Measurement of the $e^+e^- \to Λ\barΣ^0 + c.c.$ cross sections at $\sqrt{s}$ from 2.3094 to 3.0800 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (601 additional authors not shown)
Abstract:
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical…
▽ More
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical significance of more than five standard deviations, and the cross sections at other energies are obtained with improved precision compared to earlier measurements from the BaBar Collaboration. The Born cross-section lineshape is described better by a shape with a plateau near the threshold than by a pQCD motivated functional form.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
A Voting-Stacking Ensemble of Inception Networks for Cervical Cytology Classification
Authors:
Linyi Qian,
Qian Huang,
Yulin Chen,
Junzhou Chen
Abstract:
Cervical cancer is one of the most severe diseases threatening women's health. Early detection and diagnosis can significantly reduce cancer risk, in which cervical cytology classification is indispensable. Researchers have recently designed many networks for automated cervical cancer diagnosis, but the limited accuracy and bulky size of these individual models cannot meet practical application ne…
▽ More
Cervical cancer is one of the most severe diseases threatening women's health. Early detection and diagnosis can significantly reduce cancer risk, in which cervical cytology classification is indispensable. Researchers have recently designed many networks for automated cervical cancer diagnosis, but the limited accuracy and bulky size of these individual models cannot meet practical application needs. To address this issue, we propose a Voting-Stacking ensemble strategy, which employs three Inception networks as base learners and integrates their outputs through a voting ensemble. The samples misclassified by the ensemble model generate a new training set on which a linear classification model is trained as the meta-learner and performs the final predictions. In addition, a multi-level Stacking ensemble framework is designed to improve performance further. The method is evaluated on the SIPakMed, Herlev, and Mendeley datasets, achieving accuracies of 100%, 100%, and 100%, respectively. The experimental results outperform the current state-of-the-art (SOTA) methods, demonstrating its potential for reducing screening workload and hel** pathologists detect cervical cancer.
△ Less
Submitted 8 August, 2023; v1 submitted 4 August, 2023;
originally announced August 2023.
-
VLUCI: Variational Learning of Unobserved Confounders for Counterfactual Inference
Authors:
Yonghe Zhao,
Qiang Huang,
Siwei Wu,
Yun Peng,
Huiyan Sun
Abstract:
Causal inference plays a vital role in diverse domains like epidemiology, healthcare, and economics. De-confounding and counterfactual prediction in observational data has emerged as a prominent concern in causal inference research. While existing models tackle observed confounders, the presence of unobserved confounders remains a significant challenge, distorting causal inference and impacting co…
▽ More
Causal inference plays a vital role in diverse domains like epidemiology, healthcare, and economics. De-confounding and counterfactual prediction in observational data has emerged as a prominent concern in causal inference research. While existing models tackle observed confounders, the presence of unobserved confounders remains a significant challenge, distorting causal inference and impacting counterfactual outcome accuracy. To address this, we propose a novel variational learning model of unobserved confounders for counterfactual inference (VLUCI), which generates the posterior distribution of unobserved confounders. VLUCI relaxes the unconfoundedness assumption often overlooked by most causal inference methods. By disentangling observed and unobserved confounders, VLUCI constructs a doubly variational inference model to approximate the distribution of unobserved confounders, which are used for inferring more accurate counterfactual outcomes. Extensive experiments on synthetic and semi-synthetic datasets demonstrate VLUCI's superior performance in inferring unobserved confounders. It is compatible with state-of-the-art counterfactual inference models, significantly improving inference accuracy at both group and individual levels. Additionally, VLUCI provides confidence intervals for counterfactual outcomes, aiding decision-making in risk-sensitive domains. We further clarify the considerations when applying VLUCI to cases where unobserved confounders don't strictly conform to our model assumptions using the public IHDP dataset as an example, highlighting the practical advantages of VLUCI.
△ Less
Submitted 7 September, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Select2Col: Leveraging Spatial-Temporal Importance of Semantic Information for Efficient Collaborative Perception
Authors:
Yuntao Liu,
Qian Huang,
Rongpeng Li,
Xianfu Chen,
Zhifeng Zhao,
Shuyuan Zhao,
Yongdong Zhu,
Honggang Zhang
Abstract:
Collaborative perception by leveraging the shared semantic information plays a crucial role in overcoming the individual limitations of isolated agents. However, existing collaborative perception methods tend to focus solely on the spatial features of semantic information, while neglecting the importance of the temporal dimension. Consequently, the potential benefits of collaboration remain underu…
▽ More
Collaborative perception by leveraging the shared semantic information plays a crucial role in overcoming the individual limitations of isolated agents. However, existing collaborative perception methods tend to focus solely on the spatial features of semantic information, while neglecting the importance of the temporal dimension. Consequently, the potential benefits of collaboration remain underutilized. In this article, we propose Select2Col, a novel collaborative perception framework that takes into account the \underline{s}patial-t\underline{e}mpora\underline{l} importanc\underline{e} of semanti\underline{c} informa\underline{t}ion. Within the Select2Col, we develop a collaborator selection method that utilizes a lightweight graph neural network (GNN) to estimate the importance of semantic information (IoSI) of each collaborator in enhancing perception performance, thereby identifying contributive collaborators while excluding those that potentially bring negative impact. Moreover, we present a semantic information fusion algorithm called HPHA (historical prior hybrid attention), which integrates multi-scale attention and short-term attention modules to capture the IoSI in feature representation from the spatial and temporal dimensions respectively, and assigns IoSI-consistent weights for efficient fusion of information from selected collaborators. Extensive experiments on three open datasets demonstrate that our proposed Select2Col significantly improves the perception performance compared to state-of-the-art approaches. The code associated with this research is publicly available at https://github.com/huangqzj/Select2Col/.
△ Less
Submitted 6 February, 2024; v1 submitted 31 July, 2023;
originally announced July 2023.
-
Rapid Flood Inundation Forecast Using Fourier Neural Operator
Authors:
Alexander Y. Sun,
Zhi Li,
Wonhyun Lee,
Qixing Huang,
Bridget R. Scanlon,
Clint Dawson
Abstract:
Flood inundation forecast provides critical information for emergency planning before and during flood events. Real time flood inundation forecast tools are still lacking. High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding. Here we present a hybrid proc…
▽ More
Flood inundation forecast provides critical information for emergency planning before and during flood events. Real time flood inundation forecast tools are still lacking. High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding. Here we present a hybrid process-based and data-driven machine learning (ML) approach for flood extent and inundation depth prediction. We used the Fourier neural operator (FNO), a highly efficient ML method, for surrogate modeling. The FNO model is demonstrated over an urban area in Houston (Texas, U.S.) by training using simulated water depths (in 15-min intervals) from six historical storm events and then tested over two holdout events. Results show FNO outperforms the baseline U-Net model. It maintains high predictability at all lead times tested (up to 3 hrs) and performs well when applying to new sites, suggesting strong generalization skill.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
Determination of the $Σ^{+}$ Timelike Electromagnetic Form Factors
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike regio…
▽ More
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike region are extracted. The relative phase between the electric and magnetic form factors is determined to be $\sinΔΦ$ = -0.67~$\pm$~0.29~(stat)~$\pm$~0.18~(syst) at $\sqrt{s}$ = 2.3960 GeV, $ΔΦ$ = 55$^{\circ}$~$\pm$~19$^{\circ}$~(stat) $\pm$~14$^{\circ}$~(syst) at $\sqrt{s}$ = 2.6454 GeV, and 78$^{\circ}$~$\pm$~22$^{\circ}$~(stat) $\pm$~9$^{\circ}$~(syst) at $\sqrt{s}$ = 2.9000 GeV. For the first time, the phase of the hyperon electromagnetic form factors is explored in a wide range of four-momentum transfer. The evolution of the phase along with four-momentum transfer is an important input for understanding its asymptotic behavior and the dynamics of baryons.
△ Less
Submitted 5 March, 2024; v1 submitted 29 July, 2023;
originally announced July 2023.
-
A search for new physics in low-energy electron recoils from the first LZ exposure
Authors:
The LZ Collaboration,
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
P. Beltrame,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
G. M. Blockinger
, et al. (178 additional authors not shown)
Abstract:
The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics inc…
▽ More
The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics including solar axion electron coupling, solar neutrino magnetic moment and millicharge, and electron couplings to galactic axion-like particles and hidden photons. Similar limits are set on weakly interacting massive particle (WIMP) dark matter producing signals through ionized atomic states from the Migdal effect.
△ Less
Submitted 9 September, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Disproof of a conjecture on the minimum spectral radius and the domination number
Authors:
Yarong Hu,
Zhenzhen Lou,
Qiongxiang Huang
Abstract:
Let $G_{n,γ}$ be the set of all connected graphs on $n$ vertices with domination number $γ$. A graph is called a minimizer graph if it attains the minimum spectral radius among $G_{n,γ}$. Very recently, Liu, Li and Xie [Linear Algebra and its Applications 673 (2023) 233--258] proved that the minimizer graph over all graphs in $\mathbb{G}_{n,γ}$ must be a tree. Moreover, they determined the minimiz…
▽ More
Let $G_{n,γ}$ be the set of all connected graphs on $n$ vertices with domination number $γ$. A graph is called a minimizer graph if it attains the minimum spectral radius among $G_{n,γ}$. Very recently, Liu, Li and Xie [Linear Algebra and its Applications 673 (2023) 233--258] proved that the minimizer graph over all graphs in $\mathbb{G}_{n,γ}$ must be a tree. Moreover, they determined the minimizer graph among $G_{n,\lfloor\frac{n}{2}\rfloor}$ for even $n$, and posed the conjecture on the minimizer graph among $G_{n,\lfloor\frac{n}{2}\rfloor}$ for odd $n$. In this paper, we disprove the conjecture and completely determine the unique minimizer graph among $G_{n,\lfloor\frac{n}{2}\rfloor}$ for odd $n$.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
Med-Flamingo: a Multimodal Medical Few-shot Learner
Authors:
Michael Moor,
Qian Huang,
Shirley Wu,
Michihiro Yasunaga,
Cyril Zakka,
Yash Dalmia,
Eduardo Pontes Reis,
Pranav Rajpurkar,
Jure Leskovec
Abstract:
Medicine, by its nature, is a multifaceted domain that requires the synthesis of information across various modalities. Medical generative vision-language models (VLMs) make a first step in this direction and promise many exciting clinical applications. However, existing models typically have to be fine-tuned on sizeable down-stream datasets, which poses a significant limitation as in many medical…
▽ More
Medicine, by its nature, is a multifaceted domain that requires the synthesis of information across various modalities. Medical generative vision-language models (VLMs) make a first step in this direction and promise many exciting clinical applications. However, existing models typically have to be fine-tuned on sizeable down-stream datasets, which poses a significant limitation as in many medical applications data is scarce, necessitating models that are capable of learning from few examples in real-time. Here we propose Med-Flamingo, a multimodal few-shot learner adapted to the medical domain. Based on OpenFlamingo-9B, we continue pre-training on paired and interleaved medical image-text data from publications and textbooks. Med-Flamingo unlocks few-shot generative medical visual question answering (VQA) abilities, which we evaluate on several datasets including a novel challenging open-ended VQA dataset of visual USMLE-style problems. Furthermore, we conduct the first human evaluation for generative medical VQA where physicians review the problems and blinded generations in an interactive app. Med-Flamingo improves performance in generative medical VQA by up to 20\% in clinician's rating and firstly enables multimodal medical few-shot adaptations, such as rationale generation. We release our model, code, and evaluation app under https://github.com/snap-stanford/med-flamingo.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.