Search | arXiv e-print repository

MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China

Authors: Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng

Abstract: Current large language models demonstrate deficiencies in understanding low-resource languages, particularly the minority languages in China. This limitation stems from the scarcity of available pre-training data. To address this accessibility challenge, we present MC$^2$, a Multilingual Corpus of Minority Languages in China, which is the largest open-source corpus of its kind so far. MC$^2$ inclu… ▽ More Current large language models demonstrate deficiencies in understanding low-resource languages, particularly the minority languages in China. This limitation stems from the scarcity of available pre-training data. To address this accessibility challenge, we present MC$^2$, a Multilingual Corpus of Minority Languages in China, which is the largest open-source corpus of its kind so far. MC$^2$ includes four underrepresented languages: Tibetan, Uyghur, Kazakh, and Mongolian. Notably, we focus on the less common writing systems of Kazakh and Mongolian, i.e., Kazakh Arabic script and traditional Mongolian script, respectively, which have been long neglected in previous corpus construction efforts. Recognizing the prevalence of language contamination within existing corpora, we adopt a quality-centric solution for collecting MC$^2$, prioritizing accuracy while enhancing diversity. Furthermore, we underscore the importance of attending to the multiplicity of writing systems, which is closely related to the cultural awareness of the resulting models. The MC$^2$ corpus and related models are made public to the community. △ Less

Submitted 13 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

Comments: ACL 2024 https://github.com/luciusssss/mc2_corpus

arXiv:2311.07043 [pdf, ps, other]

Study of the decay $J/ψ\to φπ^{0}η$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (604 additional authors not shown)

Abstract: Based on $(10.09 \pm 0.04) \times 10^9$ $J/ψ$ events collected with the BESIII detector operating at the BEPCII collider, a partial wave analysis of the decay $J/ψ\to φπ^{0}η$ is performed. We observe for the first time two new structures on the $φη$ invariant mass distribution, with statistical significances of $24.0σ$ and $16.9σ$; the first with $J^{\rm PC}$ = $1^{+-}$, mass M = (1911 $\pm$ 6 (s… ▽ More Based on $(10.09 \pm 0.04) \times 10^9$ $J/ψ$ events collected with the BESIII detector operating at the BEPCII collider, a partial wave analysis of the decay $J/ψ\to φπ^{0}η$ is performed. We observe for the first time two new structures on the $φη$ invariant mass distribution, with statistical significances of $24.0σ$ and $16.9σ$; the first with $J^{\rm PC}$ = $1^{+-}$, mass M = (1911 $\pm$ 6 (stat.) $\pm$ 14 (sys.))~MeV/$c^{2}$, and width $Γ= $ (149 $\pm$ 12 (stat.) $\pm$ 23 (sys.))~MeV, the second with $J^{\rm PC}$ = $1^{--}$, mass M = (1996 $\pm$ 11 (stat.) $\pm$ 30 (sys.))~MeV/$c^{2}$, and width $Γ$ = (148 $\pm$ 16 (stat.) $\pm$ 66 (sys.))~MeV. These measurements provide important input for the strangeonium spectrum. In addition, the $f_0(980)-a_0(980)^0$ mixing signal in $J/ψ\to φf_0(980) \to φa_0(980)^0$ and the corresponding electromagnetic decay $J/ψ\to φa_0(980)^0$ are measured with improved precision, providing crucial information to understand the nature of $a_0(980)^0$ and $f_0(980)$. △ Less

Submitted 14 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

arXiv:2311.06883 [pdf, other]

doi 10.1103/PhysRevD.109.L091101

Evidence of the Singly Cabibbo Suppressed decay $Λ_c^+\to pπ^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

Abstract: Evidence for the singly Cabibbo suppressed decay $Λ_c^+\to pπ^0$ is reported for the first time with a statistical significance of $3.7σ$ based on 6.0 $\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.843 GeV with the BESIII detector at the BEPCII collider. The absolute branching fraction of $Λ_c^+\to pπ^0$ is measured to be… ▽ More Evidence for the singly Cabibbo suppressed decay $Λ_c^+\to pπ^0$ is reported for the first time with a statistical significance of $3.7σ$ based on 6.0 $\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.843 GeV with the BESIII detector at the BEPCII collider. The absolute branching fraction of $Λ_c^+\to pπ^0$ is measured to be $(1.56^{+0.72}_{-0.58}\pm0.20)\times 10^{-4}$. Combining with the branching fraction of $Λ_c^+\to nπ^+$, $(6.6\pm1.3)\times10^{-4}$, the ratio of the branching fractions of $Λ_c^+\to nπ^+$ and $Λ_c^+\to pπ^0$ is calculated to be $3.2^{+2.2}_{-1.2}$. As an important input for the theoretical models describing the decay mechanisms of charmed baryons, our result indicates that the non-factorizable contributions play an essential role and their interference with the factorizable contributions should not be significant. In addition, the absolute branching fraction of $Λ_c^+\to pη$ is measured to be $(1.63\pm0.31_{\rm stat}\pm0.11_{\rm syst}) \times10^{-3}$. △ Less

Submitted 3 June, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

Comments: 9 pages, 3 figures

Journal ref: Phys. Rev. D 109, L091101 (2024)

arXiv:2311.05955 [pdf, other]

Observation and branching fraction measurement of the decay $J\!/\!ψ\rightarrow \bar{p} Σ^{+} K_{S}^{0} + c.c.$

Authors: M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (602 additional authors not shown)

Abstract: The first observation of the decays $J\!/\!ψ\rightarrow \bar{p} Σ^{+} K_{S}^{0}$ and $J\!/\!ψ\rightarrow p \barΣ^{-} K_{S}^{0}$ is reported using $(10087\pm44)\times10^{6}$ $J\!/\!ψ$ events recorded by the BESIII detector at the BEPCII storage ring. The branching fractions of each channel are determined to be… ▽ More The first observation of the decays $J\!/\!ψ\rightarrow \bar{p} Σ^{+} K_{S}^{0}$ and $J\!/\!ψ\rightarrow p \barΣ^{-} K_{S}^{0}$ is reported using $(10087\pm44)\times10^{6}$ $J\!/\!ψ$ events recorded by the BESIII detector at the BEPCII storage ring. The branching fractions of each channel are determined to be $\mathcal{B}(J\!/\!ψ\rightarrow \bar{p} Σ^{+} K_{S}^{0})=(1.361 \pm 0.006 \pm 0.025) \times 10^{-4}$ and $\mathcal{B}(J\!/\!ψ\rightarrow p \barΣ^{-} K_{S}^{0})=(1.352 \pm 0.006 \pm 0.025) \times 10^{-4}$. The combined result is $\mathcal{B}(J\!/\!ψ\rightarrow \bar{p} Σ^{+} K_{S}^{0} +c.c.)=(2.725 \pm 0.009 \pm 0.050) \times 10^{-4}$, where the first uncertainty is statistical and the second systematic. The results presented are in good agreement with the branching fractions of the isospin partner decay $J\!/\!ψ\rightarrow p K^- \barΣ^0 + c.c.$. △ Less

Submitted 14 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

arXiv:2311.05391 [pdf, other]

doi 10.1103/PhysRevD.109.014021

Three-body unitary coupled-channel approach to radiative $J/ψ$ decays and $η(1405/1475)$

Authors: S. X. Nakamura, Q. Huang, J. -J. Wu, H. P. Peng, Y. Zhang, Y. C. Zhu

Abstract: Recent BESIII data on radiative $J/ψ$ decays from $\sim 10^{10}$ $J/ψ$ samples should significantly advance our understanding of the controversial nature of $η(1405/1475)$. This motivates us to develop a three-body unitary coupled-channel model for radiative $J/ψ$ decays to three-meson final states of any partial wave ($J^{PC}$). Basic building blocks of the model are bare resonance states such as… ▽ More Recent BESIII data on radiative $J/ψ$ decays from $\sim 10^{10}$ $J/ψ$ samples should significantly advance our understanding of the controversial nature of $η(1405/1475)$. This motivates us to develop a three-body unitary coupled-channel model for radiative $J/ψ$ decays to three-meson final states of any partial wave ($J^{PC}$). Basic building blocks of the model are bare resonance states such as $η(1405/1475)$ and $f_1(1420)$, and $πK$, $K\bar{K}$, and $πη$ two-body interactions that generate resonances such as $K^*(892)$, $K^*_0(700)$, and $a_0(980)$. This model reasonably fits $K_SK_Sπ^0$ Dalitz plot pseudo data generated from the BESIII's $J^{PC}=0^{-+}$ amplitude for $J/ψ\toγK_SK_Sπ^0$. The experimental branching ratios of $η(1405/1475)\toηππ$ and $η(1405/1475)\toγρ$ relative to that of $η(1405/1475)\to K\bar{K}π$ are simultaneously fitted. Our $0^{-+}$ amplitude is analytically continued to find three poles, two of which correspond to $η(1405)$ on different Riemann sheets of the $K^*\bar{K}$ channel, and the third one for $η(1475)$. This is the first pole determination of $η(1405/1475)$ and, furthermore, the first-ever pole determination from analyzing experimental Dalitz plot distributions with a manifestly three-body unitary coupled-channel framework. Process-dependent $ηππ$, $γπ^+π^-$, and $πππ$ lineshapes of $J/ψ\toγ(0^{-+})\to γ(ηππ)$, $γ(γρ)$, and $γ(πππ)$ are predicted, and are in reasonable agreement with data. A triangle singularity is shown to play a crucial role to cause the large isospin violation of $J/ψ\toγ(πππ)$. △ Less

Submitted 19 January, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: 23 pages, 19 figures; (v2) minor changes, published version

Journal ref: Phys. Rev. D 109, 014021 (2024)

arXiv:2311.03055 [pdf, other]

DRAUC: An Instance-wise Distributionally Robust AUC Optimization Framework

Authors: Siran Dai, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang

Abstract: The Area Under the ROC Curve (AUC) is a widely employed metric in long-tailed classification scenarios. Nevertheless, most existing methods primarily assume that training and testing examples are drawn i.i.d. from the same distribution, which is often unachievable in practice. Distributionally Robust Optimization (DRO) enhances model performance by optimizing it for the local worst-case scenario,… ▽ More The Area Under the ROC Curve (AUC) is a widely employed metric in long-tailed classification scenarios. Nevertheless, most existing methods primarily assume that training and testing examples are drawn i.i.d. from the same distribution, which is often unachievable in practice. Distributionally Robust Optimization (DRO) enhances model performance by optimizing it for the local worst-case scenario, but directly integrating AUC optimization with DRO results in an intractable optimization problem. To tackle this challenge, methodically we propose an instance-wise surrogate loss of Distributionally Robust AUC (DRAUC) and build our optimization framework on top of it. Moreover, we highlight that conventional DRAUC may induce label bias, hence introducing distribution-aware DRAUC as a more suitable metric for robust AUC learning. Theoretically, we affirm that the generalization gap between the training loss and testing error diminishes if the training set is sufficiently large. Empirically, experiments on corrupted benchmark datasets demonstrate the effectiveness of our proposed method. Code is available at: https://github.com/EldercatSAM/DRAUC. △ Less

Submitted 6 November, 2023; originally announced November 2023.

arXiv:2311.02347 [pdf, other]

doi 10.1103/PhysRevD.109.052001

Measurement of the absolute branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ and search for $Λ_{c}^+ \to nK^+π^0$, $Σ^{0}K^{+}π^{0}$ and $ΛK^{+}π^{0}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (600 additional authors not shown)

Abstract: The Cabbibo-favored decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is studied for the first time using 6.1 fb$^{-1}$ of $e^+e^-$ collision data at center-of-mass energies between 4.600 and 4.840 GeV, collected with the BESIII detector at the BEPCII collider. With a double-tag method, the branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is measured to be… ▽ More The Cabbibo-favored decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is studied for the first time using 6.1 fb$^{-1}$ of $e^+e^-$ collision data at center-of-mass energies between 4.600 and 4.840 GeV, collected with the BESIII detector at the BEPCII collider. With a double-tag method, the branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is measured to be $(7.79 \pm 1.46 _{\rm} \pm0.71 _{\rm}) \times 10^{ - 3}$, where the first and second uncertainties are statistical and systematic, respectively. The branching fraction of the two-body decay $Λ_{c}^+ \to Ξ(1530)^{0}K^+$ is $(5.99\pm1.04\pm0.29)\times10^{-3}$, which is consistent with the previous result of $(5.02\pm0.99\pm0.31)\times 10^{-3}$. In addition, the upper limit on the branching fraction of the doubly Cabbibo-suppressed decay $Λ_{c}^+ \to nK^+π^0$ is $7.1 \times 10^{-4}$ at the 90$\%$ confidence level. The upper limits on the branching fractions of $Λ_{c}^+ \to Σ^{0}K^{+}π^{0}$ and $ΛK^{+}π^{0}$ are also determined to be $1.8\times 10^{-3}$ and $ 2.0 \times 10^{-3}$, respectively. △ Less

Submitted 8 May, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

Comments: 15 pages, 20 figures

Journal ref: Phys. Rev. D 109, 052001 (2024)

arXiv:2311.01782 [pdf, other]

Generating Unbiased Pseudo-labels via a Theoretically Guaranteed Chebyshev Constraint to Unify Semi-supervised Classification and Regression

Authors: Jiaqi Wu, Junbiao Pang, Qingming Huang

Abstract: Both semi-supervised classification and regression are practically challenging tasks for computer vision. However, semi-supervised classification methods are barely applied to regression tasks. Because the threshold-to-pseudo label process (T2L) in classification uses confidence to determine the quality of label. It is successful for classification tasks but inefficient for regression tasks. In na… ▽ More Both semi-supervised classification and regression are practically challenging tasks for computer vision. However, semi-supervised classification methods are barely applied to regression tasks. Because the threshold-to-pseudo label process (T2L) in classification uses confidence to determine the quality of label. It is successful for classification tasks but inefficient for regression tasks. In nature, regression also requires unbiased methods to generate high-quality labels. On the other hand, T2L for classification often fails if the confidence is generated by a biased method. To address this issue, in this paper, we propose a theoretically guaranteed constraint for generating unbiased labels based on Chebyshev's inequality, combining multiple predictions to generate superior quality labels from several inferior ones. In terms of high-quality labels, the unbiased method naturally avoids the drawback of T2L. Specially, we propose an Unbiased Pseudo-labels network (UBPL network) with multiple branches to combine multiple predictions as pseudo-labels, where a Feature Decorrelation loss (FD loss) is proposed based on Chebyshev constraint. In principle, our method can be used for both classification and regression and can be easily extended to any semi-supervised framework, e.g. Mean Teacher, FixMatch, DualPose. Our approach achieves superior performance over SOTAs on the pose estimation datasets Mouse, FLIC and LSP, as well as the classification datasets CIFAR10/100 and SVHN. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01770 [pdf, other]

Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation

Authors: Jiaqi Wu, Junbiao Pang, Qingming Huang

Abstract: Semi-supervised pose estimation is a practically challenging task for computer vision. Although numerous excellent semi-supervised classification methods have emerged, these methods typically use confidence to evaluate the quality of pseudo-labels, which is difficult to achieve in pose estimation tasks. For example, in pose estimation, confidence represents only the possibility that a position of… ▽ More Semi-supervised pose estimation is a practically challenging task for computer vision. Although numerous excellent semi-supervised classification methods have emerged, these methods typically use confidence to evaluate the quality of pseudo-labels, which is difficult to achieve in pose estimation tasks. For example, in pose estimation, confidence represents only the possibility that a position of the heatmap is a keypoint, not the quality of that prediction. In this paper, we propose a simple yet efficient framework to estimate the quality of pseudo-labels in semi-supervised pose estimation tasks from the perspective of modeling the uncertainty of the pseudo-labels. Concretely, under the dual mean-teacher framework, we construct the two maximum discrepant students (MDSs) to effectively push two teachers to generate different decision boundaries for the same sample. Moreover, we create multiple uncertainties to assess the quality of the pseudo-labels. Experimental results demonstrate that our method improves the performance of semi-supervised pose estimation on three datasets. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01266 [pdf, other]

Let's Discover More API Relations: A Large Language Model-based AI Chain for Unsupervised API Relation Inference

Authors: Qing Huang, Yanbang Sun, Zhenchang Xing, Yuanlong Cao, Jieshan Chen, Xiwei Xu, Huan **, Jiaxing Lu

Abstract: APIs have intricate relations that can be described in text and represented as knowledge graphs to aid software engineering tasks. Existing relation extraction methods have limitations, such as limited API text corpus and affected by the characteristics of the input text.To address these limitations, we propose utilizing large language models (LLMs) (e.g., GPT-3.5) as a neural knowledge base for A… ▽ More APIs have intricate relations that can be described in text and represented as knowledge graphs to aid software engineering tasks. Existing relation extraction methods have limitations, such as limited API text corpus and affected by the characteristics of the input text.To address these limitations, we propose utilizing large language models (LLMs) (e.g., GPT-3.5) as a neural knowledge base for API relation inference. This approach leverages the entire Web used to pre-train LLMs as a knowledge base and is insensitive to the context and complexity of input texts. To ensure accurate inference, we design our analytic flow as an AI Chain with three AI modules: API FQN Parser, API Knowledge Extractor, and API Relation Decider. The accuracy of the API FQN parser and API Relation Decider module are 0.81 and 0.83, respectively. Using the generative capacity of the LLM and our approach's inference capability, we achieve an average F1 value of 0.76 under the three datasets, significantly higher than the state-of-the-art method's average F1 value of 0.40. Compared to CoT-based method, our AI Chain design improves the inference reliability by 67%, and the AI-crowd-intelligence strategy enhances the robustness of our approach by 26%. △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2311.01076 [pdf, other]

Search for a muonphilic scalar $X_{0}$ or vector $X_{1}$ via $J/ψ\toμ^+μ^-+\rm{invisible}$ decays at BESII

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: A light scalar $X_{0}$ or vector $X_{1}$ particles have been introduced as a possible explanation for the $(g-2)_μ$ anomaly and dark matter phenomena. Using $(8.998\pm 0.039)\times10^9$ $\jpsi $ events collected by the BESIII detector, we search for a light muon philic scalar $X_{0}$ or vector $X_{1}$ in the processes $J/ψ\toμ^+μ^- X_{0,1}$ with $X_{0,1}$ invisible decays. No obvious signal is f… ▽ More A light scalar $X_{0}$ or vector $X_{1}$ particles have been introduced as a possible explanation for the $(g-2)_μ$ anomaly and dark matter phenomena. Using $(8.998\pm 0.039)\times10^9$ $\jpsi $ events collected by the BESIII detector, we search for a light muon philic scalar $X_{0}$ or vector $X_{1}$ in the processes $J/ψ\toμ^+μ^- X_{0,1}$ with $X_{0,1}$ invisible decays. No obvious signal is found, and the upper limits on the coupling $g_{0,1}'$ between the muon and the $X_{0,1}$ particles are set to be between $1.1\times10^{-3}$ and $1.0\times10^{-2}$ for the $X_{0,1}$ mass in the range of $1<M(X_{0,1})<1000$ MeV$/c^2$ at 90$\%$ confidence level. △ Less

Submitted 18 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: 9 pages 7 figures

arXiv:2310.18021 [pdf, other]

FormalGeo: An Extensible Formalized Framework for Olympiad Geometric Problem Solving

Authors: Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Qike Huang, Xiaoxiao **, Yanjun Guo, Chenyang Mao, Yang Li, Zhe Zhu, Dengfeng Yue, Fangzhen Zhu, Yifan Wang, Yiwen Huang, Runan Wang, Cheng Qin, Zhenbing Zeng, Shaorong Xie, Xiangfeng Luo, Tuo Leng

Abstract: This is the first paper in a series of work we have accomplished over the past three years. In this paper, we have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. Within this formal framework, we have been able to seamlessly integrate modern AI models with our formal system.… ▽ More This is the first paper in a series of work we have accomplished over the past three years. In this paper, we have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. Within this formal framework, we have been able to seamlessly integrate modern AI models with our formal system. AI is now capable of providing deductive reasoning solutions to IMO-level plane geometry problems, just like handling other natural languages, and these proofs are readable, traceable, and verifiable. We propose the geometry formalization theory (GFT) to guide the development of the geometry formal system. Based on the GFT, we have established the FormalGeo, which consists of 88 geometric predicates and 196 theorems. It can represent, validate, and solve IMO-level geometry problems. we also have crafted the FGPS (formal geometry problem solver) in Python. It serves as both an interactive assistant for verifying problem-solving processes and an automated problem solver. We've annotated the formalgeo7k and formalgeo-imo datasets. The former contains 6,981 (expand to 133,818 through data augmentation) geometry problems, while the latter includes 18 (expand to 2,627 and continuously increasing) IMO-level challenging geometry problems. All annotated problems include detailed formal language descriptions and solutions. Implementation of the formal system and experiments validate the correctness and utility of the GFT. The backward depth-first search method only yields a 2.42% problem-solving failure rate, and we can incorporate deep learning techniques to achieve lower one. The source code of FGPS and datasets are available at https://github.com/BitSecret/FGPS. △ Less

Submitted 14 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

Comments: 44 pages

arXiv:2310.17937 [pdf, ps, other]

doi 10.1103/PhysRevLett.132.151901

Observation of the Anomalous Shape of $X(1840)$ in $J/ψ\rightarrow γ3(π^+ π^-)$ Indicating a Second Resonance Near $p\bar{p}$ Threshold

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Using a sample of $(10087\pm44)\times 10^6$ $J/ψ$ events, which is about 45 times larger than that was previously analyzed, a further investigation on the $J/ψ\rightarrow γ3(π^+π^-)$ decay is performed. A significant distortion at 1.84 GeV/$c^2$ in the line-shape of the $3(π^+π^-)$ invariant mass spectrum is observed for the first time, which could be resolved by two overlap** resonant structure… ▽ More Using a sample of $(10087\pm44)\times 10^6$ $J/ψ$ events, which is about 45 times larger than that was previously analyzed, a further investigation on the $J/ψ\rightarrow γ3(π^+π^-)$ decay is performed. A significant distortion at 1.84 GeV/$c^2$ in the line-shape of the $3(π^+π^-)$ invariant mass spectrum is observed for the first time, which could be resolved by two overlap** resonant structures, $X(1840)$ and $X(1880)$. The new state $X(1880)$ is observed with a statistical significance larger than $10σ$. The mass and width of $X(1880)$ are determined to be $1882.1\pm1.7\pm0.7$ MeV/$c^2$ and $30.7\pm5.5 \pm2.4$ MeV, respectively, which indicates the existence of a $p\bar{p}$ bound state. △ Less

Submitted 15 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

Journal ref: Phys. Rev. Lett. 132, 151901 (2024)

arXiv:2310.17331 [pdf]

A novel solution for seepage problems using physics-informed neural networks

Authors: Tianfu Luo, Yelin Feng, Qingfu Huang, Zongliang Zhang, Mingjiao Yan, Zaihong Yang, Dawei Zheng, Yang Yang

Abstract: A Physics-Informed Neural Network (PINN) provides a distinct advantage by synergizing neural networks' capabilities with the problem's governing physical laws. In this study, we introduce an innovative approach for solving seepage problems by utilizing the PINN, harnessing the capabilities of Deep Neural Networks (DNNs) to approximate hydraulic head distributions in seepage analysis. To effectivel… ▽ More A Physics-Informed Neural Network (PINN) provides a distinct advantage by synergizing neural networks' capabilities with the problem's governing physical laws. In this study, we introduce an innovative approach for solving seepage problems by utilizing the PINN, harnessing the capabilities of Deep Neural Networks (DNNs) to approximate hydraulic head distributions in seepage analysis. To effectively train the PINN model, we introduce a comprehensive loss function comprising three components: one for evaluating differential operators, another for assessing boundary conditions, and a third for appraising initial conditions. The validation of the PINN involves solving four benchmark seepage problems. The results unequivocally demonstrate the exceptional accuracy of the PINN in solving seepage problems, surpassing the accuracy of FEM in addressing both steady-state and free-surface seepage problems. Hence, the presented approach highlights the robustness of the PINN and underscores its precision in effectively addressing a spectrum of seepage challenges. This amalgamation enables the derivation of accurate solutions, overcoming limitations inherent in conventional methods such as mesh generation and adaptability to complex geometries. △ Less

Submitted 25 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

arXiv:2310.17082 [pdf, ps, other]

Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE; $E_γ\geq 100$~TeV) $γ$-rays. In this context, the historical SNR Cassiopeia A (Cas A) is considered one of the most promising target for UHE observations. This paper presents the observation of Cas A and its vicinity by the LHAASO KM2A detector. The exceptional sensitivity of LHAASO KM2A in the UHE band, combined with the young age of Cas A, enabled us to derive stringent model-independent limits on the energy budget of UHE protons and nuclei accelerated by Cas A at any epoch after the explosion. The results challenge the prevailing paradigm that Cas A-type SNRs are major suppliers of PeV CRs in the Milky Way. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 11 pages, 3 figures, Accepted by the APJL

arXiv:2310.16512 [pdf, other]

doi 10.3847/1538-4357/acee6e

Joint Constraints on the Hubble Constant, Spatial Curvature, and Sound Horizon from the Late-time Universe with Cosmography

Authors: Kaituo Zhang, Tianyao Zhou, Bing Xu, Qihong Huang, Yangsheng Yuan

Abstract: In this paper, using the latest Pantheon+ sample of Type Ia supernovae (SNe Ia), Baryon Acoustic Oscillation (BAO) measurements, and observational Hubble data (OHD), we carry out a joint constraint on the Hubble constant $H_0$, the spatial curvature $Ω_{\rm K}$, and the sound horizon at the end of drag epoch $r_{\rm d}$. To be model-independent, four cosmography models, i.e., the Taylor series in… ▽ More In this paper, using the latest Pantheon+ sample of Type Ia supernovae (SNe Ia), Baryon Acoustic Oscillation (BAO) measurements, and observational Hubble data (OHD), we carry out a joint constraint on the Hubble constant $H_0$, the spatial curvature $Ω_{\rm K}$, and the sound horizon at the end of drag epoch $r_{\rm d}$. To be model-independent, four cosmography models, i.e., the Taylor series in terms of redshift $y_1=z/(1+z)$, $y_2=\arctan(z)$, $y_3=\ln(1+z)$, and the Padé approximants, are used without the assumption of flat Universe. The results show that the $H_0$ is anti-correlated with $Ω_{\rm K}$ and $r_{\rm d}$, indicating smaller $Ω_{\rm K}$ or $r_{\rm d}$ would be helpful in alleviating the Hubble tension. And the values of $H_0$ and $r_{\rm d}$ are consistent with the estimate derived from the Planck Cosmic Microwave Background (CMB) data based on the flat $Λ$CDM model, but $H_0$ is in 2.3$\sim$3.0$σ$ tension with that obtained by \cite{Riess2022} in all these cosmographic approaches. Meanwhile, a flat Universe is preferred by the present observations under all approximations except the third order of $y_1$ and $y_2$ of the Taylor series. Furthermore, according to the values of the Bayesian evidence, we found that the flat $Λ$CDM remains to be the most favored model by the joint datasets, and the Padé approximant of order (2,2), the third order of $y_3$ and $y_1$ are the top three cosmographic expansions that fit the datasets best, while the Taylor series in terms of $y_2$ are essentially ruled out. △ Less

Submitted 23 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: 18 pages, 4 figures

Journal ref: ApJ, 957, 5 (2023)

arXiv:2310.16358 [pdf, other]

From Simple to Complex: A Progressive Framework for Document-level Informative Argument Extraction

Authors: Quzhe Huang, Yanxi Zhang, Dongyan Zhao

Abstract: Document-level Event Argument Extraction (EAE) requires the model to extract arguments of multiple events from a single document. Considering the underlying dependencies between these events, recent efforts leverage the idea of "memory", where the results of already predicted events are cached and can be retrieved to help the prediction of upcoming events. These methods extract events according to… ▽ More Document-level Event Argument Extraction (EAE) requires the model to extract arguments of multiple events from a single document. Considering the underlying dependencies between these events, recent efforts leverage the idea of "memory", where the results of already predicted events are cached and can be retrieved to help the prediction of upcoming events. These methods extract events according to their appearance order in the document, however, the event that appears in the first sentence does not mean that it is the easiest to extract. Existing methods might introduce noise to the extraction of upcoming events if they rely on an incorrect prediction of previous events. In order to provide more reliable memory, we propose a simple-to-complex progressive framework for document-level EAE. Specifically, we first calculate the difficulty of each event and then, we conduct the extraction following a simple-to-complex order. In this way, the memory will store the most certain results, and the model could use these reliable sources to help the prediction of more difficult events. Experiments on WikiEvents show that our model outperforms SOTA by 1.4% in F1, indicating the proposed simple-to-complex framework is useful in the EAE task. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: Accepted to the Findings of EMNLP 2023 (Long Paper)

arXiv:2310.15601 [pdf, ps, other]

doi 10.1103/PhysRevD.109.032011

Study of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (604 additional authors not shown)

Abstract: Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, the experimental studies of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$ are reported. We determine the absolute branching fraction of $D^+_s\to K^+K^+π^-$ to be (… ▽ More Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, the experimental studies of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$ are reported. We determine the absolute branching fraction of $D^+_s\to K^+K^+π^-$ to be (${1.23^{+0.28}_{-0.25}}({\rm stat})\pm0.06({\rm syst})$) $\times 10^{-4}$. No significant signal of $D^+_s\to K^+K^+π^-π^0$ is observed and the upper limit on its decay branching fraction at 90\% confidence level is set to be $1.7\times10^{-4}$. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 10 pages, 4 figures, 4 tables

Report number: BAM-00695

Journal ref: Phys. Rev. D 109, 032011 (2024)

arXiv:2310.14585 [pdf, other]

doi 10.1103/PhysRevD.108.092011

Observation of the $ψ(3686)$ decays into $Σ^{+}\barΣ^{-}ω$ and $Σ^{+}\barΣ^{-}{\mathcalφ}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Based on $(27.08\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the $ψ(3686)\toΣ^{+}\barΣ^{-}ω$ and $Σ^{+}\barΣ^{-}φ$ decays are observed for the first time with statistical significances of 13.8$σ$ and 7.6$σ$, respectively. The corresponding branching fractions are measured to be… ▽ More Based on $(27.08\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the $ψ(3686)\toΣ^{+}\barΣ^{-}ω$ and $Σ^{+}\barΣ^{-}φ$ decays are observed for the first time with statistical significances of 13.8$σ$ and 7.6$σ$, respectively. The corresponding branching fractions are measured to be $\mathcal{B}(ψ(3686)\toΣ^{+}\barΣ^{-}ω)=(1.90 \pm 0.18 \pm 0.21) \times 10^{-5}$ and $\mathcal{B}(ψ(3686)\toΣ^{+}\barΣ^{-}φ)=(2.96 \pm 0.54 \pm 0.41) \times 10^{-6}$, where the first uncertainties are statistical and the second systematic. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: 10 pages

Journal ref: Phys. Rev. D 108, 092011 (2023)

arXiv:2310.14016 [pdf, other]

SwG-former: A Sliding-Window Graph Convolutional Network for Simultaneous Spatial-Temporal Information Extraction in Sound Event Localization and Detection

Authors: Weiming Huang, Qinghua Huang, Liyan Ma, Chuan Wang

Abstract: Sound event localization and detection (SELD) involves sound event detection (SED) and direction of arrival (DoA) estimation tasks. SED mainly relies on temporal dependencies to distinguish different sound classes, while DoA estimation depends on spatial correlations to estimate source directions. This paper addresses the need to simultaneously extract spatial-temporal information in audio signals… ▽ More Sound event localization and detection (SELD) involves sound event detection (SED) and direction of arrival (DoA) estimation tasks. SED mainly relies on temporal dependencies to distinguish different sound classes, while DoA estimation depends on spatial correlations to estimate source directions. This paper addresses the need to simultaneously extract spatial-temporal information in audio signals to improve SELD performance. A novel block, the sliding-window graph-former (SwG-former), is designed to learn temporal context information of sound events based on their spatial correlations. The SwG-former block transforms audio signals into a graph representation and constructs graph vertices to capture higher abstraction levels for spatial correlations. It uses different-sized sliding windows to adapt various sound event durations and aggregates temporal features with similar spatial information while incorporating multi-head self-attention (MHSA) to model global information. Furthermore, as the cornerstone of message passing, a robust Conv2dAgg function is proposed and embedded into the block to aggregate the features of neighbor vertices. As a result, a SwG-former model, which stacks the SwG-former blocks, demonstrates superior performance compared to recent advanced SELD models. The SwG-former block is also integrated into the event-independent network version 2 (EINV2), called SwG-EINV2, which surpasses the state-of-the-art (SOTA) methods under the same acoustic environment. △ Less

Submitted 20 March, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

arXiv:2310.13375 [pdf, other]

An Improved Artificial Fish Swarm Algorithm for Solving the Problem of Investigation Path Planning

Authors: Qian Huang, Weiwen Qian, Chang Li, Xuan Ding

Abstract: Informationization is a prevailing trend in today's world. The increasing demand for information in decision-making processes poses significant challenges for investigation activities, particularly in terms of effectively allocating limited resources to plan investigation programs. This paper addresses the investigation path planning problem by formulating it as a multi-traveling salesman problem… ▽ More Informationization is a prevailing trend in today's world. The increasing demand for information in decision-making processes poses significant challenges for investigation activities, particularly in terms of effectively allocating limited resources to plan investigation programs. This paper addresses the investigation path planning problem by formulating it as a multi-traveling salesman problem (MTSP). Our objective is to minimize costs, and to achieve this, we propose a chaotic artificial fish swarm algorithm based on multiple population differential evolution (DE-CAFSA). To overcome the limitations of the artificial fish swarm algorithm, such as low optimization accuracy and the inability to consider global and local information, we incorporate adaptive field of view and step size adjustments, replace random behavior with the 2-opt operation, and introduce chaos theory and sub-optimal solutions to enhance optimization accuracy and search performance. Additionally, we integrate the differential evolution algorithm to create a hybrid algorithm that leverages the complementary advantages of both approaches. Experimental results demonstrate that DE-CAFSA outperforms other algorithms on various public datasets of different sizes, as well as showcasing excellent performance on the examples proposed in this study. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: 25 pages

MSC Class: 68W50 ACM Class: F.2.2; I.m

arXiv:2310.12733 [pdf, other]

Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression

Authors: Yiming Wang, Qian Huang, Bin Tang, Huashan Sun, Xing Li

Abstract: Recently, learned video compression has achieved exciting performance. Following the traditional hybrid prediction coding framework, most learned methods generally adopt the motion estimation motion compensation (MEMC) method to remove inter-frame redundancy. However, inaccurate motion vector (MV) usually lead to the distortion of reconstructed frame. In addition, most approaches ignore the spatia… ▽ More Recently, learned video compression has achieved exciting performance. Following the traditional hybrid prediction coding framework, most learned methods generally adopt the motion estimation motion compensation (MEMC) method to remove inter-frame redundancy. However, inaccurate motion vector (MV) usually lead to the distortion of reconstructed frame. In addition, most approaches ignore the spatial and channel redundancy. To solve above problems, we propose a motion-aware and spatial-temporal-channel contextual coding based video compression network (MASTC-VC), which learns the latent representation and uses variational autoencoders (VAEs) to capture the characteristics of intra-frame pixels and inter-frame motion. Specifically, we design a multiscale motion-aware module (MS-MAM) to estimate spatial-temporal-channel consistent motion vector by utilizing the multiscale motion prediction information in a coarse-to-fine way. On the top of it, we further propose a spatial-temporal-channel contextual module (STCCM), which explores the correlation of latent representation to reduce the bit consumption from spatial, temporal and channel aspects respectively. Comprehensive experiments show that our proposed MASTC-VC is surprior to previous state-of-the-art (SOTA) methods on three public benchmark datasets. More specifically, our method brings average 10.15\% BD-rate savings against H.265/HEVC (HM-16.20) in PSNR metric and average 23.93\% BD-rate savings against H.266/VVC (VTM-13.2) in MS-SSIM metric. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 12pages,12 figures

arXiv:2310.11238 [pdf, other]

doi 10.1103/PhysRevD.109.084045

Search for Non-Tensorial Gravitational-Wave Backgrounds in the NANOGrav 15-Year Data Set

Authors: Zu-Cheng Chen, Yu-Mei Wu, Yan-Chen Bi, Qing-Guo Huang

Abstract: The recent detection of a stochastic signal in the NANOGrav 15-year data set has aroused great interest in uncovering its origin. However, the evidence for the Hellings-Downs correlations, a key signature of the gravitational-wave background (GWB) predicted by general relativity, remains inconclusive. In this letter, we search for an isotropic non-tensorial GWB, allowed by general metric theories… ▽ More The recent detection of a stochastic signal in the NANOGrav 15-year data set has aroused great interest in uncovering its origin. However, the evidence for the Hellings-Downs correlations, a key signature of the gravitational-wave background (GWB) predicted by general relativity, remains inconclusive. In this letter, we search for an isotropic non-tensorial GWB, allowed by general metric theories of gravity, in the NANOGrav 15-year data set. Our analysis reveals a Bayes factor of approximately 2.5, comparing the quadrupolar (tensor transverse, TT) correlations to the scalar transverse (ST) correlations, suggesting that the ST correlations provide a comparable explanation for the observed stochastic signal in the NANOGrav data. We obtain the median and the $90\%$ equal-tail amplitudes as $\mathcal{A}_\mathrm{ST} = 7.8^{+5.1}_{-3.5} \times 10^{-15}$ at the frequency of 1/year. Furthermore, we find that the vector longitudinal (VL) and scalar longitudinal (SL) correlations are weakly and strongly disfavoured by data, respectively, yielding upper limits on the amplitudes: $\mathcal{A}_\mathrm{VL}^{95\%} \lesssim 1.7 \times 10^{-15}$ and $\mathcal{A}_\mathrm{SL}^{95\%} \lesssim 7.4 \times 10^{-17}$. Lastly, we fit the NANOGrav data with the general transverse (GT) correlations parameterized by a free parameter $α$. Our analysis yields $α=1.74^{+1.18}_{-1.41}$, thus excluding both the TT ($α=3$) and ST ($α=0$) models at the $90\%$ confidence level. △ Less

Submitted 6 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 9 pages, 4 figures, 3 tables; match the published version

Journal ref: Phys.Rev.D 109 (2024) 8, 084045

arXiv:2310.10815 [pdf, ps, other]

Streaming Algorithms for Graph k-Matching with Optimal or Near-Optimal Update Time

Authors: Jianer Chen, Qin Huang, Iyad Kanj, Qian Li, Ge Xia

Abstract: We present streaming algorithms for the graph $k$-matching problem in both the insert-only and dynamic models. Our algorithms, with space complexity matching the best upper bounds, have optimal or near-optimal update time, significantly improving on previous results. More specifically, for the insert-only streaming model, we present a one-pass algorithm with optimal space complexity $O(k^2)$ and o… ▽ More We present streaming algorithms for the graph $k$-matching problem in both the insert-only and dynamic models. Our algorithms, with space complexity matching the best upper bounds, have optimal or near-optimal update time, significantly improving on previous results. More specifically, for the insert-only streaming model, we present a one-pass algorithm with optimal space complexity $O(k^2)$ and optimal update time $O(1)$, that with high probability computes a maximum weighted $k$-matching of a given weighted graph. The update time of our algorithm significantly improves the previous upper bound of $O(\log k)$, which was derived only for $k$-matching on unweighted graphs. For the dynamic streaming model, we present a one-pass algorithm that with high probability computes a maximum weighted $k$-matching in $O(Wk^2 \cdot \mbox{polylog}(n)$ space and with $O(\mbox{polylog}(n))$ update time, where $W$ is the number of distinct edge weights. Again the update time of our algorithm improves the previous upper bound of $O(k^2 \cdot \mbox{polylog}(n))$. This algorithm, when applied to unweighted graphs, gives a streaming algorithm on the dynamic model whose space and update time complexities are both near-optimal. Our results also imply a streaming approximation algorithm for maximum weighted $k$-matching whose space complexity matches the best known upper bound with a significantly improved update time. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2310.10547 [pdf, other]

InfoGCN++: Learning Representation by Predicting the Future for Online Human Skeleton-based Action Recognition

Authors: Seunggeun Chi, Hyung-gun Chi, Qixing Huang, Karthik Ramani

Abstract: Skeleton-based action recognition has made significant advancements recently, with models like InfoGCN showcasing remarkable accuracy. However, these models exhibit a key limitation: they necessitate complete action observation prior to classification, which constrains their applicability in real-time situations such as surveillance and robotic systems. To overcome this barrier, we introduce InfoG… ▽ More Skeleton-based action recognition has made significant advancements recently, with models like InfoGCN showcasing remarkable accuracy. However, these models exhibit a key limitation: they necessitate complete action observation prior to classification, which constrains their applicability in real-time situations such as surveillance and robotic systems. To overcome this barrier, we introduce InfoGCN++, an innovative extension of InfoGCN, explicitly developed for online skeleton-based action recognition. InfoGCN++ augments the abilities of the original InfoGCN model by allowing real-time categorization of action types, independent of the observation sequence's length. It transcends conventional approaches by learning from current and anticipated future movements, thereby creating a more thorough representation of the entire sequence. Our approach to prediction is managed as an extrapolation issue, grounded on observed actions. To enable this, InfoGCN++ incorporates Neural Ordinary Differential Equations, a concept that lets it effectively model the continuous evolution of hidden states. Following rigorous evaluations on three skeleton-based action recognition benchmarks, InfoGCN++ demonstrates exceptional performance in online action recognition. It consistently equals or exceeds existing techniques, highlighting its significant potential to reshape the landscape of real-time action recognition applications. Consequently, this work represents a major leap forward from InfoGCN, pushing the limits of what's possible in online, skeleton-based action recognition. The code for InfoGCN++ is publicly available at https://github.com/stnoah1/infogcn2 for further exploration and validation. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2310.10452 [pdf, other]

doi 10.1103/PhysRevD.108.L111101

Measurement of the cross sections for $e^+e^-\toηπ^+π^-$ at center-of-mass energies between 2.00 and 3.08 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (605 additional authors not shown)

Abstract: Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of th… ▽ More Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of the process $e^+e^-\toηπ^+π^-$ are systematically higher than those of BaBar by more than $3σ$ at center-of-mass energies between 2.000 and 2.300 GeV. In the cross section lineshape for $e^+e^-\to a_2(1320)π$, a resonant structure is observed with a significance of $5.5σ$, with $M=(2044\pm31\pm4)$ MeV/$c^2$, $Γ=(163\pm69\pm24)$ MeV and $\mathcal{B_{R}}\cdotΓ_{e^+e^-}^{R}=(34.6\pm17.1\pm6.0)$ eV or $(137.1\pm73.3\pm2.1)$ eV. In the cross section lineshape for $e^+e^-\toρη$, an evidence of a dip structure around 2180 MeV/$c^2$ is observed with statistical significance of $3.0σ$. △ Less

Submitted 28 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

Journal ref: PHYS. REV. D 108, L111101 (2023)

arXiv:2310.08872 [pdf, other]

R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation

Authors: Jiayu Xiao, Henglei Lv, Liang Li, Shuhui Wang, Qingming Huang

Abstract: Recent text-to-image (T2I) diffusion models have achieved remarkable progress in generating high-quality images given text-prompts as input. However, these models fail to convey appropriate spatial composition specified by a layout instruction. In this work, we probe into zero-shot grounded T2I generation with diffusion models, that is, generating images corresponding to the input layout informati… ▽ More Recent text-to-image (T2I) diffusion models have achieved remarkable progress in generating high-quality images given text-prompts as input. However, these models fail to convey appropriate spatial composition specified by a layout instruction. In this work, we probe into zero-shot grounded T2I generation with diffusion models, that is, generating images corresponding to the input layout information without training auxiliary modules or finetuning diffusion models. We propose a Region and Boundary (R&B) aware cross-attention guidance approach that gradually modulates the attention maps of diffusion model during generative process, and assists the model to synthesize images (1) with high fidelity, (2) highly compatible with textual input, and (3) interpreting layout instructions accurately. Specifically, we leverage the discrete sampling to bridge the gap between consecutive attention maps and discrete layout constraints, and design a region-aware loss to refine the generative layout during diffusion process. We further propose a boundary-aware loss to strengthen object discriminability within the corresponding regions. Experimental results show that our method outperforms existing state-of-the-art zero-shot grounded T2I generation methods by a large margin both qualitatively and quantitatively on several benchmarks. △ Less

Submitted 27 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: Preprint. Under review. Project page: https://sagileo.github.io/Region-and-Boundary

arXiv:2310.08845 [pdf, other]

doi 10.1126/sciadv.adj2778

Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the trigger. The intrinsic energy spectrum of gamma-rays can be described by a power-law after correcting for extragalactic background light (EBL) absorption. Such a hard spectrum challenges the synchrotron self-Compton (SSC) scenario of relativistic electrons for the afterglow emission above several TeV. Observations of gamma-rays up to 13 TeV from a source with a measured redshift of z=0.151 hints more transparency in intergalactic space than previously expected. Alternatively, one may invoke new physics such as Lorentz Invariance Violation (LIV) or an axion origin of very high energy (VHE) signals. △ Less

Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: 49pages, 11figures

Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

arXiv:2310.08366 [pdf, other]

Constraints on the velocity of gravitational waves from NANOGrav 15-year data set

Authors: Yan-Chen Bi, Yu-Mei Wu, Zu-Cheng Chen, Qing-Guo Huang

Abstract: General relativity predicts that gravitational waves propagate at the speed of light. Although ground-based gravitational-wave detectors have successfully constrained the velocity of gravitational waves in the high-frequency range, extending this constraint to the lower frequency range remains a challenge. In this work, we utilize the deviations in the overlap reduction function for a gravitationa… ▽ More General relativity predicts that gravitational waves propagate at the speed of light. Although ground-based gravitational-wave detectors have successfully constrained the velocity of gravitational waves in the high-frequency range, extending this constraint to the lower frequency range remains a challenge. In this work, we utilize the deviations in the overlap reduction function for a gravitational-wave background within pulsar timing arrays to investigate the velocity of gravitational waves in the nanohertz frequency band. By analyzing the NANOGrav 15-year data set, we obtain a well-constrained lower bound for the velocity of gravitational waves that $v \gtrsim 0.87\,c$, where $c$ is the speed of light. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: 6 pages, 1 figure, 1 table

arXiv:2310.08358 [pdf, other]

Towards Demystifying the Generalization Behaviors When Neural Collapse Emerges

Authors: Peifeng Gao, Qianqian Xu, Yibo Yang, Peisong Wen, Huiyang Shao, Zhiyong Yang, Bernard Ghanem, Qingming Huang

Abstract: Neural Collapse (NC) is a well-known phenomenon of deep neural networks in the terminal phase of training (TPT). It is characterized by the collapse of features and classifier into a symmetrical structure, known as simplex equiangular tight frame (ETF). While there have been extensive studies on optimization characteristics showing the global optimality of neural collapse, little research has been… ▽ More Neural Collapse (NC) is a well-known phenomenon of deep neural networks in the terminal phase of training (TPT). It is characterized by the collapse of features and classifier into a symmetrical structure, known as simplex equiangular tight frame (ETF). While there have been extensive studies on optimization characteristics showing the global optimality of neural collapse, little research has been done on the generalization behaviors during the occurrence of NC. Particularly, the important phenomenon of generalization improvement during TPT has been remaining in an empirical observation and lacking rigorous theoretical explanation. In this paper, we establish the connection between the minimization of CE and a multi-class SVM during TPT, and then derive a multi-class margin generalization bound, which provides a theoretical explanation for why continuing training can still lead to accuracy improvement on test set, even after the train accuracy has reached 100%. Additionally, our further theoretical results indicate that different alignment between labels and features in a simplex ETF can result in varying degrees of generalization improvement, despite all models reaching NC and demonstrating similar optimization performance on train set. We refer to this newly discovered property as "non-conservative generalization". In experiments, we also provide empirical observations to verify the indications suggested by our theoretical results. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: 20 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:2304.08914

arXiv:2310.08148 [pdf, other]

Open-Set Knowledge-Based Visual Question Answering with Inference Paths

Authors: **gru Gan, Xinzhe Han, Shuhui Wang, Qingming Huang

Abstract: Given an image and an associated textual question, the purpose of Knowledge-Based Visual Question Answering (KB-VQA) is to provide a correct answer to the question with the aid of external knowledge bases. Prior KB-VQA models are usually formulated as a retriever-classifier framework, where a pre-trained retriever extracts textual or visual information from knowledge graphs and then makes a predic… ▽ More Given an image and an associated textual question, the purpose of Knowledge-Based Visual Question Answering (KB-VQA) is to provide a correct answer to the question with the aid of external knowledge bases. Prior KB-VQA models are usually formulated as a retriever-classifier framework, where a pre-trained retriever extracts textual or visual information from knowledge graphs and then makes a prediction among the candidates. Despite promising progress, there are two drawbacks with existing models. Firstly, modeling question-answering as multi-class classification limits the answer space to a preset corpus and lacks the ability of flexible reasoning. Secondly, the classifier merely consider "what is the answer" without "how to get the answer", which cannot ground the answer to explicit reasoning paths. In this paper, we confront the challenge of \emph{explainable open-set} KB-VQA, where the system is required to answer questions with entities at wild and retain an explainable reasoning path. To resolve the aforementioned issues, we propose a new retriever-ranker paradigm of KB-VQA, Graph pATH rankER (GATHER for brevity). Specifically, it contains graph constructing, pruning, and path-level ranking, which not only retrieves accurate answers but also provides inference paths that explain the reasoning process. To comprehensively evaluate our model, we reformulate the benchmark dataset OK-VQA with manually corrected entity-level annotations and release it as ConceptVQA. Extensive experiments on real-world questions demonstrate that our framework is not only able to perform open-set question answering across the whole knowledge base but provide explicit reasoning path. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2310.08095 [pdf, other]

Multi-Satellite Cooperative Networks: Joint Hybrid Beamforming and User Scheduling Design

Authors: Xuan Zhang, Shu Sun, Meixia Tao, Qin Huang, Xiaohu Tang

Abstract: In this paper, we consider a cooperative communication network where multiple low-Earth-orbit (LEO) satellites provide services to multiple ground users (GUs) cooperatively at the same time and on the same frequency. The multi-satellite cooperation has great potential in extending communication coverage and increasing spectral efficiency. Considering that the on-board radio-frequency circuit resou… ▽ More In this paper, we consider a cooperative communication network where multiple low-Earth-orbit (LEO) satellites provide services to multiple ground users (GUs) cooperatively at the same time and on the same frequency. The multi-satellite cooperation has great potential in extending communication coverage and increasing spectral efficiency. Considering that the on-board radio-frequency circuit resources and computation resources on each satellite are restricted, we aim to propose a low-complexity yet efficient multi-satellite cooperative transmission framework. Specifically, we first propose a hybrid beamforming method consisting of analog beamforming for beam alignment and digital beamforming for interference mitigation. Then, to establish appropriate connections between the satellites and GUs, we propose a heuristic user scheduling algorithm which determines the connections according to the total spectral efficiency increment of the multi-satellite cooperative network. Next, considering the intrinsic connection between beamforming and user scheduling, a joint hybrid beamforming and user scheduling (JHU) scheme is proposed to dramatically improve the performance of the multi-satellite cooperative network. In addition to the single-connection scenario, we also consider the multi-connection case using the JHU scheme. Extensive simulations conducted over different LEO satellite constellations and across various GU locations demonstrate the superiority of the proposed schemes in both overall and per-user spectral efficiencies. △ Less

Submitted 27 December, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

Comments: 14 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2301.03888

arXiv:2310.07469 [pdf, other]

Constraining the Graviton Mass with the NANOGrav 15-Year Data Set

Authors: Yu-Mei Wu, Zu-Cheng Chen, Yan-Chen Bi, Qing-Guo Huang

Abstract: The recently detected stochastic signal by several pulsar timing array collaborations, offers an opportunity to scrutinize the fundamental properties of gravity, including the potential mass of the graviton. In this study, we analyze the NANOGrav 15-year data set to search for a stochastic gravitational wave background with modified Hellings-Downs correlations predicted by massive gravity. While t… ▽ More The recently detected stochastic signal by several pulsar timing array collaborations, offers an opportunity to scrutinize the fundamental properties of gravity, including the potential mass of the graviton. In this study, we analyze the NANOGrav 15-year data set to search for a stochastic gravitational wave background with modified Hellings-Downs correlations predicted by massive gravity. While the Bayesian analysis comparing the massive gravity to massless gravity within the effective searchable mass range of $m_g\in [3\times 10^{-25}, 8 \times 10^{-24}]\,\rm{eV}/c^2$ does not yield an explicit upper bound as all the Bayes factors are smaller than $3$, the combined consideration of the minimum frequency inherent in a massive gravity and the observed spectrum leads to an upper limit of $m_g<8.2\times 10^{-24}\,\rm{eV}/c^2$. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 6 pages, 1 figure, 1 table

arXiv:2310.07277 [pdf, other]

Search for $J/ψ$ weak decays containing $D$ meson

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

Abstract: Using a sample of about 10 billion $J/ψ$ events with the BESIII detector, we search for the weak decays of $J/ψ\to \bar{D}^0π^0 + c.c.$, $J/ψ\to \bar{D}^0η+ c.c.$, $J/ψ\to \bar{D}^0ρ^0 + c.c.$, $J/ψ\to D^-π^+ + c.c.$, and $J/ψ\to D^-ρ^+ + c.c.$. Since no significant signal is observed, we set the upper limits of the branching fractions of these decays to be… ▽ More Using a sample of about 10 billion $J/ψ$ events with the BESIII detector, we search for the weak decays of $J/ψ\to \bar{D}^0π^0 + c.c.$, $J/ψ\to \bar{D}^0η+ c.c.$, $J/ψ\to \bar{D}^0ρ^0 + c.c.$, $J/ψ\to D^-π^+ + c.c.$, and $J/ψ\to D^-ρ^+ + c.c.$. Since no significant signal is observed, we set the upper limits of the branching fractions of these decays to be $\mathcal{B}(J/ψ\to \bar{D}^0π^0 + c.c.) < 4.7 \times 10^{-7}$, $\mathcal{B}(J/ψ\to \bar{D}^0η+ c.c.) < 6.8 \times 10^{-7}$, $\mathcal{B}(J/ψ\to \bar{D}^0ρ^0 + c.c.) < 5.2 \times 10^{-7}$, $\mathcal{B}(J/ψ\to D^-π^+ + c.c.) < 7.0 \times 10^{-8}$, and $\mathcal{B}(J/ψ\to D^-ρ^+ + c.c.) < 6.0 \times 10^{-7}$ at the 90\% confidence level. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 10 pages, 17 figures

arXiv:2310.05021 [pdf, other]

Toward Intelligent Emergency Control for Large-scale Power Systems: Convergence of Learning, Physics, Computing and Control

Authors: Qiuhua Huang, Renke Huang, Tianzhixi Yin, Sohom Datta, Xueqing Sun, Jason Hou, Jie Tan, Wenhao Yu, Yuan Liu, Xinya Li, Bruce Palmer, Ang Li, Xinda Ke, Marianna Vaiman, Song Wang, Yousu Chen

Abstract: This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, t… ▽ More This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, there are multifaceted challenges such as scalability, adaptiveness, and security posed by the complex power system landscape, which demand comprehensive solutions. The paper first proposes and instantiates a convergence framework for integrating power systems physics, machine learning, advanced computing, and grid control to realize intelligent grid control at a large scale. Our developed methods and platform based on the convergence framework have been applied to a large (more than 3000 buses) Texas power system, and tested with 56000 scenarios. Our work achieved a 26% reduction in load shedding on average and outperformed existing rule-based control in 99.7% of the test scenarios. The results demonstrated the potential of the proposed convergence framework and DRL-based intelligent control for the future grid. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: submitted to PSCC 2024

arXiv:2310.04752 [pdf, other]

A Unified Generalization Analysis of Re-Weighting and Logit-Adjustment for Imbalanced Learning

Authors: Zitai Wang, Qianqian Xu, Zhiyong Yang, Yuan He, Xiaochun Cao, Qingming Huang

Abstract: Real-world datasets are typically imbalanced in the sense that only a few classes have numerous samples, while many classes are associated with only a few samples. As a result, a naïve ERM learning process will be biased towards the majority classes, making it difficult to generalize to the minority classes. To address this issue, one simple but effective approach is to modify the loss function to… ▽ More Real-world datasets are typically imbalanced in the sense that only a few classes have numerous samples, while many classes are associated with only a few samples. As a result, a naïve ERM learning process will be biased towards the majority classes, making it difficult to generalize to the minority classes. To address this issue, one simple but effective approach is to modify the loss function to emphasize the learning on minority classes, such as re-weighting the losses or adjusting the logits via class-dependent terms. However, existing generalization analysis of such losses is still coarse-grained and fragmented, failing to explain some empirical results. To bridge this gap, we propose a novel technique named data-dependent contraction to capture how these modified losses handle different classes. On top of this technique, a fine-grained generalization bound is established for imbalanced learning, which helps reveal the mystery of re-weighting and logit-adjustment in a unified manner. Furthermore, a principled learning algorithm is developed based on the theoretical insights. Finally, the empirical results on benchmark datasets not only validate the theoretical results but also demonstrate the effectiveness of the proposed method. △ Less

Submitted 7 October, 2023; originally announced October 2023.

arXiv:2310.04145 [pdf, other]

From Zero to Hero: Detecting Leaked Data through Synthetic Data Injection and Model Querying

Authors: Biao Wu, Qiang Huang, Anthony K. H. Tung

Abstract: Safeguarding the Intellectual Property (IP) of data has become critically important as machine learning applications continue to proliferate, and their success heavily relies on the quality of training data. While various mechanisms exist to secure data during storage, transmission, and consumption, fewer studies have been developed to detect whether they are already leaked for model training with… ▽ More Safeguarding the Intellectual Property (IP) of data has become critically important as machine learning applications continue to proliferate, and their success heavily relies on the quality of training data. While various mechanisms exist to secure data during storage, transmission, and consumption, fewer studies have been developed to detect whether they are already leaked for model training without authorization. This issue is particularly challenging due to the absence of information and control over the training process conducted by potential attackers. In this paper, we concentrate on the domain of tabular data and introduce a novel methodology, Local Distribution Shifting Synthesis (\textsc{LDSS}), to detect leaked data that are used to train classification models. The core concept behind \textsc{LDSS} involves injecting a small volume of synthetic data--characterized by local shifts in class distribution--into the owner's dataset. This enables the effective identification of models trained on leaked data through model querying alone, as the synthetic data injection results in a pronounced disparity in the predictions of models trained on leaked and modified datasets. \textsc{LDSS} is \emph{model-oblivious} and hence compatible with a diverse range of classification models. We have conducted extensive experiments on seven types of classification models across five real-world datasets. The comprehensive results affirm the reliability, robustness, fidelity, security, and efficiency of \textsc{LDSS}. Extending \textsc{LDSS} to regression tasks further highlights its versatility and efficacy compared with baseline methods. △ Less

Submitted 17 April, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: Accepted and To Appear in VLDB 2024

arXiv:2310.03361 [pdf, other]

doi 10.1103/PhysRevD.109.092012

Measurement of $e^{+}e^{-}\rightarrowηJ/ψ$ Cross Section from $\sqrt{s}=$ 3.808 GeV to 4.951 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two addition… ▽ More Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two additional resonances, and a non-resonant component is performed. The mass and width of the first additional state are $(4219.7\pm2.5\pm4.5) \rm{MeV}/\rm{c}^2$ and $(80.7\pm4.4\pm1.4) \rm{MeV}$, respectively, consistent with the $ψ(4230)$. For the second state, the mass and width are $(4386\pm13\pm17) \rm{MeV}/\rm{c}^2$ and $(177\pm32\pm13) \rm{MeV}$, respectively, consistent with the $ψ(4360)$. The first uncertainties are statistical and the second ones are systematic. The statistical significance of $ψ(4040)$ is $8.0σ$ and those for $ψ(4230)$ and $ψ(4360)$ are more than $10.0σ$. △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2310.03302 [pdf, other]

MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation

Authors: Qian Huang, Jian Vora, Percy Liang, Jure Leskovec

Abstract: A central aspect of machine learning research is experimentation, the process of designing and running experiments, analyzing the results, and iterating towards some positive outcome (e.g., improving accuracy). Could agents driven by powerful language models perform machine learning experimentation effectively? To answer this question, we introduce MLAgentBench, a suite of 13 tasks ranging from im… ▽ More A central aspect of machine learning research is experimentation, the process of designing and running experiments, analyzing the results, and iterating towards some positive outcome (e.g., improving accuracy). Could agents driven by powerful language models perform machine learning experimentation effectively? To answer this question, we introduce MLAgentBench, a suite of 13 tasks ranging from improving model performance on CIFAR-10 to recent research problems like BabyLM. For each task, an agent can perform actions like reading/writing files, executing code, and inspecting outputs. We then construct an agent that can perform ML experimentation based on ReAct framework. We benchmark agents based on Claude v1.0, Claude v2.1, Claude v3 Opus, GPT-4, GPT-4-turbo, Gemini-Pro, and Mixtral and find that a Claude v3 Opus agent is the best in terms of success rate. It can build compelling ML models over many tasks in MLAgentBench with 37.5% average success rate. Our agents also display highly interpretable plans and actions. However, the success rates vary considerably; they span from 100% on well-established older datasets to as low as 0% on recent Kaggle challenges created potentially after the underlying LM was trained. Finally, we identify several key challenges for LM-based agents such as long-term planning and reducing hallucination. Our code is released at https://github.com/snap-stanford/MLAgentBench. △ Less

Submitted 14 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

arXiv:2310.01410 [pdf, other]

LEAP: Liberate Sparse-view 3D Modeling from Camera Poses

Authors: Hanwen Jiang, Zhenyu Jiang, Yue Zhao, Qixing Huang

Abstract: Are camera poses necessary for multi-view 3D modeling? Existing approaches predominantly assume access to accurate camera poses. While this assumption might hold for dense views, accurately estimating camera poses for sparse views is often elusive. Our analysis reveals that noisy estimated poses lead to degraded performance for existing sparse-view 3D modeling methods. To address this issue, we pr… ▽ More Are camera poses necessary for multi-view 3D modeling? Existing approaches predominantly assume access to accurate camera poses. While this assumption might hold for dense views, accurately estimating camera poses for sparse views is often elusive. Our analysis reveals that noisy estimated poses lead to degraded performance for existing sparse-view 3D modeling methods. To address this issue, we present LEAP, a novel pose-free approach, therefore challenging the prevailing notion that camera poses are indispensable. LEAP discards pose-based operations and learns geometric knowledge from data. LEAP is equipped with a neural volume, which is shared across scenes and is parameterized to encode geometry and texture priors. For each incoming scene, we update the neural volume by aggregating 2D image features in a feature-similarity-driven manner. The updated neural volume is decoded into the radiance field, enabling novel view synthesis from any viewpoint. On both object-centric and scene-level datasets, we show that LEAP significantly outperforms prior methods when they employ predicted poses from state-of-the-art pose estimators. Notably, LEAP performs on par with prior approaches that use ground-truth poses while running $400\times$ faster than PixelNeRF. We show LEAP generalizes to novel object categories and scenes, and learns knowledge closely resembles epipolar geometry. Project page: https://hwjiang1510.github.io/LEAP/ △ Less

Submitted 2 October, 2023; originally announced October 2023.

Comments: Project page https://hwjiang1510.github.io/LEAP/

arXiv:2310.00720 [pdf, other]

doi 10.1103/PhysRevC.109.L052201

First measurement of $ΛN$ inelastic scattering with $Λ$ from $e^{+} e^{-} \rightarrow J/ψ\to Λ\barΛ$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (626 additional authors not shown)

Abstract: Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the… ▽ More Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the BESIII detector. The total cross section of $Λ+ ^{9}{\rm Be} \rightarrow Σ^+ + X$ is measured to be $σ= (37.3 \pm 4.7 \pm 3.5)~{\rm mb}$ at $Λ$ beam momenta within $[1.057, 1.091]~{\rm GeV}/c$, where the uncertainties are statistical and systematic, respectively. This analysis is the first study of $Λ$-nucleon interactions at an $e^+ e^-$ collider, providing information and constraints relevant for the strong-interaction potential, the origin of color confinement, the unified model for baryon-baryon interactions, and the internal structure of neutron stars. △ Less

Submitted 1 October, 2023; originally announced October 2023.

arXiv:2310.00411 [pdf, other]

doi 10.1088/1475-7516/2024/03/022

Prospects for Taiji to detect a gravitational-wave background from cosmic strings

Authors: Zu-Cheng Chen, Qing-Guo Huang, Chang Liu, Lang Liu, Xiao-** Liu, You Wu, Yu-Mei Wu, Zhu Yi, Zhi-Qiang You

Abstract: Recently, multiple pulsar timing array collaborations have presented compelling evidence for a stochastic signal at nanohertz frequencies, potentially originating from cosmic strings. Cosmic strings are linear topological defects that can arise during phase transitions in the early Universe or as fundamental strings in superstring theory. This paper focuses on investigating the detection capabilit… ▽ More Recently, multiple pulsar timing array collaborations have presented compelling evidence for a stochastic signal at nanohertz frequencies, potentially originating from cosmic strings. Cosmic strings are linear topological defects that can arise during phase transitions in the early Universe or as fundamental strings in superstring theory. This paper focuses on investigating the detection capabilities of Taiji, a planned space-based gravitational wave detector, for the gravitational wave background generated by cosmic strings. By analyzing simulated Taiji data and utilizing comprehensive Bayesian parameter estimation techniques, we demonstrate a significant improvement in precision compared to the NANOGrav 15-year data, surpassing it by an order of magnitude. This highlights the enhanced measurement capabilities of Taiji. Consequently, Taiji can serve as a valuable complementary tool to pulsar timing arrays in validating and exploring the physics of cosmic strings in the early Universe. △ Less

Submitted 13 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

Comments: 24 pages, 5 figures

arXiv:2309.16283 [pdf, other]

Self-supervised Cross-view Representation Reconstruction for Change Captioning

Authors: Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang

Abstract: Change captioning aims to describe the difference between a pair of similar images. Its key challenge is how to learn a stable difference representation under pseudo changes caused by viewpoint change. In this paper, we address this by proposing a self-supervised cross-view representation reconstruction (SCORER) network. Concretely, we first design a multi-head token-wise matching to model relatio… ▽ More Change captioning aims to describe the difference between a pair of similar images. Its key challenge is how to learn a stable difference representation under pseudo changes caused by viewpoint change. In this paper, we address this by proposing a self-supervised cross-view representation reconstruction (SCORER) network. Concretely, we first design a multi-head token-wise matching to model relationships between cross-view features from similar/dissimilar images. Then, by maximizing cross-view contrastive alignment of two similar images, SCORER learns two view-invariant image representations in a self-supervised way. Based on these, we reconstruct the representations of unchanged objects by cross-attention, thus learning a stable difference representation for caption generation. Further, we devise a cross-modal backward reasoning to improve the quality of caption. This module reversely models a ``hallucination'' representation with the caption and ``before'' representation. By pushing it closer to the ``after'' representation, we enforce the caption to be informative about the difference in a self-supervised manner. Extensive experiments show our method achieves the state-of-the-art results on four datasets. The code is available at https://github.com/tuyunbin/SCORER. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: Accepted by ICCV 2023

arXiv:2309.16134 [pdf, other]

Let's Chat to Find the APIs: Connecting Human, LLM and Knowledge Graph through AI Chain

Authors: Qing Huang, Zhenyu Wan, Zhenchang Xing, Chang**g Wang, Jieshan Chen, Xiwei Xu, Qinghua Lu

Abstract: API recommendation methods have evolved from literal and semantic keyword matching to query expansion and query clarification. The latest query clarification method is knowledge graph (KG)-based, but limitations include out-of-vocabulary (OOV) failures and rigid question templates. To address these limitations, we propose a novel knowledge-guided query clarification approach for API recommendation… ▽ More API recommendation methods have evolved from literal and semantic keyword matching to query expansion and query clarification. The latest query clarification method is knowledge graph (KG)-based, but limitations include out-of-vocabulary (OOV) failures and rigid question templates. To address these limitations, we propose a novel knowledge-guided query clarification approach for API recommendation that leverages a large language model (LLM) guided by KG. We utilize the LLM as a neural knowledge base to overcome OOV failures, generating fluent and appropriate clarification questions and options. We also leverage the structured API knowledge and entity relationships stored in the KG to filter out noise, and transfer the optimal clarification path from KG to the LLM, increasing the efficiency of the clarification process. Our approach is designed as an AI chain that consists of five steps, each handled by a separate LLM call, to improve accuracy, efficiency, and fluency for query clarification in API recommendation. We verify the usefulness of each unit in our AI chain, which all received high scores close to a perfect 5. When compared to the baselines, our approach shows a significant improvement in MRR, with a maximum increase of 63.9% higher when the query statement is covered in KG and 37.2% when it is not. Ablation experiments reveal that the guidance of knowledge in the KG and the knowledge-guided pathfinding strategy are crucial for our approach's performance, resulting in a 19.0% and 22.2% increase in MAP, respectively. Our approach demonstrates a way to bridge the gap between KG and LLM, effectively compensating for the strengths and weaknesses of both. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: Accepted on ASE'2023

arXiv:2309.14689 [pdf, ps, other]

Updated measurements of the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K \bar{K} π$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (609 additional authors not shown)

Abstract: Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ M… ▽ More Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ MeV/$c^{2}$ and $(10.5 \pm 1.7 (\rm {stat}) \pm 3.5 (\rm {syst}))$ MeV, respectively. The product branching fraction $\mathcal{B}\left(ψ(3686) \rightarrow γη_{c}(2 S)\right) \times \mathcal{B}(η_{c}(2 S) \rightarrow K \bar{K} π)$ is determined to be $(0.97 \pm 0.06 (\rm {stat}) \pm 0.09 (\rm {syst})) \times 10^{-5}$. Using $\mathcal{BR}(η_{c}(2S)\to K\bar{K}π)=(1.86^{+0.68}_{-0.49})\%$, we obtain the branching fraction of the radiative transition to be $\mathcal{BR}(ψ(3686) \to γη_{c}(2S)) = (5.2 \pm 0.3 (\rm {stat}) \pm 0.5 (\rm {syst}) ^{+1.9}_{-1.4} (extr)) \times 10^{-4}$, where the third uncertainty is due to the quoted $\mathcal{BR}(η_{c}(2S) \to K\bar{K}π)$. △ Less

Submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.14667 [pdf, ps, other]

Investigation of the $ΔI = 1/2$ rule and test of CP violation through the measurement of decay asymmetry parameters in $Ξ^-$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by fa… ▽ More Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by factors of 4 and 1.7, respectively. The ratio of decay asymmetry parameters of $Λ\to nπ^0$ to that of $Λ\to pπ^-$, $\langle α_0 \rangle/ \langle α_{Λ-} \rangle $, is determined to be $ 0.873 \pm 0.012^{+0.011}_{-0.010}$, where the first and the second uncertainties are statistical and systematic, respectively. The ratio is smaller than unity more than $5σ$, which signifies the existence of the $ΔI = 3/2$ transition in $Λ$ for the first time. Beside, we test for CP violation in $Ξ^- \to Λπ^-$ and in $Λ\to n π^{0}$ with the best precision to date. △ Less

Submitted 8 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

Comments: 8 pages, 2 figures, 1 table

arXiv:2309.14045 [pdf, other]

Impacts of Gravitational-Wave Background from Supermassive Black Hole Binaries on the Detection of Compact Binaries by LISA

Authors: Fan Huang, Yan-Chen Bi, Zhoujian Cao, Qing-Guo Huang

Abstract: In the frequency band of Laser Interferometer Space Antenna (LISA), extensive research has been conducted on the impact of foreground confusion noise generated by galactic binaries within the Milky Way galaxy. Additionally, the recent evidence for a stochastic signal, announced by the NANOGrav, EPTA, PPTA, CPTA and InPTA, indicates that the stochastic gravitational-wave background generated by sup… ▽ More In the frequency band of Laser Interferometer Space Antenna (LISA), extensive research has been conducted on the impact of foreground confusion noise generated by galactic binaries within the Milky Way galaxy. Additionally, the recent evidence for a stochastic signal, announced by the NANOGrav, EPTA, PPTA, CPTA and InPTA, indicates that the stochastic gravitational-wave background generated by supermassive black hole binaries (SMBHBs) can contribute a strong background noise within in LISA band. Given the presence of such strong noise, it is expected to have a considerable impacts on LISA's scientific missions. In this work, we investigate the impacts of the SGWB generated by SMBHBs on the detection of massive black hole binaries (MBHBs), verified galactic binaries (VGBs) and extreme mass ratio inspirals (EMRIs) in the context of LISA, and find it crucial to resolve and eliminate the exceed noise from the SGWB to ensure the success of LISA's missions. △ Less

Submitted 25 September, 2023; originally announced September 2023.

Comments: 6 pages, 3 figures

arXiv:2309.13883 [pdf, other]

doi 10.1007/JHEP01(2024)180

Measurement of the $e^{+}e^{-} \to K_{S}^{0} K_{L}^{0} π^{0}$ cross sections from $\sqrt{s}=$ 2.000 to 3.080 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses… ▽ More Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ and $K^{*}_{2}(1430)^{0}\bar{K}^{0}$ to be measured. The Born cross sections for $e^{+}e^{-}\to K_{S}^{0}K_{L}^{0}π^{0}$ are consistent with previous measurements by BaBar, but with substantially improved precision. The Born cross section lineshape of the process $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ is consistent with a vector meson state around 2.2 GeV with a significance of 3.2$σ$. A Breit-Wigner fit determines its mass as $M_Y=(2164.7\pm9.1\pm3.1)~{\rm{MeV}}/c^{2}$ and its width as $Γ_{Y}=(32.4\pm21.0\pm1.8)~\rm{MeV}$. △ Less

Submitted 26 February, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

Journal ref: JHEP01(2024)180

arXiv:2309.10248 [pdf, other]

doi 10.1145/3588432.3591550

What is the Best Automated Metric for Text to Motion Generation?

Authors: Jordan Voas, Yili Wang, Qixing Huang, Raymond Mooney

Abstract: There is growing interest in generating skeleton-based human motions from natural language descriptions. While most efforts have focused on develo** better neural architectures for this task, there has been no significant work on determining the proper evaluation metric. Human evaluation is the ultimate accuracy measure for this task, and automated metrics should correlate well with human qualit… ▽ More There is growing interest in generating skeleton-based human motions from natural language descriptions. While most efforts have focused on develo** better neural architectures for this task, there has been no significant work on determining the proper evaluation metric. Human evaluation is the ultimate accuracy measure for this task, and automated metrics should correlate well with human quality judgments. Since descriptions are compatible with many motions, determining the right metric is critical for evaluating and designing effective generative models. This paper systematically studies which metrics best align with human evaluations and proposes new metrics that align even better. Our findings indicate that none of the metrics currently used for this task show even a moderate correlation with human judgments on a sample level. However, for assessing average model performance, commonly used metrics such as R-Precision and less-used coordinate errors show strong correlations. Additionally, several recently developed metrics are not recommended due to their low correlation compared to alternatives. We also introduce a novel metric based on a multimodal BERT-like model, MoBERT, which offers strongly human-correlated sample-level evaluations while maintaining near-perfect model-level correlation. Our results demonstrate that this new metric exhibits extensive benefits over all current alternatives. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 8 pages, SIGGRAPH Asia 2023 Conference

arXiv:2309.10077 [pdf]

GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders

Authors: Zhicheng Du, Chenyao Jiang, Xi Yuan, Shiyao Zhai, Zhengyang Lei, Shuyue Ma, Yang Liu, Qihui Ye, Chufan Xiao, Qiming Huang, Ming Xu, Dongmei Yu, Peiwu Qin

Abstract: The timely identification of mental disorders in adolescents is a global public health challenge.Single factor is difficult to detect the abnormality due to its complex and subtle nature. Additionally, the generalized multimodal Computer-Aided Screening (CAS) systems with interactive robots for adolescent mental disorders are not available. Here, we design an android application with mini-games an… ▽ More The timely identification of mental disorders in adolescents is a global public health challenge.Single factor is difficult to detect the abnormality due to its complex and subtle nature. Additionally, the generalized multimodal Computer-Aided Screening (CAS) systems with interactive robots for adolescent mental disorders are not available. Here, we design an android application with mini-games and chat recording deployed in a portable robot to screen 3,783 middle school students and construct the multimodal screening dataset, including facial images, physiological signs, voice recordings, and textual transcripts.We develop a model called GAME (Generalized Model with Attention and Multimodal EmbraceNet) with novel attention mechanism that integrates cross-modal features into the model. GAME evaluates adolescent mental conditions with high accuracy (73.34%-92.77%) and F1-Score (71.32%-91.06%).We find each modality contributes dynamically to the mental disorders screening and comorbidities among various mental disorders, indicating the feasibility of explainable model. This study provides a system capable of acquiring multimodal information and constructs a generalized multimodal integration algorithm with novel attention mechanisms for the early screening of adolescent mental disorders. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Showing 201–250 of 1,633 results for author: Huang, Q