Search | arXiv e-print repository

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing

Authors: Hao Qu, Lilian Zhang, Jun Mao, Junbo Tie, Xiaofeng He, ** Hu, Yifei Shi, Changhao Chen

Abstract: The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local features excel at capturing high-level information and perform well on matching benchmarks, they struggle with generalization in continuous motion scenes, adversely affecting loop detection accuracy. O… ▽ More The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local features excel at capturing high-level information and perform well on matching benchmarks, they struggle with generalization in continuous motion scenes, adversely affecting loop detection accuracy. Our system employs a Model-Agnostic Meta-Learning (MAML) strategy to optimize the training of keypoint extraction networks, enhancing their adaptability to diverse environments. Additionally, we introduce a coarse-to-fine feature tracking mechanism for learned keypoints. It begins with a direct method to approximate the relative pose between consecutive frames, followed by a feature matching method for refined pose estimation. To mitigate cumulative positioning errors, DK-SLAM incorporates a novel online learning module that utilizes binary features for loop closure detection. This module dynamically identifies loop nodes within a sequence, ensuring accurate and efficient localization. Experimental evaluations on publicly available datasets demonstrate that DK-SLAM outperforms leading traditional and learning based SLAM systems, such as ORB-SLAM3 and LIFT-SLAM. These results underscore the efficacy and robustness of our DK-SLAM in varied and challenging real-world environments. △ Less

Submitted 25 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: In submission

arXiv:2401.09136 [pdf, other]

doi 10.1103/PhysRevD.109.072001

Improved measurements of the Dalitz decays $η/η'\rightarrowγe^{+}e^{-}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

Abstract: Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and… ▽ More Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and $(4.83\pm0.07\pm0.14)\times10^{-4}$, respectively. Within the single pole model, the parameter of electromagnetic transition form factor for $η\rightarrowγe^+e^-$ is determined to be $Λ_η=(0.749 \pm 0.027 \pm 0.007)~ {\rm GeV}/c^{2}$. Within the multi-pole model, we extract the electromagnetic transition form factors for $η'\rightarrowγe^+e^-$ to be $Λ_{η'} = (0.802 \pm 0.007\pm 0.008)~ {\rm GeV}/c^{2}$ and $γ_{η'} = (0.113\pm0.010\pm0.002)~ {\rm GeV}/c^{2}$. The results are consistent with both theoretical predictions and previous measurements. The characteristic sizes of the interaction regions for the $η$ and $η'$ are calculated to be $(0.645 \pm 0.023 \pm 0.007 )~ {\rm fm}$ and $(0.596 \pm 0.005 \pm 0.006)~ {\rm fm}$, respectively. In addition, we search for the dark photon in $η/η^\prime\rightarrowγe^{+}e^{-}$, and the upper limits of the branching fractions as a function of the dark photon are given at 90\% confidence level. △ Less

Submitted 5 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Journal ref: Phys.Rev.D 109 (2024) 7, 072001

arXiv:2401.09012 [pdf, other]

First study of antihyperon-nucleon scattering $\barΛp\rightarrow\barΛp$ and measurement of $Λp\rightarrowΛp$ cross section

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

Abstract: Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cr… ▽ More Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cross sections in $-0.9\leq\rm{cos}θ_{Λ/\barΛ}\leq0.9$ are measured to be $σ(Λp\rightarrowΛp)=(12.2\pm1.6_{\rm{stat}}\pm1.1_{\rm{sys}})$ mb and $σ(\barΛ p\rightarrow\barΛ p)=(17.5\pm2.1_{\rm{stat}}\pm1.6_{\rm{sys}})$ mb at the $Λ/\barΛ$ momentum of $1.074$ GeV/$c$ within a range of $\pm0.017$ GeV/$c$, where the $θ_{Λ/\barΛ}$ are the scattering angles of the $Λ/\barΛ$ in the $Λp/\barΛp$ rest frames. Furthermore, the differential cross sections of the two reactions are also measured, where there is a slight tendency of forward scattering for $Λp\rightarrowΛp$, and a strong forward peak for $\barΛp\rightarrow\barΛp$. We present an approach to extract the total elastic cross sections by extrapolation. The study of $\barΛp\rightarrow\barΛp$ represents the first study of antihyperon-nucleon scattering, and these new measurements will serve as important inputs for the theoretical understanding of the (anti)hyperon-nucleon interaction. △ Less

Submitted 18 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: 9 pages, 5 figures

arXiv:2401.08252 [pdf, other]

Observation of $ψ(3686) \to Ω^- K^+ \barΞ^0 $+c.c

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (630 additional authors not shown)

Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systemati… ▽ More Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systematic. Possible baryon excited states are searched for in this decay, but no evident intermediate state is observed with the current sample size. △ Less

Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

arXiv:2401.08066 [pdf, other]

doi 10.1016/j.media.2024.103188

Achieve Fairness without Demographics for Dermatological Disease Diagnosis

Authors: Ching-Hao Chiu, Yu-Jen Chen, Yawen Wu, Yiyu Shi, Tsung-Yi Ho

Abstract: In medical image diagnosis, fairness has become increasingly crucial. Without bias mitigation, deploying unfair AI would harm the interests of the underprivileged population and potentially tear society apart. Recent research addresses prediction biases in deep learning models concerning demographic groups (e.g., gender, age, and race) by utilizing demographic (sensitive attribute) information dur… ▽ More In medical image diagnosis, fairness has become increasingly crucial. Without bias mitigation, deploying unfair AI would harm the interests of the underprivileged population and potentially tear society apart. Recent research addresses prediction biases in deep learning models concerning demographic groups (e.g., gender, age, and race) by utilizing demographic (sensitive attribute) information during training. However, many sensitive attributes naturally exist in dermatological disease images. If the trained model only targets fairness for a specific attribute, it remains unfair for other attributes. Moreover, training a model that can accommodate multiple sensitive attributes is impractical due to privacy concerns. To overcome this, we propose a method enabling fair predictions for sensitive attributes during the testing phase without using such information during training. Inspired by prior work highlighting the impact of feature entanglement on fairness, we enhance the model features by capturing the features related to the sensitive and target attributes and regularizing the feature entanglement between corresponding classes. This ensures that the model can only classify based on the features related to the target attribute without relying on features associated with sensitive attributes, thereby improving fairness and accuracy. Additionally, we use disease masks from the Segment Anything Model (SAM) to enhance the quality of the learned feature. Experimental results demonstrate that the proposed method can improve fairness in classification compared to state-of-the-art methods in two dermatological disease datasets. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.07862 [pdf, other]

Adaptive Neural-Operator Backstep** Control of a Benchmark Hyperbolic PDE

Authors: Maxence Lamarque, Luke Bhan, Yuanyuan Shi, Miroslav Krstic

Abstract: To stabilize PDEs, feedback controllers require gain kernel functions, which are themselves governed by PDEs. Furthermore, these gain-kernel PDEs depend on the PDE plants' functional coefficients. The functional coefficients in PDE plants are often unknown. This requires an adaptive approach to PDE control, i.e., an estimation of the plant coefficients conducted concurrently with control, where a… ▽ More To stabilize PDEs, feedback controllers require gain kernel functions, which are themselves governed by PDEs. Furthermore, these gain-kernel PDEs depend on the PDE plants' functional coefficients. The functional coefficients in PDE plants are often unknown. This requires an adaptive approach to PDE control, i.e., an estimation of the plant coefficients conducted concurrently with control, where a separate PDE for the gain kernel must be solved at each timestep upon the update in the plant coefficient function estimate. Solving a PDE at each timestep is computationally expensive and a barrier to the implementation of real-time adaptive control of PDEs. Recently, results in neural operator (NO) approximations of functional map**s have been introduced into PDE control, for replacing the computation of the gain kernel with a neural network that is trained, once offline, and reused in real-time for rapid solution of the PDEs. In this paper, we present the first result on applying NOs in adaptive PDE control, presented for a benchmark 1-D hyperbolic PDE with recirculation. We establish global stabilization via Lyapunov analysis, in the plant and parameter error states, and also present an alternative approach, via passive identifiers, which avoids the strong assumptions on kernel differentiability. We then present numerical simulations demonstrating stability and observe speedups up to three orders of magnitude, highlighting the real-time efficacy of neural operators in adaptive control. Our code (Github) is made publicly available for future researchers. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 16.5 pages, 3 figures

arXiv:2401.07320 [pdf, other]

doi 10.1093/mnras/stae156

Massive Red Spiral Galaxies in SDSS-IV MaNGA Survey

Authors: Jiantong Cui, Qiusheng Gu, Yong Shi

Abstract: Massive red spiral galaxies (MRSGs) are supposed to be the possible progenitors of lenticular galaxies (S0s). We select a large sample of MRSGs ($M_*>10^{10.5}\rm M_{\odot}$) from MaNGA DR17 using the $g-r$ color vs. stellar mass diagram, along with control samples of blue spirals and S0s. Our main results are as follows: (1) After comparing the S$\rm \acute{e}$rsic index, concentration parameter,… ▽ More Massive red spiral galaxies (MRSGs) are supposed to be the possible progenitors of lenticular galaxies (S0s). We select a large sample of MRSGs ($M_*>10^{10.5}\rm M_{\odot}$) from MaNGA DR17 using the $g-r$ color vs. stellar mass diagram, along with control samples of blue spirals and S0s. Our main results are as follows: (1) After comparing the S$\rm \acute{e}$rsic index, concentration parameter, asymmetry parameter distribution, size-mass relation and $Σ_1$ (stellar mass surface density within the central 1 kpc)-mass relation, we find MRSGs are similar to S0s and have more compact and symmetric structures than blue spirals. MRSGs also resemble S0s in Dn4000, metallicity, Mgb/$\rm \left \langle Fe \right \rangle$ and $V/σ$ radial profile. (2) By using MaNGA 2D spectra data, we separate the spatial regions into inner (R < 0.8$R_{\rm e}$) and outer (0.8$R_{\rm e}$ < R < 1.5$R_{\rm e}$) regions, and detect residual star formation in the outer regions of MRSGs. (3) When we select a sub-sample of MRSGs with NUV$-r$ > 5, we find that they are completely star-formation quenched in both inner and outer regions. Compared to optically selected MRSGs, NUV$-r$ selected MRSGs appear to be more concentrated and have more massive dark matter halos. The similarities between S0s and MRSGs suggest the possible evolutionary trend between MRSGs and S0s. △ Less

Submitted 14 January, 2024; originally announced January 2024.

Comments: accepted for publication in MNRAS; 17 pages, 16 figures, 1 table

arXiv:2401.06813 [pdf, other]

doi 10.1103/PhysRevD.109.053005

First observation of the decay $Λ^+_c\to nK^{0}_{S}π^+π^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (630 additional authors not shown)

Abstract: Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic,… ▽ More Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic, which differs from the theoretical prediction based on isospin by 4.4$σ$. This indicates that there may be resonant contributions or some unknown dynamics in this decay. △ Less

Submitted 28 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Journal ref: Phys.Rev.D,109,053005 (2024)

arXiv:2401.06461 [pdf, other]

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers

Authors: Yuling Shi, Hongyu Zhang, Chengcheng Wan, Xiaodong Gu

Abstract: Large language models have catalyzed an unprecedented wave in code generation. While achieving significant advances, they blur the distinctions between machine- and human-authored source code, causing integrity and authenticity issues of software artifacts. Previous methods such as DetectGPT have proven effective in discerning machine-generated texts, but they do not identify and harness the uniqu… ▽ More Large language models have catalyzed an unprecedented wave in code generation. While achieving significant advances, they blur the distinctions between machine- and human-authored source code, causing integrity and authenticity issues of software artifacts. Previous methods such as DetectGPT have proven effective in discerning machine-generated texts, but they do not identify and harness the unique patterns of machine-generated code. Thus, its applicability falters when applied to code. In this paper, we carefully study the specific patterns that characterize machine- and human-authored code. Through a rigorous analysis of code attributes such as lexical diversity, conciseness, and naturalness, we expose unique patterns inherent to each source. We particularly notice that the syntactic segmentation of code is a critical factor in identifying its provenance. Based on our findings, we propose DetectCodeGPT, a novel method for detecting machine-generated code, which improves DetectGPT by capturing the distinct stylized patterns of code. Diverging from conventional techniques that depend on external LLMs for perturbations, DetectCodeGPT perturbs the code corpus by strategically inserting spaces and newlines, ensuring both efficacy and efficiency. Experiment results show that our approach significantly outperforms state-of-the-art techniques in detecting machine-generated code. △ Less

Submitted 23 March, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

Comments: code available at https://github.com/YerbaPage/DetectCodeGPT

arXiv:2401.06270 [pdf, other]

SCARIF: Towards Carbon Modeling of Cloud Servers with Accelerators

Authors: Shixin Ji, Zhuo** Yang, Xingzhen Chen, Stephen Cahoon, **gtong Hu, Yiyu Shi, Alex K. Jones, Peipei Zhou

Abstract: Embodied carbon has been widely reported as a significant component in the full system lifecycle of various computing systems' green house gas emissions. Many efforts have been undertaken to quantify the elements that comprise this embodied carbon, from tools that evaluate semiconductor manufacturing to those that can quantify different elements of the computing system from commercial and academic… ▽ More Embodied carbon has been widely reported as a significant component in the full system lifecycle of various computing systems' green house gas emissions. Many efforts have been undertaken to quantify the elements that comprise this embodied carbon, from tools that evaluate semiconductor manufacturing to those that can quantify different elements of the computing system from commercial and academic sources. However, these tools cannot easily reproduce results reported by server vendors' product carbon reports and the accuracy can vary substantially due to various assumptions. Furthermore, attempts to determine green house gas contributions using bottom-up methodologies often do not agree with system-level studies and are hard to rectify. Nonetheless, given there is a need to consider all contributions to green house gas emissions in datacenters, we propose SCARIF, the Server Carbon including Accelerator Reporter with Intelligence-based Formulation tool. SCARIF has three main contributions: (1) We first collect reported carbon cost data from server vendors and design statistic models to predict the embodied carbon cost so that users can get the embodied carbon cost for their server configurations. (2) We provide embodied carbon cost if users configure servers with accelerators including GPUs, and FPGAs. (3) By using case studies, we show that certain design choices of data center management might flip by the insight and observation from using SCARIF. Thus, SCARIF provides an opportunity for large-scale datacenter and hyperscaler design. We release SCARIF as an open-source tool at https://github.com/arc-research-lab/SCARIF. △ Less

Submitted 22 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Comments: 6 pages; 6 figures; 3 tables. Accepted by ISVLSI' 24

arXiv:2401.05752 [pdf, other]

Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations

Authors: Na Wang, Lei Qi, **tao Guo, Yinghuan Shi, Yang Gao

Abstract: Domain generalization (DG) intends to train a model on multiple source domains to ensure that it can generalize well to an arbitrary unseen target domain. The acquisition of domain-invariant representations is pivotal for DG as they possess the ability to capture the inherent semantic information of the data, mitigate the influence of domain shift, and enhance the generalization capability of the… ▽ More Domain generalization (DG) intends to train a model on multiple source domains to ensure that it can generalize well to an arbitrary unseen target domain. The acquisition of domain-invariant representations is pivotal for DG as they possess the ability to capture the inherent semantic information of the data, mitigate the influence of domain shift, and enhance the generalization capability of the model. Adopting multiple perspectives, such as the sample and the feature, proves to be effective. The sample perspective facilitates data augmentation through data manipulation techniques, whereas the feature perspective enables the extraction of meaningful generalization features. In this paper, we focus on improving the generalization ability of the model by compelling it to acquire domain-invariant representations from both the sample and feature perspectives by disentangling spurious correlations and enhancing potential correlations. 1) From the sample perspective, we develop a frequency restriction module, guiding the model to focus on the relevant correlations between object features and labels, thereby disentangling spurious correlations. 2) From the feature perspective, the simple Tail Interaction module implicitly enhances potential correlations among all samples from all source domains, facilitating the acquisition of domain-invariant representations across multiple domains for the model. The experimental results show that Convolutional Neural Networks (CNNs) or Multi-Layer Perceptrons (MLPs) with a strong baseline embedded with these two modules can achieve superior results, e.g., an average accuracy of 92.30% on Digits-DG. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.05357 [pdf, other]

U-SWIM: Universal Selective Write-Verify for Computing-in-Memory Neural Accelerators

Authors: Zheyu Yan, Xiaobo Sharon Hu, Yiyu Shi

Abstract: Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accura… ▽ More Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accuracy if not mitigated. A widely accepted remedy for imperfect weight map** is the iterative write-verify approach, which involves verifying conductance values and adjusting devices if needed. In all existing publications, this procedure is applied to every individual device, resulting in a significant programming time overhead. In our research, we illustrate that only a small fraction of weights need this write-verify treatment for the corresponding devices and the DNN accuracy can be preserved, yielding a notable programming acceleration. Building on this, we introduce USWIM, a novel method based on the second derivative. It leverages a single iteration of forward and backpropagation to pinpoint the weights demanding write-verify. Through extensive tests on diverse DNN designs and datasets, USWIM manifests up to a 10x programming acceleration against the traditional exhaustive write-verify method, all while maintaining a similar accuracy level. Furthermore, compared to our earlier SWIM technique, USWIM excels, showing a 7x speedup when dealing with devices exhibiting non-uniform variations. △ Less

Submitted 11 December, 2023; originally announced January 2024.

arXiv:2401.04283 [pdf, ps, other]

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Authors: Yang Liu, Li Wan, Yun Li, Yiteng Huang, Ming Sun, James Luan, Yangyang Shi, Xin Lei

Abstract: Despite the potential of diffusion models in speech enhancement, their deployment in Acoustic Echo Cancellation (AEC) has been restricted. In this paper, we propose DI-AEC, pioneering a diffusion-based stochastic regeneration approach dedicated to AEC. Further, we propose FADI-AEC, fast score-based diffusion AEC framework to save computational demands, making it favorable for edge devices. It stan… ▽ More Despite the potential of diffusion models in speech enhancement, their deployment in Acoustic Echo Cancellation (AEC) has been restricted. In this paper, we propose DI-AEC, pioneering a diffusion-based stochastic regeneration approach dedicated to AEC. Further, we propose FADI-AEC, fast score-based diffusion AEC framework to save computational demands, making it favorable for edge devices. It stands out by running the score model once per frame, achieving a significant surge in processing efficiency. Apart from that, we introduce a novel noise generation technique where far-end signals are utilized, incorporating both far-end and near-end signals to refine the score model's accuracy. We test our proposed method on the ICASSP2023 Microsoft deep echo cancellation challenge evaluation dataset, where our method outperforms some of the end-to-end methods and other diffusion based echo cancellation methods. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.03221 [pdf, other]

MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond

Authors: Yupei Lin, Xiaoyu Xian, Yukai Shi, Liang Lin

Abstract: Recently, text-to-image diffusion models become a new paradigm in image processing fields, including content generation, image restoration and image-to-image translation. Given a target prompt, Denoising Diffusion Probabilistic Models (DDPM) are able to generate realistic yet eligible images. With this appealing property, the image translation task has the potential to be free from target image sa… ▽ More Recently, text-to-image diffusion models become a new paradigm in image processing fields, including content generation, image restoration and image-to-image translation. Given a target prompt, Denoising Diffusion Probabilistic Models (DDPM) are able to generate realistic yet eligible images. With this appealing property, the image translation task has the potential to be free from target image samples for supervision. By using a target text prompt for domain adaption, the diffusion model is able to implement zero-shot image-to-image translation advantageously. However, the sampling and inversion processes of DDPM are stochastic, and thus the inversion process often fail to reconstruct the input content. Specifically, the displacement effect will gradually accumulated during the diffusion and inversion processes, which led to the reconstructed results deviating from the source domain. To make reconstruction explicit, we propose a prompt redescription strategy to realize a mirror effect between the source and reconstructed image in the diffusion model (MirrorDiffusion). More specifically, a prompt redescription mechanism is investigated to align the text prompts with latent code at each time step of the Denoising Diffusion Implicit Models (DDIM) inversion to pursue a structure-preserving reconstruction. With the revised DDIM inversion, MirrorDiffusion is able to realize accurate zero-shot image translation by editing optimized text prompts and latent code. Extensive experiments demonstrate that MirrorDiffusion achieves superior performance over the state-of-the-art methods on zero-shot image translation benchmarks by clear margins and practical model stability. △ Less

Submitted 6 January, 2024; originally announced January 2024.

Comments: A prompt re-description strategy is proposed for stabilizing the diffusion model in image-to-image translation. Code and dataset page: https://mirrordiffusion.github.io/

arXiv:2401.02740 [pdf, ps, other]

Fairness-Aware Job Scheduling for Multi-Job Federated Learning

Authors: Yuxin Shi, Han Yu

Abstract: Federated learning (FL) enables multiple data owners (a.k.a. FL clients) to collaboratively train machine learning models without disclosing sensitive private data. Existing FL research mostly focuses on the monopoly scenario in which a single FL server selects a subset of FL clients to update their local models in each round of training. In practice, there can be multiple FL servers simultaneousl… ▽ More Federated learning (FL) enables multiple data owners (a.k.a. FL clients) to collaboratively train machine learning models without disclosing sensitive private data. Existing FL research mostly focuses on the monopoly scenario in which a single FL server selects a subset of FL clients to update their local models in each round of training. In practice, there can be multiple FL servers simultaneously trying to select clients from the same pool. In this paper, we propose a first-of-its-kind Fairness-aware Federated Job Scheduling (FairFedJS) approach to bridge this gap. Based on Lyapunov optimization, it ensures fair allocation of high-demand FL client datasets to FL jobs in need of them, by jointly considering the current demand and the job payment bids, in order to prevent prolonged waiting. Extensive experiments comparing FairFedJS against four state-of-the-art approaches on two datasets demonstrate its significant advantages. It outperforms the best baseline by 31.9% and 1.0% on average in terms of scheduling fairness and convergence time, respectively, while achieving comparable test accuracy. △ Less

Submitted 7 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

Comments: accepted by ICASSP 2024

arXiv:2401.02655 [pdf, ps, other]

CscK metrics near the canonical class

Authors: Bin Guo, Wangjian Jian, Yalong Shi, Jian Song

Abstract: Let $X$ be a Kähler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any Kähler class $γ$, there exists $δ>0$ such that for all $t\in (0, δ)$ there exists a unique cscK metric $g_t$ in $K_X+ t γ$. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, δ)} $ have uniformly bounded Kähler potentials, volume forms and diameters. As a consequence, these metric s… ▽ More Let $X$ be a Kähler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any Kähler class $γ$, there exists $δ>0$ such that for all $t\in (0, δ)$ there exists a unique cscK metric $g_t$ in $K_X+ t γ$. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, δ)} $ have uniformly bounded Kähler potentials, volume forms and diameters. As a consequence, these metric spaces are pre-compact in the Gromov-Hausdorff sense. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 13 pages, no figures. All comments are welcome!

MSC Class: 53C55; 35J60

arXiv:2401.02516 [pdf, other]

Moving-Horizon Estimators for Hyperbolic and Parabolic PDEs in 1-D

Authors: Luke Bhan, Yuanyuan Shi, Iasson Karafyllis, Miroslav Krstic, James B. Rawlings

Abstract: Observers for PDEs are themselves PDEs. Therefore, producing real time estimates with such observers is computationally burdensome. For both finite-dimensional and ODE systems, moving-horizon estimators (MHE) are operators whose output is the state estimate, while their inputs are the initial state estimate at the beginning of the horizon as well as the measured output and input signals over the m… ▽ More Observers for PDEs are themselves PDEs. Therefore, producing real time estimates with such observers is computationally burdensome. For both finite-dimensional and ODE systems, moving-horizon estimators (MHE) are operators whose output is the state estimate, while their inputs are the initial state estimate at the beginning of the horizon as well as the measured output and input signals over the moving time horizon. In this paper we introduce MHEs for PDEs which remove the need for a numerical solution of an observer PDE in real time. We accomplish this using the PDE backstep** method which, for certain classes of both hyperbolic and parabolic PDEs, produces moving-horizon state estimates explicitly. Precisely, to explicitly produce the state estimates, we employ a backstep** transformation of a hard-to-solve observer PDE into a target observer PDE, which is explicitly solvable. The MHEs we propose are not new observer designs but simply the explicit MHE realizations, over a moving horizon of arbitrary length, of the existing backstep** observers. Our PDE MHEs lack the optimality of the MHEs that arose as duals of MPC, but they are given explicitly, even for PDEs. In the paper we provide explicit formulae for MHEs for both hyperbolic and parabolic PDEs, as well as simulation results that illustrate theoretically guaranteed convergence of the MHEs. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 7 pages, 1 figure, submitted to ACC 2024

arXiv:2401.02087 [pdf, ps, other]

Green functions for GJMS operators on spheres, Gegenbauer polynomials and rigidity theorems

Authors: Xuezhang Chen, Yalong Shi

Abstract: We derive explicit representation formulae of Green functions for GJMS operators on $n$-spheres, including the fractional ones. These formulae not only have natural geometric interpretations concerning the extrinsic geometry of the round sphere, but also reflect the spherical rigidity among closed embedded hypersurfaces in $\mathbb{R}^{n+1}$. We derive explicit representation formulae of Green functions for GJMS operators on $n$-spheres, including the fractional ones. These formulae not only have natural geometric interpretations concerning the extrinsic geometry of the round sphere, but also reflect the spherical rigidity among closed embedded hypersurfaces in $\mathbb{R}^{n+1}$. △ Less

Submitted 20 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

Comments: 36 pages, no figures. Related works are mentioned, typos corrected. Theorem 1(3) now includes more cases. We also provide an alternative proof of Theorem 3(2) when n is at least 5 in the appendix, by using the asymptotic expansion formula of Green functions. Comments are welcome!

MSC Class: 35J08; 53C24

arXiv:2401.01530 [pdf, other]

Disorder-induced topological pum** on a superconducting quantum processor

Authors: Yu Liu, Yu-Ran Zhang, Yun-Hao Shi, Tao Liu, Congwei Lu, Yong-Yi Wang, Hao Li, Tian-Ming Li, Cheng-Lin Deng, Si-Yun Zhou, Tong Liu, Jia-Chi Zhang, Gui-Han Liang, Zheng-Yang Mei, Wei-Guo Ma, Hao-Tian Liu, Zheng-He Liu, Chi-Tong Chen, Kaixuan Huang, Xiaohui Song, SP Zhao, Ye Tian, Zhongcheng Xiang, Dongning Zheng, Franco Nori , et al. (2 additional authors not shown)

Abstract: Thouless pum**, a dynamical version of the integer quantum Hall effect, represents the quantized charge pumped during an adiabatic cyclic evolution. Here we report experimental observations of nontrivial topological pum** that is induced by disorder even during a topologically trivial pum** trajectory. With a 41-qubit superconducting quantum processor, we develop a Floquet engineering techni… ▽ More Thouless pum**, a dynamical version of the integer quantum Hall effect, represents the quantized charge pumped during an adiabatic cyclic evolution. Here we report experimental observations of nontrivial topological pum** that is induced by disorder even during a topologically trivial pum** trajectory. With a 41-qubit superconducting quantum processor, we develop a Floquet engineering technique to realize cycles of adiabatic pum** by simultaneously varying the on-site potentials and the hop** couplings. We demonstrate Thouless pum** in the presence of disorder and show its breakdown as the strength of disorder increases. Moreover, we observe two types of topological pum** that are induced by on-site potential disorder and hop** disorder, respectively. Especially, an intrinsic topological pump that is induced by quasi-periodic hop** disorder has never been experimentally realized before. Our highly controllable system provides a valuable quantum simulating platform for studying various aspects of topological physics in the presence of disorder. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2401.00918 [pdf, ps, other]

Partial Wave Analysis of $J/ψ\rightarrow γγφ$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (603 additional authors not shown)

Abstract: Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and… ▽ More Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and $η_{c}$ are observed with statistical significance greater than 5$σ$. The product branching fractions $\mathcal{B}(J/ψ\rightarrowγX, X\rightarrow γφ)$ are reported. The resonance parameters of $η(1405)$ and $X(1835)$ are also measured. △ Less

Submitted 1 January, 2024; originally announced January 2024.

arXiv:2401.00878 [pdf, ps, other]

Observation of $\mathcal R(3810)$ in $e^+e^-\rightarrow {\rm hadrons}$ and Improved Measurements of the Resonance Parameters of $\mathcal R(3760)$ and $\mathcal R(3780)$

Authors: M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (596 additional authors not shown)

Abstract: We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$,… ▽ More We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$, a total width of $(5.4 \pm 3.5 \pm 3.2)$~MeV, and an electronic partial width of $(19.4 \pm 7.4 \pm 12.1)$~eV. Its significance is $7.7σ$. The $\mathcal R(3810)$ could be interpreted as a hadro-charmonium resonance predicted by Quantum Chromodynamics (QCD). In addition, we measure the mass $(3751.9\pm 3.8\pm 2.8)$ ~MeV/$c^2$, the total width $(32.8 \pm 5.8 \pm 8.7)$~MeV, and the electronic partial width $(184\pm 75\pm 86)$~eV with improved precision for the $\mathcal R(3760)$. Furthermore, for the $\mathcal R(3780)$ we measure the mass $(3778.7\pm 0.5\pm 0.3)$ ~MeV/$c^2$ and total width $(20.3 \pm 0.8 \pm 1.7)$~MeV with improved precision, and the electronic partial width $(265\pm 69\pm 83)$~eV. The $\mathcal R(3780)$ can be interpreted as the $1^3D_1$ state of charmonium. Its mass and total width differ significantly from the corresponding fitted values given by the Particle Data Group in 2022 by 7.1 and 3.2 times the uncertainties for $ψ(3770)$, respectively. $ψ(3770)$ has been interpreted as the $1^3D_1$ state for 45 years. △ Less

Submitted 30 December, 2023; originally announced January 2024.

arXiv:2401.00434 [pdf, other]

GeoGalactica: A Scientific Large Language Model in Geoscience

Authors: Zhouhan Lin, Cheng Deng, Le Zhou, Tianhang Zhang, Yi Xu, Yutong Xu, Zhongmou He, Yuanyuan Shi, Beiya Dai, Yunchong Song, Boyi Zeng, Qiyuan Chen, Yuxun Miao, Bo Xue, Shu Wang, Luoyi Fu, Weinan Zhang, Junxian He, Yunqiang Zhu, Xinbing Wang, Chenghu Zhou

Abstract: Large language models (LLMs) have achieved huge success for their general knowledge and ability to solve a wide spectrum of tasks in natural language processing (NLP). Due to their impressive abilities, LLMs have shed light on potential inter-discipline applications to foster scientific discoveries of a specific domain by using artificial intelligence (AI for science, AI4S). In the meantime, utili… ▽ More Large language models (LLMs) have achieved huge success for their general knowledge and ability to solve a wide spectrum of tasks in natural language processing (NLP). Due to their impressive abilities, LLMs have shed light on potential inter-discipline applications to foster scientific discoveries of a specific domain by using artificial intelligence (AI for science, AI4S). In the meantime, utilizing NLP techniques in geoscience research and practice is wide and convoluted, contributing from knowledge extraction and document classification to question answering and knowledge discovery. In this work, we take the initial step to leverage LLM for science, through a rather straightforward approach. We try to specialize an LLM into geoscience, by further pre-training the model with a vast amount of texts in geoscience, as well as supervised fine-tuning (SFT) the resulting model with our custom collected instruction tuning dataset. These efforts result in a model GeoGalactica consisting of 30 billion parameters. To our best knowledge, it is the largest language model for the geoscience domain. More specifically, GeoGalactica is from further pre-training of Galactica. We train GeoGalactica over a geoscience-related text corpus containing 65 billion tokens, preserving as the largest geoscience-specific text corpus. Then we fine-tune the model with 1 million pairs of instruction-tuning data consisting of questions that demand professional geoscience knowledge to answer. In this technical report, we will illustrate in detail all aspects of GeoGalactica, including data collection, data cleaning, base model selection, pre-training, SFT, and evaluation. We open-source our data curation tools and the checkpoints of GeoGalactica during the first 3/4 of pre-training. △ Less

Submitted 13 April, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

ACM Class: I.2.7; F.4.1

arXiv:2401.00271 [pdf, other]

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations

Authors: Yilan Dong, Chunlin Yu, Ruiyang Ha, Ye Shi, Yuexin Ma, Lan Xu, Yanwei Fu, **gya Wang

Abstract: Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space. In this paper, we propose the first in-the-wild benchmark CCGait for cloth-changing gait recognition, which incorporates diverse clothing changes, indoor and outdoor scenes, and multi-modal statistics over 92 days. To further a… ▽ More Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space. In this paper, we propose the first in-the-wild benchmark CCGait for cloth-changing gait recognition, which incorporates diverse clothing changes, indoor and outdoor scenes, and multi-modal statistics over 92 days. To further address the coupling effect of clothing and viewpoint variations, we propose a hybrid approach HybridGait that exploits both temporal dynamics and the projected 2D information of 3D human meshes. Specifically, we introduce a Canonical Alignment Spatial-Temporal Transformer (CA-STT) module to encode human joint position-aware features, and fully exploit 3D dense priors via a Silhouette-guided Deformation with 3D-2D Appearance Projection (SilD) strategy. Our contributions are twofold: we provide a challenging benchmark CCGait that captures realistic appearance changes across an expanded and space, and we propose a hybrid framework HybridGait that outperforms prior works on CCGait and Gait3D benchmarks. Our project page is available at https://github.com/HCVLab/HybridGait. △ Less

Submitted 30 December, 2023; originally announced January 2024.

arXiv:2401.00138 [pdf, other]

Absence of Weyl nodes in EuCd$_2$As$_2$ revealed by the carrier density dependence of the anomalous Hall effect

Authors: Yue Shi, Zhaoyu Liu, Logan A. Burnett, Seokhyeong Lee, Chaowei Hu, Qianni Jiang, Jiaqi Cai, Xiaodong Xu, Mo Li, Cheng-Chien Chen, Jiun-Haw Chu

Abstract: The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highl… ▽ More The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highly insulating EuCd$_2$As$_2$ crystals with carrier density reaching as low as $2\times 10^{15}$ $\text{cm}^{-3}$. The magneto-transport measurements revealed a progressive decrease of the anomalous Hall conductivity (AHC) by several orders of magnitude as the carrier density decreases. This behavior contradicts with what is expected from the intrinsic AHC generated by the Weyl points, which is independent of carrier density as the Fermi level approaches the charge neutrality point. In contrast, the scaling relationship between AHC and longitudinal conductivity aligns with the characteristics of variable range hop** insulators. Our results suggest that EuCd$_2$As$_2$ is a magnetic semiconductor rather than a topological Weyl semimetal. △ Less

Submitted 27 February, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

arXiv:2312.17568 [pdf, ps, other]

Light baryon in three quark picture light front approach and its application: hyperon weak radiative decays

Authors: Zhi-Peng Xing, Yu Ji Shi, ** Sun, Zhen-Xing Zhao

Abstract: Motivated by recent experimental data on $Σ^+\to pγ$ at BESIII, we investigate a class of hyperon weak radiative decays. To estimate these processes, in our research, we employ a new type of light-front quark model with a three-quark picture for octet baryons. In the three-quark picture, with the use of $SU(3)_f$ and spin symmetries, we present a general form of the light front wave function for e… ▽ More Motivated by recent experimental data on $Σ^+\to pγ$ at BESIII, we investigate a class of hyperon weak radiative decays. To estimate these processes, in our research, we employ a new type of light-front quark model with a three-quark picture for octet baryons. In the three-quark picture, with the use of $SU(3)_f$ and spin symmetries, we present a general form of the light front wave function for each octet baryon. By including contributions from the penguin diagram and W exchange diagram, we perform a complete calculation on the branching ratios ($Br$) and the asymmetry parameter ($α$) for hyperon weak radiative decay processes. Our results are helpful for discovering additional hyperon weak radiative decay processes in experimental facilities, and our research will promote the theoretical study of baryons. △ Less

Submitted 8 January, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

Comments: 15 pages, 2 figures, 5 tables

arXiv:2312.17516 [pdf, other]

Robust TOA-based Localization with Inaccurate Anchors for MANET

Authors: Xinkai Yu, Yang Zheng, Min Sheng, Yan Shi, Jiandong Li

Abstract: Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANE… ▽ More Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANETs challenges TOA's robustness due to the availability and accuracy of base anchors, coupled with ranging errors. To address the issue of cascading positioning error divergence, we first derive the CRLB for any primary node in MANETs as a metric to tackle localization error in cascading scenarios. Second, we propose an advanced two-step TOA method based on CRLB which is able to approximate target node's CRLB with only local neighbor information. Finally, simulation results confirm the robustness of our algorithm, achieving CRLB-level accuracy for small ranging errors and maintaining precision for larger errors compared to existing TOA methods. △ Less

Submitted 29 December, 2023; originally announced December 2023.

arXiv:2312.17164 [pdf, other]

Securing NextG Systems against Poisoning Attacks on Federated Learning: A Game-Theoretic Solution

Authors: Yalin E. Sagduyu, Tugba Erpek, Yi Shi

Abstract: This paper studies the poisoning attack and defense interactions in a federated learning (FL) system, specifically in the context of wireless signal classification using deep learning for next-generation (NextG) communications. FL collectively trains a global model without the need for clients to exchange their data samples. By leveraging geographically dispersed clients, the trained global model… ▽ More This paper studies the poisoning attack and defense interactions in a federated learning (FL) system, specifically in the context of wireless signal classification using deep learning for next-generation (NextG) communications. FL collectively trains a global model without the need for clients to exchange their data samples. By leveraging geographically dispersed clients, the trained global model can be used for incumbent user identification, facilitating spectrum sharing. However, in this distributed learning system, the presence of malicious clients introduces the risk of poisoning the training data to manipulate the global model through falsified local model exchanges. To address this challenge, a proactive defense mechanism is employed in this paper to make informed decisions regarding the admission or rejection of clients participating in FL systems. Consequently, the attack-defense interactions are modeled as a game, centered around the underlying admission and poisoning decisions. First, performance bounds are established, encompassing the best and worst strategies for attackers and defenders. Subsequently, the attack and defense utilities are characterized within the Nash equilibrium, where no player can unilaterally improve its performance given the fixed strategies of others. The results offer insights into novel operational modes that safeguard FL systems against poisoning attacks by quantifying the performance of both attacks and defenses in the context of NextG communications. △ Less

Submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.17063 [pdf, other]

doi 10.1016/j.physletb.2024.138614

Search for a massless particle beyond the Standard Model in the $Σ^+\rightarrow p+{\rm invisible}$ decay

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

Abstract: A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$… ▽ More A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$ is determined to be $3.2\times10^{-5}$ at the 90% confidence level. This is the first search for a flavor-changing neutral current process with missing energy in hyperon decays which plays an important role in constraining new physics models. △ Less

Submitted 5 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

Comments: 11 pages, 5 figures

Journal ref: Phys. Lett. B 852 (2024) 138614

arXiv:2312.16983 [pdf, other]

PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance

Authors: Taicai Chen, Yue Duan, Dong Li, Lei Qi, Yinghuan Shi, Yang Gao

Abstract: Variational Autoencoder based Bayesian Optimization (VAE-BO) has demonstrated its excellent performance in addressing high-dimensional structured optimization problems. However, current mainstream methods overlook the potential of utilizing a pool of unlabeled data to construct the latent space, while only concentrating on designing sophisticated models to leverage the labeled data. Despite their… ▽ More Variational Autoencoder based Bayesian Optimization (VAE-BO) has demonstrated its excellent performance in addressing high-dimensional structured optimization problems. However, current mainstream methods overlook the potential of utilizing a pool of unlabeled data to construct the latent space, while only concentrating on designing sophisticated models to leverage the labeled data. Despite their effective usage of labeled data, these methods often require extra network structures, additional procedure, resulting in computational inefficiency. To address this issue, we propose a novel method to effectively utilize unlabeled data with the guidance of labeled data. Specifically, we tailor the pseudo-labeling technique from semi-supervised learning to explicitly reveal the relative magnitudes of optimization objective values hidden within the unlabeled data. Based on this technique, we assign appropriate training weights to unlabeled data to enhance the construction of a discriminative latent space. Furthermore, we treat the VAE encoder and the Gaussian Process (GP) in Bayesian optimization as a unified deep kernel learning process, allowing the direct utilization of labeled data, which we term as Gaussian Process guidance. This directly and effectively integrates the goal of improving GP accuracy into the VAE training, thereby guiding the construction of the latent space. The extensive experiments demonstrate that our proposed method outperforms existing VAE-BO algorithms in various optimization scenarios. Our code will be published at https://github.com/TaicaiChen/PG-LBO. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024

arXiv:2312.16971 [pdf, other]

High Throughput Inter-Layer Connecting Strategy for Multi-Layer Ultra-Dense Satellite Networks

Authors: Qi Hao, Di Zhou, Min Sheng, Yan Shi, Jiandong Li

Abstract: Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying in… ▽ More Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying inter-layer connections (ILCs), augmenting MLUDSN's throughput. Therefore, it remains an open issue how to deploy ILCs to optimize the dynamic MLUDSN topology to dramatically raise throughput gains under multi-layer collaboration. This paper designs an ILC deployment strategy to enhance throughput by revealing the impacts of ILC distribution on reducing hop-count. Since deploying ILCs burdens the satellite with extra communication resource consumption, we model the ILC deployment problem as minimizing the average hop with limited ILCs, to maximize throughput. The proposed problem is a typical integer linear programming (ILP) problem, of which computational complexity is exponential as the satellite scale expands and the time evolves. Based on the symmetrical topology of each layer, we propose a two-phase deployment scheme to halve the problem scale and prioritize stable ILCs to reduce handover-count, which decreases the exponential complexity to a polynomial one, with 1% estimation error: Simulation results based on realistic megaconstellation information confirm that the optimal number of ILCs is less than P.S/2, where P and S are orbits and satellites per orbit. Besides, these ILCs deploy uniformly in each layer, which raises over 1.55x throughput than isolated layers. △ Less

Submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.16405 [pdf, ps, other]

Observation of $χ_{cJ}\to 3(K^+K^-)$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching… ▽ More By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching fractions of $χ_{cJ}\to 3(K^+K^-)$ decays are determined to be $\mathcal{B}_{χ_{c0}\to 3(K^+K^-)}$=$(10.7\pm1.8\pm1.1)$$\times10^{-6}$, $\mathcal{B}_{χ_{c1}\to 3(K^+K^-)}$=$(4.2\pm0.9\pm0.5)$$\times10^{-6}$, and $\mathcal{B}_{χ_{c2}\to 3(K^+K^-)}$=$(7.2\pm1.1\pm0.8)$$\times10^{-6}$, where the first uncertainties are statistical and the second are systematic. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 8 pages, 2 figures

arXiv:2312.16062 [pdf, other]

AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI

Authors: Lihang Pan, Bowen Wang, Chun Yu, Yuxuan Chen, Xiangyu Zhang, Yuanchun Shi

Abstract: Voice command interfaces (VCIs) have gained increasing importance, enabling hands-free and eyes-free interaction with digital devices. However, the inherent complexity in constructing effective voice interfaces has limited the VCIs' functionalities to only a small fraction of GUI applications and tasks. This paper presents AutoTask, a VCI capable of automating any task in any mobile application wi… ▽ More Voice command interfaces (VCIs) have gained increasing importance, enabling hands-free and eyes-free interaction with digital devices. However, the inherent complexity in constructing effective voice interfaces has limited the VCIs' functionalities to only a small fraction of GUI applications and tasks. This paper presents AutoTask, a VCI capable of automating any task in any mobile application without configuration or modification from developers or end users. The primary challenge for AutoTask is the lack of knowledge, as it needs to accomplish unknown tasks (e.g., user commands) within an unknown environment (e.g., GUI). To address this challenge, AutoTask employs two strategies: (1) trial and error: AutoTask explores the GUI, attempts potential operation sequences, and recovers from errors through backtracking; (2) learning from the environment: AutoTask accumulates experiences during exploration and summarizes correct knowledge from these experiences. We implemented AutoTask on Android devices and conducted an evaluation study, which proved the feasibility of AutoTask. △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2312.16031 [pdf, other]

Transverse electric waves in Bandos-Lechner-Sorokin-Townsend nonlinear electrodynamics

Authors: Yang Shi, Qinyan Tan, Towe Wang

Abstract: In the generalized Born-Infeld electrodynamics discovered by Bandos, Lechner, Sorokin and Townsend, we study transverse electric waves propagating perpendicular to a constant magnetic field background in a parallel-plate waveguide. The directions of propagation and polarization of the waves are perpendicular to each other, and both of them are parallel to the perfectly conducting plates. Two speci… ▽ More In the generalized Born-Infeld electrodynamics discovered by Bandos, Lechner, Sorokin and Townsend, we study transverse electric waves propagating perpendicular to a constant magnetic field background in a parallel-plate waveguide. The directions of propagation and polarization of the waves are perpendicular to each other, and both of them are parallel to the perfectly conducting plates. Two specific configurations are studied, in which the background magnetic field is either normal to the plates or along the polarization direction. The dispersion relation, the velocity and the cutoff frequency of the lowest-order lowest-frequency mode are calculated in both configurations. This paves the way for a promising test of the generalized Born-Infeld electrodynamics. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 10 pages, 4 figures

arXiv:2312.15445 [pdf, ps, other]

Lattice paths and Rogers--Ramanujan--Gordon type overpartitions

Authors: Diane Y. H. Shi

Abstract: In this paper, we connect the Rogers--Ramanujan--Gordon type overpartitions to the lattice paths with four kinds of unitary steps. By a bijection between overpartitions and lattice paths, we prove that the theorems given by Chen, Sang and Shi have the lattice paths form. Then inspired by Andrews' parity in partition identities and this relation we put the parity restrictions on lattice paths and g… ▽ More In this paper, we connect the Rogers--Ramanujan--Gordon type overpartitions to the lattice paths with four kinds of unitary steps. By a bijection between overpartitions and lattice paths, we prove that the theorems given by Chen, Sang and Shi have the lattice paths form. Then inspired by Andrews' parity in partition identities and this relation we put the parity restrictions on lattice paths and give some new results. By the parity results in lattice paths, we can derive some parity results on overpartitions. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.15412 [pdf, other]

CARSS: Cooperative Attention-guided Reinforcement Subpath Synthesis for Solving Traveling Salesman Problem

Authors: Yuchen Shi, Congying Han, Tiande Guo

Abstract: This paper introduces CARSS (Cooperative Attention-guided Reinforcement Subpath Synthesis), a novel approach to address the Traveling Salesman Problem (TSP) by leveraging cooperative Multi-Agent Reinforcement Learning (MARL). CARSS decomposes the TSP solving process into two distinct yet synergistic steps: "subpath generation" and "subpath merging." In the former, a cooperative MARL framework is e… ▽ More This paper introduces CARSS (Cooperative Attention-guided Reinforcement Subpath Synthesis), a novel approach to address the Traveling Salesman Problem (TSP) by leveraging cooperative Multi-Agent Reinforcement Learning (MARL). CARSS decomposes the TSP solving process into two distinct yet synergistic steps: "subpath generation" and "subpath merging." In the former, a cooperative MARL framework is employed to iteratively generate subpaths using multiple agents. In the latter, these subpaths are progressively merged to form a complete cycle. The algorithm's primary objective is to enhance efficiency in terms of training memory consumption, testing time, and scalability, through the adoption of a multi-agent divide and conquer paradigm. Notably, attention mechanisms play a pivotal role in feature embedding and parameterization strategies within CARSS. The training of the model is facilitated by the independent REINFORCE algorithm. Empirical experiments reveal CARSS's superiority compared to single-agent alternatives: it demonstrates reduced GPU memory utilization, accommodates training graphs nearly 2.5 times larger, and exhibits the potential for scaling to even more extensive problem sizes. Furthermore, CARSS substantially reduces testing time and optimization gaps by approximately 50% for TSP instances of up to 1000 vertices, when compared to standard decoding methods. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.15385 [pdf, other]

Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning

Authors: Xiangyu Cui, Xun Li, Yun Shi, Si Zhao

Abstract: This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionall… ▽ More This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionally, we design the corresponding reinforcement learning algorithm. Both simulation experiments and empirical analysis indicate that our discrete-time model exhibits better applicability when analyzing real-world data than the continuous-time model. △ Less

Submitted 23 December, 2023; originally announced December 2023.

Comments: arXiv admin note: text overlap with arXiv:1904.11392 by other authors

arXiv:2312.15298 [pdf, other]

10 kT axial magnetic field generated using multiple conventional laser beams

Authors: Jue Xuan Hao, Xiang Tang, Alexey Arefiev, Robert J. Kingham, ** Zhu, Yin Shi, Jian Zheng

Abstract: Strong laser-generated magnetic fields have important applications in high energy density science and laboratory astrophysics. Although the inverse Faraday effect provides a mechanism for generating strong magnetic fields by absorbing angular momentum from a high-intensity laser pulse, it is not applicable to conventional linearly polarized (LP) Gaussian laser beams. We have dmeveloped a spatial a… ▽ More Strong laser-generated magnetic fields have important applications in high energy density science and laboratory astrophysics. Although the inverse Faraday effect provides a mechanism for generating strong magnetic fields by absorbing angular momentum from a high-intensity laser pulse, it is not applicable to conventional linearly polarized (LP) Gaussian laser beams. We have dmeveloped a spatial arrangement that overcomes this difficulty by using multiple laser beams arranged to have a twist in the pointing direction. Using three-dimensional kinetic particle-in-cell simulations, we show that this arrangement is the key to generating a strong magnetic field. The resulting multi-kT picosecond axial magnetic field occupies tens of thousands of cubic microns of space and can be realized under a wide range of laser parameters and plasma conditions. Our scheme is well suited for implementation at PW-class laser facilities with multiple conventional LP laser beams. △ Less

Submitted 23 December, 2023; originally announced December 2023.

Comments: 22 pages, 14 figures

arXiv:2312.14474 [pdf, other]

MonoLSS: Learnable Sample Selection For Monocular 3D Detection

Authors: Zhenjia Li, **rang Jia, Yifeng Shi

Abstract: In the field of autonomous driving, monocular 3D detection is a critical task which estimates 3D properties (depth, dimension, and orientation) of objects in a single RGB image. Previous works have used features in a heuristic way to learn 3D properties, without considering that inappropriate features could have adverse effects. In this paper, sample selection is introduced that only suitable samp… ▽ More In the field of autonomous driving, monocular 3D detection is a critical task which estimates 3D properties (depth, dimension, and orientation) of objects in a single RGB image. Previous works have used features in a heuristic way to learn 3D properties, without considering that inappropriate features could have adverse effects. In this paper, sample selection is introduced that only suitable samples should be trained to regress the 3D properties. To select samples adaptively, we propose a Learnable Sample Selection (LSS) module, which is based on Gumbel-Softmax and a relative-distance sample divider. The LSS module works under a warm-up strategy leading to an improvement in training stability. Additionally, since the LSS module dedicated to 3D property sample selection relies on object-level features, we further develop a data augmentation method named MixUp3D to enrich 3D property samples which conforms to imaging principles without introducing ambiguity. As two orthogonal methods, the LSS module and MixUp3D can be utilized independently or in conjunction. Sufficient experiments have shown that their combined use can lead to synergistic effects, yielding improvements that transcend the mere sum of their individual applications. Leveraging the LSS module and the MixUp3D, without any extra data, our method named MonoLSS ranks 1st in all three categories (Car, Cyclist, and Pedestrian) on KITTI 3D object detection benchmark, and achieves competitive results on both the Waymo dataset and KITTI-nuScenes cross-dataset evaluation. The code is included in the supplementary material and will be released to facilitate related academic and industrial studies. △ Less

Submitted 22 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.14456 [pdf, other]

doi 10.1103/PhysRevX.14.011047

Spontaneous gap opening and potential excitonic states in an ideal Dirac semimetal Ta$_2$Pd$_3$Te$_5$

Authors: Peng Zhang, Yuyang Dong, Dayu Yan, Bei Jiang, Tao Yang, Jun Li, Zhaopeng Guo, Yong Huang, Bo Hao, Qing Li, Yupeng Li, Kifu Kurokawa, Rui Wang, Yuefeng Nie, Makoto Hashimoto, Donghui Lu, Wen-He Jiao, Jie Shen, Tian Qian, Zhijun Wang, Youguo Shi, Takeshi Kondo

Abstract: The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitatio… ▽ More The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitations. However, there is still no generally recognized single-crystal material with excitonic order, which is, therefore, awaited in condensed matter physics. Here, we show that excitonic states may exist in the quasi-one-dimensional material Ta$_2$Pd$_3$Te$_5$, which has an almost ideal Dirac-like band structure, with Dirac point located exactly at Fermi level. We find that an energy gap appears at 350 K, and it grows with decreasing temperature. The spontaneous gap opening is absent in a similar material Ta$_2$Ni$_3$Te$_5$. Intriguingly, the gap is destroyed by the potassium deposition on the crystal, likely due to extra-doped carriers. Furthermore, we observe a pair of in-gap flat bands, which is an analog of the impurity states in a superconducting gap. All these observations can be properly explained by an excitonic order, providing Ta$_2$Pd$_3$Te$_5$ as a new and promising candidate realizing excitonic states. △ Less

Submitted 15 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. X 14, 011047 (2024)

arXiv:2312.14455 [pdf, other]

doi 10.1103/PhysRevX.14.011046

Evidence for an Excitonic Insulator State in Ta$_2$Pd$_3$Te$_5$

Authors: Jierui Huang, Bei Jiang, **gyu Yao, Dayu Yan, Xincheng Lei, Jiacheng Gao, Zhaopeng Guo, Feng **, Yupeng Li, Zhenyu Yuan, Congcong Chai, Haohao Sheng, Mojun Pan, Famin Chen, Junde Liu, Shunye Gao, Gexing Qu, Bo Liu, Zhicheng Jiang, Zhengtai Liu, Xiaoyan Ma, Shiming Zhou, Yaobo Huang, Chenxia Yun, Qingming Zhang , et al. (8 additional authors not shown)

Abstract: The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical invest… ▽ More The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical investigations, we provide evidence for the existence of an EI ground state in a van der Waals compound Ta$_2$Pd$_3$Te$_5$. Density-functional-theory calculations suggest that it is a semimetal with a small band overlap, whereas various experiments exhibit an insulating ground state with a clear band gap. Upon incorporating electron-hole Coulomb interaction into our calculations, we obtain an EI phase where the electronic symmetry breaking opens a many-body gap. Angle-resolved photoemission spectroscopy measurements exhibit that the band gap is closed with a significant change in the dispersions as the number of thermally excited charge carriers becomes sufficiently large in both equilibrium and nonequilibrium states. Structural measurements reveal a slight breaking of crystal symmetry with exceptionally small lattice distortion in the insulating state, which cannot account for the significant gap opening. Therefore, we attribute the insulating ground state with a gap opening in Ta$_2$Pd$_3$Te$_5$ to exciton condensation, where the coupling to the symmetry-breaking electronic state induces a subtle change in the crystal structure. △ Less

Submitted 14 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 10 pages, 5 figures

Journal ref: Phys. Rev. X 14, 011046, 2024

arXiv:2312.13923 [pdf, other]

Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning

Authors: Zhongyi Cai, Ye Shi, Wei Huang, **gya Wang

Abstract: Federated Learning (FL) has emerged as a promising distributed learning paradigm that enables multiple clients to learn a global model collaboratively without sharing their private data. However, the effectiveness of FL is highly dependent on the quality of the data that is being used for training. In particular, data heterogeneity issues, such as label distribution skew and feature skew, can sign… ▽ More Federated Learning (FL) has emerged as a promising distributed learning paradigm that enables multiple clients to learn a global model collaboratively without sharing their private data. However, the effectiveness of FL is highly dependent on the quality of the data that is being used for training. In particular, data heterogeneity issues, such as label distribution skew and feature skew, can significantly impact the performance of FL. Previous studies in FL have primarily focused on addressing label distribution skew data heterogeneity, while only a few recent works have made initial progress in tackling feature skew issues. Notably, these two forms of data heterogeneity have been studied separately and have not been well explored within a unified FL framework. To address this gap, we propose Fed-CO$_{2}$, a universal FL framework that handles both label distribution skew and feature skew within a \textbf{C}ooperation mechanism between the \textbf{O}nline and \textbf{O}ffline models. Specifically, the online model learns general knowledge that is shared among all clients, while the offline model is trained locally to learn the specialized knowledge of each individual client. To further enhance model cooperation in the presence of feature shifts, we design an intra-client knowledge transfer mechanism that reinforces mutual learning between the online and offline models, and an inter-client knowledge transfer mechanism to increase the models' domain generalization ability. Extensive experiments show that our Fed-CO$_{2}$ outperforms a wide range of existing personalized federated learning algorithms in terms of handling label distribution skew and feature skew, both individually and collectively. The empirical results are supported by our convergence analyses in a simplified setting. △ Less

Submitted 26 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: Accepted by NeurIPS 2023

arXiv:2312.13593 [pdf, ps, other]

Search for the decay $χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$… ▽ More Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ to $χ_{c1}(3872) \to π^{+}π^{-}J/ψ$ is measured as $\mathcal{R}\equiv\frac{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}]}{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-} J/ψ]}<0.18$ at 90$\%$ confidence level. The upper limit on the product of the cross section $σ[e^{+}e^{-}\toγχ_{c1}(3872)]$ and the branching fraction $\mathcal{B}[χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}]$ at each center-of-mass energy is also given. These measurements favor the non-conventional charmonium nature of the $χ_{c1}(3872)$ state. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: 8 pages, 1 figure

arXiv:2312.13518 [pdf]

Revisit the phase diagram and piezoelectricity of lead zirconate titanate from first principles

Authors: Yubai Shi, Ri He, Bingwen Zhang, Zhicheng Zhong

Abstract: Lead zirconate titanate (PbZr1-xTixO3, PZT) exhibits excellent piezoelectric properties in the morphotropic phase boundary (MPB) region of its temperature-composition phase diagram. However, the microscopic origin of its high piezoelectric response remains controversial. Here, we develop a machine-learning-based deep potential (DP) model of PZT using the training dataset from first principles dens… ▽ More Lead zirconate titanate (PbZr1-xTixO3, PZT) exhibits excellent piezoelectric properties in the morphotropic phase boundary (MPB) region of its temperature-composition phase diagram. However, the microscopic origin of its high piezoelectric response remains controversial. Here, we develop a machine-learning-based deep potential (DP) model of PZT using the training dataset from first principles density functional theory calculations. Based on DP-assisted large-scale atomic simulations, we reproduce the temperature-composition phase diagram of PZT, in good agreement with the experiment except the absence of structural transition from R3c to R3m. We find that the rhombohedral phase maintains R3c symmetry with slight oxygen octahedral tilting as increase of temperature, instead of appearing R3m symmetry. This discrepancy can trace back to the lack of experimental measurements to identify such slight octahedral tilting. More importantly, we clarify the atomic-level feature of PZT at the MPB, exhibiting the competing coupling of ferroelectric nanodomains with various polarization orientations. The high piezoelectric response is driven by polarization rotation of nanodomains induced by an external electric field. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 19 pages, 8 figures

arXiv:2312.13016 [pdf, other]

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis

Authors: Yuming Gu, You Xie, Hongyi Xu, Guoxian Song, Yichun Shi, Di Chang, **g Yang, Linjie Luo

Abstract: We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize plausible but consistent facial details rendered from novel camera views with retained both identity and facial expression. In lieu of time-consuming optimization… ▽ More We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize plausible but consistent facial details rendered from novel camera views with retained both identity and facial expression. In lieu of time-consuming optimization and fine-tuning, our zero-shot method generalizes well to arbitrary face portraits with unposed camera views, extreme facial expressions, and diverse artistic depictions. At its core, we leverage the generative prior of 2D diffusion models pre-trained on large-scale image datasets as our rendering backbone, while the denoising is guided with disentangled attentive control of appearance and camera pose. To achieve this, we first inject the appearance context from the reference image into the self-attention layers of the frozen UNets. The rendering view is then manipulated with a novel conditional control module that interprets the camera pose by watching a condition image of a crossed subject from the same view. Furthermore, we insert a trainable cross-view attention module to enhance view consistency, which is further strengthened with a novel 3D-aware noise generation process during inference. We demonstrate state-of-the-art results both qualitatively and quantitatively on our challenging in-the-wild and multi-view benchmarks. △ Less

Submitted 19 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

arXiv:2312.12719 [pdf, ps, other]

Measurements of $Σ$ electromagnetic form factors in the time-like region using the untagged initial-state radiation technique

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (626 additional authors not shown)

Abstract: The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven… ▽ More The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven $Σ^{+}\barΣ^{-}$ invariant mass intervals from threshold to 3.04 GeV/$c^2$. The results are consistent with the previous results from Belle and BESIII. Furthermore, the branching fractions of the decays $J/ψ\toΣ^{+}\barΣ^{-}$ and $ψ(3686)\toΣ^{+}\barΣ^{-}$ are determined and the obtained results are consistent with the previous results of BESIII. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: 13 pages, 6 figures

arXiv:2312.12237 [pdf, other]

Roll With the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning

Authors: Yue Duan, Zhen Zhao, Lei Qi, Lu** Zhou, Lei Wang, Yinghuan Shi

Abstract: While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e.g., fine-grained visual classification in the context of SSL (SS-FGVC). The increased recognition difficulty on fine-grained unlabeled data spells disaster for pseudo-labeling accuracy, resulting in… ▽ More While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e.g., fine-grained visual classification in the context of SSL (SS-FGVC). The increased recognition difficulty on fine-grained unlabeled data spells disaster for pseudo-labeling accuracy, resulting in poor performance of the SSL model. To tackle this challenge, we propose Soft Label Selection with Confidence-Aware Clustering based on Class Transition Tracking (SoC) by reconstructing the pseudo-label selection process by jointly optimizing Expansion Objective and Shrinkage Objective, which is based on a soft label manner. Respectively, the former objective encourages soft labels to absorb more candidate classes to ensure the attendance of ground-truth class, while the latter encourages soft labels to reject more noisy classes, which is theoretically proved to be equivalent to entropy minimization. In comparisons with various state-of-the-art methods, our approach demonstrates its superior performance in SS-FGVC. Checkpoints and source code are available at https://github.com/NJUyued/SoC4SS-FGVC. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024

arXiv:2312.11567 [pdf, other]

Students' Perceptions and Preferences of Generative Artificial Intelligence Feedback for Programming

Authors: Zhengdong Zhang, Zihan Dong, Yang Shi, Noboru Matsuda, Thomas Price, Dongkuan Xu

Abstract: The rapid evolution of artificial intelligence (AI), specifically large language models (LLMs), has opened opportunities for various educational applications. This paper explored the feasibility of utilizing ChatGPT, one of the most popular LLMs, for automating feedback for Java programming assignments in an introductory computer science (CS1) class. Specifically, this study focused on three quest… ▽ More The rapid evolution of artificial intelligence (AI), specifically large language models (LLMs), has opened opportunities for various educational applications. This paper explored the feasibility of utilizing ChatGPT, one of the most popular LLMs, for automating feedback for Java programming assignments in an introductory computer science (CS1) class. Specifically, this study focused on three questions: 1) To what extent do students view LLM-generated feedback as formative? 2) How do students see the comparative affordances of feedback prompts that include their code, vs. those that exclude it? 3) What enhancements do students suggest for improving AI-generated feedback? To address these questions, we generated automated feedback using the ChatGPT API for four lab assignments in the CS1 class. The survey results revealed that students perceived the feedback as aligning well with formative feedback guidelines established by Shute. Additionally, students showed a clear preference for feedback generated by including the students' code as part of the LLM prompt, and our thematic study indicated that the preference was mainly attributed to the specificity, clarity, and corrective nature of the feedback. Moreover, this study found that students generally expected specific and corrective feedback with sufficient code examples, but had diverged opinions on the tone of the feedback. This study demonstrated that ChatGPT could generate Java programming assignment feedback that students perceived as formative. It also offered insights into the specific improvements that would make the ChatGPT-generated feedback useful for students. △ Less

Submitted 17 December, 2023; originally announced December 2023.

arXiv:2312.11464 [pdf, other]

Symmetry Enforced Fermi Surface Degeneracies Observed in Time-Reversal Symmetry-Breaking Superconductor LaNiGa$_2$

Authors: Matthew Staab, Robert Prater, Sudheer Sreedhar, Journey Byland, Eliana Mann, Davis Zackaria, Yunshu Shi, Henry J. Bowman, Andrew L. Stephens, Myung-Chul Jung, Antia S. Botana, Warren E. Pickett, Valentin Taufour, Inna Vishik

Abstract: LaNiGa$_2$ is superconductor that breaks time-reversal symmetry in the superconducting state without any known nearby magnetism. Recently, single crystals of LaNiGa$_2$ have been synthesized, revealing a nonsymmorphic Cmcm space group. Here, we report measurements of the electronic structure of LaNiGa$_2$ throughout the three-dimensional Brillouin zone (BZ) using angle-resolved photoemission spect… ▽ More LaNiGa$_2$ is superconductor that breaks time-reversal symmetry in the superconducting state without any known nearby magnetism. Recently, single crystals of LaNiGa$_2$ have been synthesized, revealing a nonsymmorphic Cmcm space group. Here, we report measurements of the electronic structure of LaNiGa$_2$ throughout the three-dimensional Brillouin zone (BZ) using angle-resolved photoemission spectroscopy (ARPES). Our findings show broad consistency with density functional theory (DFT) calculations and provide evidence for degeneracies in the electronic structure that are predicted from the space group. The calculations also predict four Fermi surfaces which cross the purported nodal plane and should therefore form two degenerate pairs. We report evidence for those predicted symmetry enforced degeneracies as well as accidental near degeneracies throughout the BZ. These degeneracies and near-degeneracies may play a role in the pairing mechanism of LaNiGa$_2$. Our results provide insight into the interplay between structure, Fermiology, and superconductivity in unconventional superconductors with nonsymmorphic space group. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: Main: 8 pages, 6 figures. SI: 6 pages, 8 figures

arXiv:2312.10962 [pdf, other]

Observation of significant flavor-SU(3) breaking in the kaon wave function at $12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$ and discovery of the charmless decay $ψ(3770)\to K_S^0K_L^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (607 additional authors not shown)

Abstract: We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$,… ▽ More We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$, which indicates a small but significant effect of flavor-SU(3) breaking in the kaon wave function, and consequently excludes the possibility that flavor-SU(3) breaking is the primary reason for the strong experimental violation of the pQCD prediction $|F(π^{\pm})|/|F(K^{\pm})|=f^2_π/f^2_{K}$, where $F(π^{\pm})$ and $F(K^{\pm})$ are the form factors, and $f_π$ and $f_{K}$ are the decay constants of charged pions and kaons, respectively. We also observe a significant signal for the charmless decay $ψ(3770)\to K_S^0K_L^0$ for the first time. Within a $1σ$ contour of the likelihood value, the the branching fraction for $ψ(3770)\to K_S^0K_L^0$ is determined to be ${\cal B}=(2.63_{-1.59}^{+1.40})\times 10^{-5}$, and the relative phase between the continuum and $ψ(3770)$ amplitudes is $φ=(-0.39_{-0.10}^{+0.05})π$. The branching fraction is in good agreement with the $\mathcal{S}$- and $\mathcal{D}$-wave charmonia mixing scheme proposed in the interpretation of the "$ρπ$ puzzle" between $J/ψ$ and $ψ(3686)$ decays. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 18 pages, 56 figures

arXiv:2312.09812 [pdf, other]

Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception

Authors: Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, ** Tang

Abstract: Understanding vehicles in images is important for various applications such as intelligent transportation and self-driving system. Existing vehicle-centric works typically pre-train models on large-scale classification datasets and then fine-tune them for specific downstream tasks. However, they neglect the specific characteristics of vehicle perception in different tasks and might thus lead to su… ▽ More Understanding vehicles in images is important for various applications such as intelligent transportation and self-driving system. Existing vehicle-centric works typically pre-train models on large-scale classification datasets and then fine-tune them for specific downstream tasks. However, they neglect the specific characteristics of vehicle perception in different tasks and might thus lead to sub-optimal performance. To address this issue, we propose a novel vehicle-centric pre-training framework called VehicleMAE, which incorporates the structural information including the spatial structure from vehicle profile information and the semantic structure from informative high-level natural language descriptions for effective masked vehicle appearance reconstruction. To be specific, we explicitly extract the sketch lines of vehicles as a form of the spatial structure to guide vehicle reconstruction. The more comprehensive knowledge distilled from the CLIP big model based on the similarity between the paired/unpaired vehicle image-text sample is further taken into consideration to help achieve a better understanding of vehicles. A large-scale dataset is built to pre-train our model, termed Autobot1M, which contains about 1M vehicle images and 12693 text information. Extensive experiments on four vehicle-based downstream tasks fully validated the effectiveness of our VehicleMAE. The source code and pre-trained models will be released at https://github.com/Event-AHU/VehicleMAE. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI-2024

Showing 301–350 of 3,048 results for author: Shi, Y