-
DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing
Authors:
Hao Qu,
Lilian Zhang,
Jun Mao,
Junbo Tie,
Xiaofeng He,
** Hu,
Yifei Shi,
Changhao Chen
Abstract:
The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local features excel at capturing high-level information and perform well on matching benchmarks, they struggle with generalization in continuous motion scenes, adversely affecting loop detection accuracy. O…
▽ More
The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local features excel at capturing high-level information and perform well on matching benchmarks, they struggle with generalization in continuous motion scenes, adversely affecting loop detection accuracy. Our system employs a Model-Agnostic Meta-Learning (MAML) strategy to optimize the training of keypoint extraction networks, enhancing their adaptability to diverse environments. Additionally, we introduce a coarse-to-fine feature tracking mechanism for learned keypoints. It begins with a direct method to approximate the relative pose between consecutive frames, followed by a feature matching method for refined pose estimation. To mitigate cumulative positioning errors, DK-SLAM incorporates a novel online learning module that utilizes binary features for loop closure detection. This module dynamically identifies loop nodes within a sequence, ensuring accurate and efficient localization. Experimental evaluations on publicly available datasets demonstrate that DK-SLAM outperforms leading traditional and learning based SLAM systems, such as ORB-SLAM3 and LIFT-SLAM. These results underscore the efficacy and robustness of our DK-SLAM in varied and challenging real-world environments.
△ Less
Submitted 25 June, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Improved measurements of the Dalitz decays $η/η'\rightarrowγe^{+}e^{-}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (618 additional authors not shown)
Abstract:
Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and…
▽ More
Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and $(4.83\pm0.07\pm0.14)\times10^{-4}$, respectively. Within the single pole model, the parameter of electromagnetic transition form factor for $η\rightarrowγe^+e^-$ is determined to be $Λ_η=(0.749 \pm 0.027 \pm 0.007)~ {\rm GeV}/c^{2}$. Within the multi-pole model, we extract the electromagnetic transition form factors for $η'\rightarrowγe^+e^-$ to be $Λ_{η'} = (0.802 \pm 0.007\pm 0.008)~ {\rm GeV}/c^{2}$ and $γ_{η'} = (0.113\pm0.010\pm0.002)~ {\rm GeV}/c^{2}$. The results are consistent with both theoretical predictions and previous measurements. The characteristic sizes of the interaction regions for the $η$ and $η'$ are calculated to be $(0.645 \pm 0.023 \pm 0.007 )~ {\rm fm}$ and $(0.596 \pm 0.005 \pm 0.006)~ {\rm fm}$, respectively. In addition, we search for the dark photon in $η/η^\prime\rightarrowγe^{+}e^{-}$, and the upper limits of the branching fractions as a function of the dark photon are given at 90\% confidence level.
△ Less
Submitted 5 April, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
First study of antihyperon-nucleon scattering $\barΛp\rightarrow\barΛp$ and measurement of $Λp\rightarrowΛp$ cross section
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cr…
▽ More
Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cross sections in $-0.9\leq\rm{cos}θ_{Λ/\barΛ}\leq0.9$ are measured to be $σ(Λp\rightarrowΛp)=(12.2\pm1.6_{\rm{stat}}\pm1.1_{\rm{sys}})$ mb and $σ(\barΛ p\rightarrow\barΛ p)=(17.5\pm2.1_{\rm{stat}}\pm1.6_{\rm{sys}})$ mb at the $Λ/\barΛ$ momentum of $1.074$ GeV/$c$ within a range of $\pm0.017$ GeV/$c$, where the $θ_{Λ/\barΛ}$ are the scattering angles of the $Λ/\barΛ$ in the $Λp/\barΛp$ rest frames. Furthermore, the differential cross sections of the two reactions are also measured, where there is a slight tendency of forward scattering for $Λp\rightarrowΛp$, and a strong forward peak for $\barΛp\rightarrow\barΛp$. We present an approach to extract the total elastic cross sections by extrapolation. The study of $\barΛp\rightarrow\barΛp$ represents the first study of antihyperon-nucleon scattering, and these new measurements will serve as important inputs for the theoretical understanding of the (anti)hyperon-nucleon interaction.
△ Less
Submitted 18 May, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Observation of $ψ(3686) \to Ω^- K^+ \barΞ^0 $+c.c
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (630 additional authors not shown)
Abstract:
Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systemati…
▽ More
Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systematic. Possible baryon excited states are searched for in this decay, but no evident intermediate state is observed with the current sample size.
△ Less
Submitted 15 April, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Achieve Fairness without Demographics for Dermatological Disease Diagnosis
Authors:
Ching-Hao Chiu,
Yu-Jen Chen,
Yawen Wu,
Yiyu Shi,
Tsung-Yi Ho
Abstract:
In medical image diagnosis, fairness has become increasingly crucial. Without bias mitigation, deploying unfair AI would harm the interests of the underprivileged population and potentially tear society apart. Recent research addresses prediction biases in deep learning models concerning demographic groups (e.g., gender, age, and race) by utilizing demographic (sensitive attribute) information dur…
▽ More
In medical image diagnosis, fairness has become increasingly crucial. Without bias mitigation, deploying unfair AI would harm the interests of the underprivileged population and potentially tear society apart. Recent research addresses prediction biases in deep learning models concerning demographic groups (e.g., gender, age, and race) by utilizing demographic (sensitive attribute) information during training. However, many sensitive attributes naturally exist in dermatological disease images. If the trained model only targets fairness for a specific attribute, it remains unfair for other attributes. Moreover, training a model that can accommodate multiple sensitive attributes is impractical due to privacy concerns. To overcome this, we propose a method enabling fair predictions for sensitive attributes during the testing phase without using such information during training. Inspired by prior work highlighting the impact of feature entanglement on fairness, we enhance the model features by capturing the features related to the sensitive and target attributes and regularizing the feature entanglement between corresponding classes. This ensures that the model can only classify based on the features related to the target attribute without relying on features associated with sensitive attributes, thereby improving fairness and accuracy. Additionally, we use disease masks from the Segment Anything Model (SAM) to enhance the quality of the learned feature. Experimental results demonstrate that the proposed method can improve fairness in classification compared to state-of-the-art methods in two dermatological disease datasets.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Adaptive Neural-Operator Backstep** Control of a Benchmark Hyperbolic PDE
Authors:
Maxence Lamarque,
Luke Bhan,
Yuanyuan Shi,
Miroslav Krstic
Abstract:
To stabilize PDEs, feedback controllers require gain kernel functions, which are themselves governed by PDEs. Furthermore, these gain-kernel PDEs depend on the PDE plants' functional coefficients. The functional coefficients in PDE plants are often unknown. This requires an adaptive approach to PDE control, i.e., an estimation of the plant coefficients conducted concurrently with control, where a…
▽ More
To stabilize PDEs, feedback controllers require gain kernel functions, which are themselves governed by PDEs. Furthermore, these gain-kernel PDEs depend on the PDE plants' functional coefficients. The functional coefficients in PDE plants are often unknown. This requires an adaptive approach to PDE control, i.e., an estimation of the plant coefficients conducted concurrently with control, where a separate PDE for the gain kernel must be solved at each timestep upon the update in the plant coefficient function estimate. Solving a PDE at each timestep is computationally expensive and a barrier to the implementation of real-time adaptive control of PDEs. Recently, results in neural operator (NO) approximations of functional map**s have been introduced into PDE control, for replacing the computation of the gain kernel with a neural network that is trained, once offline, and reused in real-time for rapid solution of the PDEs. In this paper, we present the first result on applying NOs in adaptive PDE control, presented for a benchmark 1-D hyperbolic PDE with recirculation. We establish global stabilization via Lyapunov analysis, in the plant and parameter error states, and also present an alternative approach, via passive identifiers, which avoids the strong assumptions on kernel differentiability. We then present numerical simulations demonstrating stability and observe speedups up to three orders of magnitude, highlighting the real-time efficacy of neural operators in adaptive control. Our code (Github) is made publicly available for future researchers.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Massive Red Spiral Galaxies in SDSS-IV MaNGA Survey
Authors:
Jiantong Cui,
Qiusheng Gu,
Yong Shi
Abstract:
Massive red spiral galaxies (MRSGs) are supposed to be the possible progenitors of lenticular galaxies (S0s). We select a large sample of MRSGs ($M_*>10^{10.5}\rm M_{\odot}$) from MaNGA DR17 using the $g-r$ color vs. stellar mass diagram, along with control samples of blue spirals and S0s. Our main results are as follows: (1) After comparing the S$\rm \acute{e}$rsic index, concentration parameter,…
▽ More
Massive red spiral galaxies (MRSGs) are supposed to be the possible progenitors of lenticular galaxies (S0s). We select a large sample of MRSGs ($M_*>10^{10.5}\rm M_{\odot}$) from MaNGA DR17 using the $g-r$ color vs. stellar mass diagram, along with control samples of blue spirals and S0s. Our main results are as follows: (1) After comparing the S$\rm \acute{e}$rsic index, concentration parameter, asymmetry parameter distribution, size-mass relation and $Σ_1$ (stellar mass surface density within the central 1 kpc)-mass relation, we find MRSGs are similar to S0s and have more compact and symmetric structures than blue spirals. MRSGs also resemble S0s in Dn4000, metallicity, Mgb/$\rm \left \langle Fe \right \rangle$ and $V/σ$ radial profile. (2) By using MaNGA 2D spectra data, we separate the spatial regions into inner (R < 0.8$R_{\rm e}$) and outer (0.8$R_{\rm e}$ < R < 1.5$R_{\rm e}$) regions, and detect residual star formation in the outer regions of MRSGs. (3) When we select a sub-sample of MRSGs with NUV$-r$ > 5, we find that they are completely star-formation quenched in both inner and outer regions. Compared to optically selected MRSGs, NUV$-r$ selected MRSGs appear to be more concentrated and have more massive dark matter halos. The similarities between S0s and MRSGs suggest the possible evolutionary trend between MRSGs and S0s.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
First observation of the decay $Λ^+_c\to nK^{0}_{S}π^+π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (630 additional authors not shown)
Abstract:
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic,…
▽ More
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic, which differs from the theoretical prediction based on isospin by 4.4$σ$. This indicates that there may be resonant contributions or some unknown dynamics in this decay.
△ Less
Submitted 28 March, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers
Authors:
Yuling Shi,
Hongyu Zhang,
Chengcheng Wan,
Xiaodong Gu
Abstract:
Large language models have catalyzed an unprecedented wave in code generation. While achieving significant advances, they blur the distinctions between machine- and human-authored source code, causing integrity and authenticity issues of software artifacts. Previous methods such as DetectGPT have proven effective in discerning machine-generated texts, but they do not identify and harness the uniqu…
▽ More
Large language models have catalyzed an unprecedented wave in code generation. While achieving significant advances, they blur the distinctions between machine- and human-authored source code, causing integrity and authenticity issues of software artifacts. Previous methods such as DetectGPT have proven effective in discerning machine-generated texts, but they do not identify and harness the unique patterns of machine-generated code. Thus, its applicability falters when applied to code. In this paper, we carefully study the specific patterns that characterize machine- and human-authored code. Through a rigorous analysis of code attributes such as lexical diversity, conciseness, and naturalness, we expose unique patterns inherent to each source. We particularly notice that the syntactic segmentation of code is a critical factor in identifying its provenance. Based on our findings, we propose DetectCodeGPT, a novel method for detecting machine-generated code, which improves DetectGPT by capturing the distinct stylized patterns of code. Diverging from conventional techniques that depend on external LLMs for perturbations, DetectCodeGPT perturbs the code corpus by strategically inserting spaces and newlines, ensuring both efficacy and efficiency. Experiment results show that our approach significantly outperforms state-of-the-art techniques in detecting machine-generated code.
△ Less
Submitted 23 March, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
SCARIF: Towards Carbon Modeling of Cloud Servers with Accelerators
Authors:
Shixin Ji,
Zhuo** Yang,
Xingzhen Chen,
Stephen Cahoon,
**gtong Hu,
Yiyu Shi,
Alex K. Jones,
Peipei Zhou
Abstract:
Embodied carbon has been widely reported as a significant component in the full system lifecycle of various computing systems' green house gas emissions. Many efforts have been undertaken to quantify the elements that comprise this embodied carbon, from tools that evaluate semiconductor manufacturing to those that can quantify different elements of the computing system from commercial and academic…
▽ More
Embodied carbon has been widely reported as a significant component in the full system lifecycle of various computing systems' green house gas emissions. Many efforts have been undertaken to quantify the elements that comprise this embodied carbon, from tools that evaluate semiconductor manufacturing to those that can quantify different elements of the computing system from commercial and academic sources. However, these tools cannot easily reproduce results reported by server vendors' product carbon reports and the accuracy can vary substantially due to various assumptions. Furthermore, attempts to determine green house gas contributions using bottom-up methodologies often do not agree with system-level studies and are hard to rectify. Nonetheless, given there is a need to consider all contributions to green house gas emissions in datacenters, we propose SCARIF, the Server Carbon including Accelerator Reporter with Intelligence-based Formulation tool. SCARIF has three main contributions: (1) We first collect reported carbon cost data from server vendors and design statistic models to predict the embodied carbon cost so that users can get the embodied carbon cost for their server configurations. (2) We provide embodied carbon cost if users configure servers with accelerators including GPUs, and FPGAs. (3) By using case studies, we show that certain design choices of data center management might flip by the insight and observation from using SCARIF. Thus, SCARIF provides an opportunity for large-scale datacenter and hyperscaler design. We release SCARIF as an open-source tool at https://github.com/arc-research-lab/SCARIF.
△ Less
Submitted 22 May, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations
Authors:
Na Wang,
Lei Qi,
**tao Guo,
Yinghuan Shi,
Yang Gao
Abstract:
Domain generalization (DG) intends to train a model on multiple source domains to ensure that it can generalize well to an arbitrary unseen target domain. The acquisition of domain-invariant representations is pivotal for DG as they possess the ability to capture the inherent semantic information of the data, mitigate the influence of domain shift, and enhance the generalization capability of the…
▽ More
Domain generalization (DG) intends to train a model on multiple source domains to ensure that it can generalize well to an arbitrary unseen target domain. The acquisition of domain-invariant representations is pivotal for DG as they possess the ability to capture the inherent semantic information of the data, mitigate the influence of domain shift, and enhance the generalization capability of the model. Adopting multiple perspectives, such as the sample and the feature, proves to be effective. The sample perspective facilitates data augmentation through data manipulation techniques, whereas the feature perspective enables the extraction of meaningful generalization features. In this paper, we focus on improving the generalization ability of the model by compelling it to acquire domain-invariant representations from both the sample and feature perspectives by disentangling spurious correlations and enhancing potential correlations. 1) From the sample perspective, we develop a frequency restriction module, guiding the model to focus on the relevant correlations between object features and labels, thereby disentangling spurious correlations. 2) From the feature perspective, the simple Tail Interaction module implicitly enhances potential correlations among all samples from all source domains, facilitating the acquisition of domain-invariant representations across multiple domains for the model. The experimental results show that Convolutional Neural Networks (CNNs) or Multi-Layer Perceptrons (MLPs) with a strong baseline embedded with these two modules can achieve superior results, e.g., an average accuracy of 92.30% on Digits-DG.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
U-SWIM: Universal Selective Write-Verify for Computing-in-Memory Neural Accelerators
Authors:
Zheyu Yan,
Xiaobo Sharon Hu,
Yiyu Shi
Abstract:
Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accura…
▽ More
Architectures that incorporate Computing-in-Memory (CiM) using emerging non-volatile memory (NVM) devices have become strong contenders for deep neural network (DNN) acceleration due to their impressive energy efficiency. Yet, a significant challenge arises when using these emerging devices: they can show substantial variations during the weight-map** process. This can severely impact DNN accuracy if not mitigated. A widely accepted remedy for imperfect weight map** is the iterative write-verify approach, which involves verifying conductance values and adjusting devices if needed. In all existing publications, this procedure is applied to every individual device, resulting in a significant programming time overhead. In our research, we illustrate that only a small fraction of weights need this write-verify treatment for the corresponding devices and the DNN accuracy can be preserved, yielding a notable programming acceleration. Building on this, we introduce USWIM, a novel method based on the second derivative. It leverages a single iteration of forward and backpropagation to pinpoint the weights demanding write-verify. Through extensive tests on diverse DNN designs and datasets, USWIM manifests up to a 10x programming acceleration against the traditional exhaustive write-verify method, all while maintaining a similar accuracy level. Furthermore, compared to our earlier SWIM technique, USWIM excels, showing a 7x speedup when dealing with devices exhibiting non-uniform variations.
△ Less
Submitted 11 December, 2023;
originally announced January 2024.
-
FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation
Authors:
Yang Liu,
Li Wan,
Yun Li,
Yiteng Huang,
Ming Sun,
James Luan,
Yangyang Shi,
Xin Lei
Abstract:
Despite the potential of diffusion models in speech enhancement, their deployment in Acoustic Echo Cancellation (AEC) has been restricted. In this paper, we propose DI-AEC, pioneering a diffusion-based stochastic regeneration approach dedicated to AEC. Further, we propose FADI-AEC, fast score-based diffusion AEC framework to save computational demands, making it favorable for edge devices. It stan…
▽ More
Despite the potential of diffusion models in speech enhancement, their deployment in Acoustic Echo Cancellation (AEC) has been restricted. In this paper, we propose DI-AEC, pioneering a diffusion-based stochastic regeneration approach dedicated to AEC. Further, we propose FADI-AEC, fast score-based diffusion AEC framework to save computational demands, making it favorable for edge devices. It stands out by running the score model once per frame, achieving a significant surge in processing efficiency. Apart from that, we introduce a novel noise generation technique where far-end signals are utilized, incorporating both far-end and near-end signals to refine the score model's accuracy. We test our proposed method on the ICASSP2023 Microsoft deep echo cancellation challenge evaluation dataset, where our method outperforms some of the end-to-end methods and other diffusion based echo cancellation methods.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond
Authors:
Yupei Lin,
Xiaoyu Xian,
Yukai Shi,
Liang Lin
Abstract:
Recently, text-to-image diffusion models become a new paradigm in image processing fields, including content generation, image restoration and image-to-image translation. Given a target prompt, Denoising Diffusion Probabilistic Models (DDPM) are able to generate realistic yet eligible images. With this appealing property, the image translation task has the potential to be free from target image sa…
▽ More
Recently, text-to-image diffusion models become a new paradigm in image processing fields, including content generation, image restoration and image-to-image translation. Given a target prompt, Denoising Diffusion Probabilistic Models (DDPM) are able to generate realistic yet eligible images. With this appealing property, the image translation task has the potential to be free from target image samples for supervision. By using a target text prompt for domain adaption, the diffusion model is able to implement zero-shot image-to-image translation advantageously. However, the sampling and inversion processes of DDPM are stochastic, and thus the inversion process often fail to reconstruct the input content. Specifically, the displacement effect will gradually accumulated during the diffusion and inversion processes, which led to the reconstructed results deviating from the source domain. To make reconstruction explicit, we propose a prompt redescription strategy to realize a mirror effect between the source and reconstructed image in the diffusion model (MirrorDiffusion). More specifically, a prompt redescription mechanism is investigated to align the text prompts with latent code at each time step of the Denoising Diffusion Implicit Models (DDIM) inversion to pursue a structure-preserving reconstruction. With the revised DDIM inversion, MirrorDiffusion is able to realize accurate zero-shot image translation by editing optimized text prompts and latent code. Extensive experiments demonstrate that MirrorDiffusion achieves superior performance over the state-of-the-art methods on zero-shot image translation benchmarks by clear margins and practical model stability.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
Fairness-Aware Job Scheduling for Multi-Job Federated Learning
Authors:
Yuxin Shi,
Han Yu
Abstract:
Federated learning (FL) enables multiple data owners (a.k.a. FL clients) to collaboratively train machine learning models without disclosing sensitive private data. Existing FL research mostly focuses on the monopoly scenario in which a single FL server selects a subset of FL clients to update their local models in each round of training. In practice, there can be multiple FL servers simultaneousl…
▽ More
Federated learning (FL) enables multiple data owners (a.k.a. FL clients) to collaboratively train machine learning models without disclosing sensitive private data. Existing FL research mostly focuses on the monopoly scenario in which a single FL server selects a subset of FL clients to update their local models in each round of training. In practice, there can be multiple FL servers simultaneously trying to select clients from the same pool. In this paper, we propose a first-of-its-kind Fairness-aware Federated Job Scheduling (FairFedJS) approach to bridge this gap. Based on Lyapunov optimization, it ensures fair allocation of high-demand FL client datasets to FL jobs in need of them, by jointly considering the current demand and the job payment bids, in order to prevent prolonged waiting. Extensive experiments comparing FairFedJS against four state-of-the-art approaches on two datasets demonstrate its significant advantages. It outperforms the best baseline by 31.9% and 1.0% on average in terms of scheduling fairness and convergence time, respectively, while achieving comparable test accuracy.
△ Less
Submitted 7 February, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
CscK metrics near the canonical class
Authors:
Bin Guo,
Wangjian Jian,
Yalong Shi,
Jian Song
Abstract:
Let $X$ be a Kähler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any Kähler class $γ$, there exists $δ>0$ such that for all $t\in (0, δ)$ there exists a unique cscK metric $g_t$ in $K_X+ t γ$. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, δ)} $ have uniformly bounded Kähler potentials, volume forms and diameters. As a consequence, these metric s…
▽ More
Let $X$ be a Kähler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any Kähler class $γ$, there exists $δ>0$ such that for all $t\in (0, δ)$ there exists a unique cscK metric $g_t$ in $K_X+ t γ$. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, δ)} $ have uniformly bounded Kähler potentials, volume forms and diameters. As a consequence, these metric spaces are pre-compact in the Gromov-Hausdorff sense.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Moving-Horizon Estimators for Hyperbolic and Parabolic PDEs in 1-D
Authors:
Luke Bhan,
Yuanyuan Shi,
Iasson Karafyllis,
Miroslav Krstic,
James B. Rawlings
Abstract:
Observers for PDEs are themselves PDEs. Therefore, producing real time estimates with such observers is computationally burdensome. For both finite-dimensional and ODE systems, moving-horizon estimators (MHE) are operators whose output is the state estimate, while their inputs are the initial state estimate at the beginning of the horizon as well as the measured output and input signals over the m…
▽ More
Observers for PDEs are themselves PDEs. Therefore, producing real time estimates with such observers is computationally burdensome. For both finite-dimensional and ODE systems, moving-horizon estimators (MHE) are operators whose output is the state estimate, while their inputs are the initial state estimate at the beginning of the horizon as well as the measured output and input signals over the moving time horizon. In this paper we introduce MHEs for PDEs which remove the need for a numerical solution of an observer PDE in real time. We accomplish this using the PDE backstep** method which, for certain classes of both hyperbolic and parabolic PDEs, produces moving-horizon state estimates explicitly. Precisely, to explicitly produce the state estimates, we employ a backstep** transformation of a hard-to-solve observer PDE into a target observer PDE, which is explicitly solvable. The MHEs we propose are not new observer designs but simply the explicit MHE realizations, over a moving horizon of arbitrary length, of the existing backstep** observers. Our PDE MHEs lack the optimality of the MHEs that arose as duals of MPC, but they are given explicitly, even for PDEs. In the paper we provide explicit formulae for MHEs for both hyperbolic and parabolic PDEs, as well as simulation results that illustrate theoretically guaranteed convergence of the MHEs.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Green functions for GJMS operators on spheres, Gegenbauer polynomials and rigidity theorems
Authors:
Xuezhang Chen,
Yalong Shi
Abstract:
We derive explicit representation formulae of Green functions for GJMS operators on $n$-spheres, including the fractional ones. These formulae not only have natural geometric interpretations concerning the extrinsic geometry of the round sphere, but also reflect the spherical rigidity among closed embedded hypersurfaces in $\mathbb{R}^{n+1}$.
We derive explicit representation formulae of Green functions for GJMS operators on $n$-spheres, including the fractional ones. These formulae not only have natural geometric interpretations concerning the extrinsic geometry of the round sphere, but also reflect the spherical rigidity among closed embedded hypersurfaces in $\mathbb{R}^{n+1}$.
△ Less
Submitted 20 January, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Disorder-induced topological pum** on a superconducting quantum processor
Authors:
Yu Liu,
Yu-Ran Zhang,
Yun-Hao Shi,
Tao Liu,
Congwei Lu,
Yong-Yi Wang,
Hao Li,
Tian-Ming Li,
Cheng-Lin Deng,
Si-Yun Zhou,
Tong Liu,
Jia-Chi Zhang,
Gui-Han Liang,
Zheng-Yang Mei,
Wei-Guo Ma,
Hao-Tian Liu,
Zheng-He Liu,
Chi-Tong Chen,
Kaixuan Huang,
Xiaohui Song,
SP Zhao,
Ye Tian,
Zhongcheng Xiang,
Dongning Zheng,
Franco Nori
, et al. (2 additional authors not shown)
Abstract:
Thouless pum**, a dynamical version of the integer quantum Hall effect, represents the quantized charge pumped during an adiabatic cyclic evolution. Here we report experimental observations of nontrivial topological pum** that is induced by disorder even during a topologically trivial pum** trajectory. With a 41-qubit superconducting quantum processor, we develop a Floquet engineering techni…
▽ More
Thouless pum**, a dynamical version of the integer quantum Hall effect, represents the quantized charge pumped during an adiabatic cyclic evolution. Here we report experimental observations of nontrivial topological pum** that is induced by disorder even during a topologically trivial pum** trajectory. With a 41-qubit superconducting quantum processor, we develop a Floquet engineering technique to realize cycles of adiabatic pum** by simultaneously varying the on-site potentials and the hop** couplings. We demonstrate Thouless pum** in the presence of disorder and show its breakdown as the strength of disorder increases. Moreover, we observe two types of topological pum** that are induced by on-site potential disorder and hop** disorder, respectively. Especially, an intrinsic topological pump that is induced by quasi-periodic hop** disorder has never been experimentally realized before. Our highly controllable system provides a valuable quantum simulating platform for studying various aspects of topological physics in the presence of disorder.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Partial Wave Analysis of $J/ψ\rightarrow γγφ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (603 additional authors not shown)
Abstract:
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and…
▽ More
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, a partial wave analysis on the decay $γγφ$ is performed to investigate the intermediate resonances in $J/ψ\rightarrowγX, X\rightarrowγφ$. The resonances $f_{1}(1285)$, $η(1405)$, $f_{1}(1420)$, $f_{1}(1510)$, $f_{2}(1525)$, $X(1835)$, $f_{2}(1950)$, $f_{2}(2010)$, $f_{0}(2200)$ and $η_{c}$ are observed with statistical significance greater than 5$σ$. The product branching fractions $\mathcal{B}(J/ψ\rightarrowγX, X\rightarrow γφ)$ are reported. The resonance parameters of $η(1405)$ and $X(1835)$ are also measured.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Observation of $\mathcal R(3810)$ in $e^+e^-\rightarrow {\rm hadrons}$ and Improved Measurements of the Resonance Parameters of $\mathcal R(3760)$ and $\mathcal R(3780)$
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (596 additional authors not shown)
Abstract:
We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$,…
▽ More
We report the measurement of the cross sections for $e^+e^-\rightarrow {\rm hadrons}$ at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe a new resonance $\mathcal R(3810)$ in the cross sections for the first time, and observe the $\mathcal R(3760)$ resonance with high significance in the cross sections. The $\mathcal R(3810)$ has a mass of $(3804.5 \pm 0.9 \pm 0.9)$ ~MeV/$c^2$, a total width of $(5.4 \pm 3.5 \pm 3.2)$~MeV, and an electronic partial width of $(19.4 \pm 7.4 \pm 12.1)$~eV. Its significance is $7.7σ$. The $\mathcal R(3810)$ could be interpreted as a hadro-charmonium resonance predicted by Quantum Chromodynamics (QCD). In addition, we measure the mass $(3751.9\pm 3.8\pm 2.8)$ ~MeV/$c^2$, the total width $(32.8 \pm 5.8 \pm 8.7)$~MeV, and the electronic partial width $(184\pm 75\pm 86)$~eV with improved precision for the $\mathcal R(3760)$. Furthermore, for the $\mathcal R(3780)$ we measure the mass $(3778.7\pm 0.5\pm 0.3)$ ~MeV/$c^2$ and total width $(20.3 \pm 0.8 \pm 1.7)$~MeV with improved precision, and the electronic partial width $(265\pm 69\pm 83)$~eV. The $\mathcal R(3780)$ can be interpreted as the $1^3D_1$ state of charmonium. Its mass and total width differ significantly from the corresponding fitted values given by the Particle Data Group in 2022 by 7.1 and 3.2 times the uncertainties for $ψ(3770)$, respectively. $ψ(3770)$ has been interpreted as the $1^3D_1$ state for 45 years.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
GeoGalactica: A Scientific Large Language Model in Geoscience
Authors:
Zhouhan Lin,
Cheng Deng,
Le Zhou,
Tianhang Zhang,
Yi Xu,
Yutong Xu,
Zhongmou He,
Yuanyuan Shi,
Beiya Dai,
Yunchong Song,
Boyi Zeng,
Qiyuan Chen,
Yuxun Miao,
Bo Xue,
Shu Wang,
Luoyi Fu,
Weinan Zhang,
Junxian He,
Yunqiang Zhu,
Xinbing Wang,
Chenghu Zhou
Abstract:
Large language models (LLMs) have achieved huge success for their general knowledge and ability to solve a wide spectrum of tasks in natural language processing (NLP). Due to their impressive abilities, LLMs have shed light on potential inter-discipline applications to foster scientific discoveries of a specific domain by using artificial intelligence (AI for science, AI4S). In the meantime, utili…
▽ More
Large language models (LLMs) have achieved huge success for their general knowledge and ability to solve a wide spectrum of tasks in natural language processing (NLP). Due to their impressive abilities, LLMs have shed light on potential inter-discipline applications to foster scientific discoveries of a specific domain by using artificial intelligence (AI for science, AI4S). In the meantime, utilizing NLP techniques in geoscience research and practice is wide and convoluted, contributing from knowledge extraction and document classification to question answering and knowledge discovery. In this work, we take the initial step to leverage LLM for science, through a rather straightforward approach. We try to specialize an LLM into geoscience, by further pre-training the model with a vast amount of texts in geoscience, as well as supervised fine-tuning (SFT) the resulting model with our custom collected instruction tuning dataset. These efforts result in a model GeoGalactica consisting of 30 billion parameters. To our best knowledge, it is the largest language model for the geoscience domain. More specifically, GeoGalactica is from further pre-training of Galactica. We train GeoGalactica over a geoscience-related text corpus containing 65 billion tokens, preserving as the largest geoscience-specific text corpus. Then we fine-tune the model with 1 million pairs of instruction-tuning data consisting of questions that demand professional geoscience knowledge to answer. In this technical report, we will illustrate in detail all aspects of GeoGalactica, including data collection, data cleaning, base model selection, pre-training, SFT, and evaluation. We open-source our data curation tools and the checkpoints of GeoGalactica during the first 3/4 of pre-training.
△ Less
Submitted 13 April, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations
Authors:
Yilan Dong,
Chunlin Yu,
Ruiyang Ha,
Ye Shi,
Yuexin Ma,
Lan Xu,
Yanwei Fu,
**gya Wang
Abstract:
Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space. In this paper, we propose the first in-the-wild benchmark CCGait for cloth-changing gait recognition, which incorporates diverse clothing changes, indoor and outdoor scenes, and multi-modal statistics over 92 days. To further a…
▽ More
Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space. In this paper, we propose the first in-the-wild benchmark CCGait for cloth-changing gait recognition, which incorporates diverse clothing changes, indoor and outdoor scenes, and multi-modal statistics over 92 days. To further address the coupling effect of clothing and viewpoint variations, we propose a hybrid approach HybridGait that exploits both temporal dynamics and the projected 2D information of 3D human meshes. Specifically, we introduce a Canonical Alignment Spatial-Temporal Transformer (CA-STT) module to encode human joint position-aware features, and fully exploit 3D dense priors via a Silhouette-guided Deformation with 3D-2D Appearance Projection (SilD) strategy. Our contributions are twofold: we provide a challenging benchmark CCGait that captures realistic appearance changes across an expanded and space, and we propose a hybrid framework HybridGait that outperforms prior works on CCGait and Gait3D benchmarks. Our project page is available at https://github.com/HCVLab/HybridGait.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Absence of Weyl nodes in EuCd$_2$As$_2$ revealed by the carrier density dependence of the anomalous Hall effect
Authors:
Yue Shi,
Zhaoyu Liu,
Logan A. Burnett,
Seokhyeong Lee,
Chaowei Hu,
Qianni Jiang,
Jiaqi Cai,
Xiaodong Xu,
Mo Li,
Cheng-Chien Chen,
Jiun-Haw Chu
Abstract:
The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highl…
▽ More
The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highly insulating EuCd$_2$As$_2$ crystals with carrier density reaching as low as $2\times 10^{15}$ $\text{cm}^{-3}$. The magneto-transport measurements revealed a progressive decrease of the anomalous Hall conductivity (AHC) by several orders of magnitude as the carrier density decreases. This behavior contradicts with what is expected from the intrinsic AHC generated by the Weyl points, which is independent of carrier density as the Fermi level approaches the charge neutrality point. In contrast, the scaling relationship between AHC and longitudinal conductivity aligns with the characteristics of variable range hop** insulators. Our results suggest that EuCd$_2$As$_2$ is a magnetic semiconductor rather than a topological Weyl semimetal.
△ Less
Submitted 27 February, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
Light baryon in three quark picture light front approach and its application: hyperon weak radiative decays
Authors:
Zhi-Peng Xing,
Yu Ji Shi,
** Sun,
Zhen-Xing Zhao
Abstract:
Motivated by recent experimental data on $Σ^+\to pγ$ at BESIII, we investigate a class of hyperon weak radiative decays. To estimate these processes, in our research, we employ a new type of light-front quark model with a three-quark picture for octet baryons. In the three-quark picture, with the use of $SU(3)_f$ and spin symmetries, we present a general form of the light front wave function for e…
▽ More
Motivated by recent experimental data on $Σ^+\to pγ$ at BESIII, we investigate a class of hyperon weak radiative decays. To estimate these processes, in our research, we employ a new type of light-front quark model with a three-quark picture for octet baryons. In the three-quark picture, with the use of $SU(3)_f$ and spin symmetries, we present a general form of the light front wave function for each octet baryon. By including contributions from the penguin diagram and W exchange diagram, we perform a complete calculation on the branching ratios ($Br$) and the asymmetry parameter ($α$) for hyperon weak radiative decay processes. Our results are helpful for discovering additional hyperon weak radiative decay processes in experimental facilities, and our research will promote the theoretical study of baryons.
△ Less
Submitted 8 January, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
Robust TOA-based Localization with Inaccurate Anchors for MANET
Authors:
Xinkai Yu,
Yang Zheng,
Min Sheng,
Yan Shi,
Jiandong Li
Abstract:
Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANE…
▽ More
Accurate node localization is vital for mobile ad hoc networks (MANETs). Current methods like Time of Arrival (TOA) can estimate node positions using imprecise baseplates and achieve the Cramér-Rao lower bound (CRLB) accuracy. In multi-hop MANETs, some nodes lack direct links to base anchors, depending on neighbor nodes as dynamic anchors for chain localization. However, the dynamic nature of MANETs challenges TOA's robustness due to the availability and accuracy of base anchors, coupled with ranging errors. To address the issue of cascading positioning error divergence, we first derive the CRLB for any primary node in MANETs as a metric to tackle localization error in cascading scenarios. Second, we propose an advanced two-step TOA method based on CRLB which is able to approximate target node's CRLB with only local neighbor information. Finally, simulation results confirm the robustness of our algorithm, achieving CRLB-level accuracy for small ranging errors and maintaining precision for larger errors compared to existing TOA methods.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Securing NextG Systems against Poisoning Attacks on Federated Learning: A Game-Theoretic Solution
Authors:
Yalin E. Sagduyu,
Tugba Erpek,
Yi Shi
Abstract:
This paper studies the poisoning attack and defense interactions in a federated learning (FL) system, specifically in the context of wireless signal classification using deep learning for next-generation (NextG) communications. FL collectively trains a global model without the need for clients to exchange their data samples. By leveraging geographically dispersed clients, the trained global model…
▽ More
This paper studies the poisoning attack and defense interactions in a federated learning (FL) system, specifically in the context of wireless signal classification using deep learning for next-generation (NextG) communications. FL collectively trains a global model without the need for clients to exchange their data samples. By leveraging geographically dispersed clients, the trained global model can be used for incumbent user identification, facilitating spectrum sharing. However, in this distributed learning system, the presence of malicious clients introduces the risk of poisoning the training data to manipulate the global model through falsified local model exchanges. To address this challenge, a proactive defense mechanism is employed in this paper to make informed decisions regarding the admission or rejection of clients participating in FL systems. Consequently, the attack-defense interactions are modeled as a game, centered around the underlying admission and poisoning decisions. First, performance bounds are established, encompassing the best and worst strategies for attackers and defenders. Subsequently, the attack and defense utilities are characterized within the Nash equilibrium, where no player can unilaterally improve its performance given the fixed strategies of others. The results offer insights into novel operational modes that safeguard FL systems against poisoning attacks by quantifying the performance of both attacks and defenses in the context of NextG communications.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Search for a massless particle beyond the Standard Model in the $Σ^+\rightarrow p+{\rm invisible}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (634 additional authors not shown)
Abstract:
A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$…
▽ More
A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$ is determined to be $3.2\times10^{-5}$ at the 90% confidence level. This is the first search for a flavor-changing neutral current process with missing energy in hyperon decays which plays an important role in constraining new physics models.
△ Less
Submitted 5 April, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance
Authors:
Taicai Chen,
Yue Duan,
Dong Li,
Lei Qi,
Yinghuan Shi,
Yang Gao
Abstract:
Variational Autoencoder based Bayesian Optimization (VAE-BO) has demonstrated its excellent performance in addressing high-dimensional structured optimization problems. However, current mainstream methods overlook the potential of utilizing a pool of unlabeled data to construct the latent space, while only concentrating on designing sophisticated models to leverage the labeled data. Despite their…
▽ More
Variational Autoencoder based Bayesian Optimization (VAE-BO) has demonstrated its excellent performance in addressing high-dimensional structured optimization problems. However, current mainstream methods overlook the potential of utilizing a pool of unlabeled data to construct the latent space, while only concentrating on designing sophisticated models to leverage the labeled data. Despite their effective usage of labeled data, these methods often require extra network structures, additional procedure, resulting in computational inefficiency. To address this issue, we propose a novel method to effectively utilize unlabeled data with the guidance of labeled data. Specifically, we tailor the pseudo-labeling technique from semi-supervised learning to explicitly reveal the relative magnitudes of optimization objective values hidden within the unlabeled data. Based on this technique, we assign appropriate training weights to unlabeled data to enhance the construction of a discriminative latent space. Furthermore, we treat the VAE encoder and the Gaussian Process (GP) in Bayesian optimization as a unified deep kernel learning process, allowing the direct utilization of labeled data, which we term as Gaussian Process guidance. This directly and effectively integrates the goal of improving GP accuracy into the VAE training, thereby guiding the construction of the latent space. The extensive experiments demonstrate that our proposed method outperforms existing VAE-BO algorithms in various optimization scenarios. Our code will be published at https://github.com/TaicaiChen/PG-LBO.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
High Throughput Inter-Layer Connecting Strategy for Multi-Layer Ultra-Dense Satellite Networks
Authors:
Qi Hao,
Di Zhou,
Min Sheng,
Yan Shi,
Jiandong Li
Abstract:
Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying in…
▽ More
Multi-layer ultra-dense satellite networks (MLUDSNs) have soared this meteoric to provide vast throughputd for globally diverse services. Differing from traditional monolayer constellations, MLUDSNs emphasize the spatial integration among layers, and its throughput may not be simply the sum of throughput of each layer. The hop-count of cross-layer communication paths can be reduced by deploying inter-layer connections (ILCs), augmenting MLUDSN's throughput. Therefore, it remains an open issue how to deploy ILCs to optimize the dynamic MLUDSN topology to dramatically raise throughput gains under multi-layer collaboration. This paper designs an ILC deployment strategy to enhance throughput by revealing the impacts of ILC distribution on reducing hop-count. Since deploying ILCs burdens the satellite with extra communication resource consumption, we model the ILC deployment problem as minimizing the average hop with limited ILCs, to maximize throughput. The proposed problem is a typical integer linear programming (ILP) problem, of which computational complexity is exponential as the satellite scale expands and the time evolves. Based on the symmetrical topology of each layer, we propose a two-phase deployment scheme to halve the problem scale and prioritize stable ILCs to reduce handover-count, which decreases the exponential complexity to a polynomial one, with 1% estimation error: Simulation results based on realistic megaconstellation information confirm that the optimal number of ILCs is less than P.S/2, where P and S are orbits and satellites per orbit. Besides, these ILCs deploy uniformly in each layer, which raises over 1.55x throughput than isolated layers.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Observation of $χ_{cJ}\to 3(K^+K^-)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (632 additional authors not shown)
Abstract:
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching…
▽ More
By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching fractions of $χ_{cJ}\to 3(K^+K^-)$ decays are determined to be
$\mathcal{B}_{χ_{c0}\to 3(K^+K^-)}$=$(10.7\pm1.8\pm1.1)$$\times10^{-6}$,
$\mathcal{B}_{χ_{c1}\to 3(K^+K^-)}$=$(4.2\pm0.9\pm0.5)$$\times10^{-6}$, and
$\mathcal{B}_{χ_{c2}\to 3(K^+K^-)}$=$(7.2\pm1.1\pm0.8)$$\times10^{-6}$,
where the first uncertainties are statistical and the second are systematic.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI
Authors:
Lihang Pan,
Bowen Wang,
Chun Yu,
Yuxuan Chen,
Xiangyu Zhang,
Yuanchun Shi
Abstract:
Voice command interfaces (VCIs) have gained increasing importance, enabling hands-free and eyes-free interaction with digital devices. However, the inherent complexity in constructing effective voice interfaces has limited the VCIs' functionalities to only a small fraction of GUI applications and tasks. This paper presents AutoTask, a VCI capable of automating any task in any mobile application wi…
▽ More
Voice command interfaces (VCIs) have gained increasing importance, enabling hands-free and eyes-free interaction with digital devices. However, the inherent complexity in constructing effective voice interfaces has limited the VCIs' functionalities to only a small fraction of GUI applications and tasks. This paper presents AutoTask, a VCI capable of automating any task in any mobile application without configuration or modification from developers or end users. The primary challenge for AutoTask is the lack of knowledge, as it needs to accomplish unknown tasks (e.g., user commands) within an unknown environment (e.g., GUI). To address this challenge, AutoTask employs two strategies: (1) trial and error: AutoTask explores the GUI, attempts potential operation sequences, and recovers from errors through backtracking; (2) learning from the environment: AutoTask accumulates experiences during exploration and summarizes correct knowledge from these experiences. We implemented AutoTask on Android devices and conducted an evaluation study, which proved the feasibility of AutoTask.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Transverse electric waves in Bandos-Lechner-Sorokin-Townsend nonlinear electrodynamics
Authors:
Yang Shi,
Qinyan Tan,
Towe Wang
Abstract:
In the generalized Born-Infeld electrodynamics discovered by Bandos, Lechner, Sorokin and Townsend, we study transverse electric waves propagating perpendicular to a constant magnetic field background in a parallel-plate waveguide. The directions of propagation and polarization of the waves are perpendicular to each other, and both of them are parallel to the perfectly conducting plates. Two speci…
▽ More
In the generalized Born-Infeld electrodynamics discovered by Bandos, Lechner, Sorokin and Townsend, we study transverse electric waves propagating perpendicular to a constant magnetic field background in a parallel-plate waveguide. The directions of propagation and polarization of the waves are perpendicular to each other, and both of them are parallel to the perfectly conducting plates. Two specific configurations are studied, in which the background magnetic field is either normal to the plates or along the polarization direction. The dispersion relation, the velocity and the cutoff frequency of the lowest-order lowest-frequency mode are calculated in both configurations. This paves the way for a promising test of the generalized Born-Infeld electrodynamics.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Lattice paths and Rogers--Ramanujan--Gordon type overpartitions
Authors:
Diane Y. H. Shi
Abstract:
In this paper, we connect the Rogers--Ramanujan--Gordon type overpartitions to the lattice paths with four kinds of unitary steps. By a bijection between overpartitions and lattice paths, we prove that the theorems given by Chen, Sang and Shi have the lattice paths form. Then inspired by Andrews' parity in partition identities and this relation we put the parity restrictions on lattice paths and g…
▽ More
In this paper, we connect the Rogers--Ramanujan--Gordon type overpartitions to the lattice paths with four kinds of unitary steps. By a bijection between overpartitions and lattice paths, we prove that the theorems given by Chen, Sang and Shi have the lattice paths form. Then inspired by Andrews' parity in partition identities and this relation we put the parity restrictions on lattice paths and give some new results. By the parity results in lattice paths, we can derive some parity results on overpartitions.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
CARSS: Cooperative Attention-guided Reinforcement Subpath Synthesis for Solving Traveling Salesman Problem
Authors:
Yuchen Shi,
Congying Han,
Tiande Guo
Abstract:
This paper introduces CARSS (Cooperative Attention-guided Reinforcement Subpath Synthesis), a novel approach to address the Traveling Salesman Problem (TSP) by leveraging cooperative Multi-Agent Reinforcement Learning (MARL). CARSS decomposes the TSP solving process into two distinct yet synergistic steps: "subpath generation" and "subpath merging." In the former, a cooperative MARL framework is e…
▽ More
This paper introduces CARSS (Cooperative Attention-guided Reinforcement Subpath Synthesis), a novel approach to address the Traveling Salesman Problem (TSP) by leveraging cooperative Multi-Agent Reinforcement Learning (MARL). CARSS decomposes the TSP solving process into two distinct yet synergistic steps: "subpath generation" and "subpath merging." In the former, a cooperative MARL framework is employed to iteratively generate subpaths using multiple agents. In the latter, these subpaths are progressively merged to form a complete cycle. The algorithm's primary objective is to enhance efficiency in terms of training memory consumption, testing time, and scalability, through the adoption of a multi-agent divide and conquer paradigm. Notably, attention mechanisms play a pivotal role in feature embedding and parameterization strategies within CARSS. The training of the model is facilitated by the independent REINFORCE algorithm. Empirical experiments reveal CARSS's superiority compared to single-agent alternatives: it demonstrates reduced GPU memory utilization, accommodates training graphs nearly 2.5 times larger, and exhibits the potential for scaling to even more extensive problem sizes. Furthermore, CARSS substantially reduces testing time and optimization gaps by approximately 50% for TSP instances of up to 1000 vertices, when compared to standard decoding methods.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
Discrete-Time Mean-Variance Strategy Based on Reinforcement Learning
Authors:
Xiangyu Cui,
Xun Li,
Yun Shi,
Si Zhao
Abstract:
This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionall…
▽ More
This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionally, we design the corresponding reinforcement learning algorithm. Both simulation experiments and empirical analysis indicate that our discrete-time model exhibits better applicability when analyzing real-world data than the continuous-time model.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
10 kT axial magnetic field generated using multiple conventional laser beams
Authors:
Jue Xuan Hao,
Xiang Tang,
Alexey Arefiev,
Robert J. Kingham,
** Zhu,
Yin Shi,
Jian Zheng
Abstract:
Strong laser-generated magnetic fields have important applications in high energy density science and laboratory astrophysics. Although the inverse Faraday effect provides a mechanism for generating strong magnetic fields by absorbing angular momentum from a high-intensity laser pulse, it is not applicable to conventional linearly polarized (LP) Gaussian laser beams. We have dmeveloped a spatial a…
▽ More
Strong laser-generated magnetic fields have important applications in high energy density science and laboratory astrophysics. Although the inverse Faraday effect provides a mechanism for generating strong magnetic fields by absorbing angular momentum from a high-intensity laser pulse, it is not applicable to conventional linearly polarized (LP) Gaussian laser beams. We have dmeveloped a spatial arrangement that overcomes this difficulty by using multiple laser beams arranged to have a twist in the pointing direction. Using three-dimensional kinetic particle-in-cell simulations, we show that this arrangement is the key to generating a strong magnetic field. The resulting multi-kT picosecond axial magnetic field occupies tens of thousands of cubic microns of space and can be realized under a wide range of laser parameters and plasma conditions. Our scheme is well suited for implementation at PW-class laser facilities with multiple conventional LP laser beams.
△ Less
Submitted 23 December, 2023;
originally announced December 2023.
-
MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Authors:
Zhenjia Li,
**rang Jia,
Yifeng Shi
Abstract:
In the field of autonomous driving, monocular 3D detection is a critical task which estimates 3D properties (depth, dimension, and orientation) of objects in a single RGB image. Previous works have used features in a heuristic way to learn 3D properties, without considering that inappropriate features could have adverse effects. In this paper, sample selection is introduced that only suitable samp…
▽ More
In the field of autonomous driving, monocular 3D detection is a critical task which estimates 3D properties (depth, dimension, and orientation) of objects in a single RGB image. Previous works have used features in a heuristic way to learn 3D properties, without considering that inappropriate features could have adverse effects. In this paper, sample selection is introduced that only suitable samples should be trained to regress the 3D properties. To select samples adaptively, we propose a Learnable Sample Selection (LSS) module, which is based on Gumbel-Softmax and a relative-distance sample divider. The LSS module works under a warm-up strategy leading to an improvement in training stability. Additionally, since the LSS module dedicated to 3D property sample selection relies on object-level features, we further develop a data augmentation method named MixUp3D to enrich 3D property samples which conforms to imaging principles without introducing ambiguity. As two orthogonal methods, the LSS module and MixUp3D can be utilized independently or in conjunction. Sufficient experiments have shown that their combined use can lead to synergistic effects, yielding improvements that transcend the mere sum of their individual applications. Leveraging the LSS module and the MixUp3D, without any extra data, our method named MonoLSS ranks 1st in all three categories (Car, Cyclist, and Pedestrian) on KITTI 3D object detection benchmark, and achieves competitive results on both the Waymo dataset and KITTI-nuScenes cross-dataset evaluation. The code is included in the supplementary material and will be released to facilitate related academic and industrial studies.
△ Less
Submitted 22 May, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Spontaneous gap opening and potential excitonic states in an ideal Dirac semimetal Ta$_2$Pd$_3$Te$_5$
Authors:
Peng Zhang,
Yuyang Dong,
Dayu Yan,
Bei Jiang,
Tao Yang,
Jun Li,
Zhaopeng Guo,
Yong Huang,
Bo Hao,
Qing Li,
Yupeng Li,
Kifu Kurokawa,
Rui Wang,
Yuefeng Nie,
Makoto Hashimoto,
Donghui Lu,
Wen-He Jiao,
Jie Shen,
Tian Qian,
Zhijun Wang,
Youguo Shi,
Takeshi Kondo
Abstract:
The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitatio…
▽ More
The opening of an energy gap in the electronic structure generally indicates the presence of interactions. In materials with low carrier density and short screening length, long-range Coulomb interaction favors the spontaneous formation of electron-hole pairs, so-called excitons, opening an excitonic gap at the Fermi level. Excitonic materials host unique phenomenons associated with pair excitations. However, there is still no generally recognized single-crystal material with excitonic order, which is, therefore, awaited in condensed matter physics. Here, we show that excitonic states may exist in the quasi-one-dimensional material Ta$_2$Pd$_3$Te$_5$, which has an almost ideal Dirac-like band structure, with Dirac point located exactly at Fermi level. We find that an energy gap appears at 350 K, and it grows with decreasing temperature. The spontaneous gap opening is absent in a similar material Ta$_2$Ni$_3$Te$_5$. Intriguingly, the gap is destroyed by the potassium deposition on the crystal, likely due to extra-doped carriers. Furthermore, we observe a pair of in-gap flat bands, which is an analog of the impurity states in a superconducting gap. All these observations can be properly explained by an excitonic order, providing Ta$_2$Pd$_3$Te$_5$ as a new and promising candidate realizing excitonic states.
△ Less
Submitted 15 March, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Evidence for an Excitonic Insulator State in Ta$_2$Pd$_3$Te$_5$
Authors:
Jierui Huang,
Bei Jiang,
**gyu Yao,
Dayu Yan,
Xincheng Lei,
Jiacheng Gao,
Zhaopeng Guo,
Feng **,
Yupeng Li,
Zhenyu Yuan,
Congcong Chai,
Haohao Sheng,
Mojun Pan,
Famin Chen,
Junde Liu,
Shunye Gao,
Gexing Qu,
Bo Liu,
Zhicheng Jiang,
Zhengtai Liu,
Xiaoyan Ma,
Shiming Zhou,
Yaobo Huang,
Chenxia Yun,
Qingming Zhang
, et al. (8 additional authors not shown)
Abstract:
The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical invest…
▽ More
The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical investigations, we provide evidence for the existence of an EI ground state in a van der Waals compound Ta$_2$Pd$_3$Te$_5$. Density-functional-theory calculations suggest that it is a semimetal with a small band overlap, whereas various experiments exhibit an insulating ground state with a clear band gap. Upon incorporating electron-hole Coulomb interaction into our calculations, we obtain an EI phase where the electronic symmetry breaking opens a many-body gap. Angle-resolved photoemission spectroscopy measurements exhibit that the band gap is closed with a significant change in the dispersions as the number of thermally excited charge carriers becomes sufficiently large in both equilibrium and nonequilibrium states. Structural measurements reveal a slight breaking of crystal symmetry with exceptionally small lattice distortion in the insulating state, which cannot account for the significant gap opening. Therefore, we attribute the insulating ground state with a gap opening in Ta$_2$Pd$_3$Te$_5$ to exciton condensation, where the coupling to the symmetry-breaking electronic state induces a subtle change in the crystal structure.
△ Less
Submitted 14 March, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning
Authors:
Zhongyi Cai,
Ye Shi,
Wei Huang,
**gya Wang
Abstract:
Federated Learning (FL) has emerged as a promising distributed learning paradigm that enables multiple clients to learn a global model collaboratively without sharing their private data. However, the effectiveness of FL is highly dependent on the quality of the data that is being used for training. In particular, data heterogeneity issues, such as label distribution skew and feature skew, can sign…
▽ More
Federated Learning (FL) has emerged as a promising distributed learning paradigm that enables multiple clients to learn a global model collaboratively without sharing their private data. However, the effectiveness of FL is highly dependent on the quality of the data that is being used for training. In particular, data heterogeneity issues, such as label distribution skew and feature skew, can significantly impact the performance of FL. Previous studies in FL have primarily focused on addressing label distribution skew data heterogeneity, while only a few recent works have made initial progress in tackling feature skew issues. Notably, these two forms of data heterogeneity have been studied separately and have not been well explored within a unified FL framework. To address this gap, we propose Fed-CO$_{2}$, a universal FL framework that handles both label distribution skew and feature skew within a \textbf{C}ooperation mechanism between the \textbf{O}nline and \textbf{O}ffline models. Specifically, the online model learns general knowledge that is shared among all clients, while the offline model is trained locally to learn the specialized knowledge of each individual client. To further enhance model cooperation in the presence of feature shifts, we design an intra-client knowledge transfer mechanism that reinforces mutual learning between the online and offline models, and an inter-client knowledge transfer mechanism to increase the models' domain generalization ability. Extensive experiments show that our Fed-CO$_{2}$ outperforms a wide range of existing personalized federated learning algorithms in terms of handling label distribution skew and feature skew, both individually and collectively. The empirical results are supported by our convergence analyses in a simplified setting.
△ Less
Submitted 26 December, 2023; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Search for the decay $χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$…
▽ More
Using a data sample corresponding to an integrated luminosity of 10.9 fb$^{-1}$ collected at center-of-mass energies from 4.16 to 4.34 GeV with the BESIII detector, we search for the decay $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ in the radiative production $e^{+}e^{-} \to γχ_{c1}(3872)$. No significant signal is observed, and the ratio for the branching fraction of $χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}$ to $χ_{c1}(3872) \to π^{+}π^{-}J/ψ$ is measured as $\mathcal{R}\equiv\frac{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-}χ_{c1}]}{\mathcal{B}[χ_{c1}(3872) \to π^{+}π^{-} J/ψ]}<0.18$ at 90$\%$ confidence level. The upper limit on the product of the cross section $σ[e^{+}e^{-}\toγχ_{c1}(3872)]$ and the branching fraction $\mathcal{B}[χ_{c1}(3872)\toπ^{+}π^{-}χ_{c1}]$ at each center-of-mass energy is also given. These measurements favor the non-conventional charmonium nature of the $χ_{c1}(3872)$ state.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Revisit the phase diagram and piezoelectricity of lead zirconate titanate from first principles
Authors:
Yubai Shi,
Ri He,
Bingwen Zhang,
Zhicheng Zhong
Abstract:
Lead zirconate titanate (PbZr1-xTixO3, PZT) exhibits excellent piezoelectric properties in the morphotropic phase boundary (MPB) region of its temperature-composition phase diagram. However, the microscopic origin of its high piezoelectric response remains controversial. Here, we develop a machine-learning-based deep potential (DP) model of PZT using the training dataset from first principles dens…
▽ More
Lead zirconate titanate (PbZr1-xTixO3, PZT) exhibits excellent piezoelectric properties in the morphotropic phase boundary (MPB) region of its temperature-composition phase diagram. However, the microscopic origin of its high piezoelectric response remains controversial. Here, we develop a machine-learning-based deep potential (DP) model of PZT using the training dataset from first principles density functional theory calculations. Based on DP-assisted large-scale atomic simulations, we reproduce the temperature-composition phase diagram of PZT, in good agreement with the experiment except the absence of structural transition from R3c to R3m. We find that the rhombohedral phase maintains R3c symmetry with slight oxygen octahedral tilting as increase of temperature, instead of appearing R3m symmetry. This discrepancy can trace back to the lack of experimental measurements to identify such slight octahedral tilting. More importantly, we clarify the atomic-level feature of PZT at the MPB, exhibiting the competing coupling of ferroelectric nanodomains with various polarization orientations. The high piezoelectric response is driven by polarization rotation of nanodomains induced by an external electric field.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
Authors:
Yuming Gu,
You Xie,
Hongyi Xu,
Guoxian Song,
Yichun Shi,
Di Chang,
**g Yang,
Linjie Luo
Abstract:
We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize plausible but consistent facial details rendered from novel camera views with retained both identity and facial expression. In lieu of time-consuming optimization…
▽ More
We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize plausible but consistent facial details rendered from novel camera views with retained both identity and facial expression. In lieu of time-consuming optimization and fine-tuning, our zero-shot method generalizes well to arbitrary face portraits with unposed camera views, extreme facial expressions, and diverse artistic depictions. At its core, we leverage the generative prior of 2D diffusion models pre-trained on large-scale image datasets as our rendering backbone, while the denoising is guided with disentangled attentive control of appearance and camera pose. To achieve this, we first inject the appearance context from the reference image into the self-attention layers of the frozen UNets. The rendering view is then manipulated with a novel conditional control module that interprets the camera pose by watching a condition image of a crossed subject from the same view. Furthermore, we insert a trainable cross-view attention module to enhance view consistency, which is further strengthened with a novel 3D-aware noise generation process during inference. We demonstrate state-of-the-art results both qualitatively and quantitatively on our challenging in-the-wild and multi-view benchmarks.
△ Less
Submitted 19 March, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Measurements of $Σ$ electromagnetic form factors in the time-like region using the untagged initial-state radiation technique
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (626 additional authors not shown)
Abstract:
The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven…
▽ More
The process $e^{+}e^{-}\toΣ^{+}\barΣ^{-}$ is studied from threshold up to 3.04 GeV/$c^2$ via the initial-state radiation technique using data with an integrated luminosity of 12.0 fb$^{-1}$, collected at center-of-mass energies between 3.773 and 4.258 GeV with the BESIII detector at the BEPCII collider. The pair production cross sections and the effective form factors of $Σ$ are measured in eleven $Σ^{+}\barΣ^{-}$ invariant mass intervals from threshold to 3.04 GeV/$c^2$. The results are consistent with the previous results from Belle and BESIII. Furthermore, the branching fractions of the decays $J/ψ\toΣ^{+}\barΣ^{-}$ and $ψ(3686)\toΣ^{+}\barΣ^{-}$ are determined and the obtained results are consistent with the previous results of BESIII.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Roll With the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning
Authors:
Yue Duan,
Zhen Zhao,
Lei Qi,
Lu** Zhou,
Lei Wang,
Yinghuan Shi
Abstract:
While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e.g., fine-grained visual classification in the context of SSL (SS-FGVC). The increased recognition difficulty on fine-grained unlabeled data spells disaster for pseudo-labeling accuracy, resulting in…
▽ More
While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e.g., fine-grained visual classification in the context of SSL (SS-FGVC). The increased recognition difficulty on fine-grained unlabeled data spells disaster for pseudo-labeling accuracy, resulting in poor performance of the SSL model. To tackle this challenge, we propose Soft Label Selection with Confidence-Aware Clustering based on Class Transition Tracking (SoC) by reconstructing the pseudo-label selection process by jointly optimizing Expansion Objective and Shrinkage Objective, which is based on a soft label manner. Respectively, the former objective encourages soft labels to absorb more candidate classes to ensure the attendance of ground-truth class, while the latter encourages soft labels to reject more noisy classes, which is theoretically proved to be equivalent to entropy minimization. In comparisons with various state-of-the-art methods, our approach demonstrates its superior performance in SS-FGVC. Checkpoints and source code are available at https://github.com/NJUyued/SoC4SS-FGVC.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Students' Perceptions and Preferences of Generative Artificial Intelligence Feedback for Programming
Authors:
Zhengdong Zhang,
Zihan Dong,
Yang Shi,
Noboru Matsuda,
Thomas Price,
Dongkuan Xu
Abstract:
The rapid evolution of artificial intelligence (AI), specifically large language models (LLMs), has opened opportunities for various educational applications. This paper explored the feasibility of utilizing ChatGPT, one of the most popular LLMs, for automating feedback for Java programming assignments in an introductory computer science (CS1) class. Specifically, this study focused on three quest…
▽ More
The rapid evolution of artificial intelligence (AI), specifically large language models (LLMs), has opened opportunities for various educational applications. This paper explored the feasibility of utilizing ChatGPT, one of the most popular LLMs, for automating feedback for Java programming assignments in an introductory computer science (CS1) class. Specifically, this study focused on three questions: 1) To what extent do students view LLM-generated feedback as formative? 2) How do students see the comparative affordances of feedback prompts that include their code, vs. those that exclude it? 3) What enhancements do students suggest for improving AI-generated feedback? To address these questions, we generated automated feedback using the ChatGPT API for four lab assignments in the CS1 class. The survey results revealed that students perceived the feedback as aligning well with formative feedback guidelines established by Shute. Additionally, students showed a clear preference for feedback generated by including the students' code as part of the LLM prompt, and our thematic study indicated that the preference was mainly attributed to the specificity, clarity, and corrective nature of the feedback. Moreover, this study found that students generally expected specific and corrective feedback with sufficient code examples, but had diverged opinions on the tone of the feedback. This study demonstrated that ChatGPT could generate Java programming assignment feedback that students perceived as formative. It also offered insights into the specific improvements that would make the ChatGPT-generated feedback useful for students.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Symmetry Enforced Fermi Surface Degeneracies Observed in Time-Reversal Symmetry-Breaking Superconductor LaNiGa$_2$
Authors:
Matthew Staab,
Robert Prater,
Sudheer Sreedhar,
Journey Byland,
Eliana Mann,
Davis Zackaria,
Yunshu Shi,
Henry J. Bowman,
Andrew L. Stephens,
Myung-Chul Jung,
Antia S. Botana,
Warren E. Pickett,
Valentin Taufour,
Inna Vishik
Abstract:
LaNiGa$_2$ is superconductor that breaks time-reversal symmetry in the superconducting state without any known nearby magnetism. Recently, single crystals of LaNiGa$_2$ have been synthesized, revealing a nonsymmorphic Cmcm space group. Here, we report measurements of the electronic structure of LaNiGa$_2$ throughout the three-dimensional Brillouin zone (BZ) using angle-resolved photoemission spect…
▽ More
LaNiGa$_2$ is superconductor that breaks time-reversal symmetry in the superconducting state without any known nearby magnetism. Recently, single crystals of LaNiGa$_2$ have been synthesized, revealing a nonsymmorphic Cmcm space group. Here, we report measurements of the electronic structure of LaNiGa$_2$ throughout the three-dimensional Brillouin zone (BZ) using angle-resolved photoemission spectroscopy (ARPES). Our findings show broad consistency with density functional theory (DFT) calculations and provide evidence for degeneracies in the electronic structure that are predicted from the space group. The calculations also predict four Fermi surfaces which cross the purported nodal plane and should therefore form two degenerate pairs. We report evidence for those predicted symmetry enforced degeneracies as well as accidental near degeneracies throughout the BZ. These degeneracies and near-degeneracies may play a role in the pairing mechanism of LaNiGa$_2$. Our results provide insight into the interplay between structure, Fermiology, and superconductivity in unconventional superconductors with nonsymmorphic space group.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Observation of significant flavor-SU(3) breaking in the kaon wave function at $12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$ and discovery of the charmless decay $ψ(3770)\to K_S^0K_L^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (607 additional authors not shown)
Abstract:
We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$,…
▽ More
We present cross sections for the reaction $e^+e^-\to K_S^0K_L^0$ at center-of-mass energies ranging from 3.51 GeV to 4.95 GeV using data samples collected in the BESIII experiment, corresponding to a total integrated luminosity of 26.5 fb$^{-1}$. The ratio of neutral-to-charged kaon form factors at large momentum transfers ($12~{\rm GeV}^2<Q^2<25~{\rm GeV}^2$) is determined to be $0.21\pm 0.01$, which indicates a small but significant effect of flavor-SU(3) breaking in the kaon wave function, and consequently excludes the possibility that flavor-SU(3) breaking is the primary reason for the strong experimental violation of the pQCD prediction $|F(π^{\pm})|/|F(K^{\pm})|=f^2_π/f^2_{K}$, where $F(π^{\pm})$ and $F(K^{\pm})$ are the form factors, and $f_π$ and $f_{K}$ are the decay constants of charged pions and kaons, respectively. We also observe a significant signal for the charmless decay $ψ(3770)\to K_S^0K_L^0$ for the first time. Within a $1σ$ contour of the likelihood value, the the branching fraction for $ψ(3770)\to K_S^0K_L^0$ is determined to be ${\cal B}=(2.63_{-1.59}^{+1.40})\times 10^{-5}$, and the relative phase between the continuum and $ψ(3770)$ amplitudes is $φ=(-0.39_{-0.10}^{+0.05})π$. The branching fraction is in good agreement with the $\mathcal{S}$- and $\mathcal{D}$-wave charmonia mixing scheme proposed in the interpretation of the "$ρπ$ puzzle" between $J/ψ$ and $ψ(3686)$ decays.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception
Authors:
Xiao Wang,
Wentao Wu,
Chenglong Li,
Zhicheng Zhao,
Zhe Chen,
Yukai Shi,
** Tang
Abstract:
Understanding vehicles in images is important for various applications such as intelligent transportation and self-driving system. Existing vehicle-centric works typically pre-train models on large-scale classification datasets and then fine-tune them for specific downstream tasks. However, they neglect the specific characteristics of vehicle perception in different tasks and might thus lead to su…
▽ More
Understanding vehicles in images is important for various applications such as intelligent transportation and self-driving system. Existing vehicle-centric works typically pre-train models on large-scale classification datasets and then fine-tune them for specific downstream tasks. However, they neglect the specific characteristics of vehicle perception in different tasks and might thus lead to sub-optimal performance. To address this issue, we propose a novel vehicle-centric pre-training framework called VehicleMAE, which incorporates the structural information including the spatial structure from vehicle profile information and the semantic structure from informative high-level natural language descriptions for effective masked vehicle appearance reconstruction. To be specific, we explicitly extract the sketch lines of vehicles as a form of the spatial structure to guide vehicle reconstruction. The more comprehensive knowledge distilled from the CLIP big model based on the similarity between the paired/unpaired vehicle image-text sample is further taken into consideration to help achieve a better understanding of vehicles. A large-scale dataset is built to pre-train our model, termed Autobot1M, which contains about 1M vehicle images and 12693 text information. Extensive experiments on four vehicle-based downstream tasks fully validated the effectiveness of our VehicleMAE. The source code and pre-trained models will be released at https://github.com/Event-AHU/VehicleMAE.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.