-
Observation and branching fraction measurement of the decay $Ξ_b^-\toΛ_b^0π^-$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1082 additional authors not shown)
Abstract:
The decay $Ξ_b^-\toΛ_b^0π^-$ is observed using a proton-proton collision data sample collected at center-of-mass energy $\sqrt{s}=13$ TeV with the LHCb detector, corresponding to an integrated luminosity of 5.5 fb$^{-1}$. This process is mediated by the $s\to u\bar{u}d$ quark-level transition, where the $b$ quark in the $Ξ_b^-$ baryon is a spectator in the decay. Averaging the results obtained usi…
▽ More
The decay $Ξ_b^-\toΛ_b^0π^-$ is observed using a proton-proton collision data sample collected at center-of-mass energy $\sqrt{s}=13$ TeV with the LHCb detector, corresponding to an integrated luminosity of 5.5 fb$^{-1}$. This process is mediated by the $s\to u\bar{u}d$ quark-level transition, where the $b$ quark in the $Ξ_b^-$ baryon is a spectator in the decay. Averaging the results obtained using the two $Λ_b^0$ decay modes, $Λ_b^0\toΛ_c^+π^-$ and $Λ_b^0\toΛ_c^+π^-π^+π^-$, the relative production ratio is measured to be $(f_{Ξ_b^-}/f_{Λ_b^0}){\cal{B}}(Ξ_b^-\toΛ_b^0π^-)=(7.3\pm0.8\pm0.6)\times10^{-4}$. Here the uncertainties are statistical and systematic, respectively, and $f_{Ξ_b^-}(f_{Λ_b^0})$ is the fragmentation fraction for a $b$ quark into a $Ξ_b^-$ ($Λ_b^0$) baryon. Using an independent measurement of $f_{Ξ_b^-}/f_{Λ_b^0}$, the branching fraction ${\cal{B}}(Ξ_b^-\toΛ_b^0π^-)=(0.89\pm0.10\pm0.07\pm0.29)\%$ is obtained, where the last uncertainty is due to the assumed SU(3) flavor symmetry in the determination of $f_{Ξ_b^-}/f_{Λ_b^0}$.
△ Less
Submitted 12 October, 2023; v1 submitted 18 July, 2023;
originally announced July 2023.
-
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Authors:
Shaoshi Ling,
Yuxuan Hu,
Shuangbei Qian,
Guoli Ye,
Yao Qian,
Yifan Gong,
Ed Lin,
Michael Zeng
Abstract:
Most end-to-end (E2E) speech recognition models are composed of encoder and decoder blocks that perform acoustic and language modeling functions. Pretrained large language models (LLMs) have the potential to improve the performance of E2E ASR. However, integrating a pretrained language model into an E2E speech recognition model has shown limited benefits due to the mismatches between text-based LL…
▽ More
Most end-to-end (E2E) speech recognition models are composed of encoder and decoder blocks that perform acoustic and language modeling functions. Pretrained large language models (LLMs) have the potential to improve the performance of E2E ASR. However, integrating a pretrained language model into an E2E speech recognition model has shown limited benefits due to the mismatches between text-based LLMs and those used in E2E ASR. In this paper, we explore an alternative approach by adapting a pretrained LLMs to speech. Our experiments on fully-formatted E2E ASR transcription tasks across various domains demonstrate that our approach can effectively leverage the strengths of pretrained LLMs to produce more readable ASR transcriptions. Our model, which is based on the pretrained large language models with either an encoder-decoder or decoder-only structure, surpasses strong ASR models such as Whisper, in terms of recognition error rate, considering formats like punctuation and capitalization as well.
△ Less
Submitted 2 August, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
SoK: Comparing Different Membership Inference Attacks with a Comprehensive Benchmark
Authors:
Jun Niu,
Xiaoyan Zhu,
Moxuan Zeng,
Ge Zhang,
Qingyang Zhao,
Chunhui Huang,
Yangming Zhang,
Suyu An,
Yangzhong Wang,
Xinghui Yue,
Zhipeng He,
Weihao Guo,
Kuo Shen,
Peng Liu,
Yulong Shen,
Xiaohong Jiang,
Jianfeng Ma,
Yuqing Zhang
Abstract:
Membership inference (MI) attacks threaten user privacy through determining if a given data example has been used to train a target model. However, it has been increasingly recognized that the "comparing different MI attacks" methodology used in the existing works has serious limitations. Due to these limitations, we found (through the experiments in this work) that some comparison results reporte…
▽ More
Membership inference (MI) attacks threaten user privacy through determining if a given data example has been used to train a target model. However, it has been increasingly recognized that the "comparing different MI attacks" methodology used in the existing works has serious limitations. Due to these limitations, we found (through the experiments in this work) that some comparison results reported in the literature are quite misleading. In this paper, we seek to develop a comprehensive benchmark for comparing different MI attacks, called MIBench, which consists not only the evaluation metrics, but also the evaluation scenarios. And we design the evaluation scenarios from four perspectives: the distance distribution of data samples in the target dataset, the distance between data samples of the target dataset, the differential distance between two datasets (i.e., the target dataset and a generated dataset with only nonmembers), and the ratio of the samples that are made no inferences by an MI attack. The evaluation metrics consist of ten typical evaluation metrics. We have identified three principles for the proposed "comparing different MI attacks" methodology, and we have designed and implemented the MIBench benchmark with 84 evaluation scenarios for each dataset. In total, we have used our benchmark to fairly and systematically compare 15 state-of-the-art MI attack algorithms across 588 evaluation scenarios, and these evaluation scenarios cover 7 widely used datasets and 7 representative types of models. All codes and evaluations of MIBench are publicly available at https://github.com/MIBench/MIBench.github.io/blob/main/README.md.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
Microwave conductivity due to impurity scattering in cuprate superconductors
Authors:
Minghuan Zeng,
Xiang Li,
Yongjun Wang,
Shi** Feng
Abstract:
The microwave surface impedance measurements on cuprate superconductors provide the crucial information of the effect of the impurity scattering on the quasiparticle transport, however, the full understanding of the effect of the impurity scattering on the quasiparticle transport is still a challenging issue. Here based on the microscopic octet scattering model, the effect of the impurity scatteri…
▽ More
The microwave surface impedance measurements on cuprate superconductors provide the crucial information of the effect of the impurity scattering on the quasiparticle transport, however, the full understanding of the effect of the impurity scattering on the quasiparticle transport is still a challenging issue. Here based on the microscopic octet scattering model, the effect of the impurity scattering on the low-temperature microwave conductivity in cuprate superconductors is investigated in the self-consistent $T$-matrix approach. The impurity-dressed electron propagator obtained in the Fermi-arc-tip approximation of the quasiparticle excitations and scattering processes is employed to derive the electron current-current correlation function by taking into account the impurity-induced vertex correction. It is shown that the microwave conductivity spectrum is a non-Drude-like, with a sharp cusp-like peak extending to zero-energy and a high-energy tail falling slowly with energy. Moreover, the microwave conductivity decreases with the increase of the impurity concentration or with the increase of the strength of the impurity scattering potential. In a striking contrast to the dome-like shape of the do** dependence of the superconducting transition temperature, the microwave conductivity exhibits a reverse dome-like shape of the do** dependence. The theory also show that the highly unconventional features of the microwave conductivity are generated by both the strong electron correlation and impurity-scattering effects.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
Robust Ranking Explanations
Authors:
Chao Chen,
Chenghua Guo,
Guixiang Ma,
Ming Zeng,
Xi Zhang,
Sihong Xie
Abstract:
Robust explanations of machine learning models are critical to establish human trust in the models. Due to limited cognition capability, most humans can only interpret the top few salient features. It is critical to make top salient features robust to adversarial attacks, especially those against the more vulnerable gradient-based explanations. Existing defense measures robustness using $\ell_p$-n…
▽ More
Robust explanations of machine learning models are critical to establish human trust in the models. Due to limited cognition capability, most humans can only interpret the top few salient features. It is critical to make top salient features robust to adversarial attacks, especially those against the more vulnerable gradient-based explanations. Existing defense measures robustness using $\ell_p$-norms, which have weaker protection power. We define explanation thickness for measuring salient features ranking stability, and derive tractable surrogate bounds of the thickness to design the \textit{R2ET} algorithm to efficiently maximize the thickness and anchor top salient features. Theoretically, we prove a connection between R2ET and adversarial training. Experiments with a wide spectrum of network architectures and data modalities, including brain networks, demonstrate that R2ET attains higher explanation robustness under stealthy attacks while retaining accuracy.
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
An ML approach to resolution of singularities
Authors:
Gergely Bérczi,
Honglu Fan,
Mingcong Zeng
Abstract:
The solution set of a system of polynomial equations typically contains ill-behaved, singular points. Resolution is a fundamental process in geometry in which we replace singular points with smooth points, while kee** the rest of the solution set unchanged. Resolutions are not unique: the usual way to describe them involves repeatedly performing a fundamental operation known as "blowing-up", and…
▽ More
The solution set of a system of polynomial equations typically contains ill-behaved, singular points. Resolution is a fundamental process in geometry in which we replace singular points with smooth points, while kee** the rest of the solution set unchanged. Resolutions are not unique: the usual way to describe them involves repeatedly performing a fundamental operation known as "blowing-up", and the complexity of the resolution highly depends on certain choices. The process can be translated into various versions of a 2-player game, the so-called Hironaka game, and a winning strategy for the first player provides a solution to the resolution problem. In this paper we introduce a new approach to the Hironaka game that uses reinforcement learning agents to find optimal resolutions of singularities. In certain domains, the trained model outperforms state-of-the-art selection heuristics in total number of polynomial additions performed, which provides a proof-of-concept that recent developments in machine learning have the potential to improve performance of algorithms in symbolic computation.
△ Less
Submitted 22 August, 2023; v1 submitted 1 July, 2023;
originally announced July 2023.
-
Search for $CP$ violation in the phase space of $D^0 \to π^-π^+π^0$ decays with the energy test
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1039 additional authors not shown)
Abstract:
A search for $CP$ violation in $D^0 \to π^-π^+π^0$ decays is reported, using $pp$ collision data collected by the LHCb experiment from 2015 to 2018 corresponding to an integrated luminosity of 6$fb^{-1}$. An unbinned model-independent approach provides sensitivity to local $CP$ violation within the two-dimensional phase space of the decay. The method is validated using the Cabibbo-favoured channel…
▽ More
A search for $CP$ violation in $D^0 \to π^-π^+π^0$ decays is reported, using $pp$ collision data collected by the LHCb experiment from 2015 to 2018 corresponding to an integrated luminosity of 6$fb^{-1}$. An unbinned model-independent approach provides sensitivity to local $CP$ violation within the two-dimensional phase space of the decay. The method is validated using the Cabibbo-favoured channel $\D^0 \to \K^-π^+π^0$ and background regions of the signal mode. The results are consistent with $CP$ symmetry in this decay.
△ Less
Submitted 20 March, 2024; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Measurements of $CP$ asymmetries and branching fraction ratios of $B^-$ decays to two charm mesons
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1046 additional authors not shown)
Abstract:
The $CP$ asymmetries of seven $B^-$ decays to two charm mesons are measured using data corresponding to an integrated luminosity of $9\text{ fb}^{-1}$ of proton-proton collisions collected by the LHCb experiment. Decays involving a $D^{*0}$ or $D^{*-}_s$ meson are analysed by reconstructing only the $D^0$ or $D^-_s$ decay products. This paper presents the first measurement of…
▽ More
The $CP$ asymmetries of seven $B^-$ decays to two charm mesons are measured using data corresponding to an integrated luminosity of $9\text{ fb}^{-1}$ of proton-proton collisions collected by the LHCb experiment. Decays involving a $D^{*0}$ or $D^{*-}_s$ meson are analysed by reconstructing only the $D^0$ or $D^-_s$ decay products. This paper presents the first measurement of $\mathcal{A}^{CP}(B^- \rightarrow D^{*-}_s D^0)$ and $\mathcal{A}^{CP}(B^- \rightarrow D^{-}_s D^{*0})$, and the most precise measurement of the other five $CP$ asymmetries. There is no evidence of $CP$ violation in any of the analysed decays. Additionally, two ratios between branching fractions of selected decays are measured.
△ Less
Submitted 5 October, 2023; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Study of the Bose-Einstein correlations of same-sign pions in proton-lead collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1038 additional authors not shown)
Abstract:
Correlations of same-sign charged particles are analysed using proton-lead collision data collected by the LHCb experiment at a nucleon-nucleon centre-of-mass energy of 5.02 TeV, corresponding to an integrated luminosity of 1.06 nb-1. Bose-Einstein correlations are observed in the form of an enhancement of pair production for same-sign charged pions with a small four-momentum difference squared. T…
▽ More
Correlations of same-sign charged particles are analysed using proton-lead collision data collected by the LHCb experiment at a nucleon-nucleon centre-of-mass energy of 5.02 TeV, corresponding to an integrated luminosity of 1.06 nb-1. Bose-Einstein correlations are observed in the form of an enhancement of pair production for same-sign charged pions with a small four-momentum difference squared. The dependence of the correlation radius and the intercept parameter on the reconstructed charged-particle multiplicity is investigated. The measured correlation radii scale linearly with the cube root of the reconstructed charged-particle multiplicity, being compatible with predictions of hydrodynamic models on the collision system evolution.
△ Less
Submitted 11 October, 2023; v1 submitted 16 June, 2023;
originally announced June 2023.
-
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
Authors:
Chenda Li,
Yao Qian,
Zhuo Chen,
Naoyuki Kanda,
Dongmei Wang,
Takuya Yoshioka,
Yanmin Qian,
Michael Zeng
Abstract:
State-of-the-art large-scale universal speech models (USMs) show a decent automatic speech recognition (ASR) performance across multiple domains and languages. However, it remains a challenge for these models to recognize overlapped speech, which is often seen in meeting conversations. We propose an approach to adapt USMs for multi-talker ASR. We first develop an enhanced version of serialized out…
▽ More
State-of-the-art large-scale universal speech models (USMs) show a decent automatic speech recognition (ASR) performance across multiple domains and languages. However, it remains a challenge for these models to recognize overlapped speech, which is often seen in meeting conversations. We propose an approach to adapt USMs for multi-talker ASR. We first develop an enhanced version of serialized output training to jointly perform multi-talker ASR and utterance timestamp prediction. That is, we predict the ASR hypotheses for all speakers, count the speakers, and estimate the utterance timestamps at the same time. We further introduce a lightweight adapter module to maintain the multilingual property of the USMs even when we perform the adaptation with only a single language. Experimental results obtained using the AMI and AliMeeting corpora show that our proposed approach effectively transfers the USMs to a strong multilingual multi-talker ASR model with timestamp prediction capability.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Associated production of prompt $J/ψ$ and $\mathitΥ$ mesons in $pp$ collisions at $\sqrt{s}=13\,\mathrm{TeV}$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1037 additional authors not shown)
Abstract:
The associated production of prompt $J/ψ$ and $\mathit{\mathitΥ}$ mesons in $pp$ collisions at a centre-of-mass energy of $\sqrt{s}=13\,\mathrm{TeV}$ is studied using LHCb data, corresponding to an integrated luminosity of $4\,\mathrm{fb}^{-1}$. The measurement is performed for $J/ψ$ ($\mathitΥ$) mesons with a transverse momentum $p_{\mathrm{T}}<10\,(30)\,\mathrm{GeV}/c$ in the rapidity range…
▽ More
The associated production of prompt $J/ψ$ and $\mathit{\mathitΥ}$ mesons in $pp$ collisions at a centre-of-mass energy of $\sqrt{s}=13\,\mathrm{TeV}$ is studied using LHCb data, corresponding to an integrated luminosity of $4\,\mathrm{fb}^{-1}$. The measurement is performed for $J/ψ$ ($\mathitΥ$) mesons with a transverse momentum $p_{\mathrm{T}}<10\,(30)\,\mathrm{GeV}/c$ in the rapidity range $2.0<y<4.5$. In this kinematic range, the cross-section of the associated production of prompt $J/ψ$ and $\mathitΥ(1S)$ mesons is measured to be $133 \pm 22 \pm 7 \pm 3 \, \mathrm{pb}$, with a significance of $7.9\,σ$, and that of prompt $J/ψ$ and $\mathitΥ(2S)$ mesons to be $76\pm 21 \pm 4 \pm 7 \, \mathrm{pb}$, with a significance of $4.9\,σ$. The first uncertainty is statistical, the second systematic, and the third due to uncertainties on the used branching fractions. This is the first observation of the associated production of $J/ψ$ and $\mathitΥ(1S)$ in proton-proton collisions. Differential cross-sections are measured as functions of variables that are sensitive to kinematic correlations between the $J/ψ$ and $\mathitΥ(1S)$ mesons. The effective cross-sections of the associated production of prompt $J/ψ$ and $\mathitΥ$ mesons are obtained and found to be compatible with measurements using other particle productions.
△ Less
Submitted 29 August, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Measurement of the mass difference and relative production rate of the $Ω^-_b$ and $Ξ^-_b$ baryons
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1042 additional authors not shown)
Abstract:
The mass difference between the $Ω^-_b$ and $Ξ^-_b$ baryons is measured using proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9 \, \text{fb}^{-1}$, and is found to be \begin{equation} m(Ω^-_b)- m(Ξ^-_b) = 248.54 \pm 0.51 \text{(stat)} \pm 0.38 \text{(syst)} \, \text{MeV}/c^2. \end{equation} The mass of the $Ω^-_b$ baryon is measured to b…
▽ More
The mass difference between the $Ω^-_b$ and $Ξ^-_b$ baryons is measured using proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of $9 \, \text{fb}^{-1}$, and is found to be \begin{equation} m(Ω^-_b)- m(Ξ^-_b) = 248.54 \pm 0.51 \text{(stat)} \pm 0.38 \text{(syst)} \, \text{MeV}/c^2. \end{equation} The mass of the $Ω^-_b$ baryon is measured to be \begin{equation} m(Ω^-_b)= 6045.9 \pm 0.5 \text{(stat)} \pm 0.6 \text{(syst)} \, \text{MeV}/c^2. \end{equation} This is the most precise determination of the $Ω^-_b$ mass to date. In addition, the production rate of $Ω^-_b$ baryons relative to that of $Ξ^-_b$ baryons is measured for the first time in $pp$ collisions, using an LHCb dataset collected at a center-of-mass energy of $13 \, \text{TeV}$ and corresponding to an integrated luminosity of $6\,\text{fb}^{-1}$. Reconstructing beauty baryons in the kinematic region $2 < η< 6$ and $p_T < 20\,\text{GeV}/c$ with their decays to a $J/ψ$ meson and a hyperon, the ratio \begin{equation} \frac{f_{Ω^-_b}}{f_{Ξ^-_b}}\times\frac{\mathcal{B}(Ω^-_b \to J/ψΩ^-)}{\mathcal{B}(Ξ^-_b \to J/ψΞ^-)} = 0.120 \pm 0.008 \text{(stat)} \pm 0.008 \text{(syst)}, \end{equation} is obtained, where $f_{Ω^-_b}$ and $f_{Ξ^-_b}$ are the fragmentation fractions of $b$ quarks into $Ω^-_b$ and $Ξ^-_b$ baryons, respectively, and $\mathcal{B}$ represents the branching fractions of their respective decays.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Authors:
Chenyang Le,
Yao Qian,
Long Zhou,
Shujie Liu,
Yanmin Qian,
Michael Zeng,
Xuedong Huang
Abstract:
Joint speech-language training is challenging due to the large demand for training data and GPU consumption, as well as the modality gap between speech and language. We present ComSL, a speech-language model built atop a composite architecture of public pretrained speech-only and language-only models and optimized data-efficiently for spoken language tasks. Particularly, we propose to incorporate…
▽ More
Joint speech-language training is challenging due to the large demand for training data and GPU consumption, as well as the modality gap between speech and language. We present ComSL, a speech-language model built atop a composite architecture of public pretrained speech-only and language-only models and optimized data-efficiently for spoken language tasks. Particularly, we propose to incorporate cross-modality learning into transfer learning and conduct them simultaneously for downstream tasks in a multi-task learning manner. Our approach has demonstrated effectiveness in end-to-end speech-to-text translation tasks, achieving a new state-of-the-art average BLEU score of 31.5 on the multilingual speech to English text translation task for 21 languages, as measured on the public CoVoST2 evaluation set.
△ Less
Submitted 14 October, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
i-Code Studio: A Configurable and Composable Framework for Integrative AI
Authors:
Yuwei Fang,
Mahmoud Khademi,
Chenguang Zhu,
Ziyi Yang,
Reid Pryzant,
Yichong Xu,
Yao Qian,
Takuya Yoshioka,
Lu Yuan,
Michael Zeng,
Xuedong Huang
Abstract:
Artificial General Intelligence (AGI) requires comprehensive understanding and generation capabilities for a variety of tasks spanning different modalities and functionalities. Integrative AI is one important direction to approach AGI, through combining multiple models to tackle complex multimodal tasks. However, there is a lack of a flexible and composable platform to facilitate efficient and eff…
▽ More
Artificial General Intelligence (AGI) requires comprehensive understanding and generation capabilities for a variety of tasks spanning different modalities and functionalities. Integrative AI is one important direction to approach AGI, through combining multiple models to tackle complex multimodal tasks. However, there is a lack of a flexible and composable platform to facilitate efficient and effective model composition and coordination. In this paper, we propose the i-Code Studio, a configurable and composable framework for Integrative AI. The i-Code Studio orchestrates multiple pre-trained models in a finetuning-free fashion to conduct complex multimodal tasks. Instead of simple model composition, the i-Code Studio provides an integrative, flexible, and composable setting for developers to quickly and easily compose cutting-edge services and technologies tailored to their specific requirements. The i-Code Studio achieves impressive results on a variety of zero-shot multimodal tasks, such as video-to-text retrieval, speech-to-speech translation, and visual question answering. We also demonstrate how to quickly build a multimodal agent based on the i-Code Studio that can communicate and personalize for users.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
LMGQS: A Large-scale Dataset for Query-focused Summarization
Authors:
Ruochen Xu,
Song Wang,
Yang Liu,
Shuohang Wang,
Yichong Xu,
Dan Iter,
Chenguang Zhu,
Michael Zeng
Abstract:
Query-focused summarization (QFS) aims to extract or generate a summary of an input document that directly answers or is relevant to a given query. The lack of large-scale datasets in the form of documents, queries, and summaries has hindered model development in this area. In contrast, multiple large-scale high-quality datasets for generic summarization exist. We hypothesize that there is a hidde…
▽ More
Query-focused summarization (QFS) aims to extract or generate a summary of an input document that directly answers or is relevant to a given query. The lack of large-scale datasets in the form of documents, queries, and summaries has hindered model development in this area. In contrast, multiple large-scale high-quality datasets for generic summarization exist. We hypothesize that there is a hidden query for each summary sentence in a generic summarization annotation, and we utilize a large-scale pretrained language model to recover it. In this way, we convert four generic summarization benchmarks into a new QFS benchmark dataset, LMGQS, which consists of over 1 million document-query-summary samples. We thoroughly investigate the properties of our proposed dataset and establish baselines with state-of-the-art summarization models. By fine-tuning a language model on LMGQS, we achieve state-of-the-art zero-shot and supervised performance on multiple existing QFS benchmarks, demonstrating the high quality and diversity of LMGQS.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Authors:
Yichong Xu,
Ruochen Xu,
Dan Iter,
Yang Liu,
Shuohang Wang,
Chenguang Zhu,
Michael Zeng
Abstract:
While large models such as GPT-3 demonstrate exceptional performance in zeroshot and fewshot summarization tasks, their extensive serving and fine-tuning costs hinder their utilization in various applications. Conversely, previous studies have found that although automatic metrics tend to favor smaller fine-tuned models, the quality of the summaries they generate is inferior to that of larger mode…
▽ More
While large models such as GPT-3 demonstrate exceptional performance in zeroshot and fewshot summarization tasks, their extensive serving and fine-tuning costs hinder their utilization in various applications. Conversely, previous studies have found that although automatic metrics tend to favor smaller fine-tuned models, the quality of the summaries they generate is inferior to that of larger models like GPT-3 when assessed by human evaluators. To address this issue, we propose InheritSumm, a versatile and compact summarization model derived from GPT-3.5 through distillation. InheritSumm not only exhibits comparable zeroshot and fewshot summarization capabilities to GPT-3.5 but is also sufficiently compact for fine-tuning purposes. Experimental results demonstrate that InheritSumm achieves similar or superior performance to GPT-3.5 in zeroshot and fewshot settings. Furthermore, it outperforms the previously established best small models in both prefix-tuning and full-data fine-tuning scenarios.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
EMEF: Ensemble Multi-Exposure Image Fusion
Authors:
Renshuai Liu,
Chengyang Li,
Haitao Cao,
Yinglin Zheng,
Ming Zeng,
Xuan Cheng
Abstract:
Although remarkable progress has been made in recent years, current multi-exposure image fusion (MEF) research is still bounded by the lack of real ground truth, objective evaluation function, and robust fusion strategy. In this paper, we study the MEF problem from a new perspective. We don't utilize any synthesized ground truth, design any loss function, or develop any fusion strategy. Our propos…
▽ More
Although remarkable progress has been made in recent years, current multi-exposure image fusion (MEF) research is still bounded by the lack of real ground truth, objective evaluation function, and robust fusion strategy. In this paper, we study the MEF problem from a new perspective. We don't utilize any synthesized ground truth, design any loss function, or develop any fusion strategy. Our proposed method EMEF takes advantage of the wisdom of multiple imperfect MEF contributors including both conventional and deep learning-based methods. Specifically, EMEF consists of two main stages: pre-train an imitator network and tune the imitator in the runtime. In the first stage, we make a unified network imitate different MEF targets in a style modulation way. In the second stage, we tune the imitator network by optimizing the style code, in order to find an optimal fusion result for each input pair. In the experiment, we construct EMEF from four state-of-the-art MEF methods and then make comparisons with the individuals and several other competitive methods on the latest released MEF benchmark dataset. The promising experimental results demonstrate that our ensemble framework can "get the best of all worlds". The code is available at https://github.com/medalwill/EMEF.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
Authors:
Ziyi Yang,
Mahmoud Khademi,
Yichong Xu,
Reid Pryzant,
Yuwei Fang,
Chenguang Zhu,
Dongdong Chen,
Yao Qian,
Mei Gao,
Yi-Ling Chen,
Robert Gmyr,
Naoyuki Kanda,
Noel Codella,
Bin Xiao,
Yu Shi,
Lu Yuan,
Takuya Yoshioka,
Michael Zeng,
Xuedong Huang
Abstract:
The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities. We propose closing this gap with i-Code V2, the first model capable of generating natural language from any combination of Vision, Language, and Speech data. i-Code V2 is a…
▽ More
The convergence of text, visual, and audio data is a key step towards human-like artificial intelligence, however the current Vision-Language-Speech landscape is dominated by encoder-only models which lack generative abilities. We propose closing this gap with i-Code V2, the first model capable of generating natural language from any combination of Vision, Language, and Speech data. i-Code V2 is an integrative system that leverages state-of-the-art single-modality encoders, combining their outputs with a new modality-fusing encoder in order to flexibly project combinations of modalities into a shared representational space. Next, language tokens are generated from these representations via an autoregressive decoder. The whole framework is pretrained end-to-end on a large collection of dual- and single-modality datasets using a novel text completion objective that can be generalized across arbitrary combinations of modalities. i-Code V2 matches or outperforms state-of-the-art single- and dual-modality baselines on 7 multimodal tasks, demonstrating the power of generative multimodal pretraining across a diversity of tasks and signals.
△ Less
Submitted 20 May, 2023;
originally announced May 2023.
-
Any-to-Any Generation via Composable Diffusion
Authors:
Zineng Tang,
Ziyi Yang,
Chenguang Zhu,
Michael Zeng,
Mohit Bansal
Abstract:
We present Composable Diffusion (CoDi), a novel generative model capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities. Unlike existing generative AI systems, CoDi can generate multiple modalities in parallel and its input is not limited to a subset of modalities like text or image. Despite the absence of trai…
▽ More
We present Composable Diffusion (CoDi), a novel generative model capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities. Unlike existing generative AI systems, CoDi can generate multiple modalities in parallel and its input is not limited to a subset of modalities like text or image. Despite the absence of training datasets for many combinations of modalities, we propose to align modalities in both the input and output space. This allows CoDi to freely condition on any input combination and generate any group of modalities, even if they are not present in the training data. CoDi employs a novel composable generation strategy which involves building a shared multimodal space by bridging alignment in the diffusion process, enabling the synchronized generation of intertwined modalities, such as temporally aligned video and audio. Highly customizable and flexible, CoDi achieves strong joint-modality generation quality, and outperforms or is on par with the unimodal state-of-the-art for single-modality synthesis. The project page with demonstrations and code is at https://codi-gen.github.io
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
The LHCb upgrade I
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
C. Achard,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato
, et al. (1298 additional authors not shown)
Abstract:
The LHCb upgrade represents a major change of the experiment. The detectors have been almost completely renewed to allow running at an instantaneous luminosity five times larger than that of the previous running periods. Readout of all detectors into an all-software trigger is central to the new design, facilitating the reconstruction of events at the maximum LHC interaction rate, and their select…
▽ More
The LHCb upgrade represents a major change of the experiment. The detectors have been almost completely renewed to allow running at an instantaneous luminosity five times larger than that of the previous running periods. Readout of all detectors into an all-software trigger is central to the new design, facilitating the reconstruction of events at the maximum LHC interaction rate, and their selection in real time. The experiment's tracking system has been completely upgraded with a new pixel vertex detector, a silicon tracker upstream of the dipole magnet and three scintillating fibre tracking stations downstream of the magnet. The whole photon detection system of the RICH detectors has been renewed and the readout electronics of the calorimeter and muon systems have been fully overhauled. The first stage of the all-software trigger is implemented on a GPU farm. The output of the trigger provides a combination of totally reconstructed physics objects, such as tracks and vertices, ready for final analysis, and of entire events which need further offline reprocessing. This scheme required a complete revision of the computing model and rewriting of the experiment's software.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Evolving the Digital Industrial Infrastructure for Production: Steps Taken and the Road Ahead
Authors:
Jan Pennekamp,
Anastasiia Belova,
Thomas Bergs,
Matthias Bodenbenner,
Andreas Bührig-Polaczek,
Markus Dahlmanns,
Ike Kunze,
Moritz Kröger,
Sandra Geisler,
Martin Henze,
Daniel Lütticke,
Benjamin Montavon,
Philipp Niemietz,
Lucia Ortjohann,
Maximilian Rudack,
Robert H. Schmitt,
Uwe Vroomen,
Klaus Wehrle,
Michael Zeng
Abstract:
The Internet of Production (IoP) leverages concepts such as digital shadows, data lakes, and a World Wide Lab (WWL) to advance today's production. Consequently, it requires a technical infrastructure that can support the agile deployment of these concepts and corresponding high-level applications, which, e.g., demand the processing of massive data in motion and at rest. As such, key research aspec…
▽ More
The Internet of Production (IoP) leverages concepts such as digital shadows, data lakes, and a World Wide Lab (WWL) to advance today's production. Consequently, it requires a technical infrastructure that can support the agile deployment of these concepts and corresponding high-level applications, which, e.g., demand the processing of massive data in motion and at rest. As such, key research aspects are the support for low-latency control loops, concepts on scalable data stream processing, deployable information security, and semantically rich and efficient long-term storage. In particular, such an infrastructure cannot continue to be limited to machines and sensors, but additionally needs to encompass networked environments: production cells, edge computing, and location-independent cloud infrastructures. Finally, in light of the envisioned WWL, i.e., the interconnection of production sites, the technical infrastructure must be advanced to support secure and privacy-preserving industrial collaboration. To evolve today's production sites and lay the infrastructural foundation for the IoP, we identify five broad streams of research: (1) adapting data and stream processing to heterogeneous data from distributed sources, (2) ensuring data interoperability between systems and production sites, (3) exchanging and sharing data with different stakeholders, (4) network security approaches addressing the risks of increasing interconnectivity, and (5) security architectures to enable secure and privacy-preserving industrial collaboration. With our research, we evolve the underlying infrastructure from isolated, sparsely networked production sites toward an architecture that supports high-level applications and sophisticated digital shadows while facilitating the transition toward a WWL.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Conservative binary dynamics at order $O(α^5)$ in electrodynamics
Authors:
Zvi Bern,
Enrico Herrmann,
Radu Roiban,
Michael S. Ruf,
Alexander V. Smirnov,
Vladimir A. Smirnov,
Mao Zeng
Abstract:
We compute the potential-photon contributions to the classical relativistic scattering angle of two charged non-spinning bodies in electrodynamics through fifth order in the coupling. We use the scattering amplitudes framework, effective field theory, and multi-loop integration techniques based on integration by parts and differential equations. At fifth order, the result is expressed in terms of…
▽ More
We compute the potential-photon contributions to the classical relativistic scattering angle of two charged non-spinning bodies in electrodynamics through fifth order in the coupling. We use the scattering amplitudes framework, effective field theory, and multi-loop integration techniques based on integration by parts and differential equations. At fifth order, the result is expressed in terms of cyclotomic polylogarithms. Our calculation demonstrates the feasibility of the corresponding calculations in general relativity, including the evaluation of the encountered four-loop integrals.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Measurement of $Ξ_{c}^{+}$ production in $p$Pb collisions at $\sqrt{s_{NN}}=8.16$ TeV at LHCb
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1040 additional authors not shown)
Abstract:
A study of prompt $Ξ_{c}^{+}$ production in proton-lead collisions is performed with the LHCb experiment at a centre-of-mass energy per nucleon pair of 8.16 TeV in 2016 in $p$Pb and Pb$p$ collisions with an estimated integrated luminosity of approximately 12.5 and 17.4 nb$^{-1}$, respectively. The $Ξ_{c}^{+}$ production cross-section, as well as the $Ξ_{c}^{+}$ to $Λ_{c}^{+}$ production cross-sect…
▽ More
A study of prompt $Ξ_{c}^{+}$ production in proton-lead collisions is performed with the LHCb experiment at a centre-of-mass energy per nucleon pair of 8.16 TeV in 2016 in $p$Pb and Pb$p$ collisions with an estimated integrated luminosity of approximately 12.5 and 17.4 nb$^{-1}$, respectively. The $Ξ_{c}^{+}$ production cross-section, as well as the $Ξ_{c}^{+}$ to $Λ_{c}^{+}$ production cross-section ratio, are measured as a function of the transverse momentum and rapidity and compared to latest theory predictions. The forward-backward asymmetry is also measured as a function of the $Ξ_{c}^{+}$ transverse momentum.
△ Less
Submitted 2 November, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
Automatic Prompt Optimization with "Gradient Descent" and Beam Search
Authors:
Reid Pryzant,
Dan Iter,
Jerry Li,
Yin Tat Lee,
Chenguang Zhu,
Michael Zeng
Abstract:
Large Language Models (LLMs) have shown impressive performance as general purpose agents, but their abilities remain highly dependent on prompts which are hand written with onerous trial-and-error effort. We propose a simple and nonparametric solution to this problem, Automatic Prompt Optimization (APO), which is inspired by numerical gradient descent to automatically improve prompts, assuming acc…
▽ More
Large Language Models (LLMs) have shown impressive performance as general purpose agents, but their abilities remain highly dependent on prompts which are hand written with onerous trial-and-error effort. We propose a simple and nonparametric solution to this problem, Automatic Prompt Optimization (APO), which is inspired by numerical gradient descent to automatically improve prompts, assuming access to training data and an LLM API. The algorithm uses minibatches of data to form natural language "gradients" that criticize the current prompt. The gradients are then "propagated" into the prompt by editing the prompt in the opposite semantic direction of the gradient. These gradient descent steps are guided by a beam search and bandit selection procedure which significantly improves algorithmic efficiency. Preliminary results across three benchmark NLP tasks and the novel problem of LLM jailbreak detection suggest that Automatic Prompt Optimization can outperform prior prompt editing techniques and improve an initial prompt's performance by up to 31%, by using data to rewrite vague task descriptions into more precise annotation instructions.
△ Less
Submitted 19 October, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Test of lepton flavour universality using $B^0 \to D^{*-}τ^+ν_τ$ decays with hadronic $τ$ channels
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1043 additional authors not shown)
Abstract:
The branching fraction $\mathcal{B}(B^0 \to D^{*-}τ^+ν_τ)$ is measured relative to that of the normalization mode $B^0 \to D^{*-}π^+π^-π^+$ using hadronic $τ^+ \to π^+π^-π^+(π^0)\overlineν_τ$ decays in proton-proton collision data at a center-of-mass energy of 13 TeV collected by the LHCb experiment, corresponding to an integrated luminosity of 2 fb$^{-1}$. The measured ratio is…
▽ More
The branching fraction $\mathcal{B}(B^0 \to D^{*-}τ^+ν_τ)$ is measured relative to that of the normalization mode $B^0 \to D^{*-}π^+π^-π^+$ using hadronic $τ^+ \to π^+π^-π^+(π^0)\overlineν_τ$ decays in proton-proton collision data at a center-of-mass energy of 13 TeV collected by the LHCb experiment, corresponding to an integrated luminosity of 2 fb$^{-1}$. The measured ratio is $\mathcal{B}(B^0 \to D^{*-}τ^+ν_τ)/\mathcal{B}(B^0 \to D^{*-}π^+π^-π^+) = 1.79 \pm 0.11 \pm 0.11$, where the first uncertainty is statistical and the second is related to systematic effects. Using established branching fractions for the $B^0 \to D^{*-}π^+π^-π^+$ and $B^0 \to D^{*-}μ^+ν_μ$ modes, the lepton universality test, $\mathcal{R}(D^{*-}) \equiv \mathcal{B}(B^0 \to D^{*-}τ^+ν_τ)/\mathcal{B}(B^0 \to D^{*-}μ^+ν_μ)$ is calculated, $$ \mathcal{R}(D^{*-}) = 0.260 \pm 0.015 \pm 0.016 \pm 0.012\, , $$ where the third uncertainty is due to the uncertainties on the external branching fractions. This result is consistent with the Standard Model prediction and with previous measurements.
△ Less
Submitted 13 May, 2024; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Searching for $^{76}$Ge neutrinoless double beta decay with the CDEX-1B experiment
Authors:
B. T. Zhang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
Y. H. Chen,
J. P. Cheng,
W. H. Dai,
Z. Deng,
C. H. Fang,
X. P. Geng,
H. Gong,
Q. J. Guo,
X. Y. Guo,
L. He,
S. M. He,
J. W. Hu,
H. X. Huang,
T. C. Huang,
H. T. Jia,
X. Jiang,
S. Karmakar
, et al. (59 additional authors not shown)
Abstract:
We operated a p-type point contact high purity germanium (PPCGe) detector (CDEX-1B, 1.008 kg) in the China **** Underground Laboratory (CJPL) for 500.3 days to search for neutrinoless double beta ($0νββ$) decay of $^{76}$Ge. A total of 504.3 kg $\cdot$ day effective exposure data was accumulated. The anti-coincidence and the multi/single-site event (MSE/SSE) discrimination methods were used to…
▽ More
We operated a p-type point contact high purity germanium (PPCGe) detector (CDEX-1B, 1.008 kg) in the China **** Underground Laboratory (CJPL) for 500.3 days to search for neutrinoless double beta ($0νββ$) decay of $^{76}$Ge. A total of 504.3 kg $\cdot$ day effective exposure data was accumulated. The anti-coincidence and the multi/single-site event (MSE/SSE) discrimination methods were used to suppress the background in the energy region of interest (ROI, $1989-2089$ keV for this work) with a factor of 23. A background level of 0.33 counts/(keV $\cdot$ kg $\cdot$ yr) was achieved. The lower limit on the half life of $^{76}$Ge $0νββ$ decay was constrained as $T_{1/2}^{0ν}\ > \ {2.2}\times 10^{23}\ \rm yr\ (90\% \ C.L.)$, corresponding to the upper limits on the effective Majorana neutrino mass: $\langle m_{ββ}\rangle < 2.3-5.2\ \mathrm{eV}$.
△ Less
Submitted 8 May, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Study of charmonium decays to $K^0_S K π$ in the $B \to (K^0_S K π) K$ channels
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1041 additional authors not shown)
Abstract:
A study of the $B^+\to K^0_SK^+K^-π^+$ and $B^+\to K^0_SK^+K^+π^-$ decays is performed using proton-proton collisions at center-of-mass energies of 7, 8 and 13 TeV at the LHCb experiment. The $K^0_SK π$ invariant mass spectra from both decay modes reveal a rich content of charmonium resonances. New precise measurements of the $η_c$ and $η_c(2S)$ resonance parameters are performed and branching fra…
▽ More
A study of the $B^+\to K^0_SK^+K^-π^+$ and $B^+\to K^0_SK^+K^+π^-$ decays is performed using proton-proton collisions at center-of-mass energies of 7, 8 and 13 TeV at the LHCb experiment. The $K^0_SK π$ invariant mass spectra from both decay modes reveal a rich content of charmonium resonances. New precise measurements of the $η_c$ and $η_c(2S)$ resonance parameters are performed and branching fraction measurements are obtained for $B^+$ decays to $η_c$, $J/ψ$, $η_c(2S)$ and $χ_{c1}$ resonances. In particular, the first observation and branching fraction measurement of $B^+ \to χ_{c0} K^0 π^+$ is reported as well as first measurements of the $B^+\to K^0K^+K^-π^+$ and $B^+\to K^0K^+K^+π^-$ branching fractions. Dalitz plot analyses of $η_c \to K^0_SKπ$ and $η_c(2S) \to K^0_SKπ$ decays are performed. A new measurement of the amplitude and phase of the $K π$ $S$-wave as functions of the $K π$ mass is performed, together with measurements of the $K^*_0(1430)$, $K^*_0(1950)$ and $a_0(1700)$ parameters. Finally, the branching fractions of $χ_{c1}$ decays to $K^*$ resonances are also measured.
△ Less
Submitted 20 August, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
X-ray Polarimetry of the accreting pulsar 1A~0535+262 in the supercritical state with PolarLight
Authors:
Xiangyun Long,
Hua Feng,
Hong Li,
Ling-Da Kong,
Jeremy Heyl,
Long Ji,
Lian Tao,
Fabio Muleri,
Qiong Wu,
Jiahuan Zhu,
Jiahui Huang,
Massimo Minuti,
Weichun Jiang,
Saverio Citraro,
Hikmat Nasimi,
Jiandong Yu,
Ge **,
Ming Zeng,
Peng An,
Luca Baldini,
Ronaldo Bellazzini,
Alessandro Brez,
Luca Latronico,
Carmelo Sgrò,
Gloria Spandre
, et al. (3 additional authors not shown)
Abstract:
The X-ray pulsar 1A 0535+262 exhibited a giant outburst in 2020, offering us a unique opportunity for X-ray polarimetry of an accreting pulsar in the supercritical state. Measurement with PolarLight yielded a non-detection in 3-8 keV; the 99% upper limit of the polarization fraction (PF) is found to be 0.34 averaged over spin phases, or 0.51 based on the rotating vector model. No useful constraint…
▽ More
The X-ray pulsar 1A 0535+262 exhibited a giant outburst in 2020, offering us a unique opportunity for X-ray polarimetry of an accreting pulsar in the supercritical state. Measurement with PolarLight yielded a non-detection in 3-8 keV; the 99% upper limit of the polarization fraction (PF) is found to be 0.34 averaged over spin phases, or 0.51 based on the rotating vector model. No useful constraint can be placed with phase resolved polarimetry. These upper limits are lower than a previous theoretical prediction of 0.6-0.8, but consistent with those found in other accreting pulsars, like Her X-1, Cen X-3, 4U 1626-67, and GRO J1008-57, which were in the subcritical state, or at least not confidently in the supercritical state, during the polarization measurements. Our results suggest that the relatively low PF seen in accreting pulsars cannot be attributed to the source not being in the supercritical state, but could be a general feature.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Rational Function Simplification for Integration-by-Parts Reduction and Beyond
Authors:
Kirill Mokrov,
Alexander Smirnov,
Mao Zeng
Abstract:
We present FUEL (Fractional Universal Evaluation Library), a C++ library for performing rational function arithmetic with a flexible choice of third-party computer algebra systems as simplifiers. FUEL is an outgrowth of a C++ interface to Fermat which was originally part of the FIRE code for integration-by-parts (IBP) reduction for Feynman integrals, now promoted to be a standalone library and wit…
▽ More
We present FUEL (Fractional Universal Evaluation Library), a C++ library for performing rational function arithmetic with a flexible choice of third-party computer algebra systems as simplifiers. FUEL is an outgrowth of a C++ interface to Fermat which was originally part of the FIRE code for integration-by-parts (IBP) reduction for Feynman integrals, now promoted to be a standalone library and with access to simplifiers other than Fermat. We compare the performance of various simplifiers for standalone benchmark problems as well as IBP reduction runs with FIRE. A speedup of more than 10 times is achieved for an example IBP problem related to off-shell three-particle form factors in $\mathcal N=4$ super-Yang-Mills theory.
△ Less
Submitted 22 October, 2023; v1 submitted 26 April, 2023;
originally announced April 2023.
-
High quality beam produced by tightly focused laser driven wakefield accelerators
Authors:
Jia Wang,
Ming Zeng,
Dazhang Li,
Xiaoning Wang,
Jie Gao
Abstract:
We propose to use tightly focused lasers to generate high quality electron beams in laser wakefield accelerators. In this scheme, the expansion of the laser beam after the focal position enlarges the size of wakefield bubble, which reduces the effective phase velocity of the wake and triggers injection of plasma electrons. This scheme injects a relatively long beam with high charge. The energy spr…
▽ More
We propose to use tightly focused lasers to generate high quality electron beams in laser wakefield accelerators. In this scheme, the expansion of the laser beam after the focal position enlarges the size of wakefield bubble, which reduces the effective phase velocity of the wake and triggers injection of plasma electrons. This scheme injects a relatively long beam with high charge. The energy spread of the injected beam can be minimized if an optimal acceleration distance is chosen so that the beam chirp is suppressed. Particle-in-cell simulations indicate that electron beams with the charge in the order of nanocoulomb, the energy spread of $\sim 1\%$, and the normalized emittance of $\rm \sim 0.1\ mm\cdot mrad$ can be generated in uniform plasma using $\sim 100\ \rm TW$ laser pulses. An empirical formula is also given for predicting the beam charge. This injection scheme, with a very simple setup, paves the way towards practical high-quality laser wakefield accelerators for table-top electron and radiation sources.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
Comparison of post-Minkowskian and self-force expansions: Scattering in a scalar charge toy model
Authors:
Leor Barack,
Zvi Bern,
Enrico Herrmann,
Oliver Long,
Julio Parra-Martinez,
Radu Roiban,
Michael S. Ruf,
Chia-Hsien Shen,
Mikhail P. Solon,
Fei Teng,
Mao Zeng
Abstract:
We compare numerical self-force results and analytical fourth-order post-Minkowskian (PM) calculations for hyperbolic-type scattering of a point-like particle carrying a scalar charge $Q$ off a Schwarzschild black hole, showing a remarkably good agreement. Specifically, we numerically compute the scattering angle including the full $O(Q^2)$ scalar-field self-force term (but ignoring the gravitatio…
▽ More
We compare numerical self-force results and analytical fourth-order post-Minkowskian (PM) calculations for hyperbolic-type scattering of a point-like particle carrying a scalar charge $Q$ off a Schwarzschild black hole, showing a remarkably good agreement. Specifically, we numerically compute the scattering angle including the full $O(Q^2)$ scalar-field self-force term (but ignoring the gravitational self-force), and compare with analytical expressions obtained in a PM framework using scattering-amplitude methods. This example provides a nontrivial, high-precision test of both calculation methods, and illustrates the complementarity of the two approaches in the context of the program to provide high-precision models of gravitational two-body dynamics. Our PM calculation is carried out through 4PM order, i.e., including all terms through $O(Q^2 G^3)$. At the fourth post-Minkowskian order the point-particle description involves two a-priori undetermined coefficients, due to contributions from tidal effects in the model under consideration. These coefficients are chosen to align the post-Minkowskian results with the self-force ones.
△ Less
Submitted 12 July, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Precision measurement of $\it{CP} $ violation in the penguin-mediated decay $B_s^{0}\rightarrowφφ$
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1037 additional authors not shown)
Abstract:
A flavor-tagged time-dependent angular analysis of the decay $B_s^{0}\rightarrowφφ$ is performed using $pp$ collision data collected by the LHCb experiment at $\sqrt{s}=13$ TeV, the center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 6 fb^{-1}. The $\it{CP}$-violating phase and direct $\it{CP}$-violation parameter are measured to be…
▽ More
A flavor-tagged time-dependent angular analysis of the decay $B_s^{0}\rightarrowφφ$ is performed using $pp$ collision data collected by the LHCb experiment at $\sqrt{s}=13$ TeV, the center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 6 fb^{-1}. The $\it{CP}$-violating phase and direct $\it{CP}$-violation parameter are measured to be $φ_{s\bar{s}s} = -0.042 \pm 0.075 \pm 0.009 $ rad and $|λ|=1.004\pm 0.030 \pm 0.009 $, respectively, assuming the same values for all polarization states of the $φφ$ system. In these results, the first uncertainties are statistical and the second systematic. These parameters are also determined separately for each polarization state, showing no evidence for polarization dependence. The results are combined with previous LHCb measurements using $pp$ collisions at center-of-mass energies of 7 and 8 TeV, yielding $φ_{s\bar{s}s} = -0.074 \pm 0.069 $ rad and $|λ|=1.009 \pm 0.030$. This is the most precise study of time-dependent $\it{CP} $ violation in a penguin-dominated $B$ meson decay. The results are consistent with $\it{CP} $ symmetry and with the Standard Model predictions.
△ Less
Submitted 25 October, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Search for $D^{*}(2007)^0\toμ^+μ^-$ in $B^-\toπ^-μ^+μ^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1040 additional authors not shown)
Abstract:
The very rare $D^{*}(2007)^0\toμ^+μ^-$ decay is searched for by analysing $B^-\toπ^-μ^+μ^-$ decays. The analysis uses a sample of beauty mesons produced in proton-proton collisions collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. The signal signature corresponds to simultaneous peaks in the $μ^+μ^-$ and $π^-μ^+μ^-$ invariant masses.…
▽ More
The very rare $D^{*}(2007)^0\toμ^+μ^-$ decay is searched for by analysing $B^-\toπ^-μ^+μ^-$ decays. The analysis uses a sample of beauty mesons produced in proton-proton collisions collected with the LHCb detector between 2011 and 2018, corresponding to an integrated luminosity of 9 fb$^{-1}$. The signal signature corresponds to simultaneous peaks in the $μ^+μ^-$ and $π^-μ^+μ^-$ invariant masses. No evidence for an excess of events over background is observed and an upper limit is set on the branching fraction of the decay at ${\cal B}(D^{*}(2007)^0\toμ^+μ^-) < 2.6\times 10^{-8}$ at $90\%$ confidence level. This is the first limit on the branching fraction of $D^{*}(2007)^0\toμ^+μ^-$ decays and the most stringent limit on $D^{*}(2007)^0$ decays to leptonic final states. The analysis is the first search for a rare charm-meson decay exploiting production via beauty decays.
△ Less
Submitted 15 August, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
STCF Conceptual Design Report: Volume 1 -- Physics & Detector
Authors:
M. Achasov,
X. C. Ai,
R. Aliberti,
L. P. An,
Q. An,
X. Z. Bai,
Y. Bai,
O. Bakina,
A. Barnyakov,
V. Blinov,
V. Bobrovnikov,
D. Bodrov,
A. Bogomyagkov,
A. Bondar,
I. Boyko,
Z. H. Bu,
F. M. Cai,
H. Cai,
J. J. Cao,
Q. H. Cao,
Z. Cao,
Q. Chang,
K. T. Chao,
D. Y. Chen,
H. Chen
, et al. (413 additional authors not shown)
Abstract:
The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,…
▽ More
The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII, providing a unique platform for exploring the asymmetry of matter-antimatter (charge-parity violation), in-depth studies of the internal structure of hadrons and the nature of non-perturbative strong interactions, as well as searching for exotic hadrons and physics beyond the Standard Model. The STCF project in China is under development with an extensive R\&D program. This document presents the physics opportunities at the STCF, describes conceptual designs of the STCF detector system, and discusses future plans for detector R\&D and physics case studies.
△ Less
Submitted 5 October, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Feynman Integrals from Positivity Constraints
Authors:
Mao Zeng
Abstract:
We explore inequality constraints as a new tool for numerically evaluating Feynman integrals. A convergent Feynman integral is non-negative if the integrand is non-negative in either loop momentum space or Feynman parameter space. Applying various identities, all such integrals can be reduced to linear sums of a small set of master integrals, leading to infinitely many linear constraints on the va…
▽ More
We explore inequality constraints as a new tool for numerically evaluating Feynman integrals. A convergent Feynman integral is non-negative if the integrand is non-negative in either loop momentum space or Feynman parameter space. Applying various identities, all such integrals can be reduced to linear sums of a small set of master integrals, leading to infinitely many linear constraints on the values of the master integrals. The constraints can be solved as a semidefinite programming problem in mathematical optimization, producing rigorous two-sided bounds for the integrals which are observed to converge rapidly as more constraints are included, enabling high-precision determination of the integrals. Positivity constraints can also be formulated for the $ε$ expansion terms in dimensional regularization and reveal hidden consistency relations between terms at different orders in $ε$. We introduce the main methods using one-loop bubble integrals, then present a nontrivial example of three-loop banana integrals with unequal masses, where 11 top-level master integrals are evaluated to high precision.
△ Less
Submitted 3 October, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
MusicFace: Music-driven Expressive Singing Face Synthesis
Authors:
Pengfei Liu,
Wen** Deng,
Hengda Li,
**tai Wang,
Yinglin Zheng,
Yiwei Ding,
Xiaohu Guo,
Ming Zeng
Abstract:
It is still an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music signal. In this paper, we present a method for this task with natural motions of the lip, facial expression, head pose, and eye states. Due to the coupling of the mixed information of human voice and background music in common signals of music audio, we design a decouple-and-fuse str…
▽ More
It is still an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music signal. In this paper, we present a method for this task with natural motions of the lip, facial expression, head pose, and eye states. Due to the coupling of the mixed information of human voice and background music in common signals of music audio, we design a decouple-and-fuse strategy to tackle the challenge. We first decompose the input music audio into human voice stream and background music stream. Due to the implicit and complicated correlation between the two-stream input signals and the dynamics of the facial expressions, head motions and eye states, we model their relationship with an attention scheme, where the effects of the two streams are fused seamlessly. Furthermore, to improve the expressiveness of the generated results, we propose to decompose head movements generation into speed generation and direction generation, and decompose eye states generation into the short-time eye blinking generation and the long-time eye closing generation to model them separately. We also build a novel SingingFace Dataset to support the training and evaluation of this task, and to facilitate future works on this topic. Extensive experiments and user study show that our proposed method is capable of synthesizing vivid singing face, which is better than state-of-the-art methods qualitatively and quantitatively.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Authors:
Zhengyuan Yang,
Linjie Li,
Jianfeng Wang,
Kevin Lin,
Ehsan Azarnasab,
Faisal Ahmed,
Zicheng Liu,
Ce Liu,
Michael Zeng,
Lijuan Wang
Abstract:
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision experts to achieve multimodal reasoning and action. In this paper, we define and explore a comprehensive list of advanced vision tasks that are intriguing to solve, but may exceed the capabilities of existing vision and vision-language models. To achieve such advanced visual intelligence, MM-REACT introduces a tex…
▽ More
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision experts to achieve multimodal reasoning and action. In this paper, we define and explore a comprehensive list of advanced vision tasks that are intriguing to solve, but may exceed the capabilities of existing vision and vision-language models. To achieve such advanced visual intelligence, MM-REACT introduces a textual prompt design that can represent text descriptions, textualized spatial coordinates, and aligned file names for dense visual signals such as images and videos. MM-REACT's prompt design allows language models to accept, associate, and process multimodal information, thereby facilitating the synergetic combination of ChatGPT and various vision experts. Zero-shot experiments demonstrate MM-REACT's effectiveness in addressing the specified capabilities of interests and its wide application in different scenarios that require advanced visual understanding. Furthermore, we discuss and compare MM-REACT's system paradigm with an alternative approach that extends language models for multimodal scenarios through joint finetuning. Code, demo, video, and visualization are available at https://multimodal-react.github.io/
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Code-Switching Text Generation and Injection in Mandarin-English ASR
Authors:
Haibin Yu,
Yuxuan Hu,
Yao Qian,
Ma **,
Linquan Liu,
Shujie Liu,
Yu Shi,
Yanmin Qian,
Edward Lin,
Michael Zeng
Abstract:
Code-switching speech refers to a means of expression by mixing two or more languages within a single utterance. Automatic Speech Recognition (ASR) with End-to-End (E2E) modeling for such speech can be a challenging task due to the lack of data. In this study, we investigate text generation and injection for improving the performance of an industry commonly-used streaming model, Transformer-Transd…
▽ More
Code-switching speech refers to a means of expression by mixing two or more languages within a single utterance. Automatic Speech Recognition (ASR) with End-to-End (E2E) modeling for such speech can be a challenging task due to the lack of data. In this study, we investigate text generation and injection for improving the performance of an industry commonly-used streaming model, Transformer-Transducer (T-T), in Mandarin-English code-switching speech recognition. We first propose a strategy to generate code-switching text data and then investigate injecting generated text into T-T model explicitly by Text-To-Speech (TTS) conversion or implicitly by tying speech and text latent spaces. Experimental results on the T-T model trained with a dataset containing 1,800 hours of real Mandarin-English code-switched speech show that our approaches to inject generated code-switching text significantly boost the performance of T-T models, i.e., 16% relative Token-based Error Rate (TER) reduction averaged on three evaluation sets, and the approach of tying speech and text latent spaces is superior to that of TTS conversion on the evaluation set which contains more homogeneous data with the training set.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Observation of the $B^+ \rightarrow J/ψη^{\prime} K^+$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1041 additional authors not shown)
Abstract:
The $B^+ \rightarrow J/ψη^{\prime} K^+$ decay is observed for the first time using proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of 7, 8, and 13TeV, corresponding to a total integrated luminosity of 9fb$^{-1}$. The branching fraction of this decay is measured relative to the known branching fraction of the $B^+ \rightarrow ψ(2S) K^+$ decays and found to b…
▽ More
The $B^+ \rightarrow J/ψη^{\prime} K^+$ decay is observed for the first time using proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of 7, 8, and 13TeV, corresponding to a total integrated luminosity of 9fb$^{-1}$. The branching fraction of this decay is measured relative to the known branching fraction of the $B^+ \rightarrow ψ(2S) K^+$ decays and found to be $$ \frac{\mathcal{B}( B^+ \rightarrow J/ψη^{\prime}K^+)}{\mathcal{B}( B^+ \rightarrow ψ(2S)K^+)} = \left(4.91\pm 0.47\pm0.29\pm0.07\right)\times10^{-2}, $$ where the first uncertainty is statistical, the second is systematic and the third is related to external branching fractions. A first look at the $J/ψη^{\prime}$ mass distribution is performed and no signal of intermediate resonances is observed.
△ Less
Submitted 13 December, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Target Sound Extraction with Variable Cross-modality Clues
Authors:
Chenda Li,
Yao Qian,
Zhuo Chen,
Dongmei Wang,
Takuya Yoshioka,
Shujie Liu,
Yanmin Qian,
Michael Zeng
Abstract:
Automatic target sound extraction (TSE) is a machine learning approach to mimic the human auditory perception capability of attending to a sound source of interest from a mixture of sources. It often uses a model conditioned on a fixed form of target sound clues, such as a sound class label, which limits the ways in which users can interact with the model to specify the target sounds. To leverage…
▽ More
Automatic target sound extraction (TSE) is a machine learning approach to mimic the human auditory perception capability of attending to a sound source of interest from a mixture of sources. It often uses a model conditioned on a fixed form of target sound clues, such as a sound class label, which limits the ways in which users can interact with the model to specify the target sounds. To leverage variable number of clues cross modalities available in the inference phase, including a video, a sound event class, and a text caption, we propose a unified transformer-based TSE model architecture, where a multi-clue attention module integrates all the clues across the modalities. Since there is no off-the-shelf benchmark to evaluate our proposed approach, we build a dataset based on public corpora, Audioset and AudioCaps. Experimental results for seen and unseen target-sound evaluation sets show that our proposed TSE model can effectively deal with a varying number of clues which improves the TSE performance and robustness against partially compromised clues.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
Towards a transportable Ca$^+$ optical clock with a systematic uncertainty of $4.8\times 10^{-18}$
Authors:
Mengyan Zeng,
Yao Huang,
Baolin Zhang,
Yanmei Hao,
Zixiao Ma,
Ruming Hu,
Huaqing Zhang,
Zheng Chen,
Miao Wang,
Hua Guan,
Kelin Gao
Abstract:
We present a compact, long-term nearly continuous operation of a room-temperature Ca$^+$ optical clock setup towards a transportable clock, achieving an overall systematic uncertainty of $4.8\times 10^{-18}$ and an uptime rate of 97.8% over an 8-day period. The active liquid-cooling scheme is adopted, combined with the precise temperature measurement with 13 temperature sensors both inside and out…
▽ More
We present a compact, long-term nearly continuous operation of a room-temperature Ca$^+$ optical clock setup towards a transportable clock, achieving an overall systematic uncertainty of $4.8\times 10^{-18}$ and an uptime rate of 97.8% over an 8-day period. The active liquid-cooling scheme is adopted, combined with the precise temperature measurement with 13 temperature sensors both inside and outside the vacuum chamber to ensure the accurate evaluation of the thermal environment for the optical clock. The environmental temperature uncertainty is evaluated as 293.31(0.4) K, corresponding to a blackbody radiation (BBR) frequency shift uncertainty of $4.6\times 10^{-18}$, which is reduced more than two times compared to our previous work. Through the frequency comparison between the room temperature Ca$^+$ optical clock and a cryogenic Ca$^+$ optical clock, the overall uncertainty of the clock comparison is $7.5\times 10^{-18}$, including a statistic uncertainty of $4.9\times 10^{-18}$ and a systematic uncertainty of $5.7\times 10^{-18}$. This work provides a set of feasible implementations for high-precision transportable ion optical clocks.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Authors:
Jie Hu,
Mengze Zeng,
Enhua Wu
Abstract:
Deep neural networks have been proven effective in a wide range of tasks. However, their high computational and memory costs make them impractical to deploy on resource-constrained devices. To address this issue, quantization schemes have been proposed to reduce the memory footprint and improve inference speed. While numerous quantization methods have been proposed, they lack systematic analysis f…
▽ More
Deep neural networks have been proven effective in a wide range of tasks. However, their high computational and memory costs make them impractical to deploy on resource-constrained devices. To address this issue, quantization schemes have been proposed to reduce the memory footprint and improve inference speed. While numerous quantization methods have been proposed, they lack systematic analysis for their effectiveness. To bridge this gap, we collect and improve existing quantization methods and propose a gold guideline for post-training quantization. We evaluate the effectiveness of our proposed method with two popular models, ResNet50 and MobileNetV2, on the ImageNet dataset. By following our guidelines, no accuracy degradation occurs even after directly quantizing the model to 8-bits without additional training. A quantization-aware training based on the guidelines can further improve the accuracy in lower-bits quantization. Moreover, we have integrated a multi-stage fine-tuning strategy that works harmoniously with existing pruning techniques to reduce costs even further. Remarkably, our results reveal that a quantized MobileNetV2 with 30\% sparsity actually surpasses the performance of the equivalent full-precision model, underscoring the effectiveness and resilience of our proposed scheme.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
A Lite Fireworks Algorithm with Fractal Dimension Constraint for Feature Selection
Authors:
Min Zeng,
Haimiao Mo,
Zhiming Liang,
Hua Wang
Abstract:
As the use of robotics becomes more widespread, the huge amount of vision data leads to a dramatic increase in data dimensionality. Although deep learning methods can effectively process these high-dimensional vision data. Due to the limitation of computational resources, some special scenarios still rely on traditional machine learning methods. However, these high-dimensional visual data lead to…
▽ More
As the use of robotics becomes more widespread, the huge amount of vision data leads to a dramatic increase in data dimensionality. Although deep learning methods can effectively process these high-dimensional vision data. Due to the limitation of computational resources, some special scenarios still rely on traditional machine learning methods. However, these high-dimensional visual data lead to great challenges for traditional machine learning methods. Therefore, we propose a Lite Fireworks Algorithm with Fractal Dimension constraint for feature selection (LFWA+FD) and use it to solve the feature selection problem driven by robot vision. The "LFWA+FD" focuses on searching the ideal feature subset by simplifying the fireworks algorithm and constraining the dimensionality of selected features by fractal dimensionality, which in turn reduces the approximate features and reduces the noise in the original data to improve the accuracy of the model. The comparative experimental results of two publicly available datasets from UCI show that the proposed method can effectively select a subset of features useful for model inference and remove a large amount of noise noise present in the original data to improve the performance.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Observation of plaid-like spin splitting in a noncoplanar antiferromagnet
Authors:
Yu-Peng Zhu,
Xiaobing Chen,
Xiang-Rui Liu,
Yuntian Liu,
Pengfei Liu,
Heming Zha,
Gexing Qu,
Caiyun Hong,
Jiayu Li,
Zhicheng Jiang,
Xiao-Ming Ma,
Yu-Jie Hao,
Ming-Yuan Zhu,
Wen**g Liu,
Meng Zeng,
Sreehari Jayaram,
Malik Lenger,
Jianyang Ding,
Shu Mo,
Kiyohisa Tanaka,
Masashi Arita,
Zhengtai Liu,
Mao Ye,
Dawei Shen,
Jörg Wrachtrup
, et al. (5 additional authors not shown)
Abstract:
Spatial, momentum and energy separation of electronic spins in condensed matter systems guides the development of novel devices where spin-polarized current is generated and manipulated. Recent attention on a set of previously overlooked symmetry operations in magnetic materials leads to the emergence of a new type of spin splitting, enabling giant and momentum-dependent spin polarization of energ…
▽ More
Spatial, momentum and energy separation of electronic spins in condensed matter systems guides the development of novel devices where spin-polarized current is generated and manipulated. Recent attention on a set of previously overlooked symmetry operations in magnetic materials leads to the emergence of a new type of spin splitting, enabling giant and momentum-dependent spin polarization of energy bands on selected antiferromagnets. Despite the ever-growing theoretical predictions, the direct spectroscopic proof of such spin splitting is still lacking. Here, we provide solid spectroscopic and computational evidence for the existence of such materials. In the noncoplanar antiferromagnet MnTe$_2$, the in-plane components of spin are found to be antisymmetric about the high-symmetry planes of the Brillouin zone, comprising a plaid-like spin texture in the antiferromagnetic (AFM) ground state. Such an unconventional spin pattern, further found to diminish at the high-temperature paramagnetic state, stems from the intrinsic AFM order instead of spin-orbit coupling (SOC). Our finding demonstrates a new type of quadratic spin texture induced by time-reversal breaking, placing AFM spintronics on a firm basis and paving the way for studying exotic quantum phenomena in related materials.
△ Less
Submitted 4 January, 2024; v1 submitted 8 March, 2023;
originally announced March 2023.
-
Insight-HXMT and GECAM-C observations of the brightest-of-all-time GRB 221009A
Authors:
Zheng-Hua An,
S. Antier,
Xing-Zi Bi,
Qing-Cui Bu,
Ce Cai,
Xue-Lei Cao,
Anna-Elisa Camisasca,
Zhi Chang,
Gang Chen,
Li Chen,
Tian-Xiang Chen,
Wen Chen,
Yi-Bao Chen,
Yong Chen,
Yu-Peng Chen,
Michael W. Coughlin,
Wei-Wei Cui,
Zi-Gao Dai,
T. Hussenot-Desenonges,
Yan-Qi Du,
Yuan-Yuan Du,
Yun-Fei Du,
Cheng-Cheng Fan,
Filippo Frontera,
He Gao
, et al. (153 additional authors not shown)
Abstract:
GRB 221009A is the brightest gamma-ray burst ever detected since the discovery of this kind of energetic explosions. However, an accurate measurement of the prompt emission properties of this burst is very challenging due to its exceptional brightness. With joint observations of \textit{Insight}-HXMT and GECAM-C, we made an unprecedentedly accurate measurement of the emission during the first…
▽ More
GRB 221009A is the brightest gamma-ray burst ever detected since the discovery of this kind of energetic explosions. However, an accurate measurement of the prompt emission properties of this burst is very challenging due to its exceptional brightness. With joint observations of \textit{Insight}-HXMT and GECAM-C, we made an unprecedentedly accurate measurement of the emission during the first $\sim$1800 s of GRB 221009A, including its precursor, main emission (ME, which dominates the burst in flux), flaring emission and early afterglow, in the hard X-ray to soft gamma-ray band from $\sim$ 10 keV to $\sim$ 6 MeV. Based on the GECAM-C unsaturated data of the ME, we measure a record-breaking isotropic equivalent energy ($E_{\rm iso}$) of $\bf \sim 1.5 \times 10^{55}$ erg, which is about eight times the total rest-mass energy of the Sun. The early afterglow data require a significant jet break between 650 s and 1100 s, most likely at $\sim950$ s from the afterglow starting time $T_{AG}$, which corresponds to a jet opening angle of $\sim {0.7^\circ} \ (η_γn)^{1/8}$, where $n$ is the ambient medium density in units of $\rm cm^{-3}$ and $η_γ$ is the ratio between $γ$-ray energy and afterglow kinetic energy. The beaming-corrected total $γ$-ray energy $E_γ$ is $\sim 1.15 \times10^{51} \ (η_γn)^{1/4}$ erg, which is typical for long GRBs. These results suggest that this GRB may have a special central engine, which could launch and collimate a very narrowly beamed jet with an ordinary energy budget, leading to exceptionally luminous gamma-ray radiation per unit solid angle. Alternatively, more GRBs might have such a narrow and bright beam, which are missed by an unfavorable viewing angle or have been detected without distance measurement.
△ Less
Submitted 3 March, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Absolute frequency measurements with a robust, transportable ^{40}Ca^{+} optical clock
Authors:
Huaqing Zhang,
Yao Huang,
Baolin Zhang,
Yanmei Hao,
Mengyan Zeng,
Qunfeng Chen,
Yuzhuo Wang,
Shiying Cao,
Yige Lin,
Zhanjun Fang,
Hua Guan,
Kelin Gao
Abstract:
We constructed a transportable 40Ca+ optical clock (with an estimated minimum systematic shift uncertainty of 1.3*10^(-17) and a stability of 5*10^(-15)/sqrt{tau} ) that can operate outside the laboratory. We transported it from the Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan to the National Institute of Metrology, Bei**g. The absolute f…
▽ More
We constructed a transportable 40Ca+ optical clock (with an estimated minimum systematic shift uncertainty of 1.3*10^(-17) and a stability of 5*10^(-15)/sqrt{tau} ) that can operate outside the laboratory. We transported it from the Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan to the National Institute of Metrology, Bei**g. The absolute frequency of the 729 nm clock transition was measured for up to 35 days by tracing its frequency to the second of International System of Units. Some improvements were implemented in the measurement process, such as the increased effective up-time of 91.3 % of the 40Ca+ optical clock over a 35-day-period, the reduced statistical uncertainty of the comparison between the optical clock and hydrogen maser, and the use of longer measurement times to reduce the uncertainty of the frequency traceability link. The absolute frequency measurement of the 40Ca+ optical clock yielded a value of 411042129776400.26 (13) Hz with an uncertainty of 3.2*10^(-16), which is reduced by a factor of 1.7 compared with our previous results. As a result of the increase in the operating rate of the optical clock, the accuracy of 35 days of absolute frequency measurement can be comparable to the best results of different institutions in the world based on different optical frequency measurements.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Observation of the $B^0_s\rightarrow χ_{c1}(3872)π^+π^-$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1037 additional authors not shown)
Abstract:
The first observation of the $B^0_s \rightarrow \left( χ_{c1}(3872) \rightarrow J/ψπ^+π^-\right) π^+ π^-$ decay is reported using proton-proton collision data, corresponding to integrated luminosities of 1, 2 and 6fb$^{-1}$, collected by the LHCb experiment at centre-of-mass energies of 7, 8 and 13TeV, respectively. The ratio of branching fractions relative to the…
▽ More
The first observation of the $B^0_s \rightarrow \left( χ_{c1}(3872) \rightarrow J/ψπ^+π^-\right) π^+ π^-$ decay is reported using proton-proton collision data, corresponding to integrated luminosities of 1, 2 and 6fb$^{-1}$, collected by the LHCb experiment at centre-of-mass energies of 7, 8 and 13TeV, respectively. The ratio of branching fractions relative to the $B^0_s \rightarrow \left( ψ(2S) \rightarrow Jψπ^+π^- \right) π^+ π^-$ decay is measured to be $$ \frac{ \mathcal{B} \left( B^0_s \rightarrow χ_{c1}(3872) π^+π^-\right)
\times \mathcal{B} \left( χ_{c1}(3872) \rightarrow Jψπ^+π^-\right)}
{ \mathcal{B} \left( B^0_s \rightarrow ψ(2S) π^+ π^- \right)
\times \mathcal{B} \left( ψ(2S) \rightarrow Jψπ^+π^-\right) }
= \left( 6.8 \pm 1.1 \pm 0.2 \right) \times 10^{-2} , $$ where the first uncertainty is statistical and the second systematic. The mass spectrum of the $π^+π^-$ system recoiling against the $χ_{c1}(3872)$ meson exhibits a large contribution from $B^0_s \rightarrow χ_{c1}(3872) \left( f_0(980) \rightarrow π^+ π^-\right)$ decays.
△ Less
Submitted 13 December, 2023; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Measurement of the $Λ_{b}^{0}\to Λ(1520) μ^{+}μ^{-}$ differential branching fraction
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
B. Adeva,
M. Adinolfi,
P. Adlarson,
H. Afsharnia,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey
, et al. (1038 additional authors not shown)
Abstract:
The branching fraction of the rare decay $Λ_{b}^{0}\to Λ(1520) μ^{+}μ^{-}$ is measured for the first time, in the squared dimuon mass intervals, $q^2$, excluding the $J/ψ$ and $ψ(2S)$ regions. The data sample analyzed was collected by the LHCb experiment at center-of-mass energies of 7, 8, and 13 TeV, corresponding to a total integrated luminosity of $9\ \mathrm{fb}^{-1}$. The result in the highes…
▽ More
The branching fraction of the rare decay $Λ_{b}^{0}\to Λ(1520) μ^{+}μ^{-}$ is measured for the first time, in the squared dimuon mass intervals, $q^2$, excluding the $J/ψ$ and $ψ(2S)$ regions. The data sample analyzed was collected by the LHCb experiment at center-of-mass energies of 7, 8, and 13 TeV, corresponding to a total integrated luminosity of $9\ \mathrm{fb}^{-1}$. The result in the highest $q^{2}$ interval, $q^{2} >15.0\ \mathrm{GeV}^2/c^4$, where theoretical predictions have the smallest model dependence, agrees with the predictions.
△ Less
Submitted 24 October, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Practical Cross-System Shilling Attacks with Limited Access to Data
Authors:
Meifang Zeng,
Ke Li,
Bingchuan Jiang,
Liujuan Cao,
Hui Li
Abstract:
In shilling attacks, an adversarial party injects a few fake user profiles into a Recommender System (RS) so that the target item can be promoted or demoted. Although much effort has been devoted to develo** shilling attack methods, we find that existing approaches are still far from practical. In this paper, we analyze the properties a practical shilling attack method should have and propose a…
▽ More
In shilling attacks, an adversarial party injects a few fake user profiles into a Recommender System (RS) so that the target item can be promoted or demoted. Although much effort has been devoted to develo** shilling attack methods, we find that existing approaches are still far from practical. In this paper, we analyze the properties a practical shilling attack method should have and propose a new concept of Cross-system Attack. With the idea of Cross-system Attack, we design a Practical Cross-system Shilling Attack (PC-Attack) framework that requires little information about the victim RS model and the target RS data for conducting attacks. PC-Attack is trained to capture graph topology knowledge from public RS data in a self-supervised manner. Then, it is fine-tuned on a small portion of target data that is easy to access to construct fake profiles. Extensive experiments have demonstrated the superiority of PC-Attack over state-of-the-art baselines. Our implementation of PC-Attack is available at https://github.com/KDEGroup/PC-Attack.
△ Less
Submitted 18 March, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.
-
Colossal reversible barocaloric effects in a plastic crystal mediated by lattice vibrations and ion diffusion
Authors:
Ming Zeng,
Carlos Escorihuela-Sayalero,
Tamio Ikeshoji,
Shigeyuki Takagi,
Sangryun Kim,
Shin-ichi Orimo,
María Barrio,
Josep-Lluís Tamarit,
Pol Lloveras,
Claudio Cazorla,
Kartik Sau
Abstract:
Solid-state methods for cooling and heating promise a more sustainable alternative to current compression cycles of greenhouse gases and inefficient fuel-burning heaters. Barocaloric effects (BCE) driven by hydrostatic pressure ($p$) are especially encouraging in terms of large adiabatic temperature changes ($|ΔT| \sim 10$ K) and colossal isothermal entropy changes ($|ΔS| \sim 100$ JK$^{-1}$kg…
▽ More
Solid-state methods for cooling and heating promise a more sustainable alternative to current compression cycles of greenhouse gases and inefficient fuel-burning heaters. Barocaloric effects (BCE) driven by hydrostatic pressure ($p$) are especially encouraging in terms of large adiabatic temperature changes ($|ΔT| \sim 10$ K) and colossal isothermal entropy changes ($|ΔS| \sim 100$ JK$^{-1}$kg$^{-1}$). However, BCE typically require large pressure shifts due to irreversibility issues, and sizeable $|ΔT|$ and $|ΔS|$ seldom are realized in a same material. Here, we demonstrate the existence of colossal and reversible BCE in LiCB$_{11}$H$_{12}$, a well-known solid electrolyte, near its order-disorder phase transition at $\approx 380$ K. Specifically, for $Δp \approx 0.23$ $(0.10)$ GPa we measured $|ΔS_{\rm rev}| = 280$ $(200)$ JK$^{-1}$kg$^{-1}$ and $|ΔT_{\rm rev}| = 32$ $(10)$ K, which individually rival with state-of-the-art barocaloric shifts obtained under similar pressure conditions. Furthermore, over a wide temperature range, pressure shifts of the order of $0.1$ GPa yield huge reversible barocaloric strengths of $\approx 2$ JK$^{-1}$kg$^{-1}$MPa$^{-1}$. Molecular dynamics simulations were carried out to quantify the role of lattice vibrations, molecular reorientations and ion diffusion on the disclosed colossal BCE. Interestingly, lattice vibrations were found to contribute the most to $|ΔS|$ while the diffusion of lithium ions, despite adding up only slightly to the accompanying entropy change, was crucial in enabling the molecular order-disorder phase transition. Our work expands the knowledge on plastic crystals and should motivate the investigation of BCE in a variety of solid electrolytes displaying ion diffusion and concomitant molecular orientational disorder.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.