-
Search for long-lived particles decaying to $e^\pm μ^\mp ν$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (961 additional authors not shown)
Abstract:
Long-lived particles decaying to $e^\pm μ^\mp ν$, with masses between 7 and $50$ GeV/c$^2$ and lifetimes between 2 and $50$ ps, are searched for by looking at displaced vertices containing electrons and muons of opposite charges. The search is performed using $5.4$ fb$^{-1}$ of $pp$ collisions collected with the LHCb detector at a centre-of-mass energy of $\sqrt{s} = 13$ TeV. Three mechanisms of p…
▽ More
Long-lived particles decaying to $e^\pm μ^\mp ν$, with masses between 7 and $50$ GeV/c$^2$ and lifetimes between 2 and $50$ ps, are searched for by looking at displaced vertices containing electrons and muons of opposite charges. The search is performed using $5.4$ fb$^{-1}$ of $pp$ collisions collected with the LHCb detector at a centre-of-mass energy of $\sqrt{s} = 13$ TeV. Three mechanisms of production of long-lived particles are considered: the direct pair production from quark interactions, the pair production from the decay of a Standard-Model-like Higgs boson with a mass of $125$ GeV/c$^2$, and the charged current production from an on-shell $W$ boson with an additional lepton. No evidence of these long-lived states is obtained and upper limits on the production cross-section times branching fraction are set on the different production modes.
△ Less
Submitted 31 March, 2021; v1 submitted 4 December, 2020;
originally announced December 2020.
-
Observation of the $\varLambda^0_b \to \varLambda^+_c K^+ K^- π^-$ decay
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (961 additional authors not shown)
Abstract:
The $\varLambda^0_b \to \varLambda^+_c K^+ K^- π^-$ decay is observed for the first time using a data sample of proton-proton collisions at centre-of-mass energies of $\sqrt{s}=7$ and 8 $\rm{TeV}$ collected by the $\mbox{LHCb}$ detector, corresponding to an integrated luminosity of $3{\rm{fb}^{-1}}$. The ratio of branching fractions between the $\varLambda^0_b \to \varLambda^+_c K^+ K^- π^-$ and t…
▽ More
The $\varLambda^0_b \to \varLambda^+_c K^+ K^- π^-$ decay is observed for the first time using a data sample of proton-proton collisions at centre-of-mass energies of $\sqrt{s}=7$ and 8 $\rm{TeV}$ collected by the $\mbox{LHCb}$ detector, corresponding to an integrated luminosity of $3{\rm{fb}^{-1}}$. The ratio of branching fractions between the $\varLambda^0_b \to \varLambda^+_c K^+ K^- π^-$ and the $\varLambda^{0}_{b}\to\varLambda^{+}_{c}D^{-}_{s}$ decays is measured to be \begin{equation*} \frac{\mathcal{B} ( \varLambda^0_b \to \varLambda^+_c K^+ K^- π^-) } {\mathcal{B} ( \varLambda^0_b \to \varLambda^+_c D^-_s)} = (9.26 \pm 0.29 \pm 0.46 \pm 0.26)\times10^{-2}, \end{equation*} where the first uncertainty is statistical, the second systematic and the third is due to the knowledge of the $D^-_s \to K^+ K^- π^-$ branching fraction. No structure on the invariant mass distribution of the $\varLambda^+_c K^+$ system is found, consistent with no open-charm pentaquark signature.
△ Less
Submitted 5 March, 2021; v1 submitted 27 November, 2020;
originally announced November 2020.
-
Measurement of the CKM angle $γ$ and $B^0_s$-$\bar{B}^0_s$ mixing frequency with $B^0_s \rightarrow D_s^\mp h^\pm π^\pm π^\mp$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (949 additional authors not shown)
Abstract:
The CKM angle $γ$ is measured for the first time from mixing-induced $CP$ violation between $B^0_s \rightarrow D_s^\mp K^\pm π^\pm π^\mp$ and $\bar{B}^0_s \rightarrow D_s^\pm K^\mp π^\mp π^\pm$ decays reconstructed in proton-proton collision data corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$ recorded with the LHCb detector. A time-dependent amplitude analysis is performed to extra…
▽ More
The CKM angle $γ$ is measured for the first time from mixing-induced $CP$ violation between $B^0_s \rightarrow D_s^\mp K^\pm π^\pm π^\mp$ and $\bar{B}^0_s \rightarrow D_s^\pm K^\mp π^\mp π^\pm$ decays reconstructed in proton-proton collision data corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$ recorded with the LHCb detector. A time-dependent amplitude analysis is performed to extract the $CP$-violating weak phase $γ-2β_s$ and, subsequently, $γ$ by taking the $B^0_s$-$\bar{B}^0_s$ mixing phase $β_{s}$ as an external input. The measurement yields $γ= (44 \pm 12)^\circ$ modulo $180^\circ$, where statistical and systematic uncertainties are combined. An alternative model-independent measurement, integrating over the five-dimensional phase space of the decay, yields $γ= (44^{\,+\,20}_{\,-\,13})^\circ$ modulo $180^\circ$. Moreover, the $B^0_s$-$\bar{B}^0_s$ oscillation frequency is measured from the flavour-specific control channel $B^0_s \rightarrow D_s^- π^+ π^+ π^-$ to be $Δm_s = (17.757 \pm 0.007 \,({\rm stat.}) \pm 0.008 \,({\rm syst.})) \text{ps}^{-1}$, consistent with and more precise than the current world-average value.
△ Less
Submitted 12 April, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
What do we expect from Multiple-choice QA Systems?
Authors:
Krunal Shah,
Nitish Gupta,
Dan Roth
Abstract:
The recent success of machine learning systems on various QA datasets could be interpreted as a significant improvement in models' language understanding abilities. However, using various perturbations, multiple recent works have shown that good performance on a dataset might not indicate performance that correlates well with human's expectations from models that "understand" language. In this wor…
▽ More
The recent success of machine learning systems on various QA datasets could be interpreted as a significant improvement in models' language understanding abilities. However, using various perturbations, multiple recent works have shown that good performance on a dataset might not indicate performance that correlates well with human's expectations from models that "understand" language. In this work we consider a top performing model on several Multiple Choice Question Answering (MCQA) datasets, and evaluate it against a set of expectations one might have from such a model, using a series of zero-information perturbations of the model's inputs. Our results show that the model clearly falls short of our expectations, and motivates a modified training approach that forces the model to better attend to the inputs. We show that the new training paradigm leads to a model that performs on par with the original model while better satisfying our expectations.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
Observation of a new excited $D_s^+$ meson in $B^0\to D^-D^+K^+π^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (947 additional authors not shown)
Abstract:
Using $pp$ collision data corresponding to an integrated luminosity of $5.4\,{\rm fb}^{-1}$ collected with the LHCb detector at a center-of-mass energy of $13\,{\rm TeV}$, the $B^0\to D^-D^+K^+π^-$ decay is studied. A new excited $D_s^+$ meson is observed decaying into the $D^+K^+π^-$ final state with large statistical significance. The pole mass and width, and the spin-parity of the new state are…
▽ More
Using $pp$ collision data corresponding to an integrated luminosity of $5.4\,{\rm fb}^{-1}$ collected with the LHCb detector at a center-of-mass energy of $13\,{\rm TeV}$, the $B^0\to D^-D^+K^+π^-$ decay is studied. A new excited $D_s^+$ meson is observed decaying into the $D^+K^+π^-$ final state with large statistical significance. The pole mass and width, and the spin-parity of the new state are measured with an amplitude analysis to be $m_R=2591\pm6\pm7\,{\rm MeV}$, $Γ_R=89\pm16\pm12\,{\rm MeV}$ and $J^P=0^-$, where the first uncertainty is statistical and the second systematic. Fit fractions for all components in the amplitude analysis are also reported. The new resonance, denoted as $D_{s0}(2590)^+$, is a strong candidate to be the $D_s(2^1{S}_0)^+$ state, the radial excitation of the pseudoscalar ground-state $D_s^+$ meson.
△ Less
Submitted 27 March, 2021; v1 submitted 18 November, 2020;
originally announced November 2020.
-
Search for the rare decay $B^0 \rightarrow J/ψφ$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (948 additional authors not shown)
Abstract:
A search for the rare decay $B^0 \rightarrow J/ψφ$ is performed using $pp$ collision data collected with the LHCb detector at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$. No significant signal of the decay is observed and an upper limit of $1.1 \times 10^{-7}$ at 90% confidence level is set on the branching fraction.
A search for the rare decay $B^0 \rightarrow J/ψφ$ is performed using $pp$ collision data collected with the LHCb detector at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of 9 ${\rm fb}^{-1}$. No significant signal of the decay is observed and an upper limit of $1.1 \times 10^{-7}$ at 90% confidence level is set on the branching fraction.
△ Less
Submitted 21 March, 2021; v1 submitted 13 November, 2020;
originally announced November 2020.
-
Search for heavy neutral leptons in $W^+\toμ^{+}μ^{\pm}\text{jet}$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (966 additional authors not shown)
Abstract:
A search is performed for heavy neutrinos in the decay of a $W$ boson into two muons and a jet. The data set corresponds to an integrated luminosity of approximately $3.0 \text{ fb}^{-1}$ of proton-proton collision data at centre-of-mass energies of 7 and $8 \text{ TeV}$ collected with the LHCb experiment. Both same-sign and opposite-sign muons in the final state are considered. Data are found to…
▽ More
A search is performed for heavy neutrinos in the decay of a $W$ boson into two muons and a jet. The data set corresponds to an integrated luminosity of approximately $3.0 \text{ fb}^{-1}$ of proton-proton collision data at centre-of-mass energies of 7 and $8 \text{ TeV}$ collected with the LHCb experiment. Both same-sign and opposite-sign muons in the final state are considered. Data are found to be consistent with the expected background. Upper limits on the coupling of a heavy neutrino with the Standard Model neutrino are set at $95\%$ confidence level in the heavy-neutrino mass range from 5 to $50 \text{ GeV}/c^2$. These are of the order of $10^{-3}$ for lepton-number-conserving decays and of the order of $10^{-4}$ for lepton-number-violating heavy-neutrino decays.
△ Less
Submitted 25 March, 2021; v1 submitted 10 November, 2020;
originally announced November 2020.
-
Study of $B^0_s \rightarrow J/ψπ^+π^-K^+K^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (944 additional authors not shown)
Abstract:
The decays $B^0_s \rightarrow J/ψπ^+π^- K^+ K^-$ are studied using a data set corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector in proton-proton collisions at centre-of-mass energies of 7, 8 and 13TeV. The decays $B^0_s \rightarrow J/ψK^{\ast0} \bar{K}^{\ast0}$ and $B^0_s \rightarrow χ_{c1}(3872)K^+K^-$, where the $K^+K^-$ pair does not originate from a $φ$…
▽ More
The decays $B^0_s \rightarrow J/ψπ^+π^- K^+ K^-$ are studied using a data set corresponding to an integrated luminosity of 9fb$^{-1}$, collected with the LHCb detector in proton-proton collisions at centre-of-mass energies of 7, 8 and 13TeV. The decays $B^0_s \rightarrow J/ψK^{\ast0} \bar{K}^{\ast0}$ and $B^0_s \rightarrow χ_{c1}(3872)K^+K^-$, where the $K^+K^-$ pair does not originate from a $φ$ meson, are observed for the first time. Precise measurements of the ratios of branching fractions between intermediate $χ_{c1}(3872)φ$, $J/ψK^{\ast0}\bar{K}^{\ast0}$, $ψ(2S)φ$ and $χ_{c1}(3872)K^+K^-$ states are reported. A structure, denoted as $X(4740)$, is observed in the $J/ψφ$ mass spectrum and, assuming a Breit-Wigner parameterisation, its mass and width are determined to be \begin{eqnarray*} m_{X(4740)} & = & 4741 \pm 6 \pm 6\,{\mathrm{MeV}}/c^2 \,, \\ Γ_{X(4740)} & = & 53 \pm 15 \pm 11\,{\mathrm{MeV}} \,, \end{eqnarray*} where the first uncertainty is statistical and the second is systematic. In addition, the most precise single measurement of the mass of the $B^0_s$ meson is performed and gives a value of
$$ m_{B^0_s} = 5366.98 \pm 0.07 \pm 0.13\,{\mathrm{MeV}}/c^2\,. $$
△ Less
Submitted 8 February, 2021; v1 submitted 3 November, 2020;
originally announced November 2020.
-
Searches for 25 rare and forbidden decays of $D^+$ and $D_s^+$ mesons
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (983 additional authors not shown)
Abstract:
A search is performed for rare and forbidden charm decays of the form $D_{(s)}^+ \to h^\pm \ell^+ \ell^{(\prime)\mp}$, where $h^\pm$ is a pion or kaon and $\ell^{(')\pm}$ is an electron or muon. The measurements are performed using proton-proton collision data, corresponding to an integrated luminosity of $1.6\text{fb}^{-1}$, collected by the LHCb experiment in 2016. No evidence is observed for th…
▽ More
A search is performed for rare and forbidden charm decays of the form $D_{(s)}^+ \to h^\pm \ell^+ \ell^{(\prime)\mp}$, where $h^\pm$ is a pion or kaon and $\ell^{(')\pm}$ is an electron or muon. The measurements are performed using proton-proton collision data, corresponding to an integrated luminosity of $1.6\text{fb}^{-1}$, collected by the LHCb experiment in 2016. No evidence is observed for the 25 decay modes that are investigated and $90\%$ confidence level limits on the branching fractions are set between $1.4\times10^{-8}$ and $6.4\times10^{-6}$. In most cases, these results represent an improvement on existing limits by one to two orders of magnitude.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
Observation of new excited $B_s^0$ states
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (958 additional authors not shown)
Abstract:
A structure is observed in the $B^+K^-$ mass spectrum in a sample of proton--proton collisions at centre-of-mass energies of 7, 8, and 13 TeV, collected with the LHCb detector and corresponding to a total integrated luminosity of 9 fb${}^-1$. The structure is interpreted as the result of overlap** excited $B_s^0$ states. With high significance, a two-peak hypothesis provides a better description…
▽ More
A structure is observed in the $B^+K^-$ mass spectrum in a sample of proton--proton collisions at centre-of-mass energies of 7, 8, and 13 TeV, collected with the LHCb detector and corresponding to a total integrated luminosity of 9 fb${}^-1$. The structure is interpreted as the result of overlap** excited $B_s^0$ states. With high significance, a two-peak hypothesis provides a better description of the data than a single resonance. Under this hypothesis the masses and widths of the two states, assuming they decay directly to $B^+K^-$, are determined to be
$m_1 = 6063.5 \pm 1.2 \text{ (stat)} \pm 0.8\text{ (syst) MeV},$
$Γ_1 = 26 \pm 4 \text{ (stat)} \pm 4\text{ (syst) MeV},$
$m_2 = 6114 \pm 3 \text{ (stat)} \pm 5\text{ (syst) MeV},$
$Γ_2 = 66 \pm 18 \text{ (stat)} \pm 21\text{ (syst) MeV}.$
Alternative values assuming a decay through $B^{*+}K^-$, with a missing photon from the $B^{*+} \rightarrow B^+γ$ decay, which are shifted by approximately 45 MeV are also determined. The possibility of a single state decaying in both channels is also considered. The ratio of the total production cross-section times branching fraction of the new states relative to the previously observed $B_{s2}^{*0}$ state is determined to be $0.87 \pm 0.15 \text{ (stat)} \pm 0.19 \text{ (syst)}$.
△ Less
Submitted 28 July, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
Observation of a new $Ξ_b^0$ state
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (943 additional authors not shown)
Abstract:
Using a proton-proton collision data sample collected by the LHCb experiment, corresponding to an integrated luminosity of 8.5 fb$^{-1}$, the observation of a new excited $Ξ_b^0$ resonance decaying to the $Ξ_b^-π^+$ final state is presented. The state, referred to as $Ξ_b(6227)^0$, has a measured mass and natural width of
$m(Ξ_b(6227)^0) = 6227.1^{\,+1.4}_{\,-1.5}\pm0.5$ MeV,…
▽ More
Using a proton-proton collision data sample collected by the LHCb experiment, corresponding to an integrated luminosity of 8.5 fb$^{-1}$, the observation of a new excited $Ξ_b^0$ resonance decaying to the $Ξ_b^-π^+$ final state is presented. The state, referred to as $Ξ_b(6227)^0$, has a measured mass and natural width of
$m(Ξ_b(6227)^0) = 6227.1^{\,+1.4}_{\,-1.5}\pm0.5$ MeV,
$Γ(Ξ_b(6227)^0) = 18.6^{\,+5.0}_{\,-4.1}\pm1.4$ MeV,
where the uncertainties are statistical and systematic. The production rate of the $Ξ_b(6227)^0$ state relative to that of the $Ξ_b^-$ baryon in the kinematic region $2<η<5$ and $p_{\rm T}<30$ GeV is measured to be
$\frac{f_{Ξ_b(6227)^0}}{f_{Ξ_b^-}}{\mathcal{B}}(Ξ_b(6227)^0\toΞ_b^-π^+) = 0.045\pm0.008\pm0.004$,
where ${\mathcal{B}}(Ξ_b(6227)^0\toΞ_b^-π^+)$ is the branching fraction of the decay, and $f_{Ξ_b(6227)^0}$ and $f_{Ξ_b^-}$ represent fragmentation fractions.
Improved measurements of the mass and natural width of the previously observed $Ξ_b(6227)^-$ state, along with the mass of the $Ξ_b^-$ baryon, are also reported. Both measurements are significantly more precise than, and consistent with, previously reported values.
△ Less
Submitted 7 January, 2021; v1 submitted 27 October, 2020;
originally announced October 2020.
-
Pairwise Representation Learning for Event Coreference
Authors:
Xiaodong Yu,
Wenpeng Yin,
Dan Roth
Abstract:
Natural Language Processing tasks such as resolving the coreference of events require understanding the relations between two text snippets. These tasks are typically formulated as (binary) classification problems over independently induced representations of the text snippets. In this work, we develop a Pairwise Representation Learning (PairwiseRL) scheme for the event mention pairs, in which we…
▽ More
Natural Language Processing tasks such as resolving the coreference of events require understanding the relations between two text snippets. These tasks are typically formulated as (binary) classification problems over independently induced representations of the text snippets. In this work, we develop a Pairwise Representation Learning (PairwiseRL) scheme for the event mention pairs, in which we jointly encode a pair of text snippets so that the representation of each mention in the pair is induced in the context of the other one. Furthermore, our representation supports a finer, structured representation of the text snippet to facilitate encoding events and their arguments. We show that PairwiseRL, despite its simplicity, outperforms the prior state-of-the-art event coreference systems on both cross-document and within-document event coreference benchmarks. We also conduct in-depth analysis in terms of the improvement and the limitation of pairwise representation so as to provide insights for future work.
△ Less
Submitted 15 February, 2023; v1 submitted 24 October, 2020;
originally announced October 2020.
-
Temporal Reasoning on Implicit Events from Distant Supervision
Authors:
Ben Zhou,
Kyle Richardson,
Qiang Ning,
Tushar Khot,
Ashish Sabharwal,
Dan Roth
Abstract:
We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a new challenge in temporal reasoning research, where prior work has focused on explicitly mentioned events. Human readers can infer implicit events via commonsen…
▽ More
We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a new challenge in temporal reasoning research, where prior work has focused on explicitly mentioned events. Human readers can infer implicit events via commonsense reasoning, resulting in a more comprehensive understanding of the situation and, consequently, better reasoning about time. We find, however, that state-of-the-art models struggle when predicting temporal relationships between implicit and explicit events. To address this, we propose a neuro-symbolic temporal reasoning model, SYMTIME, which exploits distant supervision signals from large-scale text and uses temporal rules to combine start times and durations to infer end times. SYMTIME outperforms strong baseline systems on TRACIE by 5%, and by 11% in a zero prior knowledge training setting. Our approach also generalizes to other temporal reasoning tasks, as evidenced by a gain of 1%-9% on MATRES, an explicit event benchmark.
△ Less
Submitted 7 May, 2021; v1 submitted 23 October, 2020;
originally announced October 2020.
-
Understanding the Extent to which Summarization Evaluation Metrics Measure the Information Quality of Summaries
Authors:
Daniel Deutsch,
Dan Roth
Abstract:
Reference-based metrics such as ROUGE or BERTScore evaluate the content quality of a summary by comparing the summary to a reference. Ideally, this comparison should measure the summary's information quality by calculating how much information the summaries have in common. In this work, we analyze the token alignments used by ROUGE and BERTScore to compare summaries and argue that their scores lar…
▽ More
Reference-based metrics such as ROUGE or BERTScore evaluate the content quality of a summary by comparing the summary to a reference. Ideally, this comparison should measure the summary's information quality by calculating how much information the summaries have in common. In this work, we analyze the token alignments used by ROUGE and BERTScore to compare summaries and argue that their scores largely cannot be interpreted as measuring information overlap, but rather the extent to which they discuss the same topics. Further, we provide evidence that this result holds true for many other summarization evaluation metrics. The consequence of this result is that it means the summarization community has not yet found a reliable automatic metric that aligns with its research goal, to generate summaries with high-quality information. Then, we propose a simple and interpretable method of evaluating summaries which does directly measure information overlap and demonstrate how it can be used to gain insights into model behavior that could not be provided by other methods alone.
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Measurement of the branching fraction of the $B^{0}\rightarrow D_{s}^{+}π^{-}$ decay
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (935 additional authors not shown)
Abstract:
A branching fraction measurement of the $B^{0}\rightarrow D_{s}^{+}π^{-}$ decay is presented using proton-proton collision data collected with the LHCb experiment, corresponding to an integrated luminosity of $5.0\,$fb$^{-1}$. The branching fraction is found to be ${\mathcal{B}(B^{0}\rightarrow D_{s}^{+}π^{-}) = (19.4 \pm 1.8\pm 1.3 \pm 1.2)\times 10^{-6}}$, where the first uncertainty is statisti…
▽ More
A branching fraction measurement of the $B^{0}\rightarrow D_{s}^{+}π^{-}$ decay is presented using proton-proton collision data collected with the LHCb experiment, corresponding to an integrated luminosity of $5.0\,$fb$^{-1}$. The branching fraction is found to be ${\mathcal{B}(B^{0}\rightarrow D_{s}^{+}π^{-}) = (19.4 \pm 1.8\pm 1.3 \pm 1.2)\times 10^{-6}}$, where the first uncertainty is statistical, the second systematic and the third is due to the uncertainty on the $B^0 \to D^{-}π^{+}$, $D_{s}^{+}\rightarrow K^{+}K^{-}π^{+}$ and $D^{-}\rightarrow K^{+}π^{-}π^{-}$ branching fractions. This is the most precise single measurement of this quantity to date. As this decay proceeds through a single amplitude involving a $b \to u$ charged-current transition, the result provides information on non-factorisable strong interaction effects and the magnitude of the Cabibbo-Kobayashi-Maskawa matrix element $V_{ub}$. Additionally, the collision energy dependence of the hadronisation-fraction ratio $f_s/f_d$ is measured through $\bar{B}{}_{s}^{0}\rightarrow D_{s}^{+}π^{-}$ and $B^0 \to D^{-}π^{+}$ decays.
△ Less
Submitted 19 April, 2021; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Measurement of the relative branching fractions of $B^+ \to h^+h^{\prime +}h^{\prime -}$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (943 additional authors not shown)
Abstract:
The relative branching fractions of $B^+ \to h^+h^{\prime +}h^{\prime -}$ decays, where $h^{(\prime)}$ is a pion or kaon, are measured. The analysis is performed with a data sample, collected with the LHCb detector, corresponding to an integrated luminosity of $3.0 {\rm fb}^{-1}$ of $pp$ collisions. The results obtained improve significantly on previous measurements of these quantities, and are im…
▽ More
The relative branching fractions of $B^+ \to h^+h^{\prime +}h^{\prime -}$ decays, where $h^{(\prime)}$ is a pion or kaon, are measured. The analysis is performed with a data sample, collected with the LHCb detector, corresponding to an integrated luminosity of $3.0 {\rm fb}^{-1}$ of $pp$ collisions. The results obtained improve significantly on previous measurements of these quantities, and are important for the interpretation of Dalitz plot analyses of three-body charmless hadronic decays of $B^+$ mesons.
△ Less
Submitted 18 December, 2020; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Measurement of differential $b\bar{b}$- and $c\bar{c}$-dijet cross-sections in the forward region of $pp$ collisions at $\sqrt{s}=13 ~ \mathrm{TeV}$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (961 additional authors not shown)
Abstract:
The inclusive $b \bar{b}$- and $c \bar{c}$-dijet production cross-sections in the forward region of $pp$ collisions are measured using a data sample collected with the LHCb detector at a centre-of-mass energy of 13 TeV in 2016. The data sample corresponds to an integrated luminosity of 1.6 fb$^{-1}$. Differential cross-sections are measured as a function of the transverse momentum and of the pseud…
▽ More
The inclusive $b \bar{b}$- and $c \bar{c}$-dijet production cross-sections in the forward region of $pp$ collisions are measured using a data sample collected with the LHCb detector at a centre-of-mass energy of 13 TeV in 2016. The data sample corresponds to an integrated luminosity of 1.6 fb$^{-1}$. Differential cross-sections are measured as a function of the transverse momentum and of the pseudorapidity of the leading jet, of the rapidity difference between the jets, and of the dijet invariant mass. A fiducial region for the measurement is defined by requiring that the two jets originating from the two $b$ or $c$ quarks are emitted with transverse momentum greater than 20 GeV$/c$, pseudorapidity in the range $2.2 < η< 4.2$, and with a difference in the azimuthal angle between the two jets greater than 1.5. The integrated $b \bar{b}$-dijet cross-section is measured to be $53.0 \pm 9.7$ nb, and the total $c \bar{c}$-dijet cross-section is measured to be $73 \pm 16$ nb. The ratio between $c \bar{c}$- and $b \bar{b}$-dijet cross-sections is also measured and found to be $1.37 \pm 0.27$. The results are in agreement with theoretical predictions at next-to-leading order.
△ Less
Submitted 11 February, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Analogous Process Structure Induction for Sub-event Sequence Prediction
Authors:
Hongming Zhang,
Muhao Chen,
Haoyu Wang,
Yangqiu Song,
Dan Roth
Abstract:
Computational and cognitive studies of event understanding suggest that identifying, comprehending, and predicting events depend on having structured representations of a sequence of events and on conceptualizing (abstracting) its components into (soft) event categories. Thus, knowledge about a known process such as "buying a car" can be used in the context of a new but analogous process such as "…
▽ More
Computational and cognitive studies of event understanding suggest that identifying, comprehending, and predicting events depend on having structured representations of a sequence of events and on conceptualizing (abstracting) its components into (soft) event categories. Thus, knowledge about a known process such as "buying a car" can be used in the context of a new but analogous process such as "buying a house". Nevertheless, most event understanding work in NLP is still at the ground level and does not consider abstraction. In this paper, we propose an Analogous Process Structure Induction APSI framework, which leverages analogies among processes and conceptualization of sub-event instances to predict the whole sub-event sequence of previously unseen open-domain processes. As our experiments and analysis indicate, APSI supports the generation of meaningful sub-event sequences for unseen processes and can help predict missing events.
△ Less
Submitted 16 October, 2020;
originally announced October 2020.
-
Measurement of the CKM angle $γ$ in $B^\pm\to D K^\pm$ and $B^\pm \to D π^\pm$ decays with $D \to K_\mathrm S^0 h^+ h^-$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (961 additional authors not shown)
Abstract:
A measurement of $CP$-violating observables is performed using the decays $B^\pm\to D K^\pm$ and $B^\pm\to D π^\pm$, where the $D$ meson is reconstructed in one of the self-conjugate three-body final states $K_{\mathrm S}π^+π^-$ and $K_{\mathrm S}K^+K^-$ (commonly denoted $K_{\mathrm S} h^+h^-$). The decays are analysed in bins of the $D$-decay phase space, leading to a measurement that is indepen…
▽ More
A measurement of $CP$-violating observables is performed using the decays $B^\pm\to D K^\pm$ and $B^\pm\to D π^\pm$, where the $D$ meson is reconstructed in one of the self-conjugate three-body final states $K_{\mathrm S}π^+π^-$ and $K_{\mathrm S}K^+K^-$ (commonly denoted $K_{\mathrm S} h^+h^-$). The decays are analysed in bins of the $D$-decay phase space, leading to a measurement that is independent of the modelling of the $D$-decay amplitude. The observables are interpreted in terms of the CKM angle $γ$. Using a data sample corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ collected in proton-proton collisions at centre-of-mass energies of $7$, $8$, and $13\,\text{TeV}$ with the LHCb experiment, $γ$ is measured to be $\left(68.7^{+5.2}_{-5.1}\right)^\circ$. The hadronic parameters $r_B^{DK}$, $r_B^{Dπ}$, $δ_B^{DK}$, and $δ_B^{Dπ}$, which are the ratios and strong-phase differences of the suppressed and favoured $B^\pm$ decays, are also reported.
△ Less
Submitted 7 March, 2021; v1 submitted 16 October, 2020;
originally announced October 2020.
-
Joint Constrained Learning for Event-Event Relation Extraction
Authors:
Haoyu Wang,
Muhao Chen,
Hongming Zhang,
Dan Roth
Abstract:
Understanding natural language involves recognizing how multiple event mentions structurally and temporally interact with each other. In this process, one can induce event complexes that organize multi-granular events with temporal order and membership relations interweaving among them. Due to the lack of jointly labeled data for these relational phenomena and the restriction on the structures the…
▽ More
Understanding natural language involves recognizing how multiple event mentions structurally and temporally interact with each other. In this process, one can induce event complexes that organize multi-granular events with temporal order and membership relations interweaving among them. Due to the lack of jointly labeled data for these relational phenomena and the restriction on the structures they articulate, we propose a joint constrained learning framework for modeling event-event relations. Specifically, the framework enforces logical constraints within and across multiple temporal and subevent relations by converting these constraints into differentiable learning objectives. We show that our joint constrained learning approach effectively compensates for the lack of jointly labeled data, and outperforms SOTA methods on benchmarks for both temporal relation extraction and event hierarchy construction, replacing a commonly used but more expensive global inference process. We also present a promising case study showing the effectiveness of our approach in inducing event complexes on an external corpus.
△ Less
Submitted 2 May, 2021; v1 submitted 13 October, 2020;
originally announced October 2020.
-
"What Are You Trying to Do?" Semantic Ty** of Event Processes
Authors:
Muhao Chen,
Hongming Zhang,
Haoyu Wang,
Dan Roth
Abstract:
This paper studies a new cognitively motivated semantic ty** task, multi-axis event process ty**, that, given an event process, attempts to infer free-form type labels describing (i) the type of action made by the process and (ii) the type of object the process seeks to affect. This task is inspired by computational and cognitive studies of event understanding, which suggest that understanding…
▽ More
This paper studies a new cognitively motivated semantic ty** task, multi-axis event process ty**, that, given an event process, attempts to infer free-form type labels describing (i) the type of action made by the process and (ii) the type of object the process seeks to affect. This task is inspired by computational and cognitive studies of event understanding, which suggest that understanding processes of events is often directed by recognizing the goals, plans or intentions of the protagonist(s). We develop a large dataset containing over 60k event processes, featuring ultra fine-grained ty** on both the action and object type axes with very large ($10^3\sim 10^4$) label vocabularies. We then propose a hybrid learning framework, P2GT, which addresses the challenging ty** problem with indirect supervision from glosses1and a joint learning-to-rank framework. As our experiments indicate, P2GT supports identifying the intent of processes, as well as the fine semantic type of the affected object. It also demonstrates the capability of handling few-shot cases, and strong generalizability on out-of-domain event processes.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
Strong constraints on the $b \to sγ$ photon polarisation from $B^0 \to K^{*0} e^+ e^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (962 additional authors not shown)
Abstract:
An angular analysis of the $B^0 \to K^{*0} e^+ e^-$ decay is performed using a data sample corresponding to an integrated luminosity of $9~{\rm fb}^{-1}$ of $pp$ collisions collected with the LHCb experiment. The analysis is conducted in the very low dielectron mass squared ($q^2$) interval between $0.0008$ and $0.257~{\rm GeV}^2$, where the rate is dominated by the $B^0\to K^{\ast 0}γ$ transition…
▽ More
An angular analysis of the $B^0 \to K^{*0} e^+ e^-$ decay is performed using a data sample corresponding to an integrated luminosity of $9~{\rm fb}^{-1}$ of $pp$ collisions collected with the LHCb experiment. The analysis is conducted in the very low dielectron mass squared ($q^2$) interval between $0.0008$ and $0.257~{\rm GeV}^2$, where the rate is dominated by the $B^0\to K^{\ast 0}γ$ transition with a virtual photon. The fraction of longitudinal polarisation of the $K^{\ast 0}$ meson, $F_{\rm L}$, is measured to be $F_{\rm L} = (4.4 \pm 2.6 \pm 1.4)\%$, where the first uncertainty is statistical and the second systematic. The $A_{\rm T}^{\rm Re}$ observable, which is related to the lepton forward-backward asymmetry, is measured to be $A_{\rm T}^{\rm Re}=-0.06 \pm 0.08 \pm 0.02$. The $A_{\rm T}^{(2)}$ and $A_{\rm T}^{\rm Im}$ transverse asymmetries, which are sensitive to the virtual photon polarisation, are found to be $A_{\rm T}^{(2)} = 0.11 \pm 0.10 \ \pm 0.02$ and $A_{\rm T}^{\rm Im} = 0.02 \pm 0.10 \pm 0.01$. The results are consistent with Standard Model predictions and provide the world's best constraint on the $b\to sγ$ photon polarisation.
△ Less
Submitted 16 December, 2020; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Do Language Embeddings Capture Scales?
Authors:
Xikun Zhang,
Deepak Ramachandran,
Ian Tenney,
Yanai Elazar,
Dan Roth
Abstract:
Pretrained Language Models (LMs) have been shown to possess significant linguistic, common sense, and factual knowledge. One form of knowledge that has not been studied yet in this context is information about the scalar magnitudes of objects. We show that pretrained language models capture a significant amount of this information but are short of the capability required for general common-sense r…
▽ More
Pretrained Language Models (LMs) have been shown to possess significant linguistic, common sense, and factual knowledge. One form of knowledge that has not been studied yet in this context is information about the scalar magnitudes of objects. We show that pretrained language models capture a significant amount of this information but are short of the capability required for general common-sense reasoning. We identify contextual information in pre-training and numeracy as two key factors affecting their performance and show that a simple method of canonicalizing numbers can have a significant effect on the results.
△ Less
Submitted 24 November, 2020; v1 submitted 11 October, 2020;
originally announced October 2020.
-
"I'd rather just go to bed": Understanding Indirect Answers
Authors:
Annie Louis,
Dan Roth,
Filip Radlinski
Abstract:
We revisit a pragmatic inference problem in dialog: understanding indirect responses to questions. Humans can interpret 'I'm starving.' in response to 'Hungry?', even without direct cue words such as 'yes' and 'no'. In dialog systems, allowing natural responses rather than closed vocabularies would be similarly beneficial. However, today's systems are only as sensitive to these pragmatic moves as…
▽ More
We revisit a pragmatic inference problem in dialog: understanding indirect responses to questions. Humans can interpret 'I'm starving.' in response to 'Hungry?', even without direct cue words such as 'yes' and 'no'. In dialog systems, allowing natural responses rather than closed vocabularies would be similarly beneficial. However, today's systems are only as sensitive to these pragmatic moves as their language model allows. We create and release the first large-scale English language corpus 'Circa' with 34,268 (polar question, indirect answer) pairs to enable progress on this task. The data was collected via elaborate crowdsourcing, and contains utterances with yes/no meaning, as well as uncertain, middle-ground, and conditional responses. We also present BERT-based neural models to predict such categories for a question-answer pair. We find that while transfer learning from entailment works reasonably, performance is not yet sufficient for robust dialog. Our models reach 82-88% accuracy for a 4-class distinction, and 74-85% for 6 classes.
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Pruning Redundant Map**s in Transformer Models via Spectral-Normalized Identity Prior
Authors:
Zi Lin,
Jeremiah Zhe Liu,
Zi Yang,
Nan Hua,
Dan Roth
Abstract:
Traditional (unstructured) pruning methods for a Transformer model focus on regularizing the individual weights by penalizing them toward zero. In this work, we explore spectral-normalized identity priors (SNIP), a structured pruning approach that penalizes an entire residual module in a Transformer model toward an identity map**. Our method identifies and discards unimportant non-linear map**…
▽ More
Traditional (unstructured) pruning methods for a Transformer model focus on regularizing the individual weights by penalizing them toward zero. In this work, we explore spectral-normalized identity priors (SNIP), a structured pruning approach that penalizes an entire residual module in a Transformer model toward an identity map**. Our method identifies and discards unimportant non-linear map**s in the residual connections by applying a thresholding operator on the function norm. It is applicable to any structured module, including a single attention head, an entire attention block, or a feed-forward subnetwork. Furthermore, we introduce spectral normalization to stabilize the distribution of the post-activation values of the Transformer layers, further improving the pruning effectiveness of the proposed methodology. We conduct experiments with BERT on 5 GLUE benchmark tasks to demonstrate that SNIP achieves effective pruning results while maintaining comparable performance. Specifically, we improve the performance over the state-of-the-art by 0.5 to 1.0% on average at 50% compression ratio.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
Authors:
Daniel Deutsch,
Tania Bedrax-Weiss,
Dan Roth
Abstract:
A desirable property of a reference-based evaluation metric that measures the content quality of a summary is that it should estimate how much information that summary has in common with a reference. Traditional text overlap based metrics such as ROUGE fail to achieve this because they are limited to matching tokens, either lexically or via embeddings. In this work, we propose a metric to evaluate…
▽ More
A desirable property of a reference-based evaluation metric that measures the content quality of a summary is that it should estimate how much information that summary has in common with a reference. Traditional text overlap based metrics such as ROUGE fail to achieve this because they are limited to matching tokens, either lexically or via embeddings. In this work, we propose a metric to evaluate the content quality of a summary using question-answering (QA). QA-based methods directly measure a summary's information overlap with a reference, making them fundamentally different than text overlap metrics. We demonstrate the experimental benefits of QA-based metrics through an analysis of our proposed metric, QAEval. QAEval out-performs current state-of-the-art metrics on most evaluations using benchmark datasets, while being competitive on others due to limitations of state-of-the-art models. Through a careful analysis of each component of QAEval, we identify its performance bottlenecks and estimate that its potential upper-bound performance surpasses all other automatic metrics, approaching that of the gold-standard Pyramid Method.
△ Less
Submitted 26 July, 2021; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Visual Pivoting for (Unsupervised) Entity Alignment
Authors:
Fangyu Liu,
Muhao Chen,
Dan Roth,
Nigel Collier
Abstract:
This work studies the use of visual semantic representations to align entities in heterogeneous knowledge graphs (KGs). Images are natural components of many existing KGs. By combining visual knowledge with other auxiliary information, we show that the proposed new approach, EVA, creates a holistic entity representation that provides strong signals for cross-graph entity alignment. Besides, previo…
▽ More
This work studies the use of visual semantic representations to align entities in heterogeneous knowledge graphs (KGs). Images are natural components of many existing KGs. By combining visual knowledge with other auxiliary information, we show that the proposed new approach, EVA, creates a holistic entity representation that provides strong signals for cross-graph entity alignment. Besides, previous entity alignment methods require human labelled seed alignment, restricting availability. EVA provides a completely unsupervised solution by leveraging the visual similarity of entities to create an initial seed dictionary (visual pivots). Experiments on benchmark data sets DBP15k and DWY15k show that EVA offers state-of-the-art performance on both monolingual and cross-lingual entity alignment tasks. Furthermore, we discover that images are particularly useful to align long-tail KG entities, which inherently lack the structural contexts necessary for capturing the correspondences.
△ Less
Submitted 16 December, 2020; v1 submitted 28 September, 2020;
originally announced September 2020.
-
Task-Oriented Dialogue as Dataflow Synthesis
Authors:
Semantic Machines,
Jacob Andreas,
John Bufe,
David Burkett,
Charles Chen,
Josh Clausman,
Jean Crawford,
Kate Crim,
Jordan DeLoach,
Leah Dorner,
Jason Eisner,
Hao Fang,
Alan Guo,
David Hall,
Kristin Hayes,
Kellie Hill,
Diana Ho,
Wendy Iwaszuk,
Smriti Jha,
Dan Klein,
Jayant Krishnamurthy,
Theo Lanman,
Percy Liang,
Christopher H Lin,
Ilya Lintsbakh
, et al. (21 additional authors not shown)
Abstract:
We describe an approach to task-oriented dialogue in which dialogue state is represented as a dataflow graph. A dialogue agent maps each user utterance to a program that extends this graph. Programs include metacomputation operators for reference and revision that reuse dataflow fragments from previous turns. Our graph-based state enables the expression and manipulation of complex user intents, an…
▽ More
We describe an approach to task-oriented dialogue in which dialogue state is represented as a dataflow graph. A dialogue agent maps each user utterance to a program that extends this graph. Programs include metacomputation operators for reference and revision that reuse dataflow fragments from previous turns. Our graph-based state enables the expression and manipulation of complex user intents, and explicit metacomputation makes these intents easier for learned models to predict. We introduce a new dataset, SMCalFlow, featuring complex dialogues about events, weather, places, and people. Experiments show that dataflow graphs and metacomputation substantially improve representability and predictability in these natural dialogues. Additional experiments on the MultiWOZ dataset show that our dataflow representation enables an otherwise off-the-shelf sequence-to-sequence model to match the best existing task-specific state tracking model. The SMCalFlow dataset and code for replicating experiments are available at https://www.microsoft.com/en-us/research/project/dataflow-based-dialogue-semantic-machines.
△ Less
Submitted 10 February, 2021; v1 submitted 23 September, 2020;
originally announced September 2020.
-
Observation of multiplicity-dependent prompt $χ_{c1}(3872)$ and $ψ(2S)$ production in $pp$ collisions
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (953 additional authors not shown)
Abstract:
The production of $χ_{c1}(3872)$ and $ψ(2S)$ hadrons is studied as a function of charged particle multiplicity in $pp$ collisions at a center-of-mass energy of 8 TeV, corresponding to an integrated luminosity of 2 fb$^{-1}$. For both states, the fraction that is produced promptly at the collision vertex is found to decrease as charged particle multiplicity increases. The ratio of $χ_{c1}(3872)$ to…
▽ More
The production of $χ_{c1}(3872)$ and $ψ(2S)$ hadrons is studied as a function of charged particle multiplicity in $pp$ collisions at a center-of-mass energy of 8 TeV, corresponding to an integrated luminosity of 2 fb$^{-1}$. For both states, the fraction that is produced promptly at the collision vertex is found to decrease as charged particle multiplicity increases. The ratio of $χ_{c1}(3872)$ to $ψ(2S)$ cross-sections for promptly produced particles is also found to decrease with multiplicity, while no significant dependence on multiplicity is observed for the equivalent ratio of particles produced away from the collision vertex in $b$-hadron decays. This behavior is consistent with a calculation that models the $χ_{c1}(3872)$ structure as a compact tetraquark. Comparisons with model calculations and implications for the binding energy of the $χ_{c1}(3872)$ state are discussed.
△ Less
Submitted 10 March, 2021; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Search for the doubly heavy $\mathitΞ_{bc}^{0}$ baryon via decays to $D^0pK^-$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (973 additional authors not shown)
Abstract:
A search for the doubly heavy $\mathitΞ_{bc}^{0}$ baryon using its decay to the $D^0pK^-$ final state is performed using proton-proton collision data at a centre-of-mass energy of 13 TeV collected by the LHCb experiment between 2016 and 2018, corresponding to an integrated luminosity of 5.4 $\mathrm{fb}^{-1}$. No significant signal is found in the invariant mass range from 6.7 to 7.2…
▽ More
A search for the doubly heavy $\mathitΞ_{bc}^{0}$ baryon using its decay to the $D^0pK^-$ final state is performed using proton-proton collision data at a centre-of-mass energy of 13 TeV collected by the LHCb experiment between 2016 and 2018, corresponding to an integrated luminosity of 5.4 $\mathrm{fb}^{-1}$. No significant signal is found in the invariant mass range from 6.7 to 7.2 $\mathrm{GeV}/c^2$. Upper limits are set at $95\%$ credibility level on the ratio of the $\mathitΞ_{bc}^{0}$ production cross-section times its branching fraction to $D^0pK^-$ relative to that of the $\mathitΛ_{b}^{0} \to D^0pK^-$ decay. The limits are set as a function of the $\mathitΞ_{bc}^{0}$ mass and lifetime hypotheses, in the rapidity range from 2.0 to 4.5 and in the transverse momentum region from 5 to 25 $\mathrm{GeV}/c$. Upper limits range from $1.7\times10^{-2}$ to $3.0\times10^{-1}$ for the considered $\mathitΞ_{bc}^{0}$ mass and lifetime hypotheses.
△ Less
Submitted 23 November, 2020; v1 submitted 5 September, 2020;
originally announced September 2020.
-
Amplitude analysis of the $B^+\to D^+D^-K^+$ decay
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (948 additional authors not shown)
Abstract:
Results are reported from an amplitude analysis of the $B^+\to D^+D^-K^+$ decay. The analysis is carried out using LHCb proton-proton collision data taken at $\sqrt{s}=7,8,$ and $13$ TeV, corresponding to a total integrated luminosity of 9 fb$^{-1}$. In order to obtain a good description of the data, it is found to be necessary to include new spin-0 and spin-1 resonances in the $D^-K^+$ channel wi…
▽ More
Results are reported from an amplitude analysis of the $B^+\to D^+D^-K^+$ decay. The analysis is carried out using LHCb proton-proton collision data taken at $\sqrt{s}=7,8,$ and $13$ TeV, corresponding to a total integrated luminosity of 9 fb$^{-1}$. In order to obtain a good description of the data, it is found to be necessary to include new spin-0 and spin-1 resonances in the $D^-K^+$ channel with masses around 2.9 GeV$/c^2$, and a new spin-0 charmonium resonance in proximity to the spin-2 $χ_{c2}(3930)$ state. The masses and widths of these resonances are determined, as are the relative contributions of all components in the amplitude model, which additionally include the vector charmonia $ψ(3770)$, $ψ(4040)$, $ψ(4160)$ and $ψ(4415)$ states and a nonresonant component.
△ Less
Submitted 17 December, 2020; v1 submitted 31 August, 2020;
originally announced September 2020.
-
Model-independent study of structure in $B^+\to D^+D^-K^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
A. Andreianov,
M. Andreotti,
F. Archilli
, et al. (948 additional authors not shown)
Abstract:
The only anticipated resonant contributions to $B^+\to D^+D^-K^+$ decays are charmonium states in the $D^+D^-$ channel. A model-independent analysis, using LHCb proton-proton collision data taken at centre-of-mass energies of $\sqrt{s}=7,8,$ and $13$ TeV, corresponding to a total integrated luminosity of 9 fb$^{-1}$, is carried out to test this hypothesis. The description of the data assuming that…
▽ More
The only anticipated resonant contributions to $B^+\to D^+D^-K^+$ decays are charmonium states in the $D^+D^-$ channel. A model-independent analysis, using LHCb proton-proton collision data taken at centre-of-mass energies of $\sqrt{s}=7,8,$ and $13$ TeV, corresponding to a total integrated luminosity of 9 fb$^{-1}$, is carried out to test this hypothesis. The description of the data assuming that resonances only manifest in decays to the $D^+D^-$ pair is shown to be incomplete. This constitutes evidence for a new contribution to the decay, potentially one or more new charm-strange resonances in the $D^-K^+$ channel with masses around 2.9 GeV$/c^2$.
△ Less
Submitted 17 December, 2020; v1 submitted 31 August, 2020;
originally announced September 2020.
-
First branching fraction measurement of the suppressed decay $Ξ_c^0\to π^-Λ_c^+$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov,
M. Andreotti
, et al. (948 additional authors not shown)
Abstract:
The $Ξ_c^0$ baryon is unstable and usually decays into charmless final states by the $c \to s u\overline{d}$ transition. It can, however, also disintegrate into a $π^-$ meson and a $Λ_c^+$ baryon via $s$ quark decay or via $cs\to d c$ weak scattering. The interplay between the latter two processes governs the size of the branching fraction ${\cal{B}}$$(Ξ_c^0\to π^-Λ_c^+)$, first measured here to b…
▽ More
The $Ξ_c^0$ baryon is unstable and usually decays into charmless final states by the $c \to s u\overline{d}$ transition. It can, however, also disintegrate into a $π^-$ meson and a $Λ_c^+$ baryon via $s$ quark decay or via $cs\to d c$ weak scattering. The interplay between the latter two processes governs the size of the branching fraction ${\cal{B}}$$(Ξ_c^0\to π^-Λ_c^+)$, first measured here to be $(0.55\pm 0.02 \pm 0.18)$%, where the first uncertainty is statistical and second systematic. This result is compatible with the larger of the theoretical predictions that connect models of hyperon decays using partially conserved axial currents and SU(3) symmetry with those involving the heavy-quark expansion and heavy-quark symmetry. In addition, the branching fraction of the normalization channel, ${\cal{B}}(Ξ_c^+\to p K^- π^+) = (1.135 \pm 0.002 \pm 0.387)$% is measured.
△ Less
Submitted 11 September, 2020; v1 submitted 23 July, 2020;
originally announced July 2020.
-
First observation of the decay $Λ_b^0 \to η_c(1S) p K^-$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (971 additional authors not shown)
Abstract:
The decay $Λ_b^0 \to η_c(1S) p K^-$ is observed for the first time using a data sample of proton-proton collisions, corresponding to an integrated luminosity of 5.5 $fb^{-1}$, collected with the LHCb experiment at a center-of-mass energy of 13 TeV. The branching fraction of the decay is measured, using the $Λ_b^0 \to J/ψp K^-$ decay as a normalization mode, to be…
▽ More
The decay $Λ_b^0 \to η_c(1S) p K^-$ is observed for the first time using a data sample of proton-proton collisions, corresponding to an integrated luminosity of 5.5 $fb^{-1}$, collected with the LHCb experiment at a center-of-mass energy of 13 TeV. The branching fraction of the decay is measured, using the $Λ_b^0 \to J/ψp K^-$ decay as a normalization mode, to be $\mathcal{B}(Λ_b^0 \to η_c(1S) p K^-)=(1.06\pm0.16\pm0.06^{+0.22}_{-0.19})\times10^{-4}$, where the quoted uncertainties are statistical, systematic and due to external inputs, respectively. A study of the $η_c(1S) p$ mass spectrum is performed to search for the $P_c(4312)^+$ pentaquark state. No evidence is observed and an upper limit of \begin{equation*} \frac{\mathcal{B}(Λ_b^0 \to P_c(4312)^+ K^-)\times \mathcal{B}(P_c(4312)^+ \to η_c(1S) p)}{\mathcal{B}(Λ_b^0 \to η_c(1S) p K^-)} < 0.24 \end{equation*} is obtained at the 95% confidence level.
△ Less
Submitted 23 December, 2020; v1 submitted 22 July, 2020;
originally announced July 2020.
-
From Spatial Relations to Spatial Configurations
Authors:
Soham Dan,
Parisa Kordjamshidi,
Julia Bonn,
Archna Bhatia,
Jon Cai,
Martha Palmer,
Dan Roth
Abstract:
Spatial Reasoning from language is essential for natural language understanding. Supporting it requires a representation scheme that can capture spatial phenomena encountered in language as well as in images and videos. Existing spatial representations are not sufficient for describing spatial configurations used in complex tasks. This paper extends the capabilities of existing spatial representat…
▽ More
Spatial Reasoning from language is essential for natural language understanding. Supporting it requires a representation scheme that can capture spatial phenomena encountered in language as well as in images and videos. Existing spatial representations are not sufficient for describing spatial configurations used in complex tasks. This paper extends the capabilities of existing spatial representation languages and increases coverage of the semantic aspects that are needed to ground the spatial meaning of natural language text in the world. Our spatial relation language is able to represent a large, comprehensive set of spatial concepts crucial for reasoning and is designed to support the composition of static and dynamic spatial configurations. We integrate this language with the Abstract Meaning Representation(AMR) annotation schema and present a corpus annotated by this extended AMR. To exhibit the applicability of our representation scheme, we annotate text taken from diverse datasets and show how we extend the capabilities of existing spatial representation languages with the fine-grained decomposition of semantics and blend it seamlessly with AMRs of sentences and discourse representations as a whole.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
Understanding Spatial Relations through Multiple Modalities
Authors:
Soham Dan,
Hangfeng He,
Dan Roth
Abstract:
Recognizing spatial relations and reasoning about them is essential in multiple applications including navigation, direction giving and human-computer interaction in general. Spatial relations between objects can either be explicit -- expressed as spatial prepositions, or implicit -- expressed by spatial verbs such as moving, walking, shifting, etc. Both these, but implicit relations in particular…
▽ More
Recognizing spatial relations and reasoning about them is essential in multiple applications including navigation, direction giving and human-computer interaction in general. Spatial relations between objects can either be explicit -- expressed as spatial prepositions, or implicit -- expressed by spatial verbs such as moving, walking, shifting, etc. Both these, but implicit relations in particular, require significant common sense understanding. In this paper, we introduce the task of inferring implicit and explicit spatial relations between two entities in an image. We design a model that uses both textual and visual information to predict the spatial relations, making use of both positional and size information of objects and image embeddings. We contrast our spatial model with powerful language models and show how our modeling complements the power of these, improving prediction accuracy and coverage and facilitates dealing with unseen subjects, objects and relations.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
Observation of enhanced double parton scattering in proton-lead collisions at $\sqrt{s_\mathrm{NN}}=8.16$ TeV
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (978 additional authors not shown)
Abstract:
A study of prompt charm-hadron pair production in proton-lead collisions at $\sqrt{s_\mathrm{NN}}= 8.16$ TeV is performed using data corresponding to an integrated luminosity of about 30 nb${}^{-1}$, collected with the LHCb experiment. Production cross-sections for different pairs of charm hadrons are measured and kinematic correlations between the two charm hadrons are investigated. This is the f…
▽ More
A study of prompt charm-hadron pair production in proton-lead collisions at $\sqrt{s_\mathrm{NN}}= 8.16$ TeV is performed using data corresponding to an integrated luminosity of about 30 nb${}^{-1}$, collected with the LHCb experiment. Production cross-sections for different pairs of charm hadrons are measured and kinematic correlations between the two charm hadrons are investigated. This is the first measurement of associated production of two charm hadrons in proton-lead collisions. The results confirm the predicted enhancement of double parton scattering production in proton-lead collisions compared to the single parton scattering production.
△ Less
Submitted 24 November, 2020; v1 submitted 14 July, 2020;
originally announced July 2020.
-
SacreROUGE: An Open-Source Library for Using and Develo** Summarization Evaluation Metrics
Authors:
Daniel Deutsch,
Dan Roth
Abstract:
We present SacreROUGE, an open-source library for using and develo** summarization evaluation metrics. SacreROUGE removes many obstacles that researchers face when using or develo** metrics: (1) The library provides Python wrappers around the official implementations of existing evaluation metrics so they share a common, easy-to-use interface; (2) it provides functionality to evaluate how well…
▽ More
We present SacreROUGE, an open-source library for using and develo** summarization evaluation metrics. SacreROUGE removes many obstacles that researchers face when using or develo** metrics: (1) The library provides Python wrappers around the official implementations of existing evaluation metrics so they share a common, easy-to-use interface; (2) it provides functionality to evaluate how well any metric implemented in the library correlates to human-annotated judgments, so no additional code needs to be written for a new evaluation metric; and (3) it includes scripts for loading datasets that contain human judgments so they can easily be used for evaluation. This work describes the design of the library, including the core Metric interface, the command-line API for evaluating summarization models and metrics, and the scripts to load and reformat publicly available datasets. The development of SacreROUGE is ongoing and open to contributions from the community.
△ Less
Submitted 10 July, 2020;
originally announced July 2020.
-
First observation of the decay $B^0 \rightarrow D^0 \overline{D}{}^0 K^+ π^-$
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (949 additional authors not shown)
Abstract:
The first observation of the decay $B^0 \rightarrow D^0 \overline{D}{}^0 K^+ π^-$ is reported using proton-proton collision data corresponding to an integrated luminosity of 4.7 $\mathrm{fb}^{-1}$ collected by the LHCb experiment in 2011, 2012 and 2016. The measurement is performed in the full kinematically allowed range of the decay outside of the $D^{*-}$ region. The ratio of the branching fract…
▽ More
The first observation of the decay $B^0 \rightarrow D^0 \overline{D}{}^0 K^+ π^-$ is reported using proton-proton collision data corresponding to an integrated luminosity of 4.7 $\mathrm{fb}^{-1}$ collected by the LHCb experiment in 2011, 2012 and 2016. The measurement is performed in the full kinematically allowed range of the decay outside of the $D^{*-}$ region. The ratio of the branching fraction relative to that of the control channel $B^0 \rightarrow D^{*-} D^0 K^+$ is measured to be $\mathcal{R} = (14.2 \pm 1.1 \pm 1.0)\%$, where the first uncertainty is statistical and the second is systematic. The absolute branching fraction of $B^0 \rightarrow D^0 \overline{D}{}^0 K^+ π^-$ decays is thus determined to be $\mathcal{B}(B^0 \rightarrow D^0 \overline{D}{}^0 K^+ π^-) = (3.50 \pm 0.27 \pm 0.26 \pm 0.30) \times 10^{-4}$, where the third uncertainty is due to the branching fraction of the control channel. This decay mode is expected to provide insights to spectroscopy and the charm-loop contributions in rare semileptonic decays.
△ Less
Submitted 22 September, 2020; v1 submitted 8 July, 2020;
originally announced July 2020.
-
Searches for low-mass dimuon resonances
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (949 additional authors not shown)
Abstract:
Searches are performed for a low-mass dimuon resonance, $X$, produced in proton-proton collisions at a center-of-mass energy of 13 TeV, using a data sample corresponding to an integrated luminosity of 5.1 fb$^{-1}$ and collected with the LHCb detector. The $X$ bosons can either decay promptly or displaced from the proton-proton collision, where in both cases the requirements placed on the event an…
▽ More
Searches are performed for a low-mass dimuon resonance, $X$, produced in proton-proton collisions at a center-of-mass energy of 13 TeV, using a data sample corresponding to an integrated luminosity of 5.1 fb$^{-1}$ and collected with the LHCb detector. The $X$ bosons can either decay promptly or displaced from the proton-proton collision, where in both cases the requirements placed on the event and the assumptions made about the production mechanisms are kept as minimal as possible. The searches for promptly decaying $X$ bosons explore the mass range from near the dimuon threshold up to 60 GeV, with nonnegligible $X$ widths considered above 20 GeV. The searches for displaced $X \to μ^+μ^-$ decays consider masses up to 3 GeV. None of the searches finds evidence for a signal and 90% confidence-level exclusion limits are placed on the $X \to μ^+μ^-$ cross sections, each with minimal model dependence. In addition, these results are used to place world-leading constraints on GeV-scale bosons in the two-Higgs-doublet and hidden-valley scenarios.
△ Less
Submitted 2 November, 2020; v1 submitted 8 July, 2020;
originally announced July 2020.
-
Observation of structure in the $J/ψ$-pair mass spectrum
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (948 additional authors not shown)
Abstract:
Using proton-proton collision data at centre-of-mass energies of $\sqrt{s} = 7$, $8$ and $13\mathrm{\,TeV}$ recorded by the LHCb experiment at the Large Hadron Collider, corresponding to an integrated luminosity of $9\mathrm{\,fb}^{-1}$, the invariant mass spectrum of $J/ψ$ pairs is studied. A narrow structure around $6.9\mathrm{\,GeV/}c^2$ matching the lineshape of a resonance and a broad structu…
▽ More
Using proton-proton collision data at centre-of-mass energies of $\sqrt{s} = 7$, $8$ and $13\mathrm{\,TeV}$ recorded by the LHCb experiment at the Large Hadron Collider, corresponding to an integrated luminosity of $9\mathrm{\,fb}^{-1}$, the invariant mass spectrum of $J/ψ$ pairs is studied. A narrow structure around $6.9\mathrm{\,GeV/}c^2$ matching the lineshape of a resonance and a broad structure just above twice the $J/ψ$ mass are observed. The deviation of the data from nonresonant $J/ψ$-pair production is above five standard deviations in the mass region between $6.2$ and $7.4\mathrm{\,GeV/}c^2$, covering predicted masses of states composed of four charm quarks. The mass and natural width of the narrow $X(6900)$ structure are measured assuming a Breit--Wigner lineshape.
△ Less
Submitted 10 November, 2020; v1 submitted 30 June, 2020;
originally announced June 2020.
-
Building Low-Resource NER Models Using Non-Speaker Annotation
Authors:
Tatiana Tsygankova,
Francesca Marini,
Stephen Mayhew,
Dan Roth
Abstract:
In low-resource natural language processing (NLP), the key problems are a lack of target language training data, and a lack of native speakers to create it. Cross-lingual methods have had notable success in addressing these concerns, but in certain common circumstances, such as insufficient pre-training corpora or languages far from the source language, their performance suffers. In this work we p…
▽ More
In low-resource natural language processing (NLP), the key problems are a lack of target language training data, and a lack of native speakers to create it. Cross-lingual methods have had notable success in addressing these concerns, but in certain common circumstances, such as insufficient pre-training corpora or languages far from the source language, their performance suffers. In this work we propose a complementary approach to building low-resource Named Entity Recognition (NER) models using ``non-speaker'' (NS) annotations, provided by annotators with no prior experience in the target language. We recruit 30 participants in a carefully controlled annotation experiment with Indonesian, Russian, and Hindi. We show that use of NS annotators produces results that are consistently on par or better than cross-lingual methods built on modern contextual representations, and have the potential to outperform with additional effort. We conclude with observations of common annotation patterns and recommended implementation practices, and motivate how NS annotations can be used in addition to prior methods for improved performance. For more details, http://cogcomp.org/page/publication_view/941
△ Less
Submitted 26 April, 2021; v1 submitted 16 June, 2020;
originally announced June 2020.
-
Learnability with Indirect Supervision Signals
Authors:
Kaifu Wang,
Qiang Ning,
Dan Roth
Abstract:
Learning from indirect supervision signals is important in real-world AI applications when, often, gold labels are missing or too costly. In this paper, we develop a unified theoretical framework for multi-class classification when the supervision is provided by a variable that contains nonzero mutual information with the gold label. The nature of this problem is determined by (i) the transition p…
▽ More
Learning from indirect supervision signals is important in real-world AI applications when, often, gold labels are missing or too costly. In this paper, we develop a unified theoretical framework for multi-class classification when the supervision is provided by a variable that contains nonzero mutual information with the gold label. The nature of this problem is determined by (i) the transition probability from the gold labels to the indirect supervision variables and (ii) the learner's prior knowledge about the transition. Our framework relaxes assumptions made in the literature, and supports learning with unknown, non-invertible and instance-dependent transitions. Our theory introduces a novel concept called \emph{separation}, which characterizes the learnability and generalization bounds. We also demonstrate the application of our framework via concrete novel results in a variety of learning scenarios such as learning with superset annotations and joint supervision signals.
△ Less
Submitted 11 November, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Foreseeing the Benefits of Incidental Supervision
Authors:
Hangfeng He,
Mingyuan Zhang,
Qiang Ning,
Dan Roth
Abstract:
Real-world applications often require improved models by leveraging a range of cheap incidental supervision signals. These could include partial labels, noisy labels, knowledge-based constraints, and cross-domain or cross-task annotations -- all having statistical associations with gold annotations but not exactly the same. However, we currently lack a principled way to measure the benefits of the…
▽ More
Real-world applications often require improved models by leveraging a range of cheap incidental supervision signals. These could include partial labels, noisy labels, knowledge-based constraints, and cross-domain or cross-task annotations -- all having statistical associations with gold annotations but not exactly the same. However, we currently lack a principled way to measure the benefits of these signals to a given target task, and the common practice of evaluating these benefits is through exhaustive experiments with various models and hyperparameters. This paper studies whether we can, in a single framework, quantify the benefits of various types of incidental signals for a given target task without going through combinatorial experiments. We propose a unified PAC-Bayesian motivated informativeness measure, PABI, that characterizes the uncertainty reduction provided by incidental supervision signals. We demonstrate PABI's effectiveness by quantifying the value added by various types of incidental signals to sequence tagging tasks. Experiments on named entity recognition (NER) and question answering (QA) show that PABI's predictions correlate well with learning performance, providing a promising way to determine, ahead of learning, which supervision signals would be beneficial.
△ Less
Submitted 10 September, 2021; v1 submitted 9 June, 2020;
originally announced June 2020.
-
Search for $CP$ violation in $Ξ_c^+\rightarrow pK^-π^+$ decays using model-independent techniques
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
M. Andreotti
, et al. (932 additional authors not shown)
Abstract:
A first search for $CP$ violation in the Cabibbo-suppressed $Ξ_c^+\rightarrow pK^-π^+$ decay is performed using both a binned and an unbinned model-independent technique in the Dalitz plot. The studies are based on a sample of proton-proton collision data, corresponding to an integrated luminosity of $3.0~{\rm fb^{-1}}$, and collected by the LHCb experiment at centre-of-mass energies of $7$ and…
▽ More
A first search for $CP$ violation in the Cabibbo-suppressed $Ξ_c^+\rightarrow pK^-π^+$ decay is performed using both a binned and an unbinned model-independent technique in the Dalitz plot. The studies are based on a sample of proton-proton collision data, corresponding to an integrated luminosity of $3.0~{\rm fb^{-1}}$, and collected by the LHCb experiment at centre-of-mass energies of $7$ and $8~\rm TeV$. The data are consistent with the hypothesis of no $CP$ violation.
△ Less
Submitted 2 November, 2020; v1 submitted 4 June, 2020;
originally announced June 2020.
-
Study of the $ψ_2(3823)$ and $χ_{c1}(3872)$ states in $B^+ \rightarrow \left( Jψπ^+π^-\right)K^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (940 additional authors not shown)
Abstract:
The decays $B^+\rightarrow J/ψπ^+ π^- K^+$ are studied using a data set corresponding to an integrated luminosity of 9fb$^{-1}$ collected with the LHCb detector in proton-proton collisions between 2011 and 2018. Precise measurements of the ratios of branching fractions with the intermediate $ψ_2(3823)$, $χ_{c1}(3872)$ and $ψ(2S)$ states are reported. The decay of $B^+\rightarrow ψ_2(3823)K^+$ with…
▽ More
The decays $B^+\rightarrow J/ψπ^+ π^- K^+$ are studied using a data set corresponding to an integrated luminosity of 9fb$^{-1}$ collected with the LHCb detector in proton-proton collisions between 2011 and 2018. Precise measurements of the ratios of branching fractions with the intermediate $ψ_2(3823)$, $χ_{c1}(3872)$ and $ψ(2S)$ states are reported. The decay of $B^+\rightarrow ψ_2(3823)K^+$ with $ψ_2(3823)\rightarrow Jψπ^+π^-$ is observed for the first time with a significance of 5.1 standard deviations. The mass differences between the $ψ_2(3823)$, $χ_{c1}(3872)$ and $ψ(2S)$ states are measured to be $$ \begin{array}{rcl} m_{χ_{c1(3872)}} - m_{ψ_2(3823)} &= & 47.50 \pm 0.53 \pm 0.13\,\mathrm{MeV/}c^2\,, \\ m_{ψ_2(3823)} - m_{ψ(2S)} &= & 137.98 \pm 0.53 \pm 0.14\,\mathrm{MeV/}c^2\,, \\ m_{χ_{c1}(3872)} - m_{ψ(2S)} &= & 185.49 \pm 0.06 \pm 0.03\,\mathrm{MeV/}c^2\,, \end{array} $$ resulting in the most precise determination of the $χ_{c1}(3782)$ mass. The width of the $ψ_2(3823)$ state is found to be below 5.2MeV at 90\% confidence level. The Breit-Wigner width of the $χ_{c1}(3872)$ state is measured to be $$ Γ^{\mathrm{BW}}_{χ_{c1}(3872)} = 0.96^{+0.19}_{-0.18}\pm0.21 \mathrm{MeV},$$ which is inconsistent with zero by 5.5 standard deviations.
△ Less
Submitted 17 September, 2021; v1 submitted 27 May, 2020;
originally announced May 2020.
-
Study of the lineshape of the $χ_{c1}(3872)$ state
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
Z. Aliouche,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov
, et al. (949 additional authors not shown)
Abstract:
A study of the lineshape of the $χ_{c1}(3872)$ state is made using a data sample corresponding to an integrated luminosity of $3\,$fb$^{-1}$ collected in $pp$ collisions at centre-of-mass energies of 7 and 8\,TeV with the LHCb detector. Candidate $χ_{c1}(3872)$ and $ψ(2S)$ mesons from b-hadron decays are selected in the $ J/ψπ^+ π^-$ decay mode. Describing the {\mbox{lineshape}} with a Breit--Wign…
▽ More
A study of the lineshape of the $χ_{c1}(3872)$ state is made using a data sample corresponding to an integrated luminosity of $3\,$fb$^{-1}$ collected in $pp$ collisions at centre-of-mass energies of 7 and 8\,TeV with the LHCb detector. Candidate $χ_{c1}(3872)$ and $ψ(2S)$ mesons from b-hadron decays are selected in the $ J/ψπ^+ π^-$ decay mode. Describing the {\mbox{lineshape}} with a Breit--Wigner function, the mass splitting between the $χ_{c1}(3872)$ and $ψ(2S)$ states, $Δm$, and the width of the $χ_{c1}(3872)$ state, $Γ_{\mathrm{BW}}$, are determined to be \begin{eqnarray*} Δm & = & 185.598 \pm 0.067 \pm 0.068\, \mathrm{MeV} \,, \\ Γ_{\mathrm{BW}} & = & \phantom{00}1.39\phantom{0} \pm 0.24\phantom{0} \pm 0.10\phantom{0} \mathrm{MeV} \,, \end{eqnarray*} where the first uncertainty is statistical and the second systematic. Using a Flatté-inspired model, the mode and full width at half maximum of the lineshape are determined to be \begin{eqnarray*} \mathrm{mode} & = 3871.69^{\,+\,0.00\,+\,0.05}_{\,-\,0.04\,-\,0.13} &\mathrm{MeV} \\ \mathrm{FWHM} & = 0.22^{\,+\,0.07\,+\,0.11}_{\,-\,0.06\,-\,0.13}& \mathrm{MeV} . \end{eqnarray*} An investigation of the analytic structure of the Flatté amplitude reveals a pole structure, which is compatible with a quasi-bound $D^0\bar{D}^{*0}$ state but a quasi-virtual state is still allowed at the level of $2$ standard deviations.
△ Less
Submitted 12 March, 2021; v1 submitted 27 May, 2020;
originally announced May 2020.
-
Incidental Supervision: Moving beyond Supervised Learning
Authors:
Dan Roth
Abstract:
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract representations of natural language text, visual scenes, and other messy, naturally occurring data, and support decisions that depend on it. However, learning models for these tasks is difficult partly because generating the necessary supervision signals for it is costly and does not scale. This pa…
▽ More
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract representations of natural language text, visual scenes, and other messy, naturally occurring data, and support decisions that depend on it. However, learning models for these tasks is difficult partly because generating the necessary supervision signals for it is costly and does not scale. This paper describes several learning paradigms that are designed to alleviate the supervision bottleneck. It will illustrate their benefit in the context of multiple problems, all pertaining to inducing various levels of semantic representations from text.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Measurement of branching fraction ratios for $B^+\to D^{*+}D^-K^+$, $B^+\to D^{*-}D^+K^+$, and $B^0\to D^{*-}D^0K^+$ decays
Authors:
LHCb collaboration,
R. Aaij,
C. Abellán Beteta,
T. Ackernley,
B. Adeva,
M. Adinolfi,
H. Afsharnia,
C. A. Aidala,
S. Aiola,
Z. Ajaltouni,
S. Akar,
J. Albrecht,
F. Alessio,
M. Alexander,
A. Alfonso Albero,
G. Alkhazov,
P. Alvarez Cartelle,
A. A. Alves Jr,
S. Amato,
Y. Amhis,
L. An,
L. Anderlini,
G. Andreassi,
A. Andreianov,
M. Andreotti
, et al. (896 additional authors not shown)
Abstract:
A measurement of four branching-fraction ratios for three-body decays of $B$ mesons involving two open-charm hadrons in the final state is presented. Run 1 and Run 2 $pp$ collision data are used, recorded by the LHCb experiment at centre-of-mass energies $7$, $8$, and $13$ TeV and corresponding to an integrated luminosity of $9$ fb$^{-1}$. The measured branching-fraction ratios are \[ \begin{eqnar…
▽ More
A measurement of four branching-fraction ratios for three-body decays of $B$ mesons involving two open-charm hadrons in the final state is presented. Run 1 and Run 2 $pp$ collision data are used, recorded by the LHCb experiment at centre-of-mass energies $7$, $8$, and $13$ TeV and corresponding to an integrated luminosity of $9$ fb$^{-1}$. The measured branching-fraction ratios are \[ \begin{eqnarray} \frac{\mathcal{B} (B^+\to D^{*+}D^-K^+)}{\mathcal{B} (B^+\to \kern 0.2em\overline{\kern -0.2em D}{}^0 D^0 K^+)} &=& 0.517 \pm 0.015 \pm 0.013 \pm 0.011 , \\ \frac{\mathcal{B} (B^+\to D^{*-}D^+K^+)}{\mathcal{B} (B^+\to \kern 0.2em\overline{\kern -0.2em D}{}^0 D^0 K^+)} &=& 0.577 \pm 0.016 \pm 0.013 \pm 0.013 , \\ \frac{\mathcal{B} (B^0\to D^{*-}D^0K^+)}{\mathcal{B} (B^0\to D^- D^0 K^+)} &=& 1.754 \pm 0.028 \pm 0.016 \pm 0.035 , \\ \frac{\mathcal{B} (B^+\to D^{*+}D^-K^+)}{\mathcal{B} (B^+\to D^{*-}D^+K^+)} &=& 0.907 \pm 0.033 \pm 0.014 ,\end{eqnarray} \] where the first of the uncertainties is statistical, the second systematic, and the third is due to the uncertainties on the $D$-meson branching fractions. These are the most accurate measurements of these ratios to date.
△ Less
Submitted 6 January, 2021; v1 submitted 20 May, 2020;
originally announced May 2020.
-
Text Classification with Few Examples using Controlled Generalization
Authors:
Abhijit Mahabal,
Jason Baldridge,
Burcu Karagol Ayan,
Vincent Perot,
Dan Roth
Abstract:
Training data for text classification is often limited in practice, especially for applications with many output classes or involving many related classification problems. This means classifiers must generalize from limited evidence, but the manner and extent of generalization is task dependent. Current practice primarily relies on pre-trained word embeddings to map words unseen in training to sim…
▽ More
Training data for text classification is often limited in practice, especially for applications with many output classes or involving many related classification problems. This means classifiers must generalize from limited evidence, but the manner and extent of generalization is task dependent. Current practice primarily relies on pre-trained word embeddings to map words unseen in training to similar seen ones. Unfortunately, this squishes many components of meaning into highly restricted capacity. Our alternative begins with sparse pre-trained representations derived from unlabeled parsed corpora; based on the available training data, we select features that offers the relevant generalizations. This produces task-specific semantic vectors; here, we show that a feed-forward network over these vectors is especially effective in low-data scenarios, compared to existing state-of-the-art methods. By further pairing this network with a convolutional neural network, we keep this edge in low data scenarios and remain competitive when using full training sets.
△ Less
Submitted 18 May, 2020;
originally announced May 2020.