-
Semi-inclusive single-jet production in DIS at next-to-leading order in the Color Glass Condensate
Authors:
Paul Caucal,
Elouan Ferrand,
Farid Salazar
Abstract:
Within the Color Glass Condensate (CGC) effective field theory, we derive the next-to-leading order (NLO) cross-section for the single-jet semi-inclusive cross-section in deep inelastic scattering (DIS) at small $x$, for both longitudinally and transversely polarized virtual photons. We provide analytic expressions, valid at finite $N_c$ and suitable for numerical evaluation, for both the cross-se…
▽ More
Within the Color Glass Condensate (CGC) effective field theory, we derive the next-to-leading order (NLO) cross-section for the single-jet semi-inclusive cross-section in deep inelastic scattering (DIS) at small $x$, for both longitudinally and transversely polarized virtual photons. We provide analytic expressions, valid at finite $N_c$ and suitable for numerical evaluation, for both the cross-section differential in rapidity and transverse momentum and the cross-section differential in rapidity only. Our NLO formulae demonstrate that the very forward rapidity regime is plagued by large double logarithmic corrections coming from phase space constraints on soft gluons close to the kinematic threshold for jet production. A joint resummation of small-$x$ and threshold logarithms at single logarithmic accuracy is proposed to remedy the instability of the cross-section in this regime. By integrating over the single-jet phase space, we recover known results for the NLO DIS structure functions at small $x$, previously obtained using the optical theorem.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Spoken Term Detection Methods for Sparse Transcription in Very Low-resource Settings
Authors:
Éric Le Ferrand,
Steven Bird,
Laurent Besacier
Abstract:
We investigate the efficiency of two very different spoken term detection approaches for transcription when the available data is insufficient to train a robust ASR system. This work is grounded in very low-resource language documentation scenario where only few minutes of recording have been transcribed for a given language so far.Experiments on two oral languages show that a pretrained universal…
▽ More
We investigate the efficiency of two very different spoken term detection approaches for transcription when the available data is insufficient to train a robust ASR system. This work is grounded in very low-resource language documentation scenario where only few minutes of recording have been transcribed for a given language so far.Experiments on two oral languages show that a pretrained universal phone recognizer, fine-tuned with only a few minutes of target language speech, can be used for spoken term detection with a better overall performance than a dynamic time war** approach. In addition, we show that representing phoneme recognition ambiguity in a graph structure can further boost the recall while maintaining high precision in the low resource spoken term detection task.
△ Less
Submitted 11 June, 2021;
originally announced June 2021.
-
Enabling Interactive Transcription in an Indigenous Community
Authors:
Éric Le Ferrand,
Steven Bird,
Laurent Besacier
Abstract:
We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR syste…
▽ More
We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
Authors:
Marcely Zanon Boito,
William N. Havard,
Mahault Garnerin,
Éric Le Ferrand,
Laurent Besacier
Abstract:
The CMU Wilderness Multilingual Speech Dataset (Black, 2019) is a newly published multilingual speech dataset based on recorded readings of the New Testament. It provides data to build Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) models for potentially 700 languages. However, the fact that the source content (the Bible) is the same for all the languages is not exploited to date.Ther…
▽ More
The CMU Wilderness Multilingual Speech Dataset (Black, 2019) is a newly published multilingual speech dataset based on recorded readings of the New Testament. It provides data to build Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) models for potentially 700 languages. However, the fact that the source content (the Bible) is the same for all the languages is not exploited to date.Therefore, this article proposes to add multilingual links between speech segments in different languages, and shares a large and clean dataset of 8,130 parallel spoken utterances across 8 languages (56 language pairs). We name this corpus MaSS (Multilingual corpus of Sentence-aligned Spoken utterances). The covered languages (Basque, English, Finnish, French, Hungarian, Romanian, Russian and Spanish) allow researches on speech-to-speech alignment as well as on translation for typologically different language pairs. The quality of the final corpus is attested by human evaluation performed on a corpus subset (100 utterances, 8 language pairs). Lastly, we showcase the usefulness of the final product on a bilingual speech retrieval task.
△ Less
Submitted 26 February, 2020; v1 submitted 30 July, 2019;
originally announced July 2019.
-
Positive isotopies of Legendrian submanifolds and applications
Authors:
Vincent Colin,
Emmanuel Ferrand,
Petya Pushkar
Abstract:
We show that there is no positive loop inside the component of a fiber in the space of Legendrian embeddings in the contact manifold $ST^*M$, provided that the universal cover of $M$ is $\RM^n$. We consider some related results in the space of one-jets of functions on a compact manifold. We give an application to the positive isotopies in homogeneous neighborhoods of surfaces in a tight contact 3-…
▽ More
We show that there is no positive loop inside the component of a fiber in the space of Legendrian embeddings in the contact manifold $ST^*M$, provided that the universal cover of $M$ is $\RM^n$. We consider some related results in the space of one-jets of functions on a compact manifold. We give an application to the positive isotopies in homogeneous neighborhoods of surfaces in a tight contact 3-manifold.
△ Less
Submitted 29 April, 2010;
originally announced April 2010.
-
Problems on invariants of knots and 3-manifolds
Authors:
J. E. Andersen,
N. Askitas,
D. Bar-Natan,
S. Baseilhac,
R. Benedetti,
S. Bigelow,
M. Boileau,
R. Bott,
J. S. Carter,
F. Deloup,
N. Dunfield,
R. Fenn,
E. Ferrand,
S. Garoufalidis,
M. Goussarov,
E. Guadagnini,
H. Habiro,
S. K. Hansen,
T. Harikae,
A. Haviv,
M. -J. Jeong,
V. Jones,
R. Kashaev,
Y. Kawahigashi,
T. Kerler
, et al. (35 additional authors not shown)
Abstract:
This is a list of open problems on invariants of knots and 3-manifolds with expositions of their history, background, significance, or importance. This list was made by editing open problems given in problem sessions in the workshop and seminars on `Invariants of Knots and 3-Manifolds' held at Kyoto in 2001.
This is a list of open problems on invariants of knots and 3-manifolds with expositions of their history, background, significance, or importance. This list was made by editing open problems given in problem sessions in the workshop and seminars on `Invariants of Knots and 3-Manifolds' held at Kyoto in 2001.
△ Less
Submitted 9 June, 2004;
originally announced June 2004.
-
On Legendrian knots and polynomial invariants
Authors:
Emmanuel Ferrand
Abstract:
It is proved in this note that the analogues of the Bennequin inequality which provide an upper bound for the Bennequin invariant of a Legendrian knot in the standard contact three dimensional space in terms of the lower degree in the framing variable of the HOMFLY and the Kauffman polynomials are not sharp. Furthermore, the relationships between these restrictions on the range of the Bennequin…
▽ More
It is proved in this note that the analogues of the Bennequin inequality which provide an upper bound for the Bennequin invariant of a Legendrian knot in the standard contact three dimensional space in terms of the lower degree in the framing variable of the HOMFLY and the Kauffman polynomials are not sharp. Furthermore, the relationships between these restrictions on the range of the Bennequin invariant are investigated, which leads to a new simple proof of the inequality involving the Kauffman polynomial.
△ Less
Submitted 21 July, 2000; v1 submitted 29 February, 2000;
originally announced February 2000.