Skip to main content

Showing 1–33 of 33 results for author: Laurent, A

.
  1. arXiv:2407.02127  [pdf, ps, other

    math.NA math.OC

    Control theory and splitting methods

    Authors: Karine Beauchard, Adrien Laurent, Frédéric Marbach

    Abstract: Our goal is to highlight some of the deep links between numerical splitting methods and control theory. We consider evolution equations of the form $\dot{x} = f_0(x) + f_1(x)$, where $f_0$ encodes a non-reversible dynamic, so that one is interested in schemes only involving forward flows of $f_0$. In this context, a splitting method can be interpreted as a trajectory of the control-affine system… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 35 pages

  2. arXiv:2406.10073  [pdf, other

    eess.AS cs.CL cs.HC cs.SD

    Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content

    Authors: Rémi Uro, Marie Tahon, David Doukhan, Antoine Laurent, Albert Rilliard

    Abstract: Transition Relevance Places are defined as the end of an utterance where the interlocutor may take the floor without interrupting the current speaker --i.e., a place where the turn is terminal. Analyzing turn terminality is useful to study the dynamic of turn-taking in spontaneous conversations. This paper presents an automatic classification of spoken utterances as Terminal or Non-Terminal in mul… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: keywords : Spoken interaction, Media, TV, Radio, Transition-Relevance Places, Turn Taking, Interruption. Accepted to InterSpeech 2024, Kos Island, Greece

  3. arXiv:2404.17552  [pdf, other

    eess.AS cs.CL cs.DL cs.LG cs.SD

    A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification

    Authors: Rémi Uro, David Doukhan, Albert Rilliard, Laëtitia Larcher, Anissa-Claire Adgharouamane, Marie Tahon, Antoine Laurent

    Abstract: This paper presents a semi-automatic approach to create a diachronic corpus of voices balanced for speaker's age, gender, and recording period, according to 32 categories (2 genders, 4 age ranges and 4 recording periods). Corpora were selected at French National Institute of Audiovisual (INA) to obtain at least 30 speakers per category (a total of 960 speakers; only 874 have be found yet). For eac… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Keywords:, semi-automatic processing, corpus creation, diarization, speaker identification, gender-balanced, age-balanced, speaker corpus, diachrony

    Journal ref: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)

  4. arXiv:2309.07478  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Direct Text to Speech Translation System using Acoustic Units

    Authors: Victoria Mingote, Pablo Gimeno, Luis Vicente, Sameer Khurana, Antoine Laurent, Jarod Duret

    Abstract: This paper proposes a direct text to speech translation system using discrete acoustic units. This framework employs text in different source languages as input to generate speech in the target language without the need for text transcriptions in this language. Motivated by the success of acoustic units in previous works for direct speech to speech translation systems, we use the same pipeline to… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures

  5. arXiv:2307.13012  [pdf, other

    cs.SD cs.AI cs.NE eess.AS eess.SP

    Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains

    Authors: Martin Lebourdais, Théo Mariotte, Marie Tahon, Anthony Larcher, Antoine Laurent, Silvio Montresor, Sylvain Meignier, Jean-Hugh Thomas

    Abstract: Voice activity and overlapped speech detection (respectively VAD and OSD) are key pre-processing tasks for speaker diarization. The final segmentation performance highly relies on the robustness of these sub-tasks. Recent studies have shown VAD and OSD can be trained jointly using a multi-class classification model. However, these works are often restricted to a specific speech domain, lacking inf… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  6. The Lie derivative and Noether's theorem on the aromatic bicomplex for the study of volume-preserving numerical integrators

    Authors: Adrien Laurent

    Abstract: The aromatic bicomplex is an algebraic tool based on aromatic Butcher trees and used in particular for the explicit description of volume-preserving affine-equivariant numerical integrators. The present work defines new tools inspired from variational calculus such as the Lie derivative, different concepts of symmetries, and Noether's theory in the context of aromatic forests. The approach allows… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: 14 pages

    MSC Class: 58E30; 58J10; 05C05; 41A58; 37M15; 58A12

  7. arXiv:2306.00789  [pdf, other

    cs.CL cs.AI eess.AS eess.SP

    Improved Cross-Lingual Transfer Learning For Automatic Speech Translation

    Authors: Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James Glass

    Abstract: Research in multilingual speech-to-text translation is topical. Having a single model that supports multiple translation tasks is desirable. The goal of this work it to improve cross-lingual transfer learning in multilingual speech-to-text translation via semantic knowledge distillation. We show that by initializing the encoder of the encoder-decoder sequence-to-sequence translation model with SAM… ▽ More

    Submitted 25 January, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  8. arXiv:2305.10993  [pdf, ps, other

    math.NA math.PR

    The universal equivariance properties of exotic aromatic B-series

    Authors: Adrien Laurent, Hans Z. Munthe-Kaas

    Abstract: Exotic aromatic B-series were originally introduced for the calculation of order conditions for the high order numerical integration of ergodic stochastic differential equations in $\mathbb{R}^d$ and on manifolds. We prove in this paper that exotic aromatic B-series satisfy a universal geometric property, namely that they are characterised by locality and orthogonal-equivariance. This characterisa… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 25 pages

    MSC Class: 15A72; 37C81; 41A58; 60H35; 65C30

  9. The aromatic bicomplex for the description of divergence-free aromatic forms and volume-preserving integrators

    Authors: Adrien Laurent, Robert I. McLachlan, Hans Z. Munthe-Kaas, Olivier Verdier

    Abstract: Aromatic B-series were introduced as an extension of standard Butcher-series for the study of volume-preserving integrators. It was proven with their help that the only volume-preserving B-series method is the exact flow of the differential equation. The question was raised whether there exists a volume-preserving integrator that can be expanded as an aromatic B-series. In this work, we introduce… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 41 pages

    MSC Class: 65L06; 41A58; 58J10; 58A12; 37M15; 05C05

    Journal ref: Forum of Mathematics Sigma 11 (2023), E69

  10. A Metaheuristic Approach for Mining Gradual Patterns

    Authors: Dickson Odhiambo Owuor, Thomas Runkler, Anne Laurent

    Abstract: Swarm intelligence is a discipline that studies the collective behavior that is produced by local interactions of a group of individuals with each other and with their environment. In Computer Science domain, numerous swarm intelligence techniques are applied to optimization problems that seek to efficiently find best solutions within a search space. Gradual pattern mining is another Computer Scie… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 42 pages

  11. arXiv:2211.07795  [pdf, other

    eess.AS cs.AI cs.LG

    On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration

    Authors: Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James Glass

    Abstract: Pseudo-label (PL) filtering forms a crucial part of Self-Training (ST) methods for unsupervised domain adaptation. Dropout-based Uncertainty-driven Self-Training (DUST) proceeds by first training a teacher model on source domain labeled data. Then, the teacher model is used to provide PLs for the unlabeled target domain data. Finally, we train a student on augmented labeled and pseudo-labeled data… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  12. arXiv:2209.04167  [pdf, other

    cs.SD cs.AI eess.AS

    Overlapped speech and gender detection with WavLM pre-trained features

    Authors: Martin Lebourdais, Marie Tahon, Antoine Laurent, Sylvain Meignier

    Abstract: This article focuses on overlapped speech and gender detection in order to study interactions between women and men in French audiovisual media (Gender Equality Monitoring project). In this application context, we need to automatically segment the speech signal according to speakers gender, and to identify when at least two speakers speak at the same time. We propose to use WavLM model which has t… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: Submitted and accepted to Interspeech 2022

  13. Ant Colony Optimization for Mining Gradual Patterns

    Authors: Dickson Odhiambo Owuor, Thomas Runkler, Anne Laurent, Joseph Orero, Edmond Menya

    Abstract: Gradual pattern extraction is a field in (KDD) Knowledge Discovery in Databases that maps correlations between attributes of a data set as gradual dependencies. A gradual dependency may take a form of "the more Attribute K , the less Attribute L". In this paper, we propose an ant colony optimization technique that uses a probabilistic approach to learn and extract frequent gradual patterns. Throug… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 35 pages, journal article

    Journal ref: Int. J. Mach. Learn. & Cyber. 12, 2989--3009 (2021)

  14. arXiv:2207.01893  [pdf, other

    cs.CL

    ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks

    Authors: Valentin Pelloin, Franck Dary, Nicolas Herve, Benoit Favre, Nathalie Camelin, Antoine Laurent, Laurent Besacier

    Abstract: We aim at improving spoken language modeling (LM) using very large amount of automatically transcribed speech. We leverage the INA (French National Audiovisual Institute) collection and obtain 19GB of text after applying ASR on 350,000 hours of diverse TV shows. From this, spoken language models are trained either by fine-tuning an existing LM (FlauBERT) or through training a LM from scratch. New… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Interspeech 2022 (Camera Ready)

  15. arXiv:2205.08180  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation

    Authors: Sameer Khurana, Antoine Laurent, James Glass

    Abstract: We propose the SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation learning framework. Unlike previous works on speech representation learning, which learns multilingual contextual speech embedding at the resolution of an acoustic frame (10-20ms), this work focuses on learning multimodal (speech-text) multilingual speech embedding at the resolution of a s… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  16. arXiv:2205.01987  [pdf, ps, other

    cs.CL cs.SD eess.AS

    ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks

    Authors: Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier, Souhir Gahbiche, Yannick Estève

    Abstract: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2022: low-resource and dialect speech translation. For the Tunisian Arabic-English dataset (low-resource and dialect tracks), we build an end-to-end model as our joint primary submission, and compare it against cascaded models that leverage a large fine-tu… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: IWSLT 2022 system paper

  17. Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0

    Authors: Sameer Khurana, Antoine Laurent, James Glass

    Abstract: We propose a simple and effective cross-lingual transfer learning method to adapt monolingual wav2vec-2.0 models for Automatic Speech Recognition (ASR) in resource-scarce languages. We show that a monolingual wav2vec-2.0 is a good few-shot ASR learner in several languages. We improve its performance further via several iterations of Dropout Uncertainty-Driven Self-Training (DUST) by using a modera… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  18. A uniformly accurate scheme for the numerical integration of penalized Langevin dynamics

    Authors: Adrien Laurent

    Abstract: In molecular dynamics, penalized overdamped Langevin dynamics are used to model the motion of a set of particles that follow constraints up to a parameter $\varepsilon$. The most used schemes for simulating these dynamics are the Euler integrator in $\mathbb{R}^d$ and the constrained Euler integrator. Both have weak order one of accuracy, but work properly only in specific regimes depending on the… ▽ More

    Submitted 31 August, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 27 pages

    MSC Class: 60H35; 70H45; 37M25

    Journal ref: SIAM J. Sci. Comput. 44 (2022), no. 5, A2895-C398

  19. arXiv:2104.04045  [pdf, other

    eess.AS cs.SD

    End-to-end speaker segmentation for overlap-aware resegmentation

    Authors: Hervé Bredin, Antoine Laurent

    Abstract: Speaker segmentation consists in partitioning a conversation between one or more speakers into speaker turns. Usually addressed as the late combination of three sub-tasks (voice activity detection, speaker change detection, and overlapped speech detection), we propose to train an end-to-end segmentation model that does it directly. Inspired by the original end-to-end neural speaker diarization app… ▽ More

    Submitted 10 June, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: Camera-ready version for Interspeech 2021 with significantly better voice activity detection, overlapped speech detection, and speaker diarization results. The code used for results reported in v1 contained a small bug that has now been fixed

  20. End2End Acoustic to Semantic Transduction

    Authors: Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Antoine Caubrière, Yannick Estève, Sylvain Meignier

    Abstract: In this paper, we propose a novel end-to-end sequence-to-sequence spoken language understanding model using an attention mechanism. It reliably selects contextual acoustic features in order to hypothesize semantic contents. An initial architecture capable of extracting all pronounced words and concepts from acoustic spans is designed and tested. With a shallow fusion language model, this system re… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted at IEEE ICASSP 2021

    Journal ref: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  21. Order conditions for sampling the invariant measure of ergodic stochastic differential equations on manifolds

    Authors: Adrien Laurent, Gilles Vilmart

    Abstract: We derive a new methodology for the construction of high order integrators for sampling the invariant measure of ergodic stochastic differential equations with dynamics constrained on a manifold. We obtain the order conditions for sampling the invariant measure for a class of Runge-Kutta methods applied to the constrained overdamped Langevin equation. The analysis is valid for arbitrarily high ord… ▽ More

    Submitted 26 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 40 pages

    MSC Class: 60H35; 70H45; 37M25; 65L06

    Journal ref: Found. Comput. Math. 22, 649-695 (2022)

  22. arXiv:2006.02814  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning

    Authors: Sameer Khurana, Antoine Laurent, James Glass

    Abstract: More than half of the 7,000 languages in the world are in imminent danger of going extinct. Traditional methods of documenting language proceed by collecting audio data followed by manual annotation by trained linguists at different levels of granularity. This time consuming and painstaking process could benefit from machine learning. Many endangered languages do not have any orthographic form but… ▽ More

    Submitted 5 August, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

  23. arXiv:2006.02547  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

    Authors: Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James Glass

    Abstract: Probabilistic Latent Variable Models (LVMs) provide an alternative to self-supervised learning approaches for linguistic representation learning from speech. LVMs admit an intuitive probabilistic interpretation where the latent structure shapes the information extracted from the signal. Even though LVMs have recently seen a renewed interest due to the introduction of Variational Autoencoders (VAEs… ▽ More

    Submitted 8 September, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Proceedings of Interspeech, 2020

  24. arXiv:2005.08520  [pdf, other

    cs.LG cs.CL stat.ML

    Robust Training of Vector Quantized Bottleneck Models

    Authors: Adrian Łańcucki, Jan Chorowski, Guillaume Sanchez, Ricard Marxer, Nanxin Chen, Hans J. G. A. Dolfing, Sameer Khurana, Tanel Alumäe, Antoine Laurent

    Abstract: In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial representations of speech, applicable to unsupervised voice conversion and reaching state-of-the-art performance on unit discovery tasks. For unsupervised representat… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Published at IJCNN 2020

  25. arXiv:2003.14188  [pdf, other

    physics.optics

    Realization and simulation of high power holmium doped fiber laser for long-range transmission

    Authors: Julien Le Gouët, François Gustave, Pierre Bourdon, Thierry Robin, Arnaud Laurent, Benoit Cadier

    Abstract: We report on our realization of a high power holmium doped fiber laser, together with the validation of our numerical simulation of the laser. We first present the rare absolute measurements of the physical parameters that are mandatory to model accurately the laser-holmium interactions in our silica fiber. We then describe the realization of the clad-pumped laser, based on a triple-clad large mod… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: 14 pages, 7 figures

  26. Recent Advances in End-to-End Spoken Language Understanding

    Authors: Natalia Tomashenko, Antoine Caubriere, Yannick Esteve, Antoine Laurent, Emmanuel Morin

    Abstract: This work investigates spoken language understanding (SLU) systems in the scenario when the semantic information is extracted directly from the speech signal by means of a single end-to-end neural network model. Two SLU tasks are considered: named entity recognition (NER) and semantic slot filling (SF). For these tasks, in order to improve the model performance, we explore various techniques inclu… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Journal ref: Statistical Language and Speech Processing. SLSP 2019

  27. arXiv:1906.07601  [pdf, other

    cs.CL cs.SD eess.AS

    Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability

    Authors: Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin, Yannick Estève

    Abstract: We present an end-to-end approach to extract semantic concepts directly from the speech audio signal. To overcome the lack of data available for this spoken language understanding approach, we investigate the use of a transfer learning strategy based on the principles of curriculum learning. This approach allows us to exploit out-of-domain data that can help to prepare a fully neural architecture.… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: Accepted to the INTERSPEECH 2019 conference. Submitted on March 29, 2019 (Paper submission deadline)

  28. Multirevolution integrators for differential equations with fast stochastic oscillations

    Authors: Adrien Laurent, Gilles Vilmart

    Abstract: We introduce a new methodology based on the multirevolution idea for constructing integrators for stochastic differential equations in the situation where the fast oscillations themselves are driven by a Stratonovich noise. Applications include in particular highly-oscillatory Kubo oscillators and spatial discretizations of the nonlinear Schrödinger equation with fast white noise dispersion. We co… ▽ More

    Submitted 5 October, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: 27 pages

    MSC Class: 60H35; 35Q55; 34E13

    Journal ref: SIAM J. Sci. Comput. 42 (2020), no. 1, A115-A139

  29. arXiv:1805.12045  [pdf, other

    cs.CL

    End-to-end named entity extraction from speech

    Authors: Sahar Ghannay, Antoine Caubrière, Yannick Estève, Antoine Laurent, Emmanuel Morin

    Abstract: Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-opt… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: Submitted to Interspeech 2018

    ACM Class: I.2.7

  30. Exotic aromatic B-series for the study of long time integrators for a class of ergodic SDEs

    Authors: Adrien Laurent, Gilles Vilmart

    Abstract: We introduce a new algebraic framework based on a modification (called exotic) of aromatic Butcher-series for the systematic study of the accuracy of numerical integrators for the invariant measure of a class of ergodic stochastic differential equations (SDEs) with additive noise. The proposed analysis covers Runge-Kutta type schemes including the cases of partitioned methods and postprocessed met… ▽ More

    Submitted 1 July, 2019; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: 33 pages

    MSC Class: 60H35; 37M25; 65L06; 41A58

    Journal ref: Math. Comp. 89 (2020), 169-202

  31. arXiv:1702.06154  [pdf, other

    cs.SI physics.soc-ph

    Role model detection using low rank similarity matrix

    Authors: Sibo Cheng, Adissa Laurent, Paul Van Dooren

    Abstract: Computing meaningful clusters of nodes is crucial to analyse large networks. In this paper, we apply new clustering methods to improve the computational time. We use the properties of the adjacency matrix to obtain better role extraction. We also define a new non-recursive similarity measure and compare its results with the ones obtained with Browet's similarity measure. We will show the extractio… ▽ More

    Submitted 28 January, 2017; originally announced February 2017.

  32. Tutorial in Joint Modeling and Prediction: a Statistical Software for Correlated Longitudinal Outcomes, Recurrent Events and a Terminal Event

    Authors: Agnieszka Król, Audrey Mauguen, Yassin Mazroui, Alexandre Laurent, Stefan Michiels, Virginie Rondeau

    Abstract: Extensions in the field of joint modeling of correlated data and dynamic predictions improve the development of prognosis research. The R package frailtypack provides estimations of various joint models for longitudinal data and survival events. In particular, it fits models for recurrent events and a terminal event (frailtyPenal), models for two survival outcomes for clustered data (frailtyPenal)… ▽ More

    Submitted 13 January, 2017; originally announced January 2017.

    Comments: Journal of Statistical Software (conditionally accepted for publication)

  33. arXiv:1502.02053  [pdf, other

    math.DS

    Negative refraction and tiling billiards

    Authors: Diana Davis, Kelsey DiPietro, Jenny Rustad, Alexander St Laurent

    Abstract: We introduce a new dynamical system that we call "tiling billiards," where trajectories refract through planar tilings. This system is motivated by a recent discovery of physical substances with negative indices of refraction. We investigate several special cases where the planar tiling is created by dividing the plane by lines, and we describe the results of computer experiments.

    Submitted 20 September, 2017; v1 submitted 6 February, 2015; originally announced February 2015.

    Comments: 28 pages, 25 figures

    MSC Class: 37E99