Skip to main content

Showing 1–14 of 14 results for author: Lecouteux, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12621  [pdf, other

    cs.CL

    Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech

    Authors: Adrien Pupier, Maximin Coavoux, Jérôme Goulian, Benjamin Lecouteux

    Abstract: Direct dependency parsing of the speech signal -- as opposed to parsing speech transcriptions -- has recently been proposed as a task (Pupier et al. 2022), as a way of incorporating prosodic information in the parsing system and bypassing the limitations of a pipeline approach that would consist of using first an Automatic Speech Recognition (ASR) system and then a syntactic parser. In this articl… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024

  2. arXiv:2403.02173  [pdf, other

    cs.CL

    What has LeBenchmark Learnt about French Syntax?

    Authors: Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux, Maximin Coavoux

    Abstract: The paper reports on a series of experiments aiming at probing LeBenchmark, a pretrained acoustic model trained on 7k hours of spoken French, for syntactic information. Pretrained acoustic models are increasingly used for downstream speech tasks such as automatic speech recognition, speech translation, spoken language understanding or speech parsing. They are trained on very low level information… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  3. arXiv:2309.05472  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

    Authors: Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different domains including computer vision and natural language processing. Speech processing drastically benefitted from SSL as most of the current domain-related tasks are now being approached with pre-trained models. This work introduces LeBenchmark 2.0 an open-source framework for assessing and building SSL-… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Published in Computer Science and Language. Preprint allowed

  4. arXiv:2301.11716  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Pre-training for Speech Translation: CTC Meets Optimal Transport

    Authors: Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab

    Abstract: The gap between speech and text modalities is a major challenge in speech-to-text translation (ST). Different methods have been proposed to reduce this gap, but most of them require architectural changes in ST training. In this work, we propose to mitigate this issue at the pre-training stage, requiring no change in the ST model. First, we show that the connectionist temporal classification (CTC)… ▽ More

    Submitted 5 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023 (oral presentation). This version fixed URLs, updated affiliations & acknowledgements, and improved formatting

  5. LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

    Authors: Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image and natural language processing. Recent works also investigated SSL from speech. They were notably successful to improve performance on downstream tasks such as automatic speech recognition (ASR). While these works suggest it is possible to reduce dependence on labeled data for building efficient spee… ▽ More

    Submitted 10 June, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Will be presented at Interspeech 2021

    Journal ref: Proc. Interspeech 2021

  6. arXiv:2005.11861  [pdf, other

    cs.CL eess.AS

    ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020

    Authors: Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, Benjamin Lecouteux, Yannick Estève, Laurent Besacier

    Abstract: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2020, offline speech translation and simultaneous speech translation. ON-TRAC Consortium is composed of researchers from three French academic laboratories: LIA (Avignon Université), LIG (Université Grenoble Alpes), and LIUM (Le Mans Université). Attention… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

  7. arXiv:1912.05372  [pdf, ps, other

    cs.CL cs.LG

    FlauBERT: Unsupervised Language Model Pre-training for French

    Authors: Hang Le, Loïc Vial, Jibril Frej, Vincent Segonne, Maximin Coavoux, Benjamin Lecouteux, Alexandre Allauzen, Benoît Crabbé, Laurent Besacier, Didier Schwab

    Abstract: Language models have become a key step to achieve state-of-the art results in many different Natural Language Processing (NLP) tasks. Leveraging the huge amount of unlabeled texts nowadays available, they provide an efficient way to pre-train continuous word representations that can be fine-tuned for a downstream task, along with their contextualization at the sentence level. This has been widely… ▽ More

    Submitted 12 March, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: Accepted to LREC 2020

  8. arXiv:1911.02898  [pdf, other

    cs.CL

    The LIG system for the English-Czech Text Translation Task of IWSLT 2019

    Authors: Loïc Vial, Benjamin Lecouteux, Didier Schwab, Hang Le, Laurent Besacier

    Abstract: In this paper, we present our submission for the English to Czech Text Translation Task of IWSLT 2019. Our system aims to study how pre-trained language models, used as input embeddings, can improve a specialized machine translation system trained on few data. Therefore, we implemented a Transformer-based encoder-decoder neural system which is able to use the output of a pre-trained language model… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: IWSLT 2019

  9. arXiv:1905.05677  [pdf, other

    cs.CL

    Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation

    Authors: Loïc Vial, Benjamin Lecouteux, Didier Schwab

    Abstract: In this article, we tackle the issue of the limited quantity of manually sense annotated corpora for the task of word sense disambiguation, by exploiting the semantic relationships between senses such as synonymy, hypernymy and hyponymy, in order to compress the sense vocabulary of Princeton WordNet, and thus reduce the number of different sense tags that must be observed to disambiguate all words… ▽ More

    Submitted 27 August, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: In proceedings of the 10th Global WordNet Conference - GWC 2019. arXiv admin note: text overlap with arXiv:1811.00960

  10. arXiv:1811.00960  [pdf, other

    cs.CL

    Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships

    Authors: Loïc Vial, Benjamin Lecouteux, Didier Schwab

    Abstract: In Word Sense Disambiguation (WSD), the predominant approach generally involves a supervised system trained on sense annotated corpora. The limited quantity of such corpora however restricts the coverage and the performance of these systems. In this article, we propose a new method that solves these issues by taking advantage of the knowledge present in WordNet, and especially the hypernymy and hy… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

  11. arXiv:1808.08573  [pdf, other

    cs.CL

    Analyzing Learned Representations of a Deep ASR Performance Prediction Model

    Authors: Zied Elloumi, Laurent Besacier, Olivier Galibert, Benjamin Lecouteux

    Abstract: This paper addresses a relatively new task: prediction of ASR performance on unseen broadcast programs. In a previous paper, we presented an ASR performance prediction system using CNNs that encode both text (ASR transcript) and speech, in order to predict word error rate. This work is dedicated to the analysis of speech signal embeddings and text embeddings learnt by the CNN while training our pr… ▽ More

    Submitted 28 August, 2018; v1 submitted 26 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018 Workshop

  12. arXiv:1804.08477  [pdf, other

    cs.CL

    ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

    Authors: Zied Elloumi, Laurent Besacier, Olivier Galibert, Juliette Kahn, Benjamin Lecouteux

    Abstract: In this paper, we address a relatively new task: prediction of ASR performance on unseen broadcast programs. We first propose an heterogenous French corpus dedicated to this task. Two prediction approaches are compared: a state-of-the-art performance prediction based on regression (engineered features) and a new strategy based on convolutional neural networks (learnt features). We particularly foc… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: IEEE ICASSP 2018

  13. arXiv:1709.00678  [pdf, other

    cs.CL

    Disentangling ASR and MT Errors in Speech Translation

    Authors: Ngoc-Tien Le, Benjamin Lecouteux, Laurent Besacier

    Abstract: The main aim of this paper is to investigate automatic quality assessment for spoken language translation (SLT). More precisely, we investigate SLT errors that can be due to transcription (ASR) or to translation (MT) modules. This paper investigates automatic detection of SLT errors using a single classifier based on joint ASR and MT features. We evaluate both 2-class (good/bad) and 3-class (good/… ▽ More

    Submitted 3 September, 2017; originally announced September 2017.

    Comments: Accepted to MT Summit 2017 (Japan)

  14. arXiv:1609.06049  [pdf, other

    cs.CL

    Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features

    Authors: Ngoc-Tien Le, Benjamin Lecouteux, Laurent Besacier

    Abstract: This paper addresses automatic quality assessment of spoken language translation (SLT). This relatively new task is defined and formalized as a sequence labeling problem where each word in the SLT hypothesis is tagged as good or bad according to a large feature set. We propose several word confidence estimators (WCE) based on our automatic evaluation of transcription (ASR) quality, translation (MT… ▽ More

    Submitted 20 September, 2016; originally announced September 2016.

    Comments: submitted to MT Journal (special issue on spoken language translation)