Skip to main content

Showing 1–7 of 7 results for author: Lenain, R

Searching in archive cs. Search in all archives.
.
  1. Obstacle crossing strategies for high-speed 4WD small-scale vehicle

    Authors: Philippe Vaslin, Denis N'Chot, Roland Lenain, Jean-Christophe Fauroux, Lama Al Bassit

    Abstract: Unmanned ground vehicle obstacle crossing generally relies on two strategies: (i) applying a wheel torque for climbing and (ii) modifying the vehicle shape by using a wheel-leg or wheel-paddle to lift the wheel on top of the obstacle. However, most of those strategies sacrifice speed in order to have a longer contact duration between the wheels and the obstacle. This paper investigates the behavio… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Journal ref: SYROM & ROBOTICS 2022, Romanian Association for the Science of Mechanisms and Machines; ARoTMM; Robotics Society of Romania; Mechanical Engineering Faculty - ''Gheorghe Asachi'' Technical University of Iasi, Nov 2022, IASI, Romania. pp.327-336

  2. arXiv:2107.08251  [pdf, other

    cs.CL cs.LG

    Generative Pretraining for Paraphrase Evaluation

    Authors: Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

    Abstract: We introduce ParaBLEU, a paraphrase representation learning model and evaluation metric for text generation. Unlike previous approaches, ParaBLEU learns to understand paraphrasis using generative conditioning as a pretraining objective. ParaBLEU correlates more strongly with human judgements than existing metrics, obtaining new state-of-the-art results on the 2017 WMT Metrics Shared Task. We show… ▽ More

    Submitted 24 July, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Under review

  3. arXiv:2107.08248  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Learning De-identified Representations of Prosody from Raw Audio

    Authors: Jack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed

    Abstract: We propose a method for learning de-identified prosody representations from raw audio using a contrastive self-supervised signal. Whereas prior work has relied on conditioning models on bottlenecks, we introduce a set of inductive biases that exploit the natural structure of prosody to minimize timbral information and decouple prosody from speaker representations. Despite aggressive downsampling o… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Proceedings of Machine Learning Research 139, PMLR 2021

  4. Phonological Features for 0-shot Multilingual Speech Synthesis

    Authors: Marlene Staib, Tian Huey Teh, Alexandra Torresquintero, Devang S Ram Mohan, Lorenzo Foglianti, Raphael Lenain, Jiameng Gao

    Abstract: Code-switching---the intra-utterance use of multiple languages---is prevalent across the world. Within text-to-speech (TTS), multilingual models have been found to enable code-switching. By modifying the linguistic input to sequence-to-sequence TTS, we show that code-switching is possible for languages unseen during training, even within monolingual models. We use a small set of phonological featu… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 5 pages, to be presented at INTERSPEECH 2020

  5. arXiv:2008.03096  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

    Authors: Devang S Ram Mohan, Raphael Lenain, Lorenzo Foglianti, Tian Huey Teh, Marlene Staib, Alexandra Torresquintero, Jiameng Gao

    Abstract: Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation. Interleaving the action of reading a character with that of synthesising audio reduces this latency. However, the order of this sequence of interleaved actions v… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To be published in Interspeech 2020. 5 pages, 4 figures

  6. arXiv:2005.10219  [pdf, other

    cs.CL cs.LG

    BlaBla: Linguistic Feature Extraction for Clinical Analysis in Multiple Languages

    Authors: Abhishek Shivkumar, Jack Weston, Raphael Lenain, Emil Fristed

    Abstract: We introduce BlaBla, an open-source Python library for extracting linguistic features with proven clinical relevance to neurological and psychiatric diseases across many languages. BlaBla is a unifying framework for accelerating and simplifying clinical linguistic research. The library is built on state-of-the-art NLP frameworks and supports multithreaded/GPU-enabled feature extraction via both na… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: 5 pages. 1 figure. Under review

  7. arXiv:2005.08848  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Surfboard: Audio Feature Extraction for Modern Machine Learning

    Authors: Raphael Lenain, Jack Weston, Abhishek Shivkumar, Emil Fristed

    Abstract: We introduce Surfboard, an open-source Python library for extracting audio features with application to the medical domain. Surfboard is written with the aim of addressing pain points of existing libraries and facilitating joint use with modern machine learning frameworks. The package can be accessed both programmatically in Python and via its command line interface, allowing it to be easily integ… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 5 pages. 0 figures. Under review