Skip to main content

Showing 1–7 of 7 results for author: Bernard, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.05555  [pdf, other

    cs.CL cs.SD eess.AS

    Shennong: a Python toolbox for audio speech features extraction

    Authors: Mathieu Bernard, Maxime Poli, Julien Karadayi, Emmanuel Dupoux

    Abstract: We introduce Shennong, a Python toolbox and command-line utility for speech features extraction. It implements a wide range of well-established state of art algorithms including spectro-temporal filters such as Mel-Frequency Cepstral Filterbanks or Predictive Linear Filters, pre-trained neural networks, pitch estimators as well as speaker normalization methods and post-processing algorithms. Shenn… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Journal ref: Behavior Research Methods, 2023

  2. arXiv:2104.14700  [pdf, ps, other

    cs.CL cs.AI

    The Zero Resource Speech Challenge 2021: Spoken language modelling

    Authors: Ewan Dunbar, Mathieu Bernard, Nicolas Hamilakis, Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Eugene Kharitonov, Emmanuel Dupoux

    Abstract: We present the Zero Resource Speech Challenge 2021, which asks participants to learn a language model directly from audio, without any text or labels. The challenge is based on the Libri-light dataset, which provides up to 60k hours of audio from English audio books without any associated text. We provide a pipeline baseline system consisting on an encoder based on contrastive predictive coding (C… ▽ More

    Submitted 9 August, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Submitted to Interspeech 2021. arXiv admin note: text overlap with arXiv:2011.11588

  3. arXiv:2012.00880  [pdf, other

    math.PR cs.IT math.QA math.RT

    Asymptotic Shape of Quantum Markov Semigroups for Compact Uniform Trees

    Authors: Margarita Belova, Matthew Bernard

    Abstract: We give locally finite Markov trees in $L^p$-compact$,$ separable Hilbert$,$ supersymmetric process$:$ $[0,\infty)\!\times\!\mathbb{R}^{\lvert\mathcal{A}^{\otimes m}\rvert}/\mathcal{A}^{\otimes m}$ on quantum ${\rm U}(\lvert\mathcal{A}^{\otimes m}\rvert)$ semigroups$.$ In full automorphism group ${\rm Aut}({\rm\bf T})$ of modular subgroup$,$ asymptotic-ergodicity is entropy-worthy $\mathbb{R}$ sha… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  4. arXiv:2010.05967  [pdf, other

    cs.CL cs.AI

    The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units

    Authors: Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux

    Abstract: We present the Zero Resource Speech Challenge 2020, which aims at learning speech representations from raw audio signals without any labels. It combines the data sets and metrics from two previous benchmarks (2017 and 2019) and features two tasks which tap into two levels of speech representation. The first task is to discover low bit-rate subword representations that optimize the quality of speec… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of Interspeech 2020

  5. arXiv:1904.11469  [pdf, other

    cs.CL cs.SD eess.AS

    The Zero Resource Speech Challenge 2019: TTS without T

    Authors: Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux

    Abstract: We present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or phonetic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery datase… ▽ More

    Submitted 7 July, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: Interspeech 2019

  6. arXiv:1803.07616  [pdf, other

    cs.AI cs.CV

    IntPhys: A Framework and Benchmark for Visual Intuitive Physics Reasoning

    Authors: Ronan Riochet, Mario Ynocente Castro, Mathieu Bernard, Adam Lerer, Rob Fergus, Véronique Izard, Emmanuel Dupoux

    Abstract: In order to reach human performance on complexvisual tasks, artificial systems need to incorporate a sig-nificant amount of understanding of the world in termsof macroscopic objects, movements, forces, etc. Inspiredby work on intuitive physics in infants, we propose anevaluation benchmark which diagnoses how much a givensystem understands about physics by testing whether itcan tell apart well matc… ▽ More

    Submitted 11 February, 2020; v1 submitted 20 March, 2018; originally announced March 2018.

  7. arXiv:1712.04313  [pdf, ps, other

    cs.CL

    The Zero Resource Speech Challenge 2017

    Authors: Ewan Dunbar, Xuan Nga Cao, Juan Benjumea, Julien Karadayi, Mathieu Bernard, Laurent Besacier, Xavier Anguera, Emmanuel Dupoux

    Abstract: We describe a new challenge aimed at discovering subword and word units from raw speech. This challenge is the followup to the Zero Resource Speech Challenge 2015. It aims at constructing systems that generalize across languages and adapt to new speakers. The design features and evaluation metrics of the challenge are presented and the results of seventeen models are discussed.

    Submitted 12 December, 2017; originally announced December 2017.

    Comments: IEEE ASRU (Automatic Speech Recognition and Understanding) 2017. Okinawa, Japan