Skip to main content

Showing 1–4 of 4 results for author: Linarès, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:1906.08043  [pdf, other

    eess.AS cs.CL cs.SD

    Real to H-space Encoder for Speech Recognition

    Authors: Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato De Mori

    Abstract: Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences. Recently, it has been shown that different input representations, based on multidimensional algebras, such as complex and quaternion numbers, are able to bring to neural networks a more natural, compressi… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Accepted at INTERSPEECH 2019

  2. arXiv:1811.09678  [pdf, other

    eess.AS cs.SD stat.ML

    Speech recognition with quaternion neural networks

    Authors: Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Renato De Mori

    Abstract: Neural network architectures are at the core of powerful automatic speech recognition systems (ASR). However, while recent researches focus on novel model architectures, the acoustic input features remain almost unchanged. Traditional ASR systems rely on multidimensional acoustic features such as the Mel filter bank energies alongside with the first, and second order derivatives to characterize ti… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

    Comments: NIPS 2018 (IRASL). arXiv admin note: text overlap with arXiv:1806.04418

  3. arXiv:1811.02566  [pdf, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    Bidirectional Quaternion Long-Short Term Memory Recurrent Neural Networks for Speech Recognition

    Authors: Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato De Mori

    Abstract: Recurrent neural networks (RNN) are at the core of modern automatic speech recognition (ASR) systems. In particular, long-short term memory (LSTM) recurrent neural networks have achieved state-of-the-art results in many speech recognition tasks, due to their efficient representation of long and short term dependencies in sequences of inter-dependent features. Nonetheless, internal dependencies wit… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: Submitted at ICASSP 2019. arXiv admin note: text overlap with arXiv:1806.04418

  4. arXiv:1806.07789  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

    Authors: Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio

    Abstract: Recently, the connectionist temporal classification (CTC) model coupled with recurrent (RNN) or convolutional neural networks (CNN), made it easier to train speech recognition systems in an end-to-end fashion. However in real-valued models, time frame components such as mel-filter-bank energies and the cepstral coefficients obtained from them, together with their first and second order derivatives… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Accepted at INTERSPEECH 2018