Skip to main content

Showing 1–4 of 4 results for author: Senoussaoui, M

.
  1. arXiv:1907.04928  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction

    Authors: Mohammed Senoussaoui, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we present a novel approach for extracting a Bag-of-Words (BoW) representation based on a Neural Network codebook. The conventional BoW model is based on a dictionary (codebook) built from elementary representations which are selected randomly or by using a clustering algorithm on a training dataset. A metric is then used to assign unseen elementary representations to the closest dic… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  2. arXiv:1907.03196  [pdf, other

    cs.CV eess.AS eess.IV

    Multimodal Fusion with Deep Neural Networks for Audio-Video Emotion Recognition

    Authors: Juan D. S. Ortega, Mohammed Senoussaoui, Eric Granger, Marco Pedersoli, Patrick Cardinal, Alessandro L. Koerich

    Abstract: This paper presents a novel deep neural network (DNN) for multimodal fusion of audio, video and text modalities for emotion recognition. The proposed DNN architecture has independent and shared layers which aim to learn the representation for each modality, as well as the best combined representation to achieve the best prediction. Experimental results on the AVEC Sentiment Analysis in the Wild da… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  3. arXiv:1904.11641  [pdf, other

    cs.SD cs.CL eess.AS

    Speaker Sincerity Detection based on Covariance Feature Vectors and Ensemble Methods

    Authors: Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich

    Abstract: Automatic measuring of speaker sincerity degree is a novel research problem in computational paralinguistics. This paper proposes covariance-based feature vectors to model speech and ensembles of support vector regressors to estimate the degree of sincerity of a speaker. The elements of each covariance vector are pairwise statistics between the short-term feature components. These features are use… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  4. arXiv:1510.04707  [pdf, other

    cs.SD

    SRMR variants for improved blind room acoustics characterization

    Authors: M. Senoussaoui, J. F. Santos, T. H. Falk

    Abstract: Reverberation, especially in large rooms, severely degrades speech recognition performance and speech intelligibility. Since direct measurement of room characteristics is usually not possible, blind estimation of reverberation-related metrics such as the reverberation time (RT) and the direct-to-reverberant energy ratio (DRR) can be valuable information to speech recognition and enhancement algori… ▽ More

    Submitted 15 October, 2015; originally announced October 2015.

    Comments: In Proceedings of the ACE Chal- lenge Workshop - a satellite event of IEEE-WASPAA 2015 (arXiv:1510.00383)

    Report number: ACEChallenge/2015/07