Skip to main content

Showing 1–3 of 3 results for author: Porjazovski, D

Searching in archive cs. Search in all archives.
.
  1. Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference

    Authors: Dejan Porjazovski, Yaroslav Getman, Tamás Grósz, Mikko Kurimo

    Abstract: Large pre-trained models are essential in paralinguistic systems, demonstrating effectiveness in tasks like emotion recognition and stuttering detection. In this paper, we employ large pre-trained models for the ACM Multimedia Computational Paralinguistics Challenge, addressing the Requests and Emotion Share tasks. We explore audio-only and hybrid solutions leveraging audio and text modalities. Ou… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at ACMM 2023

  2. arXiv:2307.11450  [pdf, other

    eess.AS cs.CL

    Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information

    Authors: Dejan Porjazovski, Tamás Grósz, Mikko Kurimo

    Abstract: Traditional topic identification solutions from audio rely on an automatic speech recognition system (ASR) to produce transcripts used as input to a text-based model. These approaches work well in high-resource scenarios, where there are sufficient data to train both components of the pipeline. However, in low-resource situations, the ASR system, even if available, produces low-quality transcripts… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted to EUSIPCO 2023

  3. arXiv:2203.12906  [pdf, other

    cs.CL eess.AS

    Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks

    Authors: Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Tamás Grósz, Krister Lindén, Mikko Kurimo

    Abstract: The Donate Speech campaign has so far succeeded in gathering approximately 3600 hours of ordinary, colloquial Finnish speech into the Lahjoita puhetta (Donate Speech) corpus. The corpus includes over twenty thousand speakers from all the regions of Finland and from all age brackets. The primary goals of the collection were to create a representative, large-scale resource to study spontaneous spoke… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Submitted to Language Resources and Evaluation