Skip to main content

Showing 1–8 of 8 results for author: Pärnamaa, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09928  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Personalized Speech Enhancement Without a Separate Speaker Embedding Model

    Authors: Tanel Pärnamaa, Ando Saabas

    Abstract: Personalized speech enhancement (PSE) models can improve the audio quality of teleconferencing systems by adapting to the characteristics of a speaker's voice. However, most existing methods require a separate speaker embedding model to extract a vector representation of the speaker from enrollment audio, which adds complexity to the training and deployment process. We propose to use the internal… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. arXiv:2309.12553  [pdf, other

    eess.AS cs.SD

    ICASSP 2023 Acoustic Echo Cancellation Challenge

    Authors: Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

    Abstract: The ICASSP 2023 Acoustic Echo Cancellation Challenge is intended to stimulate research in acoustic echo cancellation (AEC), which is an important area of speech enhancement and is still a top issue in audio communication. This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.13290, arXiv:2009.04972

  3. arXiv:2306.03177  [pdf, other

    cs.SD cs.CV eess.AS

    DeepVQE: Real Time Deep Voice Quality Enhancement for Joint Acoustic Echo Cancellation, Noise Suppression and Dereverberation

    Authors: Evgenii Indenbom, Nicolae-Catalin Ristea, Ando Saabas, Tanel Parnamaa, Jegor Guzvin, Ross Cutler

    Abstract: Acoustic echo cancellation (AEC), noise suppression (NS) and dereverberation (DR) are an integral part of modern full-duplex communication systems. As the demand for teleconferencing systems increases, addressing these tasks is required for an effective and efficient online meeting experience. Most prior research proposes solutions for these tasks separately, combining them with digital signal pro… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  4. arXiv:2211.02773  [pdf, other

    eess.AS cs.SD

    Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation

    Authors: Sefik Emre Eskimez, Takuya Yoshioka, Alex Ju, Min Tang, Tanel Parnamaa, Huaming Wang

    Abstract: Personalized speech enhancement (PSE) is a real-time SE approach utilizing a speaker embedding of a target person to remove background noise, reverberation, and interfering voices. To deploy a PSE model for full duplex communications, the model must be combined with acoustic echo cancellation (AEC), although such a combination has been less explored. This paper proposes a series of methods that ar… ▽ More

    Submitted 25 May, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted to Interspeech 2023

  5. arXiv:2208.11308  [pdf, other

    cs.SD cs.CV eess.AS

    Deep model with built-in cross-attention alignment for acoustic echo cancellation

    Authors: Evgenii Indenbom, Nicolae-Cătălin Ristea, Ando Saabas, Tanel Pärnamaa, Jegor Gužvin

    Abstract: With recent research advances, deep learning models have become an attractive choice for acoustic echo cancellation (AEC) in real-time teleconferencing applications. Since acoustic echo is one of the major sources of poor audio quality, a wide variety of deep models have been proposed. However, an important but often omitted requirement for good echo cancellation quality is the synchronization of… ▽ More

    Submitted 14 March, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

  6. arXiv:2202.13290  [pdf, other

    eess.AS cs.SD

    ICASSP 2022 Acoustic Echo Cancellation Challenge

    Authors: Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Hannes Gamper, Sebastian Braun, Karsten Sørensen, Robert Aichner

    Abstract: The ICASSP 2022 Acoustic Echo Cancellation Challenge is intended to stimulate research in acoustic echo cancellation (AEC), which is an important area of speech enhancement and still a top issue in audio communication. This is the third AEC challenge and it is enhanced by including mobile scenarios, adding speech recognition rate in the challenge goal metrics, and making the default sample rate 48… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2009.04972

  7. arXiv:2009.04972  [pdf, other

    eess.AS cs.SD

    ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results

    Authors: Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Markus Loide, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

    Abstract: The ICASSP 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems. Many recent AEC studies report good performance on synthetic datasets where the train and test samples come from the same underlying distributio… ▽ More

    Submitted 30 October, 2020; v1 submitted 10 September, 2020; originally announced September 2020.

  8. arXiv:1608.00318  [pdf, other

    cs.CL cs.LG

    A Neural Knowledge Language Model

    Authors: Sung** Ahn, Heeyoul Choi, Tanel Pärnamaa, Yoshua Bengio

    Abstract: Current language models have a significant limitation in the ability to encode and decode factual knowledge. This is mainly because they acquire such knowledge from statistical co-occurrences although most of the knowledge words are rarely observed. In this paper, we propose a Neural Knowledge Language Model (NKLM) which combines symbolic knowledge provided by the knowledge graph with the RNN lang… ▽ More

    Submitted 2 March, 2017; v1 submitted 1 August, 2016; originally announced August 2016.