Skip to main content

Showing 1–2 of 2 results for author: Taseska, M

Searching in archive eess. Search in all archives.
.
  1. MYRiAD: A Multi-Array Room Acoustic Database

    Authors: Thomas Dietzen, Randall Ali, Maja Taseska, Toon van Waterschoot

    Abstract: In the development of acoustic signal processing algorithms, their evaluation in various acoustic environments is of utmost importance. In order to advance evaluation in realistic and reproducible scenarios, several high-quality acoustic databases have been developed over the years. In this paper, we present another complementary database of acoustic recordings, referred to as the Multi-arraY Room… ▽ More

    Submitted 12 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Journal ref: EURASIP J. Audio Speech Music Process., vol. 2023, no. 17, pp. 1-14, Apr. 2023

  2. arXiv:2106.03932  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild

    Authors: Okan Köpüklü, Maja Taseska, Gerhard Rigoll

    Abstract: Successful active speaker detection requires a three-stage pipeline: (i) audio-visual encoding for all speakers in the clip, (ii) inter-speaker relation modeling between a reference speaker and the background speakers within each frame, and (iii) temporal modeling for the reference speaker. Each stage of this pipeline plays an important role for the final performance of the created architecture. B… ▽ More

    Submitted 7 September, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to ICCV 2021