Skip to main content

Showing 1–8 of 8 results for author: Čmejla, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.09259  [pdf, other

    eess.SP cs.SD eess.AS

    Informed FastICA: Semi-Blind Minimum Variance Distortionless Beamformer

    Authors: Zbyněk Koldovský, Jiří Málek, Jaroslav Čmejla, Stephen O'Regan

    Abstract: Non-Gaussianity-based Independent Vector Extraction leads to the famous one-unit FastICA/FastIVA algorithm when the likelihood function is optimized using an approximate Newton-Raphson algorithm under the orthogonality constraint. In this paper, we replace the constraint with the analytic form of the minimum variance distortionless beamformer (MVDR), by which a semi-blind variant of FastICA/FastIV… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: accepted for IWAENC 2024

  2. arXiv:2304.01778  [pdf, other

    eess.AS cs.SD eess.SP

    Independent Vector Extraction Constrained on Manifold of Half-Length Filters

    Authors: Zbyněk Koldovský, Jaroslav Čmejla, Tülay Adalı, Stephen O'Regan

    Abstract: Independent Vector Analysis (IVA) is a popular extension of Independent Component Analysis (ICA) for joint separation of a set of instantaneous linear mixtures, with a direct application in frequency-domain speaker separation or extraction. The mixtures are parameterized by mixing matrices, one matrix per mixture. This means that the IVA mixing model does not account for any relationships between… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  3. arXiv:2212.01178  [pdf, other

    eess.SP cs.IT

    Dynamic Independent Component Extraction with Blending Mixing Vector: Lower Bound on Mean Interference-to-Signal Ratio

    Authors: Jaroslav Čmejla, Zbyněk Koldovský, Václav Kautský, Tülay Adalı

    Abstract: This paper deals with dynamic Blind Source Extraction (BSE) from where the mixing parameters characterizing the position of a source of interest (SOI) are allowed to vary over time. We present a new source extraction model called CvxCSV which is a parameter-reduced modification of the recent Constant Separation Vector (CSV) mixing model. In CvxCSV, the mixing vector evolves as a convex combination… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: submitted to a conference

  4. Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification

    Authors: Jiri Malek, Jakub Jansky, Zbynek Koldovsky, Tomas Kounovsky, Jaroslav Cmejla, **drich Zdansky

    Abstract: This manuscript proposes a novel robust procedure for the extraction of a speaker of interest (SOI) from a mixture of audio sources. The estimation of the SOI is performed via independent vector extraction (IVE). Since the blind IVE cannot distinguish the target source by itself, it is guided towards the SOI via frame-wise speaker identification based on deep learning. Still, an incorrect speaker… ▽ More

    Submitted 8 July, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: Modified version of the article accepted for publication in IEEE/ACM Transactions on Audio Speech and Language Processing journal. Original results unchanged, additional experiments presented, refined discussion and conclusions

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 2295-2309, 2022

  5. arXiv:2002.12619  [pdf, other

    eess.AS

    Auxiliary Function-Based Algorithm for Blind Extraction of a Moving Speaker

    Authors: Jakub Janský, Zbyněk Koldovský, Jiří Málek, Tomáš Kounovský, Jaroslav Čmejla

    Abstract: Recently, Constant Separating Vector (CSV) mixing model has been proposed for the Blind Source Extraction (BSE) of moving sources. In this paper, we experimentally verify the applicability of CSV in the blind extraction of a moving speaker and propose a new BSE method derived by modifying the auxiliary function-based algorithm for Independent Vector Analysis. Also, a piloted variant is proposed fo… ▽ More

    Submitted 5 February, 2021; v1 submitted 28 February, 2020; originally announced February 2020.

  6. arXiv:1910.11824  [pdf, other

    eess.AS cs.SD eess.SP

    Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors

    Authors: Jakub Janský, Jiří Málek, Jaroslav Čmejla, Tomáš Kounovský, Zbyněk Koldovský, **dřich Žďánský

    Abstract: We propose a novel algorithm for adaptive blind audio source extraction. The proposed method is based on independent vector analysis and utilizes the auxiliary function optimization to achieve high convergence speed. The algorithm is partially supervised by a pilot signal related to the source of interest (SOI), which ensures that the method correctly extracts the utterance of the desired speaker.… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  7. arXiv:1910.10242  [pdf, other

    eess.SP

    Algorithm for Independent Vector Extraction Based on Semi-Time-Variant Mixing Model

    Authors: Zbyněk Koldovský, Václav Kautský, Tomáš Kounovský, Jaroslav Čmejla

    Abstract: A new algorithm for dynamic independent vector extraction is proposed. It is based on the mixing model where mixing parameters related to the source-of-interest (SOI) are time-variant while the separating parameters are time-invariant. A contrast function based on the quasi-likelihood approach is optimized using the Newton-Raphson approach. The update is computed without imposing the orthogonal co… ▽ More

    Submitted 1 March, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

  8. arXiv:1907.12421  [pdf, other

    eess.AS cs.SD

    MIRaGe: Multichannel Database Of Room Impulse Responses Measured On High-Resolution Cube-Shaped Grid In Multiple Acoustic Conditions

    Authors: Jaroslav Čmejla, Tomáš Kounovský, Sharon Gannot, Zbyněk Koldovský, Pinchas Tandeitnik

    Abstract: We introduce a database of multi-channel recordings performed in an acoustic lab with adjustable reverberation time. The recordings provide information about room impulse responses (RIR) for various positions of a loudspeaker. In particular, the main positions correspond to 4104 vertices of a cube-shaped dense grid within a 46x36x32 cm volume. The database thus provides a tool for detailed analyse… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.