Showing 1–2 of 2 results for author: Karamatlı, E

Search v0.5.6 released 2020-02-24

arXiv:2202.03875 [pdf, other]

eess.AS cs.LG cs.SD eess.SP

doi 10.1109/LSP.2022.3232276

MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training

Authors: Ertuğ Karamatlı, Serap Kırbız

Abstract: We introduce two unsupervised source separation methods, which involve self-supervised training from single-channel two-source speech mixtures. Our first method, mixture permutation invariant training (MixPIT), enables learning a neural network model which separates the underlying sources via a challenging proxy task without supervision from the reference sources. Our second method, cyclic mixture… ▽ More We introduce two unsupervised source separation methods, which involve self-supervised training from single-channel two-source speech mixtures. Our first method, mixture permutation invariant training (MixPIT), enables learning a neural network model which separates the underlying sources via a challenging proxy task without supervision from the reference sources. Our second method, cyclic mixture permutation invariant training (MixCycle), uses MixPIT as a building block in a cyclic fashion for continuous learning. MixCycle gradually converts the problem from separating mixtures of mixtures into separating single mixtures. We compare our methods to common supervised and unsupervised baselines: permutation invariant training with dynamic mixing (PIT-DM) and mixture invariant training (MixIT). We show that MixCycle outperforms MixIT and reaches a performance level very close to the supervised baseline (PIT-DM) while circumventing the over-separation issue of MixIT. Also, we propose a self-evaluation technique inspired by MixCycle that estimates model performance without utilizing any reference sources. We show that it yields results consistent with an evaluation on reference sources (LibriMix) and also with an informal listening test conducted on a real-life mixtures dataset (REAL-M). △ Less

Submitted 6 January, 2023; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: 5 pages. This article has been accepted for publication in IEEE Signal Processing Letters

Journal ref: IEEE Signal Processing Letters 29 (2022) 2637-2641
arXiv:1810.13104 [pdf, other]

cs.SD cs.LG eess.AS

doi 10.1109/LSP.2019.2929440

Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

Authors: Ertuğ Karamatlı, Ali Taylan Cemgil, Serap Kırbız

Abstract: In this paper, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. W… ▽ More In this paper, we propose a source separation method that is trained by observing the mixtures and the class labels of the sources present in the mixture without any access to isolated sources. Since our method does not require source class labels for every time-frequency bin but only a single label for each source constituting the mixture signal, we call this scenario as weak class supervision. We associate a variational autoencoder (VAE) with each source class within a non-negative (compositional) model. Each VAE provides a prior model to identify the signal from its associated class in a sound mixture. After training the model on mixtures, we obtain a generative model for each source class and demonstrate our method on one-second mixtures of utterances of digits from 0 to 9. We show that the separation performance obtained by source class supervision is as good as the performance obtained by source signal supervision. △ Less

Submitted 4 August, 2019; v1 submitted 31 October, 2018; originally announced October 2018.

Comments: Accepted version

Journal ref: IEEE Signal Processing Letters 26 (2019) 1349-1353

Search v0.5.6 released 2020-02-24