Independent Low-Rank Matrix Analysis Based on Time-Variant Sub-Gaussian Source Model
Authors:
Shinichi Mogami,
Norihiro Takamune,
Daichi Kitamura,
Hiroshi Saruwatari,
Yu Takahashi,
Kazunobu Kondo,
Hiroaki Nakajima,
Nobutaka Ono
Abstract:
Independent low-rank matrix analysis (ILRMA) is a fast and stable method for blind audio source separation. Conventional ILRMAs assume time-variant (super-)Gaussian source models, which can only represent signals that follow a super-Gaussian distribution. In this paper, we focus on ILRMA based on a generalized Gaussian distribution (GGD-ILRMA) and propose a new type of GGD-ILRMA that adopts a time…
▽ More
Independent low-rank matrix analysis (ILRMA) is a fast and stable method for blind audio source separation. Conventional ILRMAs assume time-variant (super-)Gaussian source models, which can only represent signals that follow a super-Gaussian distribution. In this paper, we focus on ILRMA based on a generalized Gaussian distribution (GGD-ILRMA) and propose a new type of GGD-ILRMA that adopts a time-variant sub-Gaussian distribution for the source model. By using a new update scheme called generalized iterative projection for homogeneous source models, we obtain a convergence-guaranteed update rule for demixing spatial parameters. In the experimental evaluation, we show the versatility of the proposed method, i.e., the proposed time-variant sub-Gaussian source model can be applied to various types of source signal.
△ Less
Submitted 24 August, 2018;
originally announced August 2018.
Independent Deeply Learned Matrix Analysis for Multichannel Audio Source Separation
Authors:
Shinichi Mogami,
Hayato Sumino,
Daichi Kitamura,
Norihiro Takamune,
Shinnosuke Takamichi,
Hiroshi Saruwatari,
Nobutaka Ono
Abstract:
In this paper, we address a multichannel audio source separation task and propose a new efficient method called independent deeply learned matrix analysis (IDLMA). IDLMA estimates the demixing matrix in a blind manner and updates the time-frequency structures of each source using a pretrained deep neural network (DNN). Also, we introduce a complex Student's t-distribution as a generalized source g…
▽ More
In this paper, we address a multichannel audio source separation task and propose a new efficient method called independent deeply learned matrix analysis (IDLMA). IDLMA estimates the demixing matrix in a blind manner and updates the time-frequency structures of each source using a pretrained deep neural network (DNN). Also, we introduce a complex Student's t-distribution as a generalized source generative model including both complex Gaussian and Cauchy distributions. Experiments are conducted using music signals with a training dataset, and the results show the validity of the proposed method in terms of separation accuracy and computational cost.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.