Search | arXiv e-print repository

Regularized EM algorithm

Authors: Pierre Houdouin, Esa Ollila, Frederic Pascal

Abstract: Expectation-Maximization (EM) algorithm is a widely used iterative algorithm for computing (local) maximum likelihood estimate (MLE). It can be used in an extensive range of problems, including the clustering of data based on the Gaussian mixture model (GMM). Numerical instability and convergence problems may arise in situations where the sample size is not much larger than the data dimensionality… ▽ More Expectation-Maximization (EM) algorithm is a widely used iterative algorithm for computing (local) maximum likelihood estimate (MLE). It can be used in an extensive range of problems, including the clustering of data based on the Gaussian mixture model (GMM). Numerical instability and convergence problems may arise in situations where the sample size is not much larger than the data dimensionality. In such low sample support (LSS) settings, the covariance matrix update in the EM-GMM algorithm may become singular or poorly conditioned, causing the algorithm to crash. On the other hand, in many signal processing problems, a priori information can be available indicating certain structures for different cluster covariance matrices. In this paper, we present a regularized EM algorithm for GMM-s that can make efficient use of such prior knowledge as well as cope with LSS situations. The method aims to maximize a penalized GMM likelihood where regularized estimation may be used to ensure positive definiteness of covariance matrix updates and shrink the estimators towards some structured target covariance matrices. We show that the theoretical guarantees of convergence hold, leading to better performing EM algorithm for structured covariance matrix models or with low sample settings. △ Less

Submitted 27 March, 2023; originally announced March 2023.

Comments: ICASSP Conference, 4 pages, 8 figures

arXiv:2203.07831 [pdf, other]

Graph Convolutional Neural Networks Sensitivity under Probabilistic Error Model

Authors: Xinjue Wang, Esa Ollila, Sergiy A. Vorobyov

Abstract: Graph Neural Networks (GNNs), particularly Graph Convolutional Neural Networks (GCNNs), have emerged as pivotal instruments in machine learning and signal processing for processing graph-structured data. This paper proposes an analysis framework to investigate the sensitivity of GCNNs to probabilistic graph perturbations, directly impacting the graph shift operator (GSO). Our study establishes tig… ▽ More Graph Neural Networks (GNNs), particularly Graph Convolutional Neural Networks (GCNNs), have emerged as pivotal instruments in machine learning and signal processing for processing graph-structured data. This paper proposes an analysis framework to investigate the sensitivity of GCNNs to probabilistic graph perturbations, directly impacting the graph shift operator (GSO). Our study establishes tight expected GSO error bounds, which are explicitly linked to the error model parameters, and reveals a linear relationship between GSO perturbations and the resulting output differences at each layer of GCNNs. This linearity demonstrates that a single-layer GCNN maintains stability under graph edge perturbations, provided that the GSO errors remain bounded, regardless of the perturbation scale. For multilayer GCNNs, the dependency of system's output difference on GSO perturbations is shown to be a recursion of linearity. Finally, we exemplify the framework with the Graph Isomorphism Network (GIN) and Simple Graph Convolution Network (SGCN). Experiments validate our theoretical derivations and the effectiveness of our approach. △ Less

Submitted 6 May, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

arXiv:2102.08641 [pdf, other]

Coupled Feature Learning for Multimodal Medical Image Fusion

Authors: Farshad G. Veshki, Nora Ouzir, Sergiy A. Vorobyov, Esa Ollila

Abstract: Multimodal image fusion aims to combine relevant information from images acquired with different sensors. In medical imaging, fused images play an essential role in both standard and automated diagnosis. In this paper, we propose a novel multimodal image fusion method based on coupled dictionary learning. The proposed method is general and can be employed for different medical imaging modalities.… ▽ More Multimodal image fusion aims to combine relevant information from images acquired with different sensors. In medical imaging, fused images play an essential role in both standard and automated diagnosis. In this paper, we propose a novel multimodal image fusion method based on coupled dictionary learning. The proposed method is general and can be employed for different medical imaging modalities. Unlike many current medical fusion methods, the proposed approach does not suffer from intensity attenuation nor loss of critical information. Specifically, the images to be fused are decomposed into coupled and independent components estimated using sparse representations with identical supports and a Pearson correlation constraint, respectively. An alternating minimization algorithm is designed to solve the resulting optimization problem. The final fusion step uses the max-absolute-value rule. Experiments are conducted using various pairs of multimodal inputs, including real MR-CT and MR-PET images. The resulting performance and execution times show the competitiveness of the proposed method in comparison with state-of-the-art medical image fusion methods. △ Less

Submitted 17 February, 2021; originally announced February 2021.

arXiv:2011.04315 [pdf, other]

doi 10.1109/TSP.2021.3118546

Coupled regularized sample covariance matrix estimator for multiple classes

Authors: Elias Raninen, Esa Ollila

Abstract: The estimation of covariance matrices of multiple classes with limited training data is a difficult problem. The sample covariance matrix (SCM) is known to perform poorly when the number of variables is large compared to the available number of samples. In order to reduce the mean squared error (MSE) of the SCM, regularized (shrinkage) SCM estimators are often used. In this work, we consider regul… ▽ More The estimation of covariance matrices of multiple classes with limited training data is a difficult problem. The sample covariance matrix (SCM) is known to perform poorly when the number of variables is large compared to the available number of samples. In order to reduce the mean squared error (MSE) of the SCM, regularized (shrinkage) SCM estimators are often used. In this work, we consider regularized SCM (RSCM) estimators for multiclass problems that couple together two different target matrices for regularization: the pooled (average) SCM of the classes and the scaled identity matrix. Regularization toward the pooled SCM is beneficial when the population covariances are similar, whereas regularization toward the identity matrix guarantees that the estimators are positive definite. We derive the MSE optimal tuning parameters for the estimators as well as propose a method for their estimation under the assumption that the class populations follow (unspecified) elliptical distributions with finite fourth-order moments. The MSE performance of the proposed coupled RSCMs are evaluated with simulations and in a regularized discriminant analysis (RDA) classification set-up on real data. The results based on three different real data sets indicate comparable performance to cross-validation but with a significant speed-up in computation time. △ Less

Submitted 9 November, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

Journal ref: IEEE Transactions on Signal Processing, vol. 69, pp. 5681-5692, 2021

arXiv:2008.10982 [pdf, other]

Block-wise Minimization-Majorization algorithm for Huber's criterion: sparse learning and applications

Authors: Esa Ollila, Ammar Mian

Abstract: Huber's criterion can be used for robust joint estimation of regression and scale parameters in the linear model. Huber's (Huber, 1981) motivation for introducing the criterion stemmed from non-convexity of the joint maximum likelihood objective function as well as non-robustness (unbounded influence function) of the associated ML-estimate of scale. In this paper, we illustrate how the original al… ▽ More Huber's criterion can be used for robust joint estimation of regression and scale parameters in the linear model. Huber's (Huber, 1981) motivation for introducing the criterion stemmed from non-convexity of the joint maximum likelihood objective function as well as non-robustness (unbounded influence function) of the associated ML-estimate of scale. In this paper, we illustrate how the original algorithm proposed by Huber can be set within the block-wise minimization majorization framework. In addition, we propose novel data-adaptive step sizes for both the location and scale, which are further improving the convergence. We then illustrate how Huber's criterion can be used for sparse learning of underdetermined linear model using the iterative hard thresholding approach. We illustrate the usefulness of the algorithms in an image denoising application and simulation studies. △ Less

Submitted 25 August, 2020; originally announced August 2020.

Comments: To appear in International Workshop on Machine Learning for Signal Processing (MLSP), 2020

arXiv:2008.10461 [pdf, other]

doi 10.1109/TSP.2021.3073226

Graph Signal Processing Meets Blind Source Separation

Authors: Jari Miettinen, Eyal Nitzan, Sergiy A. Vorobyov, Esa Ollila

Abstract: In graph signal processing (GSP), prior information on the dependencies in the signal is collected in a graph which is then used when processing or analyzing the signal. Blind source separation (BSS) techniques have been developed and analyzed in different domains, but for graph signals the research on BSS is still in its infancy. In this paper, this gap is filled with two contributions. First, a… ▽ More In graph signal processing (GSP), prior information on the dependencies in the signal is collected in a graph which is then used when processing or analyzing the signal. Blind source separation (BSS) techniques have been developed and analyzed in different domains, but for graph signals the research on BSS is still in its infancy. In this paper, this gap is filled with two contributions. First, a nonparametric BSS method, which is relevant to the GSP framework, is refined, the Cramér-Rao bound (CRB) for mixing and unmixing matrix estimators in the case of Gaussian moving average graph signals is derived, and for studying the achievability of the CRB, a new parametric method for BSS of Gaussian moving average graph signals is introduced. Second, we also consider BSS of non-Gaussian graph signals and two methods are proposed. Identifiability conditions show that utilizing both graph structure and non-Gaussianity provides a more robust approach than methods which are based on only either graph dependencies or non-Gaussianity. It is also demonstrated by numerical study that the proposed methods are more efficient in separating non-Gaussian graph signals. △ Less

Submitted 24 August, 2020; originally announced August 2020.

Comments: 31 pages, 3 figures, 1 table, submitted to IEEE Trans. Signal Processing on Aug. 2020

Journal ref: IEEE Trans. Signal Processing, vol. 69, pp. 2585-2599, May 2021

arXiv:2005.04383 [pdf, other]

A Compressive Classification Framework for High-Dimensional Data

Authors: Muhammad Naveed Tabassum, Esa Ollila

Abstract: We propose a compressive classification framework for settings where the data dimensionality is significantly higher than the sample size. The proposed method, referred to as compressive regularized discriminant analysis (CRDA) is based on linear discriminant analysis and has the ability to select significant features by using joint-sparsity promoting hard thresholding in the discriminant rule. Si… ▽ More We propose a compressive classification framework for settings where the data dimensionality is significantly higher than the sample size. The proposed method, referred to as compressive regularized discriminant analysis (CRDA) is based on linear discriminant analysis and has the ability to select significant features by using joint-sparsity promoting hard thresholding in the discriminant rule. Since the number of features is larger than the sample size, the method also uses state-of-the-art regularized sample covariance matrix estimators. Several analysis examples on real data sets, including image, speech signal and gene expression data illustrate the promising improvements offered by the proposed CRDA classifier in practise. Overall, the proposed method gives fewer misclassification errors than its competitors, while at the same time achieving accurate feature selection results. The open-source R package and MATLAB toolbox of the proposed method (named compressiveRDA) is freely available. △ Less

Submitted 12 November, 2020; v1 submitted 9 May, 2020; originally announced May 2020.

arXiv:1903.08398 [pdf, other]

Modelling Graph Errors: Towards Robust Graph Signal Processing

Authors: Jari Miettinen, Sergiy A. Vorobyov, Esa Ollila

Abstract: The first step for any graph signal processing (GSP) procedure is to learn the graph signal representation, i.e., to capture the dependence structure of the data into an adjacency matrix. Indeed, the adjacency matrix is typically not known a priori and has to be learned. However, it is learned with errors. A little attention has been paid to modelling such errors in the adjacency matrix, and study… ▽ More The first step for any graph signal processing (GSP) procedure is to learn the graph signal representation, i.e., to capture the dependence structure of the data into an adjacency matrix. Indeed, the adjacency matrix is typically not known a priori and has to be learned. However, it is learned with errors. A little attention has been paid to modelling such errors in the adjacency matrix, and studying their effects on GSP methods. However, modelling errors in the adjacency matrix will enable both to study the graph error effects in GSP and to develop robust GSP algorithms. In this paper, we therefore introduce practically justifiable graph error models. We also study, both analytically when possible and numerically, the graph error effect on the performance of GSP methods in different types of problems such as filtering of graph signals and independent component analysis of graph signals (graph decorrelation). △ Less

Submitted 25 September, 2020; v1 submitted 20 March, 2019; originally announced March 2019.

Comments: 34 pages, 6 figures, 2 tables, Submitted to IEEE journal on January 2019

Journal ref: Signal Processing, vol. 189, 108256, pp. 1-8, Dec. 2021

arXiv:1805.07575 [pdf, ps, other]

Sequential adaptive elastic net approach for single-snapshot source localization

Authors: Muhammad Naveed Tabassum, Esa Ollila

Abstract: This paper proposes efficient algorithms for accurate recovery of direction-of-arrival (DoA) of sources from single-snapshot measurements using compressed beamforming (CBF). In CBF, the conventional sensor array signal model is cast as an underdetermined complex-valued linear regression model and sparse signal recovery methods are used for solving the DoA finding problem. We develop a complex-valu… ▽ More This paper proposes efficient algorithms for accurate recovery of direction-of-arrival (DoA) of sources from single-snapshot measurements using compressed beamforming (CBF). In CBF, the conventional sensor array signal model is cast as an underdetermined complex-valued linear regression model and sparse signal recovery methods are used for solving the DoA finding problem. We develop a complex-valued pathwise weighted elastic net (c-PW-WEN) algorithm that finds solutions at knots of penalty parameter values over a path (or grid) of EN tuning parameter values. c-PW-WEN also computes Lasso or weighted Lasso in its path. We then propose a sequential adaptive EN (SAEN) method that is based on c-PW-WEN algorithm with adaptive weights that depend on the previous solution. Extensive simulation studies illustrate that SAEN improves the probability of exact recovery of true support compared to conventional sparse signal recovery approaches such as Lasso, elastic net or orthogonal matching pursuit in several challenging multiple target scenarios. The effectiveness of SAEN is more pronounced in the presence of high mutual coherence. △ Less

Submitted 19 May, 2018; originally announced May 2018.

Comments: 12 pages, 5 figures, in the publication to the Journal of the Acoustical Society of America

arXiv:1504.04184 [pdf, ps, other]

Multichannel sparse recovery of complex-valued signals using Huber's criterion

Authors: Esa Ollila

Abstract: In this paper, we generalize Huber's criterion to multichannel sparse recovery problem of complex-valued measurements where the objective is to find good recovery of jointly sparse unknown signal vectors from the given multiple measurement vectors which are different linear combinations of the same known elementary vectors. This requires careful characterization of robust complex-valued loss funct… ▽ More In this paper, we generalize Huber's criterion to multichannel sparse recovery problem of complex-valued measurements where the objective is to find good recovery of jointly sparse unknown signal vectors from the given multiple measurement vectors which are different linear combinations of the same known elementary vectors. This requires careful characterization of robust complex-valued loss functions as well as Huber's criterion function for the multivariate sparse regression problem. We devise a greedy algorithm based on simultaneous normalized iterative hard thresholding (SNIHT) algorithm. Unlike the conventional SNIHT method, our algorithm, referred to as HUB-SNIHT, is robust under heavy-tailed non-Gaussian noise conditions, yet has a negligible performance loss compared to SNIHT under Gaussian noise. Usefulness of the method is illustrated in source localization application with sensor arrays. △ Less

Submitted 16 April, 2015; originally announced April 2015.

Comments: To appear in CoSeRa'15 (Pisa, Italy, June 16-19, 2015). arXiv admin note: text overlap with arXiv:1502.02441

arXiv:1504.02382 [pdf, other]

doi 10.1109/TSP.2015.2498121

Robust, scalable and fast bootstrap method for analyzing large scale data

Authors: Shahab Basiri, Esa Ollila, Visa Koivunen

Abstract: In this paper we address the problem of performing statistical inference for large scale data sets i.e., Big Data. The volume and dimensionality of the data may be so high that it cannot be processed or stored in a single computing node. We propose a scalable, statistically robust and computationally efficient bootstrap method, compatible with distributed processing and storage systems. Bootstrap… ▽ More In this paper we address the problem of performing statistical inference for large scale data sets i.e., Big Data. The volume and dimensionality of the data may be so high that it cannot be processed or stored in a single computing node. We propose a scalable, statistically robust and computationally efficient bootstrap method, compatible with distributed processing and storage systems. Bootstrap resamples are constructed with smaller number of distinct data points on multiple disjoint subsets of data, similarly to the bag of little bootstrap method (BLB) [1]. Then significant savings in computation is achieved by avoiding the re-computation of the estimator for each bootstrap sample. Instead, a computationally efficient fixed-point estimation equation is analytically solved via a smart approximation following the Fast and Robust Bootstrap method (FRB) [2]. Our proposed bootstrap method facilitates the use of highly robust statistical methods in analyzing large scale data sets. The favorable statistical properties of the method are established analytically. Numerical examples demonstrate scalability, low complexity and robust statistical performance of the method in analyzing large data sets. △ Less

Submitted 12 April, 2015; v1 submitted 9 April, 2015; originally announced April 2015.

Comments: This paper is submitted for publication in IEEE Transactions On Signal Processing, 8 pages, 8 figures

arXiv:1502.02441 [pdf, ps, other]

Nonparametric Simultaneous Sparse Recovery: an Application to Source Localization

Authors: Esa Ollila

Abstract: We consider multichannel sparse recovery problem where the objective is to find good recovery of jointly sparse unknown signal vectors from the given multiple measurement vectors which are different linear combinations of the same known elementary vectors. Many popular greedy or convex algorithms perform poorly under non-Gaussian heavy-tailed noise conditions or in the face of outliers. In this pa… ▽ More We consider multichannel sparse recovery problem where the objective is to find good recovery of jointly sparse unknown signal vectors from the given multiple measurement vectors which are different linear combinations of the same known elementary vectors. Many popular greedy or convex algorithms perform poorly under non-Gaussian heavy-tailed noise conditions or in the face of outliers. In this paper, we propose the usage of mixed $\ell_{p,q}$ norms on data fidelity (residual matrix) term and the conventional $\ell_{0,2}$-norm constraint on the signal matrix to promote row-sparsity. We devise a greedy pursuit algorithm based on simultaneous normalized iterative hard thresholding (SNIHT) algorithm. Simulation studies highlight the effectiveness of the proposed approaches to cope with different noise environments (i.i.d., row i.i.d, etc) and outliers. Usefulness of the methods are illustrated in source localization application with sensor arrays. △ Less

Submitted 10 June, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

Comments: Paper appears in Proc. European Signal Processing Conference (EUSIPCO'15), Nice, France, Aug 31 -- Sep 4, 2015

arXiv:1405.1502 [pdf, ps, other]

Robust iterative hard thresholding for compressed sensing

Authors: Esa Ollila, Hyon-Jung Kim, Visa Koivunen

Abstract: Compressed sensing (CS) or sparse signal reconstruction (SSR) is a signal processing technique that exploits the fact that acquired data can have a sparse representation in some basis. One popular technique to reconstruct or approximate the unknown sparse signal is the iterative hard thresholding (IHT) which however performs very poorly under non-Gaussian noise conditions or in the face of outlier… ▽ More Compressed sensing (CS) or sparse signal reconstruction (SSR) is a signal processing technique that exploits the fact that acquired data can have a sparse representation in some basis. One popular technique to reconstruct or approximate the unknown sparse signal is the iterative hard thresholding (IHT) which however performs very poorly under non-Gaussian noise conditions or in the face of outliers (gross errors). In this paper, we propose a robust IHT method based on ideas from $M$-estimation that estimates the sparse signal and the scale of the error distribution simultaneously. The method has a negligible performance loss compared to IHT under Gaussian noise, but superior performance under heavy-tailed non-Gaussian noise conditions. △ Less

Submitted 7 May, 2014; originally announced May 2014.

Comments: To appear in Proc. of ISCCSP 2014

Showing 1–13 of 13 results for author: Ollila, E