-
Learning from Incomplete Features by Simultaneous Training of Neural Networks and Sparse Coding
Authors:
Cesar F. Caiafa,
Ziyao Wang,
Jordi Solé-Casals,
Qibin Zhao
Abstract:
In this paper, the problem of training a classifier on a dataset with incomplete features is addressed. We assume that different subsets of features (random or structured) are available at each data instance. This situation typically occurs in the applications when not all the features are collected for every data sample. A new supervised learning method is developed to train a general classifier,…
▽ More
In this paper, the problem of training a classifier on a dataset with incomplete features is addressed. We assume that different subsets of features (random or structured) are available at each data instance. This situation typically occurs in the applications when not all the features are collected for every data sample. A new supervised learning method is developed to train a general classifier, such as a logistic regression or a deep neural network, using only a subset of features per sample, while assuming sparse representations of data vectors on an unknown dictionary. Sufficient conditions are identified, such that, if it is possible to train a classifier on incomplete observations so that their reconstructions are well separated by a hyperplane, then the same classifier also correctly separates the original (unobserved) data samples. Extensive simulation results on synthetic and well-known datasets are presented that validate our theoretical findings and demonstrate the effectiveness of the proposed method compared to traditional data imputation approaches and one state-of-the-art algorithm.
△ Less
Submitted 17 April, 2021; v1 submitted 27 November, 2020;
originally announced November 2020.
-
Brain-Computer Interface with Corrupted EEG Data: A Tensor Completion Approach
Authors:
Jordi Sole-Casals,
Cesar F. Caiafa,
Qibin Zhao,
Adrzej Cichocki
Abstract:
One of the current issues in Brain-Computer Interface is how to deal with noisy Electroencephalography measurements organized as multidimensional datasets. On the other hand, recently, significant advances have been made in multidimensional signal completion algorithms that exploit tensor decomposition models to capture the intricate relationship among entries in a multidimensional signal. We prop…
▽ More
One of the current issues in Brain-Computer Interface is how to deal with noisy Electroencephalography measurements organized as multidimensional datasets. On the other hand, recently, significant advances have been made in multidimensional signal completion algorithms that exploit tensor decomposition models to capture the intricate relationship among entries in a multidimensional signal. We propose to use tensor completion applied to EEG data for improving the classification performance in a motor imagery BCI system with corrupted measurements. Noisy measurements are considered as unknowns that are inferred from a tensor decomposition model. We evaluate the performance of four recently proposed tensor completion algorithms plus a simple interpolation strategy, first with random missing entries and then with missing samples constrained to have a specific structure (random missing channels), which is a more realistic assumption in BCI Applications. We measured the ability of these algorithms to reconstruct the tensor from observed data. Then, we tested the classification accuracy of imagined movement in a BCI experiment with missing samples. We show that for random missing entries, all tensor completion algorithms can recover missing samples increasing the classification performance compared to a simple interpolation approach. For the random missing channels case, we show that tensor completion algorithms help to reconstruct missing channels, significantly improving the accuracy in the classification of motor imagery, however, not at the same level as clean data. Tensor completion algorithms are useful in real BCI applications. The proposed strategy could allow using motor imagery BCI systems even when EEG data is highly affected by missing channels and/or samples, avoiding the need of new acquisitions in the calibration stage.
△ Less
Submitted 26 July, 2018; v1 submitted 13 June, 2018;
originally announced June 2018.
-
Sparse multiway decomposition for analysis and modeling of diffusion imaging and tractography
Authors:
Cesar F. Caiafa,
Franco Pestilli
Abstract:
The number of neuroimaging data sets publicly available is growing at fast rate. The increase in availability and resolution of neuroimaging data requires modern approaches to signal processing for data analysis and results validation. We introduce the application of sparse multiway decomposition methods (Caiafa and Cichocki, 2012) to linearized neuroimaging models. We show that decomposed models…
▽ More
The number of neuroimaging data sets publicly available is growing at fast rate. The increase in availability and resolution of neuroimaging data requires modern approaches to signal processing for data analysis and results validation. We introduce the application of sparse multiway decomposition methods (Caiafa and Cichocki, 2012) to linearized neuroimaging models. We show that decomposed models are more compact but as accurate as full models and can be successfully used for fast data analysis. We focus as example on a recent model for the evaluation of white matter connectomes (Pestilli et al, 2014). We show that the multiway decomposed model achieves accuracy comparable to the full model, while requiring only a small fraction of the memory and compute time. The approach has implications for a majority of neuroimaging methods using linear approximations to measured signals.
△ Less
Submitted 26 May, 2015;
originally announced May 2015.