Search | arXiv e-print repository

doi 10.1088/2057-1976/ac6300

FBDNN: Filter Banks and Deep Neural Networks for Portable and Fast Brain-Computer Interfaces

Authors: Pedro R. A. S. Bassi, Romis Attux

Abstract: Objective: To propose novel SSVEP classification methodologies using deep neural networks (DNNs) and improve performances in single-channel and user-independent brain-computer interfaces (BCIs) with small data lengths. Approach: We propose the utilization of filter banks (creating sub-band components of the EEG signal) in conjunction with DNNs. In this context, we created three different models: a… ▽ More Objective: To propose novel SSVEP classification methodologies using deep neural networks (DNNs) and improve performances in single-channel and user-independent brain-computer interfaces (BCIs) with small data lengths. Approach: We propose the utilization of filter banks (creating sub-band components of the EEG signal) in conjunction with DNNs. In this context, we created three different models: a recurrent neural network (FBRNN) analyzing the time domain, a 2D convolutional neural network (FBCNN-2D) processing complex spectrum features and a 3D convolutional neural network (FBCNN-3D) analyzing complex spectrograms, which we introduce in this study as possible input for SSVEP classification. We tested our neural networks on three open datasets and conceived them so as not to require calibration from the final user, simulating a user-independent BCI. Results: The DNNs with the filter banks surpassed the accuracy of similar networks without this preprocessing step by considerable margins, and they outperformed common SSVEP classification methods (SVM and FBCCA) by even higher margins. Conclusion and significance: Filter banks allow different types of deep neural networks to more efficiently analyze the harmonic components of SSVEP. Complex spectrograms carry more information than complex spectrum features and the magnitude spectrum, allowing the FBCNN-3D to surpass the other CNNs. The performances obtained in the challenging classification problems indicates a strong potential for the construction of portable, economical, fast and low-latency BCIs. △ Less

Submitted 30 March, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

Comments: We included additional tests of statistical significance

arXiv:2104.06176 [pdf, other]

doi 10.1007/s42600-022-00242-y

COVID-19 detection using chest X-rays: is lung segmentation important for generalization?

Authors: Pedro R. A. S. Bassi, Romis Attux

Abstract: Purpose: we evaluated the generalization capability of deep neural networks (DNNs), trained to classify chest X-rays as Covid-19, normal or pneumonia, using a relatively small and mixed dataset. Methods: we proposed a DNN to perform lung segmentation and classification, stacking a segmentation module (U-Net), an original intermediate module and a classification module (DenseNet201). To evaluate ge… ▽ More Purpose: we evaluated the generalization capability of deep neural networks (DNNs), trained to classify chest X-rays as Covid-19, normal or pneumonia, using a relatively small and mixed dataset. Methods: we proposed a DNN to perform lung segmentation and classification, stacking a segmentation module (U-Net), an original intermediate module and a classification module (DenseNet201). To evaluate generalization, we tested the DNN with an external dataset (from distinct localities) and used Bayesian inference to estimate probability distributions of performance metrics. Results: our DNN achieved 0.917 AUC on the external test dataset, and a DenseNet without segmentation, 0.906. Bayesian inference indicated mean accuracy of 76.1% and [0.695, 0.826] 95% HDI (highest density interval, which concentrates 95% of the metric's probability mass) with segmentation and, without segmentation, 71.7% and [0.646, 0.786]. Conclusion: employing a novel DNN evaluation technique, which uses LRP and Brixia scores, we discovered that areas where radiologists found strong Covid-19 symptoms are the most important for the stacked DNN classification. External validation showed smaller accuracies than internal, indicating difficulty in generalization, which is positively affected by segmentation. Finally, the performance in the external dataset and the analysis with LRP suggest that DNNs can be trained in small and mixed datasets and still successfully detect Covid-19. △ Less

Submitted 2 November, 2022; v1 submitted 12 April, 2021; originally announced April 2021.

Comments: Text and figure improvements. Results did not change. Included DOI and reference to the published article (Research on Biomedical Engineering, Springer). Link for the published paper: https://trebuchet.public.springernature.app/get_content/1ab346c8-06ea-49ed-92f3-deaec80f6988

Journal ref: Research on Biomedical Engineering, Springer (2022)

arXiv:2101.09184 [pdf, other]

Tensor-Train Networks for Learning Predictive Modeling of Multidimensional Data

Authors: M. Nazareth da Costa, R. Attux, A. Cichocki, J. M. T. Romano

Abstract: In this work, we firstly apply the Train-Tensor (TT) networks to construct a compact representation of the classical Multilayer Perceptron, representing a reduction of up to 95% of the coefficients. A comparative analysis between tensor model and standard multilayer neural networks is also carried out in the context of prediction of the Mackey-Glass noisy chaotic time series and NASDAQ index. We s… ▽ More In this work, we firstly apply the Train-Tensor (TT) networks to construct a compact representation of the classical Multilayer Perceptron, representing a reduction of up to 95% of the coefficients. A comparative analysis between tensor model and standard multilayer neural networks is also carried out in the context of prediction of the Mackey-Glass noisy chaotic time series and NASDAQ index. We show that the weights of a multidimensional regression model can be learned by means of TT network and the optimization of TT weights is a more robust to the impact of coefficient initialization and hyper-parameter setting. Furthermore, an efficient algorithm based on alternating least squares has been proposed for approximating the weights in TT-format with a reduction of computational calculus, providing a much faster convergence than the well-known adaptive learning-method algorithms, widely applied for optimizing neural networks. △ Less

Submitted 30 March, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

Comments: 34 pages, 16 figures

arXiv:2010.06503 [pdf, other]

doi 10.1016/j.bspc.2021.102542

Transfer Learning and SpecAugment applied to SSVEP Based BCI Classification

Authors: Pedro R. A. S. Bassi, Willian Rampazzo, Romis Attux

Abstract: Objective: We used deep convolutional neural networks (DCNNs) to classify electroencephalography (EEG) signals in a steady-state visually evoked potentials (SSVEP) based single-channel brain-computer interface (BCI), which does not require calibration on the user. Methods: EEG signals were converted to spectrograms and served as input to train DCNNs using the transfer learning technique. We also… ▽ More Objective: We used deep convolutional neural networks (DCNNs) to classify electroencephalography (EEG) signals in a steady-state visually evoked potentials (SSVEP) based single-channel brain-computer interface (BCI), which does not require calibration on the user. Methods: EEG signals were converted to spectrograms and served as input to train DCNNs using the transfer learning technique. We also modified and applied a data augmentation method, SpecAugment, generally employed for speech recognition. Furthermore, for comparison purposes, we classified the SSVEP dataset using Support-vector machines (SVMs) and Filter Bank canonical correlation analysis (FBCCA). Results: Excluding the evaluated user's data from the fine-tuning process, we reached 82.2% mean test accuracy and 0.825 mean F1-Score on 35 subjects from an open dataset, using a small data length (0.5 s), only one electrode (Oz) and the DCNN with transfer learning, window slicing (WS) and SpecAugment's time masks. Conclusion: The DCNN results surpassed SVM and FBCCA performances, using a single electrode and a small data length. Transfer learning provided minimal accuracy change, but made training faster. SpecAugment created a small performance improvement and was successfully combined with WS, yielding higher accuracies. Significance: We present a new methodology to solve the problem of SSVEP classification using DCNNs. We also modified a speech recognition data augmentation technique and applied it to the context of BCIs. The presented methodology surpassed performances obtained with FBCCA and SVMs (more traditional SSVEP classification methods) in BCIs with small data lengths and one electrode. This type of BCI can be used to develop small and fast systems. △ Less

Submitted 18 March, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

Journal ref: Biomedical Signal Processing and Control 67 (2021) 102542

arXiv:2006.15476 [pdf, other]

Frequency learning for image classification

Authors: José Augusto Stuchi, Levy Boccato, Romis Attux

Abstract: Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image pr… ▽ More Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art architectures nowadays are DNN related, but only a few explore the frequency domain to extract useful information and improve the results, like in the image processing field. In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a slicing procedure to allow the network to learn both global and local features from the frequency-domain representations of the image blocks. The proposed method proved to be competitive with respect to well-known DNN architectures in the selected experiments, with the advantage of being a simpler and lightweight model. This work also raises the discussion on how the state-of-the-art DNNs architectures can exploit not only spatial features, but also the frequency, in order to improve its performance when solving real world problems. △ Less

Submitted 27 June, 2020; originally announced June 2020.

arXiv:2006.06137 [pdf, other]

doi 10.1109/IJCNN55064.2022.9892809

Analysis of Trade-offs in Fair Principal Component Analysis Based on Multi-objective Optimization

Authors: Guilherme D. Pelegrina, Renan D. B. Brotto, Leonardo T. Duarte, Romis Attux, João M. T. Romano

Abstract: In dimensionality reduction problems, the adopted technique may produce disparities between the representation errors of different groups. For instance, in the projected space, a specific class can be better represented in comparison with another one. In some situations, this unfair result may introduce ethical concerns. Aiming at overcoming this inconvenience, a fairness measure can be considered… ▽ More In dimensionality reduction problems, the adopted technique may produce disparities between the representation errors of different groups. For instance, in the projected space, a specific class can be better represented in comparison with another one. In some situations, this unfair result may introduce ethical concerns. Aiming at overcoming this inconvenience, a fairness measure can be considered when performing dimensionality reduction through Principal Component Analysis. However, a solution that increases fairness tends to increase the overall re-construction error. In this context, this paper proposes to address this trade-off by means of a multi-objective-based approach. For this purpose, we adopt a fairness measure associated with the disparity between the representation errors of different groups. Moreover, we investigate if the solution of a classical Principal Component Analysis can be used to find a fair projection. Numerical experiments attest that a fairer result can be achieved with a very small loss in the overall reconstruction error. △ Less

Submitted 3 October, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

Journal ref: IEEE 2022 International Joint Conference on Neural Networks (IJCNN), 2022, pp. 1-8

arXiv:2005.01578 [pdf, other]

doi 10.1007/s42600-021-00132-9

A Deep Convolutional Neural Network for COVID-19 Detection Using Chest X-Rays

Authors: Pedro R. A. S. Bassi, Romis Attux

Abstract: Purpose: We present image classifiers based on Dense Convolutional Networks and transfer learning to classify chest X-ray images according to three labels: COVID-19, pneumonia and normal. Methods: We fine-tuned neural networks pretrained on ImageNet and applied a twice transfer learning approach, using NIH ChestX-ray14 dataset as an intermediate step. We also suggested a novelty called output ne… ▽ More Purpose: We present image classifiers based on Dense Convolutional Networks and transfer learning to classify chest X-ray images according to three labels: COVID-19, pneumonia and normal. Methods: We fine-tuned neural networks pretrained on ImageNet and applied a twice transfer learning approach, using NIH ChestX-ray14 dataset as an intermediate step. We also suggested a novelty called output neuron kee**, which changes the twice transfer learning technique. In order to clarify the modus operandi of the models, we used Layer-wise Relevance Propagation (LRP) to generate heatmaps. Results: We were able to reach test accuracy of 100% on our test dataset. Twice transfer learning and output neuron kee** showed promising results improving performances, mainly in the beginning of the training process. Although LRP revealed that words on the X-rays can influence the networks' predictions, we discovered this had only a very small effect on accuracy. Conclusion: Although clinical studies and larger datasets are still needed to further ensure good generalization, the state-of-the-art performances we achieved show that, with the help of artificial intelligence, chest X-rays can become a cheap and accurate auxiliary method for COVID-19 diagnosis. Heatmaps generated by LRP improve the interpretability of the deep neural networks and indicate an analytical path for future research on diagnosis. Twice transfer learning with output neuron kee** improved performances. △ Less

Submitted 12 January, 2021; v1 submitted 30 April, 2020; originally announced May 2020.

arXiv:2002.02241 [pdf, other]

doi 10.1016/j.eswa.2019.04.041

Application of multi-objective optimization to blind source separation

Authors: Guilherme Dean Pelegrina, Romis Attux, Leonardo Tomazeli Duarte

Abstract: Several problems in signal processing are addressed by expert systems which take into account a set of priors on the sought signals and systems. For instance, blind source separation is often tackled by means of a mono-objective formulation which relies on a separation criterion associated with a given property of the sought signals (sources). However, in many practical situations, there are more… ▽ More Several problems in signal processing are addressed by expert systems which take into account a set of priors on the sought signals and systems. For instance, blind source separation is often tackled by means of a mono-objective formulation which relies on a separation criterion associated with a given property of the sought signals (sources). However, in many practical situations, there are more than one property to be exploited and, as a consequence, a set of separation criteria may be used to recover the original signals. In this context, this paper addresses the separation problem by means of an approach based on multi-objective optimization. Differently from the existing methods, which provide only one estimate for the original signals, our proposal leads to a set of solutions that can be utilized by the system user to take his/her decision. Results obtained through numerical experiments over a set of biomedical signals highlight the viability of the proposed approach, which provides estimations closer to the mean squared error solutions compared to the ones achieved via a mono-objective formulation. Moreover, since our proposal is quite general, this work also contributes to encourage future researches to develop expert systems that exploit the multi-objective formulation in different source separation problems. △ Less

Submitted 6 February, 2020; originally announced February 2020.

Journal ref: Expert Systems with Applications, Volume 131, Pages 60--70, October 2019

Showing 1–8 of 8 results for author: Attux, R