-
Knowledge-guided EEG Representation Learning
Authors:
Aditya Kommineni,
Kleanthis Avramidis,
Richard Leahy,
Shrikanth Narayanan
Abstract:
Self-supervised learning has produced impressive results in multimedia domains of audio, vision and speech. This paradigm is equally, if not more, relevant for the domain of biosignals, owing to the scarcity of labelled data in such scenarios. The ability to leverage large-scale unlabelled data to learn robust representations could help improve the performance of numerous inference tasks on biosig…
▽ More
Self-supervised learning has produced impressive results in multimedia domains of audio, vision and speech. This paradigm is equally, if not more, relevant for the domain of biosignals, owing to the scarcity of labelled data in such scenarios. The ability to leverage large-scale unlabelled data to learn robust representations could help improve the performance of numerous inference tasks on biosignals. Given the inherent domain differences between multimedia modalities and biosignals, the established objectives for self-supervised learning may not translate well to this domain. Hence, there is an unmet need to adapt these methods to biosignal analysis. In this work we propose a self-supervised model for EEG, which provides robust performance and remarkable parameter efficiency by using state space-based deep learning architecture. We also propose a novel knowledge-guided pre-training objective that accounts for the idiosyncrasies of the EEG signal. The results indicate improved embedding representation learning and downstream performance compared to prior works on exemplary tasks. Also, the proposed objective significantly reduces the amount of pre-training data required to obtain performance equivalent to prior works.
△ Less
Submitted 14 February, 2024;
originally announced March 2024.
-
Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results
Authors:
Kelly Payette,
Céline Steger,
Roxane Licandro,
Priscille de Dumast,
Hongwei Bran Li,
Matthew Barkovich,
Liu Li,
Maik Dannecker,
Chen Chen,
Cheng Ouyang,
Niccolò McConnell,
Alina Miron,
Yongmin Li,
Alena Uus,
Irina Grigorescu,
Paula Ramirez Gilliland,
Md Mahfuzur Rahman Siddiquee,
Daguang Xu,
Andriy Myronenko,
Haoyu Wang,
Ziyan Huang,
** Ye,
Mireia Alenyà,
Valentin Comte,
Oscar Camara
, et al. (42 additional authors not shown)
Abstract:
Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif…
▽ More
Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across different imaging centers remains unsolved, limiting real-world clinical applicability. The multi-center FeTA Challenge 2022 focuses on advancing the generalizability of fetal brain segmentation algorithms for magnetic resonance imaging (MRI). In FeTA 2022, the training dataset contained images and corresponding manually annotated multi-class labels from two imaging centers, and the testing data contained images from these two imaging centers as well as two additional unseen centers. The data from different centers varied in many aspects, including scanners used, imaging parameters, and fetal brain super-resolution algorithms applied. 16 teams participated in the challenge, and 17 algorithms were evaluated. Here, a detailed overview and analysis of the challenge results are provided, focusing on the generalizability of the submissions. Both in- and out of domain, the white matter and ventricles were segmented with the highest accuracy, while the most challenging structure remains the cerebral cortex due to anatomical complexity. The FeTA Challenge 2022 was able to successfully evaluate and advance generalizability of multi-class fetal brain tissue segmentation algorithms for MRI and it continues to benchmark new algorithms. The resulting new methods contribute to improving the analysis of brain development in utero.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Meta Transfer of Self-Supervised Knowledge: Foundation Model in Action for Post-Traumatic Epilepsy Prediction
Authors:
Wenhui Cui,
Haleh Akrami,
Ganning Zhao,
Anand A. Joshi,
Richard M. Leahy
Abstract:
Despite the impressive advancements achieved using deep-learning for functional brain activity analysis, the heterogeneity of functional patterns and scarcity of imaging data still pose challenges in tasks such as prediction of future onset of Post-Traumatic Epilepsy (PTE) from data acquired shortly after traumatic brain injury (TBI). Foundation models pre-trained on separate large-scale datasets…
▽ More
Despite the impressive advancements achieved using deep-learning for functional brain activity analysis, the heterogeneity of functional patterns and scarcity of imaging data still pose challenges in tasks such as prediction of future onset of Post-Traumatic Epilepsy (PTE) from data acquired shortly after traumatic brain injury (TBI). Foundation models pre-trained on separate large-scale datasets can improve the performance from scarce and heterogeneous datasets. For functional Magnetic Resonance Imaging (fMRI), while data may be abundantly available from healthy controls, clinical data is often scarce, limiting the ability of foundation models to identify clinically-relevant features. We overcome this limitation by introducing a novel training strategy for our foundation model by integrating meta-learning with self-supervised learning to improve the generalization from normal to clinical features. In this way we enable generalization to other downstream clinical tasks, in our case prediction of PTE. To achieve this, we perform self-supervised training on the control dataset to focus on inherent features that are not limited to a particular supervised task while applying meta-learning, which strongly improves the model's generalizability using bi-level optimization. Through experiments on neurological disorder classification tasks, we demonstrate that the proposed strategy significantly improves task performance on small-scale clinical datasets. To explore the generalizability of the foundation model in downstream applications, we then apply the model to an unseen TBI dataset for prediction of PTE using zero-shot learning. Results further demonstrated the enhanced generalizability of our foundation model.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Neuro-GPT: Towards A Foundation Model for EEG
Authors:
Wenhui Cui,
Woojae Jeong,
Philipp Thölke,
Takfarinas Medani,
Karim Jerbi,
Anand A. Joshi,
Richard M. Leahy
Abstract:
To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked…
▽ More
To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked EEG segments. We then fine-tune the model on a Motor Imagery Classification task to validate its performance in a low-data regime (9 subjects). Our experiments demonstrate that applying a foundation model can significantly improve classification performance compared to a model trained from scratch, which provides evidence for the generalizability of the foundation model and its ability to address challenges of data scarcity and heterogeneity in EEG. The code is publicly available at github.com/wenhui0206/NeuroGPT.
△ Less
Submitted 2 March, 2024; v1 submitted 7 November, 2023;
originally announced November 2023.
-
Learning A Disentangling Representation For PU Learning
Authors:
Omar Zamzam,
Haleh Akrami,
Mahdi Soltanolkotabi,
Richard Leahy
Abstract:
In this paper, we address the problem of learning a binary (positive vs. negative) classifier given Positive and Unlabeled data commonly referred to as PU learning. Although rudimentary techniques like clustering, out-of-distribution detection, or positive density estimation can be used to solve the problem in low-dimensional settings, their efficacy progressively deteriorates with higher dimensio…
▽ More
In this paper, we address the problem of learning a binary (positive vs. negative) classifier given Positive and Unlabeled data commonly referred to as PU learning. Although rudimentary techniques like clustering, out-of-distribution detection, or positive density estimation can be used to solve the problem in low-dimensional settings, their efficacy progressively deteriorates with higher dimensions due to the increasing complexities in the data distribution. In this paper we propose to learn a neural network-based data representation using a loss function that can be used to project the unlabeled data into two (positive and negative) clusters that can be easily identified using simple clustering techniques, effectively emulating the phenomenon observed in low-dimensional settings. We adopt a vector quantization technique for the learned representations to amplify the separation between the learned unlabeled data clusters. We conduct experiments on simulated PU data that demonstrate the improved performance of our proposed method compared to the current state-of-the-art approaches. We also provide some theoretical justification for our two cluster-based approach and our algorithmic choices.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Beta quantile regression for robust estimation of uncertainty in the presence of outliers
Authors:
Haleh Akrami,
Omar Zamzam,
Anand Joshi,
Sergul Aydore,
Richard Leahy
Abstract:
Quantile Regression (QR) can be used to estimate aleatoric uncertainty in deep neural networks and can generate prediction intervals. Quantifying uncertainty is particularly important in critical applications such as clinical diagnosis, where a realistic assessment of uncertainty is essential in determining disease status and planning the appropriate treatment. The most common application of quant…
▽ More
Quantile Regression (QR) can be used to estimate aleatoric uncertainty in deep neural networks and can generate prediction intervals. Quantifying uncertainty is particularly important in critical applications such as clinical diagnosis, where a realistic assessment of uncertainty is essential in determining disease status and planning the appropriate treatment. The most common application of quantile regression models is in cases where the parametric likelihood cannot be specified. Although quantile regression is quite robust to outlier response observations, it can be sensitive to outlier covariate observations (features). Outlier features can compromise the performance of deep learning regression problems such as style translation, image reconstruction, and deep anomaly detection, potentially leading to misleading conclusions. To address this problem, we propose a robust solution for quantile regression that incorporates concepts from robust divergence. We compare the performance of our proposed method with (i) least trimmed quantile regression and (ii) robust regression based on the regularization of case-specific parameters in a simple real dataset in the presence of outlier. These methods have not been applied in a deep learning framework. We also demonstrate the applicability of the proposed method by applying it to a medical imaging translation task using diffusion models.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Toward Improved Generalization: Meta Transfer of Self-supervised Knowledge on Graphs
Authors:
Wenhui Cui,
Haleh Akrami,
Anand A. Joshi,
Richard M. Leahy
Abstract:
Despite the remarkable success achieved by graph convolutional networks for functional brain activity analysis, the heterogeneity of functional patterns and the scarcity of imaging data still pose challenges in many tasks. Transferring knowledge from a source domain with abundant training data to a target domain is effective for improving representation learning on scarce training data. However, t…
▽ More
Despite the remarkable success achieved by graph convolutional networks for functional brain activity analysis, the heterogeneity of functional patterns and the scarcity of imaging data still pose challenges in many tasks. Transferring knowledge from a source domain with abundant training data to a target domain is effective for improving representation learning on scarce training data. However, traditional transfer learning methods often fail to generalize the pre-trained knowledge to the target task due to domain discrepancy. Self-supervised learning on graphs can increase the generalizability of graph features since self-supervision concentrates on inherent graph properties that are not limited to a particular supervised task. We propose a novel knowledge transfer strategy by integrating meta-learning with self-supervised learning to deal with the heterogeneity and scarcity of fMRI data. Specifically, we perform a self-supervised task on the source domain and apply meta-learning, which strongly improves the generalizability of the model using the bi-level optimization, to transfer the self-supervised knowledge to the target domain. Through experiments on a neurological disorder classification task, we demonstrate that the proposed strategy significantly improves target task performance by increasing the generalizability and transferability of graph-based knowledge.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Learning From Positive and Unlabeled Data Using Observer-GAN
Authors:
Omar Zamzam,
Haleh Akrami,
Richard Leahy
Abstract:
The problem of learning from positive and unlabeled data (A.K.A. PU learning) has been studied in a binary (i.e., positive versus negative) classification setting, where the input data consist of (1) observations from the positive class and their corresponding labels, (2) unlabeled observations from both positive and negative classes. Generative Adversarial Networks (GANs) have been used to reduce…
▽ More
The problem of learning from positive and unlabeled data (A.K.A. PU learning) has been studied in a binary (i.e., positive versus negative) classification setting, where the input data consist of (1) observations from the positive class and their corresponding labels, (2) unlabeled observations from both positive and negative classes. Generative Adversarial Networks (GANs) have been used to reduce the problem to the supervised setting with the advantage that supervised learning has state-of-the-art accuracy in classification tasks. In order to generate \textit{pseudo}-negative observations, GANs are trained on positive and unlabeled observations with a modified loss. Using both positive and \textit{pseudo}-negative observations leads to a supervised learning setting. The generation of pseudo-negative observations that are realistic enough to replace missing negative class samples is a bottleneck for current GAN-based algorithms. By including an additional classifier into the GAN architecture, we provide a novel GAN-based approach. In our suggested method, the GAN discriminator instructs the generator only to produce samples that fall into the unlabeled data distribution, while a second classifier (observer) network monitors the GAN training to: (i) prevent the generated samples from falling into the positive distribution; and (ii) learn the features that are the key distinction between the positive and negative observations. Experiments on four image datasets demonstrate that our trained observer network performs better than existing techniques in discriminating between real unseen positive and negative samples.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
Learning from imperfect training data using a robust loss function: application to brain image segmentation
Authors:
Haleh Akrami,
Wenhui Cui,
Anand A Joshi,
Richard M. Leahy
Abstract:
Segmentation is one of the most important tasks in MRI medical image analysis and is often the first and the most critical step in many clinical applications. In brain MRI analysis, head segmentation is commonly used for measuring and visualizing the brain's anatomical structures and is also a necessary step for other applications such as current-source reconstruction in electroencephalography and…
▽ More
Segmentation is one of the most important tasks in MRI medical image analysis and is often the first and the most critical step in many clinical applications. In brain MRI analysis, head segmentation is commonly used for measuring and visualizing the brain's anatomical structures and is also a necessary step for other applications such as current-source reconstruction in electroencephalography and magnetoencephalography (EEG/MEG). Here we propose a deep learning framework that can segment brain, skull, and extra-cranial tissue using only T1-weighted MRI as input. In addition, we describe a robust method for training the model in the presence of noisy labels.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Semi-supervised Learning using Robust Loss
Authors:
Wenhui Cui,
Haleh Akrami,
Anand A. Joshi,
Richard M. Leahy
Abstract:
The amount of manually labeled data is limited in medical applications, so semi-supervised learning and automatic labeling strategies can be an asset for training deep neural networks. However, the quality of the automatically generated labels can be uneven and inferior to manual labels. In this paper, we suggest a semi-supervised training strategy for leveraging both manually labeled data and ext…
▽ More
The amount of manually labeled data is limited in medical applications, so semi-supervised learning and automatic labeling strategies can be an asset for training deep neural networks. However, the quality of the automatically generated labels can be uneven and inferior to manual labels. In this paper, we suggest a semi-supervised training strategy for leveraging both manually labeled data and extra unlabeled data. In contrast to the existing approaches, we apply robust loss for the automated labeled data to automatically compensate for the uneven data quality using a teacher-student framework. First, we generate pseudo-labels for unlabeled data using a teacher model pre-trained on labeled data. These pseudo-labels are noisy, and using them along with labeled data for training a deep neural network can severely degrade learned feature representations and the generalization of the network. Here we mitigate the effect of these pseudo-labels by using robust loss functions. Specifically, we use three robust loss functions, namely beta cross-entropy, symmetric cross-entropy, and generalized cross-entropy. We show that our proposed strategy improves the model performance by compensating for the uneven quality of labels in image classification as well as segmentation applications.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Losing the battle over best-science guidance early in a crisis: Covid-19 and beyond
Authors:
L. Illari,
N. Johnson Restrepo,
R. Leahy,
N. Velasquez,
Y. Lupu,
N. F. Johnson
Abstract:
Ensuring widespread public exposure to best-science guidance is crucial in a crisis, e.g. Covid-19, climate change. Map** the emitter-receiver dynamics of Covid-19 guidance among 87 million Facebook users, we uncover a multi-sided battle over exposure that gets lost well before the pandemic's official announcement. By the time Covid-19 vaccines emerge, the mainstream majority -- including many p…
▽ More
Ensuring widespread public exposure to best-science guidance is crucial in a crisis, e.g. Covid-19, climate change. Map** the emitter-receiver dynamics of Covid-19 guidance among 87 million Facebook users, we uncover a multi-sided battle over exposure that gets lost well before the pandemic's official announcement. By the time Covid-19 vaccines emerge, the mainstream majority -- including many parenting communities -- have moved even closer to more extreme communities. The hidden heterogeneity explains why Facebook's own promotion of best-science guidance also missed key audience segments. A simple mathematical model reproduces these exposure dynamics at the system level. Our findings can be used to tailor guidance at scale while accounting for individual diversity, and to predict tip** point behavior and system-level responses to interventions.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
Deep Quantile Regression for Uncertainty Estimation in Unsupervised and Supervised Lesion Detection
Authors:
Haleh Akrami,
Anand Joshi,
Sergul Aydore,
Richard Leahy
Abstract:
Despite impressive state-of-the-art performance on a wide variety of machine learning tasks, deep learning methods can produce over-confident predictions, particularly with limited training data. Therefore, quantifying uncertainty is particularly important in critical applications such as lesion detection and clinical diagnosis, where a realistic assessment of uncertainty is essential in determini…
▽ More
Despite impressive state-of-the-art performance on a wide variety of machine learning tasks, deep learning methods can produce over-confident predictions, particularly with limited training data. Therefore, quantifying uncertainty is particularly important in critical applications such as lesion detection and clinical diagnosis, where a realistic assessment of uncertainty is essential in determining surgical margins, disease status and appropriate treatment. In this work, we propose a novel approach that uses quantile regression for quantifying aleatoric uncertainty in both supervised and unsupervised lesion detection problems. The resulting confidence intervals can be used for lesion detection and segmentation. In the unsupervised setting, we combine quantile regression with the Variational AutoEncoder (VAE). Here we address the problem of quantifying uncertainty in the images that are reconstructed by the VAE as the basis for principled outlier or lesion detection. The VAE models the output as a conditionally independent Gaussian characterized by its mean and variance. Unfortunately, joint optimization of both mean and variance in the VAE leads to the well-known problem of shrinkage or underestimation of variance. Here we describe an alternative Quantile-Regression VAE (QR-VAE) that avoids this variance shrinkage problem by directly estimating conditional quantiles for the input image. Using the estimated quantiles, we compute the conditional mean and variance for the input image from which we then detect outliers by thresholding at a false-discovery-rate corrected p-value. In the supervised setting, we develop binary quantile regression (BQR) for the supervised lesion segmentation task. We show how BQR can be used to capture uncertainty in lesion boundaries in a manner that characterizes expert disagreement.
△ Less
Submitted 26 April, 2022; v1 submitted 20 September, 2021;
originally announced September 2021.
-
Mainstreaming of conspiracy theories and misinformation
Authors:
N. F. Johnson,
N. Velasquez,
N. Johnson Restrepo,
R. Leahy,
R. Sear,
N. Gabriel,
H. Larson,
Y. Lupu
Abstract:
Parents - particularly moms - increasingly consult social media for support when taking decisions about their young children, and likely also when advising other family members such as elderly relatives. Minimizing malignant online influences is therefore crucial to securing their assent for policies ranging from vaccinations, masks and social distancing against the pandemic, to household best pra…
▽ More
Parents - particularly moms - increasingly consult social media for support when taking decisions about their young children, and likely also when advising other family members such as elderly relatives. Minimizing malignant online influences is therefore crucial to securing their assent for policies ranging from vaccinations, masks and social distancing against the pandemic, to household best practices against climate change, to acceptance of future 5G towers nearby. Here we show how a strengthening of bonds across online communities during the pandemic, has led to non-Covid-19 conspiracy theories (e.g. fluoride, chemtrails, 5G) attaining heightened access to mainstream parent communities. Alternative health communities act as the critical conduits between conspiracy theorists and parents, and make the narratives more palatable to the latter. We demonstrate experimentally that these inter-community bonds can perpetually generate new misinformation, irrespective of any changes in factual information. Our findings show explicitly why Facebook's current policies have failed to stop the mainstreaming of non-Covid-19 and Covid-19 conspiracy theories and misinformation, and why targeting the largest communities will not work. A simple yet exactly solvable and empirically grounded mathematical model, shows how modest tailoring of mainstream communities' couplings could prevent them from tip** against establishment guidance. Our conclusions should also apply to other social media platforms and topics.
△ Less
Submitted 3 February, 2021;
originally announced February 2021.
-
fMRI-Kernel Regression: A Kernel-based Method for Pointwise Statistical Analysis of rs-fMRI for Population Studies
Authors:
Anand A. Joshi,
Soyoung Choi,
Haleh Akrami,
Richard M. Leahy
Abstract:
Due to the spontaneous nature of resting-state fMRI (rs-fMRI) signals, cross-subject comparison and therefore, group studies of rs-fMRI are challenging. Most existing group comparison methods use features extracted from the fMRI time series, such as connectivity features, independent component analysis (ICA), and functional connectivity density (FCD) methods. However, in group studies, especially…
▽ More
Due to the spontaneous nature of resting-state fMRI (rs-fMRI) signals, cross-subject comparison and therefore, group studies of rs-fMRI are challenging. Most existing group comparison methods use features extracted from the fMRI time series, such as connectivity features, independent component analysis (ICA), and functional connectivity density (FCD) methods. However, in group studies, especially in the case of spectrum disorders, distances to a single atlas or a representative subject do not fully reflect the differences between subjects that may lie on a multi-dimensional spectrum. Moreover, there may not exist an individual subject or even an average atlas in such cases that is representative of all subjects. Here we describe an approach that measures pairwise distances between the synchronized rs-fMRI signals of pairs of subjects instead of to a single reference point. We also present a method for fMRI data comparison that leverages this generated pairwise feature to establish a radial basis function kernel matrix. This kernel matrix is used in turn to perform kernel regression of rs-fMRI to a clinical variable such as a cognitive or neurophysiological performance score of interest. This method opens a new pointwise analysis paradigm for fMRI data. We demonstrate the application of this method by performing a pointwise analysis on the cortical surface using rs-fMRI data to identify cortical regions associated with variability in ADHD index. While pointwise analysis methods are common in anatomical studies such as cortical thickness analysis and voxel- and tensor-based morphometry and its variants, such a method is lacking for rs-fMRI and could improve the utility of rs-fMRI for group studies. The method presented in this paper is aimed at filling this gap.
△ Less
Submitted 13 December, 2020;
originally announced December 2020.
-
Realistic head modeling of electromagnetic brain activity: An integrated Brainstorm pipeline from MRI data to the FEM solution
Authors:
Takfarinas Medani,
Juan Garcia-Prieto,
Francois Tadel,
Sophie Schrader,
Anand Joshi,
Christian Engwer,
Carsten H. Wolters,
John C. Mosher,
Richard M. Leahy
Abstract:
Human brain activity generates scalp potentials (electroencephalography EEG), intracranial potentials (iEEG), and external magnetic fields (magnetoencephalography MEG), all capable of being recorded, often simultaneously, for use in research and clinical applications. The so-called forward problem is the modeling of these fields at their sensors for a given putative neural source configuration. Wh…
▽ More
Human brain activity generates scalp potentials (electroencephalography EEG), intracranial potentials (iEEG), and external magnetic fields (magnetoencephalography MEG), all capable of being recorded, often simultaneously, for use in research and clinical applications. The so-called forward problem is the modeling of these fields at their sensors for a given putative neural source configuration. While early generations modeled the head as a simple set of isotropic spheres, today s ubiquitous magnetic resonance imaging (MRI) data allows detailed descriptions of head compartments with assigned isotropic and anisotropic conductivities. In this paper, we present a complete pipeline, integrated into the Brainstorm software, that allows users to generate an individual and accurate head model from the MRI and then calculate the electromagnetic forward solution using the finite element method (FEM). The head model generation is performed by the integration of the latest tools for MRI segmentation and FEM mesh generation. The final head model is divided into five main compartments: white matter, grey matter, CSF, skull, and scalp. For the isotropic compartments, widely-used default conductivity values are assigned. For the brain tissues, we use the process of the effective medium approach (EMA) to estimate anisotropic conductivity tensors from diffusion-weighted imaging (DWI) data. The FEM electromagnetic calculations are performed by the DUNEuro library, integrated into Brainstorm and accessible with a user-friendly graphical interface. This integrated pipeline, with full tutorials and example data sets freely available on the Brainstorm website, gives the neuroscience community easy access to advanced tools for electromagnetic modeling using FEM.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
Addressing Variance Shrinkage in Variational Autoencoders using Quantile Regression
Authors:
Haleh Akrami,
Anand A. Joshi,
Sergul Aydore,
Richard M. Leahy
Abstract:
Estimation of uncertainty in deep learning models is of vital importance, especially in medical imaging, where reliance on inference without taking into account uncertainty could lead to misdiagnosis. Recently, the probabilistic Variational AutoEncoder (VAE) has become a popular model for anomaly detection in applications such as lesion detection in medical images. The VAE is a generative graphica…
▽ More
Estimation of uncertainty in deep learning models is of vital importance, especially in medical imaging, where reliance on inference without taking into account uncertainty could lead to misdiagnosis. Recently, the probabilistic Variational AutoEncoder (VAE) has become a popular model for anomaly detection in applications such as lesion detection in medical images. The VAE is a generative graphical model that is used to learn the data distribution from samples and then generate new samples from this distribution. By training on normal samples, the VAE can be used to detect inputs that deviate from this learned distribution. The VAE models the output as a conditionally independent Gaussian characterized by means and variances for each output dimension. VAEs can therefore use reconstruction probability instead of reconstruction error for anomaly detection. Unfortunately, joint optimization of both mean and variance in the VAE leads to the well-known problem of shrinkage or underestimation of variance. We describe an alternative approach that avoids this variance shrinkage problem by using quantile regression. Using estimated quantiles to compute mean and variance under the Gaussian assumption, we compute reconstruction probability as a principled approach to outlier or anomaly detection. Results on simulated and Fashion MNIST data demonstrate the effectiveness of our approach. We also show how our approach can be used for principled heterogeneous thresholding for lesion detection in brain images.
△ Less
Submitted 18 October, 2020;
originally announced October 2020.
-
Not sure? Handling hesitancy of COVID-19 vaccines
Authors:
N. F. Johnson,
N. Velasquez,
R. Leahy,
N. Johnson Restrepo,
O. Jha,
Y. Lupu
Abstract:
From the moment the first COVID-19 vaccines are rolled out, there will need to be a large fraction of the global population ready in line. It is therefore crucial to start managing the growing global hesitancy to any such COVID-19 vaccine. The current approach of trying to convince the "no"s cannot work quickly enough, nor can the current policy of trying to find, remove and/or rebut all the indiv…
▽ More
From the moment the first COVID-19 vaccines are rolled out, there will need to be a large fraction of the global population ready in line. It is therefore crucial to start managing the growing global hesitancy to any such COVID-19 vaccine. The current approach of trying to convince the "no"s cannot work quickly enough, nor can the current policy of trying to find, remove and/or rebut all the individual pieces of COVID and vaccine misinformation. Instead, we show how this can be done in a simpler way by moving away from chasing misinformation content and focusing instead on managing the "yes--no--not-sure" hesitancy ecosystem.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
Covid-19 infodemic reveals new tip** point epidemiology and a revised $R$ formula
Authors:
N. F. Johnson,
N. Velasquez,
O. K. Jha,
H. Niyazi,
R. Leahy,
N. Johnson Restrepo,
R. Sear,
P. Manrique,
Y. Lupu,
P. Devkota,
S. Wuchty
Abstract:
Many governments have managed to control their COVID-19 outbreak with a simple message: keep the effective '$R$ number' $R<1$ to prevent widespread contagion and flatten the curve. This raises the question whether a similar policy could control dangerous online 'infodemics' of information, misinformation and disinformation. Here we show, using multi-platform data from the COVID-19 infodemic, that…
▽ More
Many governments have managed to control their COVID-19 outbreak with a simple message: keep the effective '$R$ number' $R<1$ to prevent widespread contagion and flatten the curve. This raises the question whether a similar policy could control dangerous online 'infodemics' of information, misinformation and disinformation. Here we show, using multi-platform data from the COVID-19 infodemic, that its online spreading instead encompasses a different dynamical regime where communities and users within and across independent platforms, sporadically form temporary active links on similar timescales to the viral spreading. This allows material that might have died out, to evolve and even mutate. This has enabled niche networks that were already successfully spreading hate and anti-vaccination material, to rapidly become global super-spreaders of narratives featuring fake COVID-19 treatments, anti-Asian sentiment and conspiracy theories. We derive new tools that incorporate these coupled social-viral dynamics, including an online $R$, to help prevent infodemic spreading at all scales: from spreading across platforms (e.g. Facebook, 4Chan) to spreading within a given subpopulation, or community, or topic. By accounting for similar social and viral timescales, the same mathematical theory also offers a quantitative description of other unconventional infection profiles such as rumors spreading in financial markets and colds spreading in schools.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Hidden order in online extremism and its disruption by nudging collective chemistry
Authors:
N. F. Johnson,
N. Velasquez,
P. Manrique,
R. Sear,
R. Leahy,
N. Johnson Restrepo,
L. Illari,
Y. Lupu
Abstract:
We show that the eclectic "Boogaloo" extremist movement that is now rising to prominence in the U.S., has a hidden online mathematical order that is identical to ISIS during its early development, despite their stark ideological, geographical and cultural differences. The evolution of each across scales follows a single shockwave equation that accounts for individual heterogeneity in online intera…
▽ More
We show that the eclectic "Boogaloo" extremist movement that is now rising to prominence in the U.S., has a hidden online mathematical order that is identical to ISIS during its early development, despite their stark ideological, geographical and cultural differences. The evolution of each across scales follows a single shockwave equation that accounts for individual heterogeneity in online interactions. This equation predicts how to disrupt the onset and 'flatten the curve' of such online extremism by nudging its collective chemistry.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
Robust Variational Autoencoder for Tabular Data with Beta Divergence
Authors:
Haleh Akrami,
Sergul Aydore,
Richard M. Leahy,
Anand A. Joshi
Abstract:
We propose a robust variational autoencoder with $β$ divergence for tabular data (RTVAE) with mixed categorical and continuous features. Variational autoencoders (VAE) and their variations are popular frameworks for anomaly detection problems. The primary assumption is that we can learn representations for normal patterns via VAEs and any deviation from that can indicate anomalies. However, the tr…
▽ More
We propose a robust variational autoencoder with $β$ divergence for tabular data (RTVAE) with mixed categorical and continuous features. Variational autoencoders (VAE) and their variations are popular frameworks for anomaly detection problems. The primary assumption is that we can learn representations for normal patterns via VAEs and any deviation from that can indicate anomalies. However, the training data itself can contain outliers. The source of outliers in training data include the data collection process itself (random noise) or a malicious attacker (data poisoning) who may target to degrade the performance of the machine learning model. In either case, these outliers can disproportionately affect the training process of VAEs and may lead to wrong conclusions about what the normal behavior is. In this work, we derive a novel form of a variational autoencoder for tabular data sets with categorical and continuous features that is robust to outliers in training data. Our results on the anomaly detection application for network traffic datasets demonstrate the effectiveness of our approach.
△ Less
Submitted 15 June, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Hate multiverse spreads malicious COVID-19 content online beyond individual platform control
Authors:
N. Velásquez,
R. Leahy,
N. Johnson Restrepo,
Y. Lupu,
R. Sear,
N. Gabriel,
O. Jha,
B. Goldberg,
N. F. Johnson
Abstract:
We show that malicious COVID-19 content, including hate speech, disinformation, and misinformation, exploits the multiverse of online hate to spread quickly beyond the control of any individual social media platform. Machine learning topic analysis shows quantitatively how online hate communities are weaponizing COVID-19, with topics evolving rapidly and content becoming increasingly coherent. Our…
▽ More
We show that malicious COVID-19 content, including hate speech, disinformation, and misinformation, exploits the multiverse of online hate to spread quickly beyond the control of any individual social media platform. Machine learning topic analysis shows quantitatively how online hate communities are weaponizing COVID-19, with topics evolving rapidly and content becoming increasingly coherent. Our mathematical analysis provides a generalized form of the public health R0 predicting the tip** point for multiverse-wide viral spreading, which suggests new policy options to mitigate the global spread of malicious COVID-19 content without relying on future coordination between all online platforms.
△ Less
Submitted 21 April, 2020; v1 submitted 1 April, 2020;
originally announced April 2020.
-
3D Phase Retrieval at Nano-Scale via Accelerated Wirtinger Flow
Authors:
Zalan Fabian,
Justin Haldar,
Richard Leahy,
Mahdi Soltanolkotabi
Abstract:
Imaging 3D nano-structures at very high resolution is crucial in a variety of scientific fields. However, due to fundamental limitations of light propagation we can only measure the object indirectly via 2D intensity measurements of the 3D specimen through highly nonlinear projection map**s where a variety of information (including phase) is lost. Reconstruction therefore involves inverting high…
▽ More
Imaging 3D nano-structures at very high resolution is crucial in a variety of scientific fields. However, due to fundamental limitations of light propagation we can only measure the object indirectly via 2D intensity measurements of the 3D specimen through highly nonlinear projection map**s where a variety of information (including phase) is lost. Reconstruction therefore involves inverting highly non-linear and seemingly non-invertible map**s. In this paper, we introduce a novel technique where the 3D object is directly reconstructed from an accurate non-linear propagation model. Furthermore, we characterize the ambiguities of this model and leverage a priori knowledge to mitigate their effect and also significantly reduce the required number of measurements and hence the acquisition time. We demonstrate the performance of our algorithm via numerical experiments aimed at nano-scale reconstruction of 3D integrated circuits. Moreover, we provide rigorous theoretical guarantees for convergence to stationarity.
△ Less
Submitted 26 February, 2020;
originally announced February 2020.
-
Health Wars and Beyond: The Rapidly Expanding and Efficient Network Insurgency Interlinking Local and Global Online Crowds of Distrust
Authors:
N. F. Johnson,
N. Velasquez,
N. Johnson Restrepo,
R. Leahy,
N. Gabriel,
S. Wuchty,
D. Broniatowski
Abstract:
We present preliminary results on the online war surrounding distrust of expertise in medical science -- specifically, the issue of vaccinations. While distrust and misinformation in politics can damage democratic elections, in the medical context it may also endanger lives through missed vaccinations and DIY cancer cures. We find that this online health war has evolved into a highly efficient net…
▽ More
We present preliminary results on the online war surrounding distrust of expertise in medical science -- specifically, the issue of vaccinations. While distrust and misinformation in politics can damage democratic elections, in the medical context it may also endanger lives through missed vaccinations and DIY cancer cures. We find that this online health war has evolved into a highly efficient network insurgency with direct inter-crowd links across countries, continents and cultures. The online anti-vax crowds (referred to as Red) now appear better positioned to groom new recruits (Green) than those supporting established expertise (Blue). We also present preliminary results from a mathematically-grounded, crowd-based analysis of the war's evolution, which offers an explanation for how Red seems to be turning the tide on Blue.
△ Less
Submitted 4 October, 2019;
originally announced October 2019.
-
Robust Variational Autoencoder
Authors:
Haleh Akrami,
Anand A. Joshi,
Jian Li,
Sergul Aydore,
Richard M. Leahy
Abstract:
Machine learning methods often need a large amount of labeled training data. Since the training data is assumed to be the ground truth, outliers can severely degrade learned representations and performance of trained models. Here we apply concepts from robust statistics to derive a novel variational autoencoder that is robust to outliers in the training data. Variational autoencoders (VAEs) extrac…
▽ More
Machine learning methods often need a large amount of labeled training data. Since the training data is assumed to be the ground truth, outliers can severely degrade learned representations and performance of trained models. Here we apply concepts from robust statistics to derive a novel variational autoencoder that is robust to outliers in the training data. Variational autoencoders (VAEs) extract a lower-dimensional encoded feature representation from which we can generate new data samples. Robustness of autoencoders to outliers is critical for generating a reliable representation of particular data types in the encoded space when using corrupted training data. Our robust VAE is based on beta-divergence rather than the standard Kullback-Leibler (KL) divergence. Our proposed lower bound lead to a RVAE model that has the same computational complexity as the VAE and contains a single tuning parameter to control the degree of robustness. We demonstrate the performance of our $β$-divergence based autoencoder for a range of image datasets, showing improved robustness to outliers both qualitatively and quantitatively. We also illustrate the use of our robust VAE for outlier detection.
△ Less
Submitted 21 December, 2019; v1 submitted 23 May, 2019;
originally announced May 2019.
-
Social media cluster dynamics create resilient global hate highways
Authors:
N. F. Johnson,
R. Leahy,
N. Johnson Restrepo,
N. Velasquez,
M. Zheng,
P. Manrique
Abstract:
Online social media allows individuals to cluster around common interests - including hate. We show that tight-knit social clusters interlink to form resilient 'global hate highways' that bridge independent social network platforms, countries, languages and ideologies, and can quickly self-repair and rewire. We provide a mathematical theory that reveals a hidden resilience in the global axis of ha…
▽ More
Online social media allows individuals to cluster around common interests - including hate. We show that tight-knit social clusters interlink to form resilient 'global hate highways' that bridge independent social network platforms, countries, languages and ideologies, and can quickly self-repair and rewire. We provide a mathematical theory that reveals a hidden resilience in the global axis of hate; explains a likely ineffectiveness of current control methods; and offers improvements. Our results reveal new science for networks-of-networks driven by bipartite dynamics, and should apply more broadly to illicit networks.
△ Less
Submitted 8 November, 2018;
originally announced November 2018.
-
Accelerated Wirtinger Flow: A fast algorithm for ptychography
Authors:
Rui Xu,
Mahdi Soltanolkotabi,
Justin P. Haldar,
Walter Unglaub,
Joshua Zusman,
Anthony F. J. Levi,
Richard M. Leahy
Abstract:
This paper presents a new algorithm, Accelerated Wirtinger Flow (AWF), for ptychographic image reconstruction from phaseless diffraction pattern measurements. AWF is based on combining Nesterov's acceleration approach with Wirtinger gradient descent. Theoretical results enable prespecification of all AWF algorithm parameters, with no need for computationally-expensive line searches and no need for…
▽ More
This paper presents a new algorithm, Accelerated Wirtinger Flow (AWF), for ptychographic image reconstruction from phaseless diffraction pattern measurements. AWF is based on combining Nesterov's acceleration approach with Wirtinger gradient descent. Theoretical results enable prespecification of all AWF algorithm parameters, with no need for computationally-expensive line searches and no need for manual parameter tuning. AWF is evaluated in the context of simulated X-ray ptychography, where we demonstrate fast convergence and low per-iteration computational complexity. We also show examples where AWF reaches higher image quality with less computation than classical algorithms. AWF is also shown to have robustness to noise and probe misalignment.
△ Less
Submitted 13 June, 2018;
originally announced June 2018.