Robust Automatic Whole Brain Extraction on Magnetic Resonance Imaging of Brain Tumor Patients using Dense-Vnet
Authors:
Sara Ranjbar,
Kyle W. Singleton,
Lee Curtin,
Cassandra R. Rickertsen,
Lisa E. Paulson,
Leland S. Hu,
J. Ross Mitchell,
Kristin R. Swanson
Abstract:
Whole brain extraction, also known as skull strip**, is a process in neuroimaging in which non-brain tissue such as skull, eyeballs, skin, etc. are removed from neuroimages. Skull stri** is a preliminary step in presurgical planning, cortical reconstruction, and automatic tumor segmentation. Despite a plethora of skull strip** approaches in the literature, few are sufficiently accurate for p…
▽ More
Whole brain extraction, also known as skull strip**, is a process in neuroimaging in which non-brain tissue such as skull, eyeballs, skin, etc. are removed from neuroimages. Skull stri** is a preliminary step in presurgical planning, cortical reconstruction, and automatic tumor segmentation. Despite a plethora of skull strip** approaches in the literature, few are sufficiently accurate for processing pathology-presenting MRIs, especially MRIs with brain tumors. In this work we propose a deep learning approach for skull stri** common MRI sequences in oncology such as T1-weighted with gadolinium contrast (T1Gd) and T2-weighted fluid attenuated inversion recovery (FLAIR) in patients with brain tumors. We automatically created gray matter, white matter, and CSF probability masks using SPM12 software and merged the masks into one for a final whole-brain mask for model training. Dice agreement, sensitivity, and specificity of the model (referred herein as DeepBrain) was tested against manual brain masks. To assess data efficiency, we retrained our models using progressively fewer training data examples and calculated average dice scores on the test set for the models trained in each round. Further, we tested our model against MRI of healthy brains from the LBP40A dataset. Overall, DeepBrain yielded an average dice score of 94.5%, sensitivity of 96.4%, and specificity of 98.5% on brain tumor data. For healthy brains, model performance improved to a dice score of 96.2%, sensitivity of 96.6% and specificity of 99.2%. The data efficiency experiment showed that, for this specific task, comparable levels of accuracy could have been achieved with as few as 50 training samples. In conclusion, this study demonstrated that a deep learning model trained on minimally processed automatically-generated labels can generate more accurate brain masks on MRI of brain tumor patients within seconds.
△ Less
Submitted 3 June, 2020;
originally announced June 2020.
Sex differences in predicting fluid intelligence of adolescent brain from T1-weighted MRIs
Authors:
Sara Ranjbar,
Kyle W. Singleton,
Lee Curtin,
Susan Christine Massey,
Andrea Hawkins-Daarud,
Pamela R. Jackson,
Kristin R. Swanson
Abstract:
Fluid intelligence (Gf) has been defined as the ability to reason and solve previously unseen problems. Links to Gf have been found in magnetic resonance imaging (MRI) sequences such as functional MRI and diffusion tensor imaging. As part of the Adolescent Brain Cognitive Development Neurocognitive Prediction Challenge 2019, we sought to predict Gf in children aged 9-10 from T1-weighted (T1W) MRIs…
▽ More
Fluid intelligence (Gf) has been defined as the ability to reason and solve previously unseen problems. Links to Gf have been found in magnetic resonance imaging (MRI) sequences such as functional MRI and diffusion tensor imaging. As part of the Adolescent Brain Cognitive Development Neurocognitive Prediction Challenge 2019, we sought to predict Gf in children aged 9-10 from T1-weighted (T1W) MRIs. The data included atlas-aligned volumetric T1W images, atlas-defined segmented regions, age, and sex for 3739 subjects used for training and internal validation and 415 subjects used for external validation. We trained sex-specific convolutional neural net (CNN) and random forest models to predict Gf. For the convolutional model, skull-stripped volumetric T1W images aligned to the SRI24 brain atlas were used for training. Volumes of segmented atlas regions along with each subject's age were used to train the random forest regressor models. Performance was measured using the mean squared error (MSE) of the predictions. Random forest models achieved lower MSEs than CNNs. Further, the external validation data had a better MSE for females than males (60.68 vs. 80.74), with a combined MSE of 70.83. Our results suggest that predictive models of Gf from volumetric T1W MRI features alone may perform better when trained separately on male and female data. However, the performance of our models indicates that more information is necessary beyond the available data to make accurate predictions of Gf.
△ Less
Submitted 6 August, 2019;
originally announced August 2019.