-
Improving Pediatric Low-Grade Neuroepithelial Tumors Molecular Subtype Identification Using a Novel AUROC Loss Function for Convolutional Neural Networks
Authors:
Khashayar Namdar,
Matthias W. Wagner,
Cynthia Hawkins,
Uri Tabori,
Birgit B. Ertl-Wagner,
Farzad Khalvati
Abstract:
Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural…
▽ More
Pediatric Low-Grade Neuroepithelial Tumors (PLGNT) are the most common pediatric cancer type, accounting for 40% of brain tumors in children, and identifying PLGNT molecular subtype is crucial for treatment planning. However, the gold standard to determine the PLGNT subtype is biopsy, which can be impractical or dangerous for patients. This research improves the performance of Convolutional Neural Networks (CNNs) in classifying PLGNT subtypes through MRI scans by introducing a loss function that specifically improves the model's Area Under the Receiver Operating Characteristic (ROC) Curve (AUROC), offering a non-invasive diagnostic alternative. In this study, a retrospective dataset of 339 children with PLGNT (143 BRAF fusion, 71 with BRAF V600E mutation, and 125 non-BRAF) was curated. We employed a CNN model with Monte Carlo random data splitting. The baseline model was trained using binary cross entropy (BCE), and achieved an AUROC of 86.11% for differentiating BRAF fusion and BRAF V600E mutations, which was improved to 87.71% using our proposed AUROC loss function (p-value 0.045). With multiclass classification, the AUROC improved from 74.42% to 76. 59% (p-value 0.0016).
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Mid-Infrared Photothermal-Fluorescence in Situ Hybridization for Functional Analysis and Genetic Identification of Single Cells
Authors:
Yeran Bai,
Zhongyue Guo,
Fátima C. Pereira,
Michael Wagner,
Ji-Xin Cheng
Abstract:
Simultaneous identification and metabolic analysis of microbes with single-cell resolution and high throughput is necessary to answer the question of "who eats what, when, and where" in complex microbial communities. Here, we present a mid-infrared photothermal-fluorescence in situ hybridization (MIP-FISH) platform that enables direct bridging of genotype and phenotype. Through multiple improvemen…
▽ More
Simultaneous identification and metabolic analysis of microbes with single-cell resolution and high throughput is necessary to answer the question of "who eats what, when, and where" in complex microbial communities. Here, we present a mid-infrared photothermal-fluorescence in situ hybridization (MIP-FISH) platform that enables direct bridging of genotype and phenotype. Through multiple improvements of MIP imaging, the sensitive detection of isotopically-labelled compounds incorporated into proteins of individual bacterial cells became possible, while simultaneous detection of FISH labelling with rRNA-targeted probes enabled the identification of the analyzed cells. In proof-of-concept experiments, we showed that the clear spectral red shift in the protein amide I region due to incorporation of $^{13}$C atoms originating from $^{13}$C-labelled-glucose can be exploited by MIP-FISH to discriminate and identify $^{13}$C-labelled bacterial cells within a complex human gut microbiome sample. The presented methods open new opportunities for single-cell structure-function analyses for microbiology.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
Open-radiomics: A Collection of Standardized Datasets and a Technical Protocol for Reproducible Radiomics Machine Learning Pipelines
Authors:
Khashayar Namdar,
Matthias W. Wagner,
Birgit B. Ertl-Wagner,
Farzad Khalvati
Abstract:
Purpose: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to investigate the effects of radiomics feature extraction on the reproducibility…
▽ More
Purpose: As an important branch of machine learning pipelines in medical imaging, radiomics faces two major challenges namely reproducibility and accessibility. In this work, we introduce open-radiomics, a set of radiomics datasets along with a comprehensive radiomics pipeline based on our proposed technical protocol to investigate the effects of radiomics feature extraction on the reproducibility of the results.
Materials and Methods: Experiments are conducted on BraTS 2020 open-source Magnetic Resonance Imaging (MRI) dataset that includes 369 adult patients with brain tumors (76 low-grade glioma (LGG), and 293 high-grade glioma (HGG)). Using PyRadiomics library for LGG vs. HGG classification, 288 radiomics datasets are formed; the combinations of 4 MRI sequences, 3 binWidths, 6 image normalization methods, and 4 tumor subregions.
Random Forest classifiers were used, and for each radiomics dataset the training-validation-test (60%/20%/20%) experiment with different data splits and model random states was repeated 100 times (28,800 test results) and Area Under Receiver Operating Characteristic Curve (AUC) was calculated.
Results: Unlike binWidth and image normalization, tumor subregion and imaging sequence significantly affected performance of the models. T1 contrast-enhanced sequence and the union of necrotic and the non-enhancing tumor core subregions resulted in the highest AUCs (average test AUC 0.951, 95% confidence interval of (0.949, 0.952)). Although 28 settings and data splits yielded test AUC of 1, they were irreproducible.
Conclusion: Our experiments demonstrate the sources of variability in radiomics pipelines (e.g., tumor subregion) can have a significant impact on the results, which may lead to superficial perfect performances that are irreproducible.
△ Less
Submitted 24 October, 2023; v1 submitted 29 July, 2022;
originally announced July 2022.