-
STREAMLINE: An Automated Machine Learning Pipeline for Biomedicine Applied to Examine the Utility of Photography-Based Phenotypes for OSA Prediction Across International Sleep Centers
Authors:
Ryan J. Urbanowicz,
Harsh Bandhey,
Brendan T. Keenan,
Greg Maislin,
Sy Hwang,
Danielle L. Mowery,
Shannon M. Lynch,
Diego R. Mazzotti,
Fang Han,
Qing Yun Li,
Thomas Penzel,
Sergio Tufik,
Lia Bittencourt,
Thorarinn Gislason,
Philip de Chazal,
Bhajan Singh,
Nigel McArdle,
Ning-Hung Chen,
Allan Pack,
Richard J. Schwab,
Peter A. Cistulli,
Ulysses J. Magalang
Abstract:
While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine…
▽ More
While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine Learning Pipeline (STREAMLINE) and apply it to investigate the added utility of photography-based phenotypes for predicting obstructive sleep apnea (OSA); a common and underdiagnosed condition associated with a variety of health, economic, and safety consequences. STREAMLINE is designed to tackle biomedical binary classification tasks while adhering to best practices and accommodating complexity, scalability, reproducibility, customization, and model interpretation. Benchmarking analyses validated the efficacy of STREAMLINE across data simulations with increasingly complex patterns of association. Then we applied STREAMLINE to evaluate the utility of demographics (DEM), self-reported comorbidities (DX), symptoms (SYM), and photography-based craniofacial (CF) and intraoral (IO) anatomy measures in predicting any OSA or moderate/severe OSA using 3,111 participants from Sleep Apnea Global Interdisciplinary Consortium (SAGIC). OSA analyses identified a significant increase in ROC-AUC when adding CF to DEM+DX+SYM to predict moderate/severe OSA. A consistent but non-significant increase in PRC-AUC was observed with the addition of each subsequent feature set to predict any OSA, with CF and IO yielding minimal improvements. Application of STREAMLINE to OSA data suggests that CF features provide additional value in predicting moderate/severe OSA, but neither CF nor IO features meaningfully improved the prediction of any OSA beyond established demographics, comorbidity and symptom characteristics.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation
Authors:
Rui Yang,
Qingcheng Zeng,
Keen You,
Yujie Qiao,
Lucas Huang,
Chia-Chun Hsieh,
Benjamin Rosand,
Jeremy Goldwasser,
Amisha D Dave,
Tiarnan D. L. Keenan,
Emily Y Chew,
Dragomir Radev,
Zhiyong Lu,
Hua Xu,
Qingyu Chen,
Irene Li
Abstract:
This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation. Ascle is tailored for biomedical researchers and healthcare professionals with an easy-to-use, all-in-one solution that requires minimal programming expertise. For the first time, Ascle evaluates and provides interfaces for the latest pre-trained language models, encompassing f…
▽ More
This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation. Ascle is tailored for biomedical researchers and healthcare professionals with an easy-to-use, all-in-one solution that requires minimal programming expertise. For the first time, Ascle evaluates and provides interfaces for the latest pre-trained language models, encompassing four advanced and challenging generative functions: question-answering, text summarization, text simplification, and machine translation. In addition, Ascle integrates 12 essential NLP functions, along with query and search capabilities for clinical databases. The toolkit, its models, and associated data are publicly available via https://github.com/Yale-LILY/MedGen.
△ Less
Submitted 9 December, 2023; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Kinetic inductance and voltage response dependence on temperature: Asymmetric dc SQUID case study
Authors:
M. A. Gali Labarias,
O. A. Nieves,
S. T. Keenan,
E. E. Mitchell
Abstract:
Inductance plays a crucial role in the design and optimization of superconducting quantum interference devices (SQUIDs) for quantum sensing applications, since it dictates the sensitivity and coupling ratio with other circuit elements. In high-temperature superconductors the kinetic inductance, which depends on both geometry and temperature, becomes a dominant part of the device's total self-induc…
▽ More
Inductance plays a crucial role in the design and optimization of superconducting quantum interference devices (SQUIDs) for quantum sensing applications, since it dictates the sensitivity and coupling ratio with other circuit elements. In high-temperature superconductors the kinetic inductance, which depends on both geometry and temperature, becomes a dominant part of the device's total self-inductance, since their London penetration depth is considerably larger compared to low-temperature superconductors. In this work, we use an asymmetric SQUID to investigate the kinetic self-inductance ratio and voltage modulation depth at different operating temperatures, device geometries and bias currents. We first validate our approach by comparing our modelled data with experimental measurements. Then, through numerical simulations, we show: (i) kinetic inductance dominates for thin superconducting films, while for thicker films the inductance is less sensitive to temperature changes; (ii) the voltage modulation depth decreases exponentially with the total inductance independent of the asymmetry ratio; (iii) narrower superconducting tracks lead to a broader temperature operation range, $ΔT \sim 30 K$, while wider tracks operate in a smaller temperature range, $ΔT \sim 10 K$, but are more sensitive to temperature changes; and (iv) the device performance versus temperature strongly depends on the bias current used.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
Upscaling Global Hourly GPP with Temporal Fusion Transformer (TFT)
Authors:
Rumi Nakagawa,
Mary Chau,
John Calzaretta,
Trevor Keenan,
Puya Vahabi,
Alberto Todeschini,
Maoya Bassiouni,
Yanghui Kang
Abstract:
Reliable estimates of Gross Primary Productivity (GPP), crucial for evaluating climate change initiatives, are currently only available from sparsely distributed eddy covariance tower sites. This limitation hampers access to reliable GPP quantification at regional to global scales. Prior machine learning studies on upscaling \textit{in situ} GPP to global wall-to-wall maps at sub-daily time steps…
▽ More
Reliable estimates of Gross Primary Productivity (GPP), crucial for evaluating climate change initiatives, are currently only available from sparsely distributed eddy covariance tower sites. This limitation hampers access to reliable GPP quantification at regional to global scales. Prior machine learning studies on upscaling \textit{in situ} GPP to global wall-to-wall maps at sub-daily time steps faced limitations such as lack of input features at higher temporal resolutions and significant missing values. This research explored a novel upscaling solution using Temporal Fusion Transformer (TFT) without relying on past GPP time series. Model development was supplemented by Random Forest Regressor (RFR) and XGBoost, followed by the hybrid model of TFT and tree algorithms. The best preforming model yielded to model performance of 0.704 NSE and 3.54 RMSE. Another contribution of the study was the breakdown analysis of encoder feature importance based on time and flux tower sites. Such analysis enhanced the interpretability of the multi-head attention layer as well as the visual understanding of temporal dynamics of influential features.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Multi-modal, multi-task, multi-attention (M3) deep learning detection of reticular pseudodrusen: towards automated and accessible classification of age-related macular degeneration
Authors:
Qingyu Chen,
Tiarnan D. L. Keenan,
Alexis Allot,
Yifan Peng,
Elvira Agrón,
Amitha Domalpally,
Caroline C. W. Klaver,
Daniel T. Luttikhuizen,
Marcus H. Colyer,
Catherine A. Cukras,
Henry E. Wiley,
M. Teresa Magone,
Chantal Cousineau-Krieger,
Wai T. Wong,
Yingying Zhu,
Emily Y. Chew,
Zhiyong Lu
Abstract:
Objective Reticular pseudodrusen (RPD), a key feature of age-related macular degeneration (AMD), are poorly detected by human experts on standard color fundus photography (CFP) and typically require advanced imaging modalities such as fundus autofluorescence (FAF). The objective was to develop and evaluate the performance of a novel 'M3' deep learning framework on RPD detection. Materials and Meth…
▽ More
Objective Reticular pseudodrusen (RPD), a key feature of age-related macular degeneration (AMD), are poorly detected by human experts on standard color fundus photography (CFP) and typically require advanced imaging modalities such as fundus autofluorescence (FAF). The objective was to develop and evaluate the performance of a novel 'M3' deep learning framework on RPD detection. Materials and Methods A deep learning framework M3 was developed to detect RPD presence accurately using CFP alone, FAF alone, or both, employing >8000 CFP-FAF image pairs obtained prospectively (Age-Related Eye Disease Study 2). The M3 framework includes multi-modal (detection from single or multiple image modalities), multi-task (training different tasks simultaneously to improve generalizability), and multi-attention (improving ensembled feature representation) operation. Performance on RPD detection was compared with state-of-the-art deep learning models and 13 ophthalmologists; performance on detection of two other AMD features (geographic atrophy and pigmentary abnormalities) was also evaluated. Results For RPD detection, M3 achieved area under receiver operating characteristic (AUROC) 0.832, 0.931, and 0.933 for CFP alone, FAF alone, and both, respectively. M3 performance on CFP was very substantially superior to human retinal specialists (median F1-score 0.644 versus 0.350). External validation (on Rotterdam Study, Netherlands) demonstrated high accuracy on CFP alone (AUROC 0.965). The M3 framework also accurately detected geographic atrophy and pigmentary abnormalities (AUROC 0.909 and 0.912, respectively), demonstrating its generalizability. Conclusion This study demonstrates the successful development, robust evaluation, and external validation of a novel deep learning framework that enables accessible, accurate, and automated AMD diagnosis and prognosis.
△ Less
Submitted 11 November, 2020; v1 submitted 8 November, 2020;
originally announced November 2020.
-
Predicting risk of late age-related macular degeneration using deep learning
Authors:
Yifan Peng,
Tiarnan D. Keenan,
Qingyu Chen,
Elvira Agrón,
Alexis Allot,
Wai T. Wong,
Emily Y. Chew,
Zhiyong Lu
Abstract:
By 2040, age-related macular degeneration (AMD) will affect approximately 288 million people worldwide. Identifying individuals at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Although deep learning has shown promise in diagnosing/screening AMD using color fundus photographs, it remains d…
▽ More
By 2040, age-related macular degeneration (AMD) will affect approximately 288 million people worldwide. Identifying individuals at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Although deep learning has shown promise in diagnosing/screening AMD using color fundus photographs, it remains difficult to predict individuals' risks of late AMD accurately. For both tasks, these initial deep learning attempts have remained largely unvalidated in independent cohorts. Here, we demonstrate how deep learning and survival analysis can predict the probability of progression to late AMD using 3,298 participants (over 80,000 images) from the Age-Related Eye Disease Studies AREDS and AREDS2, the largest longitudinal clinical trials in AMD. When validated against an independent test dataset of 601 participants, our model achieved high prognostic accuracy (five-year C-statistic 86.4 (95% confidence interval 86.2-86.6)) that substantially exceeded that of retinal specialists using two existing clinical standards (81.3 (81.1-81.5) and 82.0 (81.8-82.3), respectively). Interestingly, our approach offers additional strengths over the existing clinical standards in AMD prognosis (e.g., risk ascertainment above 50%) and is likely to be highly generalizable, given the breadth of training data from 82 US retinal specialty clinics. Indeed, during external validation through training on AREDS and testing on AREDS2 as an independent cohort, our model retained substantially higher prognostic accuracy than existing clinical standards. These results highlight the potential of deep learning systems to enhance clinical decision-making in AMD patients.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
A deep learning approach for automated detection of geographic atrophy from color fundus photographs
Authors:
Tiarnan D. Keenan,
Shazia Dharssi,
Yifan Peng,
Qingyu Chen,
Elvira Agrón,
Wai T. Wong,
Zhiyong Lu,
Emily Y. Chew
Abstract:
Purpose: To assess the utility of deep learning in the detection of geographic atrophy (GA) from color fundus photographs; secondary aim to explore potential utility in detecting central GA (CGA). Design: A deep learning model was developed to detect the presence of GA in color fundus photographs, and two additional models to detect CGA in different scenarios. Participants: 59,812 color fundus pho…
▽ More
Purpose: To assess the utility of deep learning in the detection of geographic atrophy (GA) from color fundus photographs; secondary aim to explore potential utility in detecting central GA (CGA). Design: A deep learning model was developed to detect the presence of GA in color fundus photographs, and two additional models to detect CGA in different scenarios. Participants: 59,812 color fundus photographs from longitudinal follow up of 4,582 participants in the AREDS dataset. Gold standard labels were from human expert reading center graders using a standardized protocol. Methods: A deep learning model was trained to use color fundus photographs to predict GA presence from a population of eyes with no AMD to advanced AMD. A second model was trained to predict CGA presence from the same population. A third model was trained to predict CGA presence from the subset of eyes with GA. For training and testing, 5-fold cross-validation was employed. For comparison with human clinician performance, model performance was compared with that of 88 retinal specialists. Results: The deep learning models (GA detection, CGA detection from all eyes, and centrality detection from GA eyes) had AUC of 0.933-0.976, 0.939-0.976, and 0.827-0.888, respectively. The GA detection model had accuracy, sensitivity, specificity, and precision of 0.965, 0.692, 0.978, and 0.584, respectively. The CGA detection model had equivalent values of 0.966, 0.763, 0.971, and 0.394. The centrality detection model had equivalent values of 0.762, 0.782, 0.729, and 0.799. Conclusions: A deep learning model demonstrated high accuracy for the automated detection of GA. The AUC was non-inferior to that of human retinal specialists. Deep learning approaches may also be applied to the identification of CGA. The code and pretrained models are publicly available at https://github.com/ncbi-nlp/DeepSeeNet.
△ Less
Submitted 7 June, 2019;
originally announced June 2019.
-
A multi-task deep learning model for the classification of Age-related Macular Degeneration
Authors:
Qingyu Chen,
Yifan Peng,
Tiarnan Keenan,
Shazia Dharssi,
Elvira Agron,
Wai T. Wong,
Emily Y. Chew,
Zhiyong Lu
Abstract:
Age-related Macular Degeneration (AMD) is a leading cause of blindness. Although the Age-Related Eye Disease Study group previously developed a 9-step AMD severity scale for manual classification of AMD severity from color fundus images, manual grading of images is time-consuming and expensive. Built on our previous work DeepSeeNet, we developed a novel deep learning model for automated classifica…
▽ More
Age-related Macular Degeneration (AMD) is a leading cause of blindness. Although the Age-Related Eye Disease Study group previously developed a 9-step AMD severity scale for manual classification of AMD severity from color fundus images, manual grading of images is time-consuming and expensive. Built on our previous work DeepSeeNet, we developed a novel deep learning model for automated classification of images into the 9-step scale. Instead of predicting the 9-step score directly, our approach simulates the reading center grading process. It first detects four AMD characteristics (drusen area, geographic atrophy, increased pigment, and depigmentation), then combines these to derive the overall 9-step score. Importantly, we applied multi-task learning techniques, which allowed us to train classification of the four characteristics in parallel, share representation, and prevent overfitting. Evaluation on two image datasets showed that the accuracy of the model exceeded the current state-of-the-art model by > 10%.
△ Less
Submitted 2 December, 2018;
originally announced December 2018.
-
DeepSeeNet: A deep learning model for automated classification of patient-based age-related macular degeneration severity from color fundus photographs
Authors:
Yifan Peng,
Shazia Dharssi,
Qingyu Chen,
Tiarnan D. Keenan,
Elvira Agrón,
Wai T. Wong,
Emily Y. Chew,
Zhiyong Lu
Abstract:
In assessing the severity of age-related macular degeneration (AMD), the Age-Related Eye Disease Study (AREDS) Simplified Severity Scale predicts the risk of progression to late AMD. However, its manual use requires the time-consuming participation of expert practitioners. Although several automated deep learning systems have been developed for classifying color fundus photographs (CFP) of individ…
▽ More
In assessing the severity of age-related macular degeneration (AMD), the Age-Related Eye Disease Study (AREDS) Simplified Severity Scale predicts the risk of progression to late AMD. However, its manual use requires the time-consuming participation of expert practitioners. Although several automated deep learning systems have been developed for classifying color fundus photographs (CFP) of individual eyes by AREDS severity score, none to date has used a patient-based scoring system that uses images from both eyes to assign a severity score. DeepSeeNet, a deep learning model, was developed to classify patients automatically by the AREDS Simplified Severity Scale (score 0-5) using bilateral CFP. DeepSeeNet was trained on 58,402 and tested on 900 images from the longitudinal follow-up of 4549 participants from AREDS. Gold standard labels were obtained using reading center grades. DeepSeeNet simulates the human grading process by first detecting individual AMD risk factors (drusen size, pigmentary abnormalities) for each eye and then calculating a patient-based AMD severity score using the AREDS Simplified Severity Scale. DeepSeeNet performed better on patient-based classification (accuracy = 0.671; kappa = 0.558) than retinal specialists (accuracy = 0.599; kappa = 0.467) with high AUC in the detection of large drusen (0.94), pigmentary abnormalities (0.93), and late AMD (0.97). DeepSeeNet demonstrated high accuracy with increased transparency in the automated assignment of individual patients to AMD risk categories based on the AREDS Simplified Severity Scale. These results highlight the potential of deep learning to assist and enhance clinical decision-making in patients with AMD, such as early AMD detection and risk prediction for develo** late AMD. DeepSeeNet is publicly available on https://github.com/ncbi-nlp/DeepSeeNet.
△ Less
Submitted 26 January, 2019; v1 submitted 18 November, 2018;
originally announced November 2018.
-
Gas Analyses of First complete JET Cryopump Regeneration with ITER-Like Wall
Authors:
S. Grunhagen Romanelli,
S. Brezinsek,
B. Butler,
J. P. Coad,
A. Drenik,
C. Giroud,
S. Jachmich,
T. Keenan,
U. Kruezi,
M. Mozetic,
M. Oberkofler,
A. Parracho,
M. Romanelli,
R. Smith,
JET EFDA contributors
Abstract:
Analytical results of a complete JET cryopump regeneration, including the nitrogen panel, following the first ITER-Like Wall campaign are presented along with the in-situ analyses of residual gas. H/D mixtures and impurities such as nitrogen and neon were injected during plasma operation in the vessel to study radiation cooling in the scrape-off-layer and divertor region. The global gas inventory…
▽ More
Analytical results of a complete JET cryopump regeneration, including the nitrogen panel, following the first ITER-Like Wall campaign are presented along with the in-situ analyses of residual gas. H/D mixtures and impurities such as nitrogen and neon were injected during plasma operation in the vessel to study radiation cooling in the scrape-off-layer and divertor region. The global gas inventory over the campaign is incomplete, suggesting residual volatile impurities are remaining on the cryogenic panel. This paper presents results on a) residual deuterium on the panel which is related to the campaign very low, b) impurities like nitrogen which sticks on the panel and c) the ammonia production which can be observed in the RGA spectrum.
△ Less
Submitted 13 June, 2014;
originally announced June 2014.