STREAMLINE: An Automated Machine Learning Pipeline for Biomedicine Applied to Examine the Utility of Photography-Based Phenotypes for OSA Prediction Across International Sleep Centers
Authors:
Ryan J. Urbanowicz,
Harsh Bandhey,
Brendan T. Keenan,
Greg Maislin,
Sy Hwang,
Danielle L. Mowery,
Shannon M. Lynch,
Diego R. Mazzotti,
Fang Han,
Qing Yun Li,
Thomas Penzel,
Sergio Tufik,
Lia Bittencourt,
Thorarinn Gislason,
Philip de Chazal,
Bhajan Singh,
Nigel McArdle,
Ning-Hung Chen,
Allan Pack,
Richard J. Schwab,
Peter A. Cistulli,
Ulysses J. Magalang
Abstract:
While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine…
▽ More
While machine learning (ML) includes a valuable array of tools for analyzing biomedical data, significant time and expertise is required to assemble effective, rigorous, and unbiased pipelines. Automated ML (AutoML) tools seek to facilitate ML application by automating a subset of analysis pipeline elements. In this study we develop and validate a Simple, Transparent, End-to-end Automated Machine Learning Pipeline (STREAMLINE) and apply it to investigate the added utility of photography-based phenotypes for predicting obstructive sleep apnea (OSA); a common and underdiagnosed condition associated with a variety of health, economic, and safety consequences. STREAMLINE is designed to tackle biomedical binary classification tasks while adhering to best practices and accommodating complexity, scalability, reproducibility, customization, and model interpretation. Benchmarking analyses validated the efficacy of STREAMLINE across data simulations with increasingly complex patterns of association. Then we applied STREAMLINE to evaluate the utility of demographics (DEM), self-reported comorbidities (DX), symptoms (SYM), and photography-based craniofacial (CF) and intraoral (IO) anatomy measures in predicting any OSA or moderate/severe OSA using 3,111 participants from Sleep Apnea Global Interdisciplinary Consortium (SAGIC). OSA analyses identified a significant increase in ROC-AUC when adding CF to DEM+DX+SYM to predict moderate/severe OSA. A consistent but non-significant increase in PRC-AUC was observed with the addition of each subsequent feature set to predict any OSA, with CF and IO yielding minimal improvements. Application of STREAMLINE to OSA data suggests that CF features provide additional value in predicting moderate/severe OSA, but neither CF nor IO features meaningfully improved the prediction of any OSA beyond established demographics, comorbidity and symptom characteristics.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
A Machine Learning Application for Raising WASH Awareness in the Times of COVID-19 Pandemic
Authors:
Rohan Pandey,
Vaibhav Gautam,
Ridam Pal,
Harsh Bandhey,
Lovedeep Singh Dhingra,
Himanshu Sharma,
Chirag Jain,
Kanav Bhagat,
Arushi,
Lajjaben Patel,
Mudit Agarwal,
Samprati Agrawal,
Rishabh Jalan,
Akshat Wadhwa,
Ayush Garg,
Vihaan Misra,
Yashwin Agrawal,
Bhavika Rana,
Ponnurangam Kumaraguru,
Tavpritesh Sethi
Abstract:
Background: The COVID-19 pandemic has uncovered the potential of digital misinformation in sha** the health of nations. The deluge of unverified information that spreads faster than the epidemic itself is an unprecedented phenomenon that has put millions of lives in danger. Mitigating this Infodemic requires strong health messaging systems that are engaging, vernacular, scalable, effective and c…
▽ More
Background: The COVID-19 pandemic has uncovered the potential of digital misinformation in sha** the health of nations. The deluge of unverified information that spreads faster than the epidemic itself is an unprecedented phenomenon that has put millions of lives in danger. Mitigating this Infodemic requires strong health messaging systems that are engaging, vernacular, scalable, effective and continuously learn the new patterns of misinformation.
Objective: We created WashKaro, a multi-pronged intervention for mitigating misinformation through conversational AI, machine translation and natural language processing. WashKaro provides the right information matched against WHO guidelines through AI, and delivers it in the right format in local languages.
Methods: We theorize (i) an NLP based AI engine that could continuously incorporate user feedback to improve relevance of information, (ii) bite sized audio in the local language to improve penetrance in a country with skewed gender literacy ratios, and (iii) conversational but interactive AI engagement with users towards an increased health awareness in the community. Results: A total of 5026 people who downloaded the app during the study window, among those 1545 were active users. Our study shows that 3.4 times more females engaged with the App in Hindi as compared to males, the relevance of AI-filtered news content doubled within 45 days of continuous machine learning, and the prudence of integrated AI chatbot Satya increased thus proving the usefulness of an mHealth platform to mitigate health misinformation.
Conclusion: We conclude that a multi-pronged machine learning application delivering vernacular bite-sized audios and conversational AI is an effective approach to mitigate health misinformation.
△ Less
Submitted 30 October, 2020; v1 submitted 16 March, 2020;
originally announced March 2020.