-
Learning domain-invariant classifiers for infant cry sounds
Authors:
Charles C. Onu,
Hemanth K. Sheetha,
Arsenii Gorin,
Doina Precup
Abstract:
The issue of domain shift remains a problematic phenomenon in most real-world datasets and clinical audio is no exception. In this work, we study the nature of domain shift in a clinical database of infant cry sounds acquired across different geographies. We find that though the pitches of infant cries are similarly distributed regardless of the place of birth, other characteristics introduce pecu…
▽ More
The issue of domain shift remains a problematic phenomenon in most real-world datasets and clinical audio is no exception. In this work, we study the nature of domain shift in a clinical database of infant cry sounds acquired across different geographies. We find that though the pitches of infant cries are similarly distributed regardless of the place of birth, other characteristics introduce peculiar biases into the data. We explore methodologies for mitigating the impact of domain shift in a model for identifying neurological injury from cry sounds. We adapt unsupervised domain adaptation methods from computer vision which learn an audio representation that is domain-invariant to hospitals and is task discriminative. We also propose a new approach, target noise injection (TNI), for unsupervised domain adaptation which requires neither labels nor training data from the target domain. Our best-performing model significantly improves target accuracy by 7.2%, without negatively affecting the source domain.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
A cry for help: Early detection of brain injury in newborns
Authors:
Charles C. Onu,
Samantha Latremouille,
Arsenii Gorin,
Junhao Wang,
Innocent Udeogu,
Uchenna Ekwochi,
Peter O. Ubuane,
Omolara A. Kehinde,
Muhammad A. Salisu,
Datonye Briggs,
Yoshua Bengio,
Doina Precup
Abstract:
Since the 1960s, neonatal clinicians have known that newborns suffering from certain neurological conditions exhibit altered crying patterns such as the high-pitched cry in birth asphyxia. Despite an annual burden of over 1.5 million infant deaths and disabilities, early detection of neonatal brain injuries due to asphyxia remains a challenge, particularly in develo** countries where the majorit…
▽ More
Since the 1960s, neonatal clinicians have known that newborns suffering from certain neurological conditions exhibit altered crying patterns such as the high-pitched cry in birth asphyxia. Despite an annual burden of over 1.5 million infant deaths and disabilities, early detection of neonatal brain injuries due to asphyxia remains a challenge, particularly in develo** countries where the majority of births are not attended by a trained physician. Here we report on the first inter-continental clinical study to demonstrate that neonatal brain injury can be reliably determined from recorded infant cries using an AI algorithm we call Roseline. Previous and recent work has been limited by the lack of a large, high-quality clinical database of cry recordings, constraining the application of state-of-the-art machine learning. We develop a new training methodology for audio-based pathology detection models and evaluate this system on a large database of newborn cry sounds acquired from geographically diverse settings -- 5 hospitals across 3 continents. Our system extracts interpretable acoustic biomarkers that support clinical decisions and is able to accurately detect neurological injury from newborns' cries with an AUC of 92.5% (88.7% sensitivity at 80% specificity). Cry-based neurological monitoring opens the door for low-cost, easy-to-use, non-invasive and contact-free screening of at-risk babies, especially when integrated into simple devices like smartphones or neonatal ICU monitors. This would provide a reliable tool where there are no alternatives, but also curtail the need to regularly exert newborns to physically-exhausting or radiation-exposing assessments such as brain CT scans. This work sets the stage for embracing the infant cry as a vital sign and indicates the potential of AI-driven sound monitoring for the future of affordable healthcare.
△ Less
Submitted 3 November, 2023; v1 submitted 12 October, 2023;
originally announced October 2023.
-
Self-supervised learning for infant cry analysis
Authors:
Arsenii Gorin,
Cem Subakan,
Sajjad Abdoli,
Junhao Wang,
Samantha Latremouille,
Charles Onu
Abstract:
In this paper, we explore self-supervised learning (SSL) for analyzing a first-of-its-kind database of cry recordings containing clinical indications of more than a thousand newborns. Specifically, we target cry-based detection of neurological injury as well as identification of cry triggers such as pain, hunger, and discomfort. Annotating a large database in the medical setting is expensive and t…
▽ More
In this paper, we explore self-supervised learning (SSL) for analyzing a first-of-its-kind database of cry recordings containing clinical indications of more than a thousand newborns. Specifically, we target cry-based detection of neurological injury as well as identification of cry triggers such as pain, hunger, and discomfort. Annotating a large database in the medical setting is expensive and time-consuming, typically requiring the collaboration of several experts over years. Leveraging large amounts of unlabeled audio data to learn useful representations can lower the cost of building robust models and, ultimately, clinical solutions. In this work, we experiment with self-supervised pre-training of a convolutional neural network on large audio datasets. We show that pre-training with SSL contrastive loss (SimCLR) performs significantly better than supervised pre-training for both neuro injury and cry triggers. In addition, we demonstrate further performance gains through SSL-based domain adaptation using unlabeled infant cries. We also show that using such SSL-based pre-training for adaptation to cry sounds decreases the need for labeled data of the overall system.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds
Authors:
David Budaghyan,
Charles C. Onu,
Arsenii Gorin,
Cem Subakan,
Doina Precup
Abstract:
This paper describes the Ubenwa CryCeleb dataset - a labeled collection of infant cries - and the accompanying CryCeleb 2023 task, which is a public speaker verification challenge based on cry sounds. We released more than 6 hours of manually segmented cry sounds from 786 newborns for academic use, aiming to encourage research in infant cry analysis. The inaugural public competition attracted 59 p…
▽ More
This paper describes the Ubenwa CryCeleb dataset - a labeled collection of infant cries - and the accompanying CryCeleb 2023 task, which is a public speaker verification challenge based on cry sounds. We released more than 6 hours of manually segmented cry sounds from 786 newborns for academic use, aiming to encourage research in infant cry analysis. The inaugural public competition attracted 59 participants, 11 of whom improved the baseline performance. The top-performing system achieved a significant improvement scoring 25.8% equal error rate, which is still far from the performance of state-of-the-art adult speaker verification systems. Therefore, we believe there is room for further research on this dataset, potentially extending beyond the verification task.
△ Less
Submitted 21 March, 2024; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Authors:
Charles C. Onu,
Jonathan Lebensold,
William L. Hamilton,
Doina Precup
Abstract:
Despite continuing medical advances, the rate of newborn morbidity and mortality globally remains high, with over 6 million casualties every year. The prediction of pathologies affecting newborns based on their cry is thus of significant clinical interest, as it would facilitate the development of accessible, low-cost diagnostic tools\cut{ based on wearables and smartphones}. However, the inadequa…
▽ More
Despite continuing medical advances, the rate of newborn morbidity and mortality globally remains high, with over 6 million casualties every year. The prediction of pathologies affecting newborns based on their cry is thus of significant clinical interest, as it would facilitate the development of accessible, low-cost diagnostic tools\cut{ based on wearables and smartphones}. However, the inadequacy of clinically annotated datasets of infant cries limits progress on this task. This study explores a neural transfer learning approach to develo** accurate and robust models for identifying infants that have suffered from perinatal asphyxia. In particular, we explore the hypothesis that representations learned from adult speech could inform and improve performance of models developed on infant speech. Our experiments show that models based on such representation transfer are resilient to different types and degrees of noise, as well as to signal loss in time and frequency domains.
△ Less
Submitted 19 March, 2020; v1 submitted 24 June, 2019;
originally announced June 2019.
-
Harnessing Infant Cry for swift, cost-effective Diagnosis of Perinatal Asphyxia in low-resource settings
Authors:
Charles C. Onu
Abstract:
Perinatal Asphyxia is one of the top three causes of infant mortality in develo** countries, resulting to the death of about 1.2 million newborns every year. At its early stages, the presence of asphyxia cannot be conclusively determined visually or via physical examination, but by medical diagnosis. In resource-poor settings, where skilled attendance at birth is a luxury, most cases only get de…
▽ More
Perinatal Asphyxia is one of the top three causes of infant mortality in develo** countries, resulting to the death of about 1.2 million newborns every year. At its early stages, the presence of asphyxia cannot be conclusively determined visually or via physical examination, but by medical diagnosis. In resource-poor settings, where skilled attendance at birth is a luxury, most cases only get detected when the damaging consequences begin to manifest or worse still, after death of the affected infant. In this project, we explored the approach of machine learning in develo** a low-cost diagnostic solution. We designed a support vector machine-based pattern recognition system that models patterns in the cries of known asphyxiating infants (and normal infants) and then uses the developed model for classification of `new' infants as having asphyxia or not. Our prototype has been tested in a laboratory setting to give prediction accuracy of up to 88.85%. If higher accuracies can be obtained, this research may be a key contributor to the 4th Millennium Development Goal (MDG) of reducing mortality in under-five children.
△ Less
Submitted 24 August, 2018;
originally announced August 2018.
-
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Authors:
Lara J. Kanbar,
Charles C. Onu,
Wissam Shalish,
Karen A. Brown,
Guilherme M. Sant'Anna,
Robert E. Kearney,
Doina Precup
Abstract:
Extremely preterm infants often require endotracheal intubation and mechanical ventilation during the first days of life. Due to the detrimental effects of prolonged invasive mechanical ventilation (IMV), clinicians aim to extubate infants as soon as they deem them ready. Unfortunately, existing strategies for prediction of extubation readiness vary across clinicians and institutions, and lead to…
▽ More
Extremely preterm infants often require endotracheal intubation and mechanical ventilation during the first days of life. Due to the detrimental effects of prolonged invasive mechanical ventilation (IMV), clinicians aim to extubate infants as soon as they deem them ready. Unfortunately, existing strategies for prediction of extubation readiness vary across clinicians and institutions, and lead to high reintubation rates. We present an approach using Random Forest classifiers for the analysis of cardiorespiratory variability to predict extubation readiness. We address the issue of data imbalance by employing random undersampling of examples from the majority class before training each Decision Tree in a bag. By incorporating clinical domain knowledge, we further demonstrate that our classifier could have identified 71% of infants who failed extubation, while maintaining a success detection rate of 78%.
△ Less
Submitted 23 August, 2018;
originally announced August 2018.
-
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing
Authors:
Charles C. Onu,
Lara J. Kanbar,
Wissam Shalish,
Karen A. Brown,
Guilherme M. Sant'Anna,
Robert E. Kearney,
Doina Precup
Abstract:
Extremely preterm infants commonly require intubation and invasive mechanical ventilation after birth. While the duration of mechanical ventilation should be minimized in order to avoid complications, extubation failure is associated with increases in morbidities and mortality. As part of a prospective observational study aimed at develo** an accurate predictor of extubation readiness, Markov an…
▽ More
Extremely preterm infants commonly require intubation and invasive mechanical ventilation after birth. While the duration of mechanical ventilation should be minimized in order to avoid complications, extubation failure is associated with increases in morbidities and mortality. As part of a prospective observational study aimed at develo** an accurate predictor of extubation readiness, Markov and semi-Markov chain models were applied to gain insight into the respiratory patterns of these infants, with more robust time-series modeling using semi-Markov models. This model revealed interesting similarities and differences between newborns who succeeded extubation and those who failed. The parameters of the model were further applied to predict extubation readiness via generative (joint likelihood) and discriminative (support vector machine) approaches. Results showed that up to 84\% of infants who failed extubation could have been accurately identified prior to extubation.
△ Less
Submitted 23 August, 2018;
originally announced August 2018.
-
A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants
Authors:
Charles C. Onu,
Lara J. Kanbar,
Wissam Shalish,
Karen A. Brown,
Guilherme M. Sant'Anna,
Robert E. Kearney,
Doina Precup
Abstract:
After birth, extremely preterm infants often require specialized respiratory management in the form of invasive mechanical ventilation (IMV). Protracted IMV is associated with detrimental outcomes and morbidities. Premature extubation, on the other hand, would necessitate reintubation which is risky, technically challenging and could further lead to lung injury or disease. We present an approach t…
▽ More
After birth, extremely preterm infants often require specialized respiratory management in the form of invasive mechanical ventilation (IMV). Protracted IMV is associated with detrimental outcomes and morbidities. Premature extubation, on the other hand, would necessitate reintubation which is risky, technically challenging and could further lead to lung injury or disease. We present an approach to modeling respiratory patterns of infants who succeeded extubation and those who required reintubation which relies on Markov models. We compare the use of traditional Markov chains to semi-Markov models which emphasize cross-pattern transitions and timing information, and to multi-chain Markov models which can concisely represent non-stationarity in respiratory behavior over time. The models we developed expose specific, unique similarities as well as vital differences between the two populations.
△ Less
Submitted 23 August, 2018;
originally announced August 2018.