-
Machine learning for detection of stenoses and aneurysms: application in a physiologically realistic virtual patient database
Authors:
Gareth Jones,
Jim Parr,
Perumal Nithiarasu,
Sanjay Pant
Abstract:
This study presents an application of machine learning (ML) methods for detecting the presence of stenoses and aneurysms in the human arterial system. Four major forms of arterial disease -- carotid artery stenosis (CAS), subclavian artery stenosis (SAC), peripheral arterial disease (PAD), and abdominal aortic aneurysms (AAA) -- are considered. The ML methods are trained and tested on a physiologi…
▽ More
This study presents an application of machine learning (ML) methods for detecting the presence of stenoses and aneurysms in the human arterial system. Four major forms of arterial disease -- carotid artery stenosis (CAS), subclavian artery stenosis (SAC), peripheral arterial disease (PAD), and abdominal aortic aneurysms (AAA) -- are considered. The ML methods are trained and tested on a physiologically realistic virtual patient database (VPD) containing 28,868 healthy subjects, which is adapted from the authors previous work and augmented to include the four disease forms. Six ML methods -- Naive Bayes, Logistic Regression, Support Vector Machine, Multi-layer Perceptron, Random Forests, and Gradient Boosting -- are compared with respect to classification accuracies and it is found that the tree-based methods of Random Forest and Gradient Boosting outperform other approaches. The performance of ML methods is quantified through the F1 score and computation of sensitivities and specificities. When using all the six measurements, it is found that maximum F1 scores larger than 0.9 are achieved for CAS and PAD, larger than 0.85 for SAS, and larger than 0.98 for both low- and high-severity AAAs. Corresponding sensitivities and specificities are larger than 90% for CAS and PAD, larger than 85% for SAS, and larger than 98% for both low- and high-severity AAAs. When reducing the number of measurements, it is found that the performance is degraded by less than 5% when three measurements are used, and less than 10% when only two measurements are used for classification. For AAA, it is shown that F1 scores larger than 0.85 and corresponding sensitivities and specificities larger than 85% are achievable when using only a single measurement. The results are encouraging to pursue AAA monitoring and screening through wearable devices which can reliably measure pressure or flow-rates
△ Less
Submitted 11 March, 2021; v1 submitted 28 February, 2021;
originally announced March 2021.
-
A physiologically realistic virtual patient database for the study of arterial haemodynamics
Authors:
Gareth Jones,
Jim Parr,
Perumal Nithiarasu,
Sanjay Pant
Abstract:
This study creates a physiologically realistic virtual patient database (VPD), representing the human arterial system, for the primary purpose of studying the affects of arterial disease on haemodynamics. A low dimensional representation of an anatomically detailed arterial network is outlined, and a physiologically realistic posterior distribution for its parameters is constructed through a Bayes…
▽ More
This study creates a physiologically realistic virtual patient database (VPD), representing the human arterial system, for the primary purpose of studying the affects of arterial disease on haemodynamics. A low dimensional representation of an anatomically detailed arterial network is outlined, and a physiologically realistic posterior distribution for its parameters is constructed through a Bayesian approach. This approach combines both physiological/geometrical constraints and the available measurements reported in the literature. A key contribution of this work is to present a framework for including all such available information for the creation of virtual patients (VPs). The Markov Chain Monte Carlo (MCMC) method is used to sample random VPs from this posterior distribution, and the pressure and flow-rate profiles associated with the VPs are computed through a model of pulse wave propagation. This combination of the arterial network parameters (representing the VPs) and the haemodynamics waveforms of pressure and flow-rates at various locations (representing functional response of the VPs) makes up the VPD. While 75,000 VPs are sampled from the posterior distribution, 10,000 are discarded as the initial burn-in period. A further 12,857 VPs are subsequently removed due to the presence of negative average flow-rate. Due to an undesirable behaviour observed in some VPs -- asymmetric under- and over-damped pressure and flow-rate profiles in the left and right sides of the arterial system -- a filter is proposed for their removal. The final VPD has 28,868 subjects. It is shown that the methodology is appropriate by comparing the VPD statistics to those reported in literature across real populations. A good agreement between the two is found while respecting physiological/geometrical constraints. The pre-filter database is made available at https://doi.org/10.5281/zenodo.4549764.
△ Less
Submitted 21 February, 2021;
originally announced February 2021.
-
A proof of concept study for machine learning application to stenosis detection
Authors:
Gareth Jones,
Jim Parr,
Perumal Nithiarasu,
Sanjay Pant
Abstract:
This proof of concept (PoC) assesses the ability of machine learning (ML) classifiers to predict the presence of a stenosis in a three vessel arterial system consisting of the abdominal aorta bifurcating into the two common iliacs. A virtual patient database (VPD) is created using one-dimensional pulse wave propagation model of haemodynamics. Four different machine learning (ML) methods are used t…
▽ More
This proof of concept (PoC) assesses the ability of machine learning (ML) classifiers to predict the presence of a stenosis in a three vessel arterial system consisting of the abdominal aorta bifurcating into the two common iliacs. A virtual patient database (VPD) is created using one-dimensional pulse wave propagation model of haemodynamics. Four different machine learning (ML) methods are used to train and test a series of classifiers -- both binary and multiclass -- to distinguish between healthy and unhealthy virtual patients (VPs) using different combinations of pressure and flow-rate measurements. It is found that the ML classifiers achieve specificities larger than 80% and sensitivities ranging from 50-75%. The most balanced classifier also achieves an area under the receiver operative characteristic curve of 0.75, outperforming approximately 20 methods used in clinical practice, and thus placing the method as moderately accurate. Other important observations from this study are that: i) few measurements can provide similar classification accuracies compared to the case when more/all the measurements are used; ii) some measurements are more informative than others for classification; and iii) a modification of standard methods can result in detection of not only the presence of stenosis, but also the stenosed vessel.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Technology Readiness Levels for Machine Learning Systems
Authors:
Alexander Lavin,
Ciarán M. Gilligan-Lee,
Alessya Visnjic,
Siddha Ganju,
Dava Newman,
Atılım Güneş Baydin,
Sujoy Ganguly,
Danny Lange,
Amit Sharma,
Stephan Zheng,
Eric P. Xing,
Adam Gibson,
James Parr,
Chris Mattmann,
Yarin Gal
Abstract:
The development and deployment of machine learning (ML) systems can be executed easily with modern tools, but the process is typically rushed and means-to-an-end. The lack of diligence can lead to technical debt, scope creep and misaligned objectives, model misuse and failures, and expensive consequences. Engineering systems, on the other hand, follow well-defined processes and testing standards t…
▽ More
The development and deployment of machine learning (ML) systems can be executed easily with modern tools, but the process is typically rushed and means-to-an-end. The lack of diligence can lead to technical debt, scope creep and misaligned objectives, model misuse and failures, and expensive consequences. Engineering systems, on the other hand, follow well-defined processes and testing standards to streamline development for high-quality, reliable results. The extreme is spacecraft systems, where mission critical measures and robustness are ingrained in the development process. Drawing on experience in both spacecraft engineering and ML (from research through product across domain areas), we have developed a proven systems engineering approach for machine learning development and deployment. Our "Machine Learning Technology Readiness Levels" (MLTRL) framework defines a principled process to ensure robust, reliable, and responsible systems while being streamlined for ML workflows, including key distinctions from traditional software engineering. Even more, MLTRL defines a lingua franca for people across teams and organizations to work collaboratively on artificial intelligence and machine learning technologies. Here we describe the framework and elucidate it with several real world use-cases of develo** ML methods from basic research through productization and deployment, in areas such as medical diagnostics, consumer computer vision, satellite imagery, and particle physics.
△ Less
Submitted 29 November, 2021; v1 submitted 11 January, 2021;
originally announced January 2021.
-
SpaceML: Distributed Open-source Research with Citizen Scientists for the Advancement of Space Technology for NASA
Authors:
Anirudh Koul,
Siddha Ganju,
Meher Kasam,
James Parr
Abstract:
Traditionally, academic labs conduct open-ended research with the primary focus on discoveries with long-term value, rather than direct products that can be deployed in the real world. On the other hand, research in the industry is driven by its expected commercial return on investment, and hence focuses on a real world product with short-term timelines. In both cases, opportunity is selective, of…
▽ More
Traditionally, academic labs conduct open-ended research with the primary focus on discoveries with long-term value, rather than direct products that can be deployed in the real world. On the other hand, research in the industry is driven by its expected commercial return on investment, and hence focuses on a real world product with short-term timelines. In both cases, opportunity is selective, often available to researchers with advanced educational backgrounds. Research often happens behind closed doors and may be kept confidential until either its publication or product release, exacerbating the problem of AI reproducibility and slowing down future research by others in the field. As many research organizations tend to exclusively focus on specific areas, opportunities for interdisciplinary research reduce. Undertaking long-term bold research in unexplored fields with non-commercial yet great public value is hard due to factors including the high upfront risk, budgetary constraints, and a lack of availability of data and experts in niche fields. Only a few companies or well-funded research labs can afford to do such long-term research. With research organizations focused on an exploding array of fields and resources spread thin, opportunities for the maturation of interdisciplinary research reduce. Apart from these exigencies, there is also a need to engage citizen scientists through open-source contributors to play an active part in the research dialogue. We present a short case study of SpaceML, an extension of the Frontier Development Lab, an AI accelerator for NASA. SpaceML distributes open-source research and invites volunteer citizen scientists to partake in development and deployment of high social value products at the intersection of space and AI.
△ Less
Submitted 16 February, 2021; v1 submitted 19 December, 2020;
originally announced December 2020.
-
Learnings from Frontier Development Lab and SpaceML -- AI Accelerators for NASA and ESA
Authors:
Siddha Ganju,
Anirudh Koul,
Alexander Lavin,
Josh Veitch-Michaelis,
Meher Kasam,
James Parr
Abstract:
Research with AI and ML technologies lives in a variety of settings with often asynchronous goals and timelines: academic labs and government organizations pursue open-ended research focusing on discoveries with long-term value, while research in industry is driven by commercial pursuits and hence focuses on short-term timelines and return on investment. The journey from research to product is oft…
▽ More
Research with AI and ML technologies lives in a variety of settings with often asynchronous goals and timelines: academic labs and government organizations pursue open-ended research focusing on discoveries with long-term value, while research in industry is driven by commercial pursuits and hence focuses on short-term timelines and return on investment. The journey from research to product is often tacit or ad hoc, resulting in technology transition failures, further exacerbated when research and development is interorganizational and interdisciplinary. Even more, much of the ability to produce results remains locked in the private repositories and know-how of the individual researcher, slowing the impact on future research by others and contributing to the ML community's challenges in reproducibility. With research organizations focused on an exploding array of fields, opportunities for the handover and maturation of interdisciplinary research reduce. With these tensions, we see an emerging need to measure the correctness, impact, and relevance of research during its development to enable better collaboration, improved reproducibility, faster progress, and more trusted outcomes. We perform a case study of the Frontier Development Lab (FDL), an AI accelerator under a public-private partnership from NASA and ESA. FDL research follows principled practices that are grounded in responsible development, conduct, and dissemination of AI research, enabling FDL to churn successful interdisciplinary and interorganizational research projects, measured through NASA's Technology Readiness Levels. We also take a look at the SpaceML Open Source Research Program, which helps accelerate and transition FDL's research to deployable projects with wide spread adoption amongst citizen scientists.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
The X-ray Emissivity of Low-Density Stellar Populations
Authors:
C. O. Heinke,
M. G. Ivanov,
E. W. Koch,
R. Andrews,
L. Chomiuk,
H. N. Cohn,
S. Crothers,
T. de Boer,
N. Ivanova,
A. K. H. Kong,
N. W. C. Leigh,
P. M. Lugger,
L. Nelson,
C. J. Parr,
E. W. Rosolowsky,
A. J. Ruiter,
C. L. Sarazin,
A. W. Shaw,
G. R. Sivakoff,
M. van den Berg
Abstract:
The dynamical production of low-mass X-ray binaries and brighter cataclysmic variables (CVs) in dense globular clusters is well-established. We investigate how the X-ray emissivity of fainter X-ray binaries (principally CVs and coronally active binaries) varies between different environments. We compile calculations (largely from the literature) of the X-ray emissivity of old stellar populations,…
▽ More
The dynamical production of low-mass X-ray binaries and brighter cataclysmic variables (CVs) in dense globular clusters is well-established. We investigate how the X-ray emissivity of fainter X-ray binaries (principally CVs and coronally active binaries) varies between different environments. We compile calculations (largely from the literature) of the X-ray emissivity of old stellar populations, including open and globular clusters and several galaxies. We investigate three literature claims of unusual X-ray sources in low-density stellar populations. We show that a suggested quiescent neutron star in the open cluster NGC 6819 is a foreground M dwarf. We show that the suggested diffuse X-ray emission from an old nova shell in the globular cluster NGC 6366 is actually a background galaxy cluster. And we show that a suggested population of quiescent X-ray binaries in the Sculptor Dwarf Galaxy is mostly (perhaps entirely) background galaxies. We find that above densities of $10^4$ M$_{\odot}$/pc$^3$, the X-ray emissivity of globular clusters increases, due to dynamical production of X-ray emitting systems. Below this density, globular clusters have lower X-ray emissivity than the other populations, and we do not see a strong dependence of X-ray emissivity due to density effects. We find significant correlations between X-ray emissivity and binary fraction, metallicity, and density. Sampling these fits via bootstrap techniques gives less significant correlations, but confirms the effect of metallicity on low-density populations, and that of density on the full globular cluster sample.
△ Less
Submitted 26 January, 2020;
originally announced January 2020.
-
NASA's Asteroid Grand Challenge: Strategy, Results and Lessons Learned
Authors:
Jennifer L Gustetic,
Victoria Friedensen,
Jason L Kessler,
Shanessa Jackson,
James Parr
Abstract:
Beginning in 2012, NASA utilized a strategic process to identify broad societal questions, or grand challenges, that are well suited to the aerospace sector and align with national priorities. This effort generated NASA's first grand challenge, the Asteroid Grand Challenge, a large scale effort using multidisciplinary collaborations and innovative engagement mechanisms focused on finding and addre…
▽ More
Beginning in 2012, NASA utilized a strategic process to identify broad societal questions, or grand challenges, that are well suited to the aerospace sector and align with national priorities. This effort generated NASA's first grand challenge, the Asteroid Grand Challenge, a large scale effort using multidisciplinary collaborations and innovative engagement mechanisms focused on finding and addressing asteroid threats to human populations. In April 2010, President Barack Obama announced a mission to send humans to an asteroid by 2025. This resulted in the agency's Asteroid Redirect Mission to leverage and maximize existing robotic and human efforts to capture and reroute an asteroid, with the goal of eventual human exploration. The AGC, initiated in 2013, complemented ARM by expanding public participation, partnerships, and other approaches to find, understand, and overcome these potentially harmful asteroids. This paper describes a selection of AGC activities implemented from 2013 to 2017 and their results, excluding those conducted by NASA's Near Earth Object Observations Program and other organizations. The strategic development of the initiative is outlined as well as initial successes, strengths, and weaknesses resulting from the first four years of AGC activities and approaches. Finally, we describe lesson learned and areas for continued work and study. The AGC lessons learned and strategies could inform the work of other agencies and organizations seeking to conduct a global scientific investigation with matrixed organizational support, multiple strategic partners, and numerous internal and external open innovation approaches and audiences.
△ Less
Submitted 12 March, 2018;
originally announced March 2018.