-
Machine Learning Approaches to Automated Flow Cytometry Diagnosis of Chronic Lymphocytic Leukemia
Authors:
Akum S. Kang,
Loveleen C. Kang,
Stephen M. Mastorides,
Philip R. Foulis,
Lauren A. DeLand,
Robert P. Seifert,
Andrew A. Borkowski
Abstract:
Flow cytometry is a technique that measures multiple fluorescence and light scatter-associated parameters from individual cells as they flow a single file through an excitation light source. These cells are labeled with antibodies to detect various antigens and the fluorescence signals reflect antigen expression. Interpretation of the multiparameter flow cytometry data is laborious, time-consuming…
▽ More
Flow cytometry is a technique that measures multiple fluorescence and light scatter-associated parameters from individual cells as they flow a single file through an excitation light source. These cells are labeled with antibodies to detect various antigens and the fluorescence signals reflect antigen expression. Interpretation of the multiparameter flow cytometry data is laborious, time-consuming, and expensive. It involves manual interpretation of cell distribution and pattern recognition on two-dimensional plots by highly trained medical technologists and pathologists. Using various machine learning algorithms, we attempted to develop an automated analysis for clinical flow cytometry cases that would automatically classify normal and chronic lymphocytic leukemia cases. We achieved the best success with the Gradient Boosting. The XGBoost classifier achieved a specificity of 1.00 and a sensitivity of 0.67, a negative predictive value of 0.75, a positive predictive value of 1.00, and an overall accuracy of 0.83 in prospectively classifying cases with malignancies.
△ Less
Submitted 22 July, 2021; v1 submitted 20 July, 2021;
originally announced July 2021.
-
Lung and Colon Cancer Histopathological Image Dataset (LC25000)
Authors:
Andrew A. Borkowski,
Marilyn M. Bui,
L. Brannon Thomas,
Catherine P. Wilson,
Lauren A. DeLand,
Stephen M. Mastorides
Abstract:
The field of Machine Learning, a subset of Artificial Intelligence, has led to remarkable advancements in many areas, including medicine. Machine Learning algorithms require large datasets to train computer models successfully. Although there are medical image datasets available, more image datasets are needed from a variety of medical entities, especially cancer pathology. Even more scarce are ML…
▽ More
The field of Machine Learning, a subset of Artificial Intelligence, has led to remarkable advancements in many areas, including medicine. Machine Learning algorithms require large datasets to train computer models successfully. Although there are medical image datasets available, more image datasets are needed from a variety of medical entities, especially cancer pathology. Even more scarce are ML-ready image datasets. To address this need, we created an image dataset (LC25000) with 25,000 color images in 5 classes. Each class contains 5,000 images of the following histologic entities: colon adenocarcinoma, benign colonic tissue, lung adenocarcinoma, lung squamous cell carcinoma, and benign lung tissue. All images are de-identified, HIPAA compliant, validated, and freely available for download to AI researchers.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Google Auto ML versus Apple Create ML for Histopathologic Cancer Diagnosis; Which Algorithms Are Better?
Authors:
Andrew A. Borkowski,
Catherine P. Wilson,
Steven A. Borkowski,
L. Brannon Thomas,
Lauren A. Deland,
Stefanie J. Grewe,
Stephen M. Mastorides
Abstract:
Artificial Intelligence is set to revolutionize multiple fields in the coming years. One subset of AI, machine learning, shows immense potential for application in a diverse set of medical specialties, including diagnostic pathology. In this study, we investigate the utility of the Apple Create ML and Google Cloud Auto ML, two machine learning platforms, in a variety of pathological scenarios invo…
▽ More
Artificial Intelligence is set to revolutionize multiple fields in the coming years. One subset of AI, machine learning, shows immense potential for application in a diverse set of medical specialties, including diagnostic pathology. In this study, we investigate the utility of the Apple Create ML and Google Cloud Auto ML, two machine learning platforms, in a variety of pathological scenarios involving lung and colon pathology. First, we evaluate the ability of the platforms to differentiate normal lung tissue from cancerous lung tissue. Also, the ability to accurately distinguish two subtypes of lung cancer (adenocarcinoma and squamous cell carcinoma) is examined and compared. Similarly, the ability of the two programs to differentiate colon adenocarcinoma from normal colon is assessed as is done with lung tissue. Also, cases of colon adenocarcinoma are evaluated for the presence or absence of a specific gene mutation known as KRAS. Finally, our last experiment examines the ability of the Apple and Google platforms to differentiate between adenocarcinomas of lung origin versus colon origin. In our trained models for lung and colon cancer diagnosis, both Apple and Google machine learning algorithms performed very well individually and with no statistically significant differences found between the two platforms. However, some critical factors set them apart. Apple Create ML can be used on local computers but is limited to an Apple ecosystem. Google Auto ML is not platform specific but runs only in Google Cloud with associated computational fees. In the end, both are excellent machine learning tools that have great potential in the field of diagnostic pathology, and which one to choose would depend on personal preference, programming experience, and available storage space.
△ Less
Submitted 19 March, 2019;
originally announced March 2019.
-
Apple Machine Learning Algorithms Successfully Detect Colon Cancer but Fail to Predict KRAS Mutation Status
Authors:
Andrew A. Borkowski,
Catherine P. Wilson,
Steven A. Borkowski,
L. Brannon Thomas,
Lauren A. Deland,
Stephen M. Mastorides
Abstract:
Colon cancer is the second leading cause of cancer-related death in the United States of America. Its prognosis has significantly improved with the advancement of targeted therapies based on underlying molecular changes. The KRAS mutation is one of the most frequent molecular alterations seen in colon cancer and its presence can affect treatment selection. We attempted to use Apple machine learnin…
▽ More
Colon cancer is the second leading cause of cancer-related death in the United States of America. Its prognosis has significantly improved with the advancement of targeted therapies based on underlying molecular changes. The KRAS mutation is one of the most frequent molecular alterations seen in colon cancer and its presence can affect treatment selection. We attempted to use Apple machine learning algorithms to diagnose colon cancer and predict the KRAS mutation status from histopathological images. We captured 250 colon cancer images and 250 benign colon tissue images. Half of colon cancer images were captured from KRAS mutation-positive tumors and another half from KRAS mutation-negative tumors. Next, we created Image Classifier Model using Apple CreateML machine learning module. The trained and validated model was able to successfully differentiate between colon cancer and benign colon tissue images with 98 % recall and 98 % precision. However, our model failed to reliably identify KRAS mutations, with the highest realized accuracy of 66 %. Although not yet perfected, in the near future Apple CreateML modules can be used in diagnostic smartphone-based applications and potentially alleviate shortages of medical professionals in understaffed parts of the world.
△ Less
Submitted 15 January, 2019; v1 submitted 11 December, 2018;
originally announced December 2018.
-
Using Apple Machine Learning Algorithms to Detect and Subclassify Non-Small Cell Lung Cancer
Authors:
Andrew A. Borkowski,
Catherine P. Wilson,
Steven A. Borkowski,
Lauren A. Deland,
Stephen M. Mastorides
Abstract:
Lung cancer continues to be a major healthcare challenge with high morbidity and mortality rates among both men and women worldwide. The majority of lung cancer cases are of non-small cell lung cancer type. With the advent of targeted cancer therapy, it is imperative not only to properly diagnose but also sub-classify non-small cell lung cancer. In our study, we evaluated the utility of using Appl…
▽ More
Lung cancer continues to be a major healthcare challenge with high morbidity and mortality rates among both men and women worldwide. The majority of lung cancer cases are of non-small cell lung cancer type. With the advent of targeted cancer therapy, it is imperative not only to properly diagnose but also sub-classify non-small cell lung cancer. In our study, we evaluated the utility of using Apple Create ML module to detect and sub-classify non-small cell carcinomas based on histopathological images. After module optimization, the program detected 100% of non-small cell lung cancer images and successfully subclassified the majority of the images. Trained modules, such as ours, can be utilized in diagnostic smartphone-based applications, augmenting diagnostic services in understaffed areas of the world.
△ Less
Submitted 18 January, 2019; v1 submitted 24 August, 2018;
originally announced August 2018.