A General-Purpose Self-Supervised Model for Computational Pathology
Authors:
Richard J. Chen,
Tong Ding,
Ming Y. Lu,
Drew F. K. Williamson,
Guillaume Jaume,
Bowen Chen,
Andrew Zhang,
Daniel Shao,
Andrew H. Song,
Muhammad Shaban,
Mane Williams,
Anurag Vaidya,
Sharifa Sahai,
Lukas Oldenburg,
Luca L. Weishaupt,
Judy J. Wang,
Walt Williams,
Long Phi Le,
Georg Gerber,
Faisal Mahmood
Abstract:
Tissue phenoty** is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts…
▽ More
Tissue phenoty** is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts have proposed using pretrained image encoders with either transfer learning from natural image datasets or self-supervised pretraining on publicly-available histopathology datasets, but have not been extensively developed and evaluated across diverse tissue types at scale. We introduce UNI, a general-purpose self-supervised model for pathology, pretrained using over 100 million tissue patches from over 100,000 diagnostic haematoxylin and eosin-stained WSIs across 20 major tissue types, and evaluated on 33 representative CPath clinical tasks in CPath of varying diagnostic difficulties. In addition to outperforming previous state-of-the-art models, we demonstrate new modeling capabilities in CPath such as resolution-agnostic tissue classification, slide classification using few-shot class prototypes, and disease subty** generalization in classifying up to 108 cancer types in the OncoTree code classification system. UNI advances unsupervised representation learning at scale in CPath in terms of both pretraining data and downstream evaluation, enabling data-efficient AI models that can generalize and transfer to a gamut of diagnostically-challenging tasks and clinical workflows in anatomic pathology.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
Deep learning-based tumor segmentation on digital images of histopathology slides for microdosimetry applications
Authors:
Luca L. Weishaupt,
Jose Torres,
Sophie Camilleri-Broët,
Roni F. Rayes,
Jonathan D. Spicer,
Sabrina Côté Maldonado,
Shirin A. Enger
Abstract:
$\bf{Purpose:}…
▽ More
$\bf{Purpose:}$ The goal of this study was (i) to use artificial intelligence to automate the traditionally labor-intensive process of manual segmentation of tumor regions in pathology slides performed by a pathologist and (ii) to validate the use of a well-known and readily available deep learning architecture. Automation will reduce the human error involved in manual delineation, increase efficiency, and result in accurate and reproducible segmentation. This advancement will alleviate the bottleneck in the workflow in clinical and research applications due to a lack of pathologist time. Our application is patient-specific microdosimetry and radiobiological modeling, which builds on the contoured pathology slides.
$\bf{Methods:}$ A U-Net architecture was used to segment tumor regions in pathology core biopsies of lung tissue with adenocarcinoma stained using hematoxylin and eosin. A pathologist manually contoured the tumor regions in 56 images with binary masks for training. Overlap** patch extraction with various patch sizes and image downsampling were investigated individually. Data augmentation and 8-fold cross-validation were used.
$\bf{Results:}$ The U-Net achieved accuracy of 0.91$\pm$0.06, specificity of 0.90$\pm$0.08, sensitivity of 0.92$\pm$0.07, and precision of 0.8$\pm$0.1. The F1/DICE score was 0.85$\pm$0.07, with a segmentation time of 3.24$\pm$0.03 seconds per image, achieving a 370$\pm$3 times increased efficiency over manual segmentation. In some cases, the U-Net correctly delineated the tumor's stroma from its epithelial component in regions that were classified as tumor by the pathologist.
$\bf{Conclusion:}$ The U-Net architecture can segment images with a level of efficiency and accuracy that makes it suitable for tumor segmentation of histopathological images in fields such as radiotherapy dosimetry, specifically in the subfields of microdosimetry.
△ Less
Submitted 4 May, 2021;
originally announced May 2021.