[1,2]B. A. Schreiber [1,3]E. J. Soilleux

[1]\orgdivDepartment of Pathology, \orgnameUniversity of Cambridge, \orgaddress\streetTennis Court Road, \cityCambridge, \postcodeCB2 1QP, \stateCambridgeshire, \countryUnited Kingdom 2]\orgdivDepartment of Applied Mathematics and Theoretical Physics, \orgnameUniversity of Cambridge, \orgaddress\streetWilberforce Road, \cityCambridge, \postcodeCB3 0WA, \stateCambridgeshire, \countryUnited Kingdom 3]\orgnameLyzeum ltd, \orgaddress\cityCambridge, \postcodeCB1 2LA, \stateCambridgeshire, \countryUnited Kingdom 4]\orgdivEdinburgh Pathology, Institute of Genetics & Cancer, \orgnameUniversity of Edinburgh, \orgaddress\streetCrewe Road, \cityEdinburgh, \postcodeEH4 2XR, \countryUK 5]\orgdivArtificial Intelligence and Machine Learning, \orgnameGSK plc, \orgaddress\streetGreat West Road, \cityBrentford, \postcodeTW8 9GS, \stateMiddlesex, \countryUK

Rapid Artefact Removal and H&E-Stained Tissue Segmentation

[email protected] J. Denholm F. Jaeckle M. J. Arends K. M. Branson C.-B. Schönlieb [email protected] * [ [ [ [

Abstract

We present an innovative method for rapidly segmenting hematoxylin and eosin (H&E)-stained tissue in whole-slide images (WSIs) that eliminates a wide range of undesirable artefacts such as pen marks and scanning artefacts. Our method involves taking a single-channel representation of a low-magnification RGB overview of the WSI in which the pixel values are bimodally distributed such that H&E-stained tissue is easily distinguished from both background and a wide variety of artefacts. We demonstrate our method on 30 WSIs prepared from a wide range of institutions and WSI digital scanners, each containing substantial artefacts, and compare it to segmentations provided by Otsu thresholding and Histolab tissue segmentation and pen filtering tools. We found that our method segmented the tissue and fully removed all artefacts in 29 out of 30 WSIs, whereas Otsu thresholding failed to remove any artefacts, and the Histolab pen filtering tools only partially removed the pen marks. The beauty of our approach lies in its simplicity: manipulating RGB colour space and using Otsu thresholding allows for the segmentation of H&E-stained tissue and the rapid removal of artefacts without the need for machine learning or parameter tuning.

keywords:

Artefact Removal, Tissue Segmentation, Whole-Slide Imaging, Haematoxylin and Eosin, Machine Learning

1 Introduction

Otsu thresholding [1] is often applied to the luminance of whole-slide images (WSI) of haematoxylin and eosin (H&E)-stained tissue for the purposes of segmentation [2, 3, 4, 5, 6, 7, 8, 9, 10, 11] (see Figure 1), including in popular histopathological analysis tools Histolab [12] and PyHist [13]. However, Otsu thresholding only successfully segments the tissue from the background when the tissue and background pixels are well-separated in a greyscale representation of the WSI. While this is often the case in artefact-free WSIs, WSIs often contain artefacts such as pen marks and dark scanning artefacts, which cause this assumption to fail, thus resulting in artefacts wrongly identified as tissue, tissue rejected as background, or both (see Figure 1). While there are a large and diverse range of artefacts that can occur on a WSI, in the context of this paper artefacts will refer only to pen marks (see Figures 3a, b, c, d, and g), bounding boxes added by the scanners (see Figures 3e, f and g) scanning artefacts such as dark blobs or text (see Figures 3e and f).

The exclusion of pen marks in particular is a crucial first step for any machine learning-based automated WSI analysis pipeline; pathologists often use pen marks to highlight areas of interest which, if observed by a machine learning algorithm, could result in deleterious bias, spurious classifications or even data leakage, thus reducing the confidence in the performance metrics and the generalizability of the algorithm [14].

In this paper, we propose a new tissue segmenting algorithm for H&E-stained tissue which can segment tissue in the presence of artefacts. We tested our method on WSIs of H&E-stained duodenal biopsies prepared at multiple different institutions, scanned using multiple different scanners, and containing a large range of artefacts of different types, shapes and colours.

2 Method

Our method improves on Otsu thresholding by selecting a representation of the WSI data that better separates H&E-stained tissue from background and artefacts than luminance. Given a three channel image $I=[I_{R},I_{B},I_{G}]$ , the channels are normalized so that the channels of each pixel are represented by floats ranging from 0 to 1. Then, the following representation of the data is calculated:

T=\textrm{ReLU}(I_{R}-I_{G})\odot\textrm{ReLU}(I_{B}-I_{G})

(1)

where $\textrm{ReLU}(x)=\max(x,0)$ is the rectifier linear unit and $\odot$ is the Hadamard product, both of which act element-wise. Otsu thresholding is then used to separate tissue and non-tissue pixels [1]. Note that this calculation requires no parameter training or tuning. A Python implementation of this previously unreported algorithm can be found here https://gitlab.developers.cam.ac.uk/bas43/h_and_e_otsu_thresholding in accordance with the Guidelines for Authors Submitting Code and & Software presented in Nature Research https://www.nature.com/nature-portfolio/editorial-policies/reporting-standard#reporting-requirements. All relevant guidelines were followed in the development and testing of this algorithm.

RGB Image:

[I_{R},I_{G},I_{B}]

Normalize:

I_{R},I_{G},I_{B}\leftarrow I_{R}/255,I_{G}/255,I_{B}/255

R - G Representation:

I_{R-G}\leftarrow\textrm{ReLU}[I_{R}-I_{G}]

B - G Representation:

I_{B-G}\leftarrow\textrm{ReLU}[I_{B}-I_{G}]

Tissue Representation:

T\leftarrow I_{R-G}\odot I_{B-G}

Otsu threshold:

\gamma\leftarrow\mathrm{Otsu}[T]

if $T[p]>\gamma$ then

Pixel

p

is segmented as tissue

else

Pixel

p

is rejected

end if

Algorithm 1 Our method for segmenting H&E stained tissue

The assumption made by Otsu thresholding is that tissue and non-tissue pixels can be separated by their grey-scale values, which is not the case when artefacts are present. However, our method, which is described in Equation 1 and Algorithm 1, is based on the assumption that the tissue pixels can be identified by being both more blue than green and more red than green as compared to non-tissue pixels. The advantage of our method is that all shades of grey have approximately the same value in the red channel as the green channel, so their difference is 0, while pixels of H&E-stained tissue have higher values in the blue and red channels than the green. Setting all negative values in both representations to zero ensures that artefacts with high green channels compared to blue or red channels do not adversely influence the threshold calculation, and are thus considered as background. Thus, this representation results in a bimodal distribution that separates pixels that are the most “purple-pink” from others, so pen marks (which are often black, blue, green or red) are also excluded, independent of the pixel’s light intensity. Pixels on an RGB colour cube that have a non-zero value in this representation are shown in figure 2 and comparisons between Otsu thresholding and our method on an RGB colour cube can be seen in the supplementary material.

We compared the performance of our method against Otsu thresholding and Histolab’s pen filtering tools by applying these methods to a dataset of WSIs and assessing the resulting tissue segmentations qualitatively.

2.1 Data

To compare the performances of the Otsu thresholding, Histolab and our method, we applied both methods to a selection of 60 WSIs of H&E stained duodenal biopsies. Of the 60 WSIs selected:

•

15 contained pen marks
•

15 contained scanning artefacts
•

30 contained no significant artefacts

The WSIs were hand-picked so that they contained a wide range of artefacts of different types, shapes and colours. The WSIs were scanned with a wide range of digital scanners (Ventana, Aperio, Hamamatsu and Philips), and the 30 WSIs with no significant artefacts were selected at random and matched for scanner type of the 30 WSIs with pen marks or artefact.

2.2 Ethical Statement

All fully anonymized slide scans (and patient data) were obtained with full ethical approval from the Oxfordshire Research Ethics Committee A (IRAS: 162057; PI: Prof. E. Soilleux), and the method was performed in accordance with their guidelines and regulations. Informed consent was obtained from all subjects and/or their legal guardian(s).

3 Results

Otsu thresholding, Histolab and our method were used to segment the tissue from the 60 WSIs described above. Examples of the WSIs selected and the tissue segmentation of these methods can be viewed in Figure 3. Examples of the tissue segmentation masks provided by Otsu thresholding and our method, and the Sørensen–Dice coefficient’s between the segmentations and a manually segmented tissue mask are displayed in the supplementary material. The tissue segmentations were assessed by a single observer, and considered “successful” if all the following were true:

•

All tissue was segmented
•

All background was rejected from the segmentation
•

All bounding boxes were rejected from the segmentation
•

All artefacts were rejected from the segmentation

Otsu thresholding rejected pen and scanning artefacts from the tissue segmentation in 0/30 WSIs containing artefacts. In 2/30 WSIs containing artefacts, the influence the artefacts had on the threshold was so great that the tissue was not segmented as tissue (see Figure 3e).

The Histolab pen filtering tool only partially removed pen marks in Figures 3a-d, and removed no pen marks in Figure 3g. Other artefacts such as scanning artefacts were not effected by the Histolab tools.

Our method segmented the tissue in all 60/60 WSIs and rejected all artefacts in 29/30 WSIs containing pen and scanning artefacts. The only WSI where pen marks were included in the tissue segmentation can be seen in Figure 3g. Here our method failed to reject the pen marks because tissue and non-tissue pixels could not be separated through their “pinkness”, when the pen marks were also pink.

4 Discussion

While Otsu thresholding segmented the tissue in all artefact-free WSIs and most WSIs with artefacts, it identified all artefacts as tissue as well. In 2 out of 30 WSIs with artefacts, the presence of artefacts caused the threshold to ignore tissue and include background in the tissue segmentation as seen in Figure 3e.

The Histolab pen filtering tools were applied to the tissue segmentations in order to remove the remaining pen marks. The filtering tools performed best on blue pen marks, as seen in Figures 3a and d. However, the tools did not remove the edges of pen marks of all colours, and dailed to detect the majority of all green and orange pen marks, as seen in Figures 3b and c respectively. The pink pen marks presented in Figure 3g remained untouched. Additionally, the Histolab tools were not designed to remove scanning artefacts and bounding boxes so these features remained.

Our method, on the other hand, segmented the tissue in all WSIs and rejected artefacts in all WSIs containing artefacts but one. The only exception can be seen in Figure 3g, which contained pink pen marks that caused all methods to fail.

The thresholding algorithm presented here is a rapid, reliable and easily implementable tissue segmentation and artefact removal tool for WSIs of H&E-stained tissue. In machine learning tasks especially, this tool can be used as a preprocessing step that ensures artefacts do not cause the machine learning algorithm to train on irrelevant patches or patches that contain data leaking pen marks.

It should be noted that this method is built to segment H&E-stained tissue only, and will not perform as intended on tissue which has been stained with stains that do not appear pink/purple. However, this method should be relatively simple to generalize to other stains by using representations of the WSI data that uniquely differentiate the stained tissue from background and artefacts, and will be studied in future research.

Funding

This work was supported by the Pathological Society [PKAG/924] and GlaxoSmithKline [LEAG/781]

Author Contributions

B. A. Schreiber devised the thresholding algorithm and wrote the manuscript. J. Denholm and F. Jaeckle independently tested and compared Otsu thresholding, Histolab, and the thresholding algorithm presented here. Histological expertise was provided by M. J. Arends and E. J. Soilleux. The project was initialized by E. J. Soilleux and supervised by K. M. Branson, C.-B. Schönlieb and E. J. Soilleux. All authors were given the opportunity to review and comment on the manuscript.

Data Availability

The datasets of WSIs analysed during this current study have not publicly available due to the large size of the WSIs and legal considerations. However, low-level representations of the WSIs used in the study have been made available at https://gitlab.developers.cam.ac.uk/bas43/h_and_e_otsu_thresholding.

References

Otsu [1979] N. Otsu, A Threshold Selection Method from Gray-Level Histograms, IEEE Transactions on Systems, Man, and Cybernetics 9 (1979) 62–66. doi:10.1109/TSMC.1979.4310076.
Wang et al. [2020] X. Wang, H. Chen, C. Gan, H. Lin, Q. Dou, E. Tsougenis, Q. Huang, M. Cai, P.-A. Heng, Weakly Supervised Deep Learning for Whole Slide Lung Cancer Image Analysis, IEEE Transactions on Cybernetics 50 (2020) 3950–3962. doi:10.1109/TCYB.2019.2935141.
Denholm et al. [2022] J. Denholm, B. A. Schreiber, S. C. Evans, O. M. Crook, A. Sharma, J. L. Watson, H. Bancroft, G. Langman, J. D. Gilbey, C. B. Schönlieb, M. J. Arends, E. J. Soilleux, Multiple-instance-learning-based detection of coeliac disease in histological whole-slide images, Journal of Pathology Informatics 13 (2022) 100151. doi:10.1016/j.jpi.2022.100151.
Campanella et al. [2019] G. Campanella, M. G. Hanna, L. Geneslaw, A. Miraflor, V. Werneck Krauss Silva, K. J. Busam, E. Brogi, V. E. Reuter, D. S. Klimstra, T. J. Fuchs, Clinical-grade computational pathology using weakly supervised deep learning on whole slide images, Nature Medicine 25 (2019) 1301–1309. doi:10.1038/s41591-019-0508-1.
Khened et al. [2021] M. Khened, A. Kori, H. Rajkumar, G. Krishnamurthi, B. Srinivasan, A generalized deep learning framework for whole-slide image segmentation and analysis, Scientific Reports 11 (2021) 11579. URL: https://www.nature.com/articles/s41598-021-90444-8. doi:10.1038/s41598-021-90444-8, number: 1 Publisher: Nature Publishing Group.
Anghel et al. [2019] A. Anghel, M. Stanisavljevic, S. Andani, N. Papandreou, J. H. Rüschoff, P. Wild, M. Gabrani, H. Pozidis, A High-Performance System for Robust Stain Normalization of Whole-Slide Images in Histopathology, Frontiers in Medicine 6 (2019) 193. doi:10.3389/fmed.2019.00193.
Haghighat et al. [2022] M. Haghighat, L. Browning, K. Sirinukunwattana, S. Malacrino, N. Khalid Alham, R. Colling, Y. Cui, E. Rakha, F. C. Hamdy, C. Verrill, J. Rittscher, Automated quality assessment of large digitised histology cohorts by artificial intelligence, Scientific Reports 12 (2022) 5002. URL: https://www.nature.com/articles/s41598-022-08351-5. doi:10.1038/s41598-022-08351-5, number: 1 Publisher: Nature Publishing Group.
Smith et al. [2021] B. Smith, M. Hermsen, E. Lesser, D. Ravichandar, W. Kremers, Develo** image analysis pipelines of whole-slide images: Pre- and post-processing, Journal of Clinical and Translational Science 5 (2021) e38. doi:10.1017/cts.2020.531, publisher: Cambridge University Press.
Veta et al. [2019] M. Veta, Y. J. Heng, N. Stathonikos, B. E. Bejnordi, F. Beca, T. Wollmann, K. Rohr, M. A. Shah, D. Wang, M. Rousson, M. Hedlund, D. Tellez, F. Ciompi, E. Zerhouni, D. Lanyi, M. Viana, V. Kovalev, V. Liauchuk, H. A. Phoulady, T. Qaiser, S. Graham, N. Rajpoot, E. Sjöblom, J. Molin, K. Paeng, S. Hwang, S. Park, Z. Jia, E. I.-C. Chang, Y. Xu, A. H. Beck, P. J. van Diest, J. P. W. Pluim, Predicting breast tumor proliferation from whole-slide images: The TUPAC16 challenge, Medical Image Analysis 54 (2019) 111–121. doi:10.1016/j.media.2019.02.012.
Schmauch et al. [2019] B. Schmauch, A. Romagnoni, E. Pronier, C. Saillard, P. Maillé, J. Calderaro, M. Sefta, S. Toldo, M. Zaslavskiy, T. Clozel, M. Moarii, P. Courtiol, G. Wainrib, Transcriptomic learning for digital pathology, bioRxiv (2019) 760173. doi:10.1101/760173.
Zhang et al. [2022] H. Zhang, Y. Meng, Y. Zhao, Y. Qiao, X. Yang, S. E. Coupland, Y. Zheng, DTFD-MIL: Double-tier feature distillation multiple instance learning for histopathology whole slide image classification, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 18780–18790. doi:10.1109/CVPR52688.2022.01824, ISSN: 2575-7075.
Marcolini et al. [2022] A. Marcolini, N. Bussola, E. Arbitrio, M. Amgad, G. Jurman, C. Furlanello, histolab: A python library for reproducible digital pathology preprocessing with automated testing, SoftwareX 20 (2022) 101237. URL: https://www.sciencedirect.com/science/article/pii/S2352711022001558. doi:10.1016/j.softx.2022.101237.
Muñoz-Aguirre et al. [2020] M. Muñoz-Aguirre, V. F. Ntasis, S. Rojas, R. Guigó, PyHIST: A histological image segmentation tool, PLOS Computational Biology 16 (2020) e1008349. URL: https://doi.org/10.1371/journal.pcbi.1008349. doi:10.1371/journal.pcbi.1008349.
Kaufman et al. [2023] S. Kaufman, S. Rosset, C. Perlich, O. Stitelman, Leakage in data mining: Formulation, detection, and avoidance, ACM Transactions on Knowledge Discovery from Data 6 (2023) 15:1–15:21. URL: https://dl.acm.org/doi/10.1145/2382577.2382579. doi:10.1145/2382577.2382579.