These authors contributed equally to this work.
@Correspondence: [email protected]
Supplementary Material
Unsupervised Latent Stain Adaptation
for Computational Pathology
1 Test Datasets
The following tables serve as overview of the number patches used for each staining and tissue combination in the segmentation task.
Staining | Tissue class | #Images |
---|---|---|
PAS | Glomerulus | 329 |
Glomerular Tuft | 352 | |
Tubule | 231 | |
Artery | 264 | |
SIL | Glomerulus | 248 |
Glomerular Tuft | 217 | |
Artery | 223 | |
TRI | Glomerulus | 260 |
Glomerular Tuft | 233 | |
Artery | 324 | |
H&E | Artery | 402 |
Staining | Tissue class | #Images |
---|---|---|
PAS | Glomerulus | 2670 |
2 Hyperparameter search
All hyperparameter searches were performed via grid search on validation sets only. In the following we detail hyperparameter selections.
2.0.1 cGAN.
We performed a careful selection of hyperparameters to ensure that the images were perfectly translated into the target stainings. The number of epochs for the adversarial model was searched within the interval and set to , while the learning rate was adjusted to , within the search range . The momentum term of Adam was set to , within the interval . The buffer for storing artifical images was fixed at , and the batch size was fixed at , which achieved the best results within the interval of . Finally, the number of unlabeled training data was set to , within the interval .
2.0.2 ULSA.
We performed grid search for finding the weighting between and in the range of with step size . We used the overall batch sizes between labeled and unlabeled samples with where we tried for different . We tested several noise injection approaches including salt and pepper, gaussian blurring and gaussian noise. Best results were archived with gaussian blurring (kernel size: , intensity: ). Other augmentation methods like color jitter and random sharpness adjustments were tried, but did not show promising results. We further tried to replace Reinhard with Macenko, which was not possible, due to computational overload (ULSA with Reinhard: 7-10h, ULSA with Macenko: at least 48-60h). Also translating the stains offline and storing them locally would not be possible, because of the huge amount of images needed to store: .
2.0.3 Comparable methods.
Reinhard and Macenko. For each image in the mini-batch we used a random target stained image as reference for transformation. Thus each image was translated multiple times into different target stains during training. UDA. We used various combinations such as color jitter and gaussian blurring (also see augmentations for ULSA) for data augmentation in the semi-supervised part. Other augmentations lead to worst results. The batch size factor for unlabeled data in the unsupervised data augmentation procedure was set to as proposed by the authors within the interval . FixMatch. We used a confidence threshold of , which was the same the authors used for their implementations. All other parameters were obtained as in UDA.