Search | arXiv e-print repository

doi 10.1117/12.3006049

Benchmarking Image Transformers for Prostate Cancer Detection from Ultrasound Data

Authors: Mohamed Harmanani, Paul F. R. Wilson, Fahimeh Fooladgar, Amoon Jamzad, Mahdi Gilany, Minh Nguyen Nhat To, Brian Wodlinger, Purang Abolmaesumi, Parvin Mousavi

Abstract: PURPOSE: Deep learning methods for classifying prostate cancer (PCa) in ultrasound images typically employ convolutional networks (CNNs) to detect cancer in small regions of interest (ROI) along a needle trace region. However, this approach suffers from weak labelling, since the ground-truth histopathology labels do not describe the properties of individual ROIs. Recently, multi-scale approaches h… ▽ More PURPOSE: Deep learning methods for classifying prostate cancer (PCa) in ultrasound images typically employ convolutional networks (CNNs) to detect cancer in small regions of interest (ROI) along a needle trace region. However, this approach suffers from weak labelling, since the ground-truth histopathology labels do not describe the properties of individual ROIs. Recently, multi-scale approaches have sought to mitigate this issue by combining the context awareness of transformers with a CNN feature extractor to detect cancer from multiple ROIs using multiple-instance learning (MIL). In this work, we present a detailed study of several image transformer architectures for both ROI-scale and multi-scale classification, and a comparison of the performance of CNNs and transformers for ultrasound-based prostate cancer classification. We also design a novel multi-objective learning strategy that combines both ROI and core predictions to further mitigate label noise. METHODS: We evaluate 3 image transformers on ROI-scale cancer classification, then use the strongest model to tune a multi-scale classifier with MIL. We train our MIL models using our novel multi-objective learning strategy and compare our results to existing baselines. RESULTS: We find that for both ROI-scale and multi-scale PCa detection, image transformer backbones lag behind their CNN counterparts. This deficit in performance is even more noticeable for larger models. When using multi-objective learning, we can improve performance of MIL, with a 77.9% AUROC, a sensitivity of 75.9%, and a specificity of 66.3%. CONCLUSION: Convolutional networks are better suited for modelling sparse datasets of prostate ultrasounds, producing more robust features than transformers in PCa detection. Multi-scale methods remain the best architecture for this task, with multi-objective learning presenting an effective way to improve performance. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: early draft, 7 pages; Accepted to SPIE Medical Imaging 2024

Journal ref: Proc. SPIE 12928, Medical Imaging 2024: Image-Guided Procedures, Robotic Interventions, and Modeling, 1292815 (29 March 2024)

arXiv:2308.06861 [pdf, other]

Manifold DivideMix: A Semi-Supervised Contrastive Learning Framework for Severe Label Noise

Authors: Fahimeh Fooladgar, Minh Nguyen Nhat To, Parvin Mousavi, Purang Abolmaesumi

Abstract: Deep neural networks have proven to be highly effective when large amounts of data with clean labels are available. However, their performance degrades when training data contains noisy labels, leading to poor generalization on the test set. Real-world datasets contain noisy label samples that either have similar visual semantics to other classes (in-distribution) or have no semantic relevance to… ▽ More Deep neural networks have proven to be highly effective when large amounts of data with clean labels are available. However, their performance degrades when training data contains noisy labels, leading to poor generalization on the test set. Real-world datasets contain noisy label samples that either have similar visual semantics to other classes (in-distribution) or have no semantic relevance to any class (out-of-distribution) in the dataset. Most state-of-the-art methods leverage ID labeled noisy samples as unlabeled data for semi-supervised learning, but OOD labeled noisy samples cannot be used in this way because they do not belong to any class within the dataset. Hence, in this paper, we propose incorporating the information from all the training data by leveraging the benefits of self-supervised training. Our method aims to extract a meaningful and generalizable embedding space for each sample regardless of its label. Then, we employ a simple yet effective K-nearest neighbor method to remove portions of out-of-distribution samples. By discarding these samples, we propose an iterative "Manifold DivideMix" algorithm to find clean and noisy samples, and train our model in a semi-supervised way. In addition, we propose "MixEMatch", a new algorithm for the semi-supervised step that involves mixup augmentation at the input and final hidden representations of the model. This will extract better representations by interpolating both in the input and manifold spaces. Extensive experiments on multiple synthetic-noise image benchmarks and real-world web-crawled datasets demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/Fahim-F/ManifoldDivideMix. △ Less

Submitted 13 August, 2023; originally announced August 2023.

arXiv:2303.02128 [pdf, other]

TRUSformer: Improving Prostate Cancer Detection from Micro-Ultrasound Using Attention and Self-Supervision

Authors: Mahdi Gilany, Paul Wilson, Andrea Perera-Ortega, Amoon Jamzad, Minh Nguyen Nhat To, Fahimeh Fooladgar, Brian Wodlinger, Purang Abolmaesumi, Parvin Mousavi

Abstract: A large body of previous machine learning methods for ultrasound-based prostate cancer detection classify small regions of interest (ROIs) of ultrasound signals that lie within a larger needle trace corresponding to a prostate tissue biopsy (called biopsy core). These ROI-scale models suffer from weak labeling as histopathology results available for biopsy cores only approximate the distribution o… ▽ More A large body of previous machine learning methods for ultrasound-based prostate cancer detection classify small regions of interest (ROIs) of ultrasound signals that lie within a larger needle trace corresponding to a prostate tissue biopsy (called biopsy core). These ROI-scale models suffer from weak labeling as histopathology results available for biopsy cores only approximate the distribution of cancer in the ROIs. ROI-scale models do not take advantage of contextual information that are normally considered by pathologists, i.e. they do not consider information about surrounding tissue and larger-scale trends when identifying cancer. We aim to improve cancer detection by taking a multi-scale, i.e. ROI-scale and biopsy core-scale, approach. Methods: Our multi-scale approach combines (i) an "ROI-scale" model trained using self-supervised learning to extract features from small ROIs and (ii) a "core-scale" transformer model that processes a collection of extracted features from multiple ROIs in the needle trace region to predict the tissue type of the corresponding core. Attention maps, as a byproduct, allow us to localize cancer at the ROI scale. We analyze this method using a dataset of micro-ultrasound acquired from 578 patients who underwent prostate biopsy, and compare our model to baseline models and other large-scale studies in the literature. Results and Conclusions: Our model shows consistent and substantial performance improvements compared to ROI-scale-only models. It achieves 80.3% AUROC, a statistically significant improvement over ROI-scale classification. We also compare our method to large studies on prostate cancer detection, using other imaging modalities. Our code is publicly available at www.github.com/med-i-lab/TRUSFormer △ Less

Submitted 3 March, 2023; originally announced March 2023.

arXiv:2211.00527 [pdf, other]

Self-Supervised Learning with Limited Labeled Data for Prostate Cancer Detection in High Frequency Ultrasound

Authors: Paul F. R. Wilson, Mahdi Gilany, Amoon Jamzad, Fahimeh Fooladgar, Minh Nguyen Nhat To, Brian Wodlinger, Purang Abolmaesumi, Parvin Mousavi

Abstract: Deep learning-based analysis of high-frequency, high-resolution micro-ultrasound data shows great promise for prostate cancer detection. Previous approaches to analysis of ultrasound data largely follow a supervised learning paradigm. Ground truth labels for ultrasound images used for training deep networks often include coarse annotations generated from the histopathological analysis of tissue sa… ▽ More Deep learning-based analysis of high-frequency, high-resolution micro-ultrasound data shows great promise for prostate cancer detection. Previous approaches to analysis of ultrasound data largely follow a supervised learning paradigm. Ground truth labels for ultrasound images used for training deep networks often include coarse annotations generated from the histopathological analysis of tissue samples obtained via biopsy. This creates inherent limitations on the availability and quality of labeled data, posing major challenges to the success of supervised learning methods. On the other hand, unlabeled prostate ultrasound data are more abundant. In this work, we successfully apply self-supervised representation learning to micro-ultrasound data. Using ultrasound data from 1028 biopsy cores of 391 subjects obtained in two clinical centres, we demonstrate that feature representations learnt with this method can be used to classify cancer from non-cancer tissue, obtaining an AUROC score of 91% on an independent test set. To the best of our knowledge, this is the first successful end-to-end self-supervised learning approach for prostate cancer detection using ultrasound data. Our method outperforms baseline supervised learning approaches, generalizes well between different data centers, and scale well in performance as more unlabeled data are added, making it a promising approach for future research using large volumes of unlabeled data. △ Less

Submitted 1 November, 2022; originally announced November 2022.

arXiv:2207.10485 [pdf, other]

Towards Confident Detection of Prostate Cancer using High Resolution Micro-ultrasound

Authors: Mahdi Gilany, Paul Wilson, Amoon Jamzad, Fahimeh Fooladgar, Minh Nguyen Nhat To, Brian Wodlinger, Purang Abolmaesumi, Parvin Mousavi

Abstract: MOTIVATION: Detection of prostate cancer during transrectal ultrasound-guided biopsy is challenging. The highly heterogeneous appearance of cancer, presence of ultrasound artefacts, and noise all contribute to these difficulties. Recent advancements in high-frequency ultrasound imaging - micro-ultrasound - have drastically increased the capability of tissue imaging at high resolution. Our aim is t… ▽ More MOTIVATION: Detection of prostate cancer during transrectal ultrasound-guided biopsy is challenging. The highly heterogeneous appearance of cancer, presence of ultrasound artefacts, and noise all contribute to these difficulties. Recent advancements in high-frequency ultrasound imaging - micro-ultrasound - have drastically increased the capability of tissue imaging at high resolution. Our aim is to investigate the development of a robust deep learning model specifically for micro-ultrasound-guided prostate cancer biopsy. For the model to be clinically adopted, a key challenge is to design a solution that can confidently identify the cancer, while learning from coarse histopathology measurements of biopsy samples that introduce weak labels. METHODS: We use a dataset of micro-ultrasound images acquired from 194 patients, who underwent prostate biopsy. We train a deep model using a co-teaching paradigm to handle noise in labels, together with an evidential deep learning method for uncertainty estimation. We evaluate the performance of our model using the clinically relevant metric of accuracy vs. confidence. RESULTS: Our model achieves a well-calibrated estimation of predictive uncertainty with area under the curve of 88$\%$. The use of co-teaching and evidential deep learning in combination yields significantly better uncertainty estimation than either alone. We also provide a detailed comparison against state-of-the-art in uncertainty estimation. △ Less

Submitted 21 July, 2022; originally announced July 2022.

arXiv:2012.08895 [pdf, other]

ReINTEL: A Multimodal Data Challenge for Responsible Information Identification on Social Network Sites

Authors: Duc-Trong Le, Xuan-Son Vu, Nhu-Dung To, Huu-Quang Nguyen, Thuy-Trinh Nguyen, Linh Le, Anh-Tuan Nguyen, Minh-Duc Hoang, Nghia Le, Huyen Nguyen, Hoang D. Nguyen

Abstract: This paper reports on the ReINTEL Shared Task for Responsible Information Identification on social network sites, which is hosted at the seventh annual workshop on Vietnamese Language and Speech Processing (VLSP 2020). Given a piece of news with respective textual, visual content and metadata, participants are required to classify whether the news is `reliable' or `unreliable'. In order to generat… ▽ More This paper reports on the ReINTEL Shared Task for Responsible Information Identification on social network sites, which is hosted at the seventh annual workshop on Vietnamese Language and Speech Processing (VLSP 2020). Given a piece of news with respective textual, visual content and metadata, participants are required to classify whether the news is `reliable' or `unreliable'. In order to generate a fair benchmark, we introduce a novel human-annotated dataset of over 10,000 news collected from a social network in Vietnam. All models will be evaluated in terms of AUC-ROC score, a typical evaluation metric for classification. The competition was run on the Codalab platform. Within two months, the challenge has attracted over 60 participants and recorded nearly 1,000 submission entries. △ Less

Submitted 16 December, 2020; originally announced December 2020.

arXiv:2003.10080 [pdf, other]

Lithography-free Kirchhoff's Metasurfaces

Authors: Takuhiro Kumagai, Naoki To, Armandas Balcytis, Gediminas Seniutinas, Saulius Juodkazis, Yoshiaki Nishijima

Abstract: Lithography-free metasurfaces composed of a nano-layered stack of materials are attractive not only due to their optical properties but also by virtue of fabrication simplicity and cost reduction of devices based on such structures. We demonstrate a multi-layer metasurface with engineered electromagnetic absorption in the mid-infrared (MIR) wavelength range. Characterisation of thin SiO$_2$ and Si… ▽ More Lithography-free metasurfaces composed of a nano-layered stack of materials are attractive not only due to their optical properties but also by virtue of fabrication simplicity and cost reduction of devices based on such structures. We demonstrate a multi-layer metasurface with engineered electromagnetic absorption in the mid-infrared (MIR) wavelength range. Characterisation of thin SiO$_2$ and Si films sandwiched between two Au layers by way of experimental absorption and thermal radiation measurements as well as finite difference time domain (FDTD) numerical simulations is presented. Comparison of experimental and simulation data of optical properties of multilayer metasurfaces show guidelines for the absorber/emitter applications. △ Less

Submitted 23 March, 2020; originally announced March 2020.

Comments: 4 figures

arXiv:1810.13230 [pdf]

Methods for Segmentation and Classification of Digital Microscopy Tissue Images

Authors: Quoc Dang Vu, Simon Graham, Minh Nguyen Nhat To, Muhammad Shaban, Talha Qaiser, Navid Alemi Koohbanani, Syed Ali Khurram, Tahsin Kurc, Keyvan Farahani, Tianhao Zhao, Rajarsi Gupta, ** Tae Kwak, Nasir Rajpoot, Joel Saltz

Abstract: High-resolution microscopy images of tissue specimens provide detailed information about the morphology of normal and diseased tissue. Image analysis of tissue morphology can help cancer researchers develop a better understanding of cancer biology. Segmentation of nuclei and classification of tissue images are two common tasks in tissue image analysis. Development of accurate and efficient algorit… ▽ More High-resolution microscopy images of tissue specimens provide detailed information about the morphology of normal and diseased tissue. Image analysis of tissue morphology can help cancer researchers develop a better understanding of cancer biology. Segmentation of nuclei and classification of tissue images are two common tasks in tissue image analysis. Development of accurate and efficient algorithms for these tasks is a challenging problem because of the complexity of tissue morphology and tumor heterogeneity. In this paper we present two computer algorithms; one designed for segmentation of nuclei and the other for classification of whole slide tissue images. The segmentation algorithm implements a multiscale deep residual aggregation network to accurately segment nuclear material and then separate clumped nuclei into individual nuclei. The classification algorithm initially carries out patch-level classification via a deep learning method, then patch-level statistical and morphological features are used as input to a random forest regression model for whole slide image classification. The segmentation and classification algorithms were evaluated in the MICCAI 2017 Digital Pathology challenge. The segmentation algorithm achieved an accuracy score of 0.78. The classification algorithm achieved an accuracy score of 0.81. △ Less

Submitted 16 November, 2018; v1 submitted 31 October, 2018; originally announced October 2018.

arXiv:1808.04277 [pdf, other]

doi 10.1016/j.media.2019.05.010

BACH: Grand Challenge on Breast Cancer Histology Images

Authors: Guilherme Aresta, Teresa Araújo, Scotty Kwok, Sai Saketh Chennamsetty, Mohammed Safwan, Varghese Alex, Bahram Marami, Marcel Prastawa, Monica Chan, Michael Donovan, Gerardo Fernandez, Jack Zeineh, Matthias Kohl, Christoph Walz, Florian Ludwig, Stefan Braunewell, Maximilian Baust, Quoc Dang Vu, Minh Nguyen Nhat To, Eal Kim, ** Tae Kwak, Sameh Galal, Veronica Sanchez-Freire, Nadia Brancati, Maria Frucci , et al. (11 additional authors not shown)

Abstract: Breast cancer is the most common invasive cancer in women, affecting more than 10% of women worldwide. Microscopic analysis of a biopsy remains one of the most important methods to diagnose the type of breast cancer. This requires specialized analysis by pathologists, in a task that i) is highly time- and cost-consuming and ii) often leads to nonconsensual results. The relevance and potential of a… ▽ More Breast cancer is the most common invasive cancer in women, affecting more than 10% of women worldwide. Microscopic analysis of a biopsy remains one of the most important methods to diagnose the type of breast cancer. This requires specialized analysis by pathologists, in a task that i) is highly time- and cost-consuming and ii) often leads to nonconsensual results. The relevance and potential of automatic classification algorithms using hematoxylin-eosin stained histopathological images has already been demonstrated, but the reported results are still sub-optimal for clinical use. With the goal of advancing the state-of-the-art in automatic classification, the Grand Challenge on BreAst Cancer Histology images (BACH) was organized in conjunction with the 15th International Conference on Image Analysis and Recognition (ICIAR 2018). A large annotated dataset, composed of both microscopy and whole-slide images, was specifically compiled and made publicly available for the BACH challenge. Following a positive response from the scientific community, a total of 64 submissions, out of 677 registrations, effectively entered the competition. From the submitted algorithms it was possible to push forward the state-of-the-art in terms of accuracy (87%) in automatic classification of breast cancer with histopathological images. Convolutional neuronal networks were the most successful methodology in the BACH challenge. Detailed analysis of the collective results allowed the identification of remaining challenges in the field and recommendations for future developments. The BACH dataset remains publically available as to promote further improvements to the field of automatic classification in digital pathology. △ Less

Submitted 17 June, 2019; v1 submitted 13 August, 2018; originally announced August 2018.

Comments: Accepted for publication at Medical Image Analysis (Elsevier). Publication licensed under the Creative Commons CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/

Journal ref: Medical Image Analysis, 2019

arXiv:1502.05050 [pdf, other]

doi 10.1093/mnras/stv356

The LMC geometry and outer stellar populations from early DES data

Authors: Eduardo Balbinot, B. X. Santiago, L. Girardi, A. Pieres, L. N. da Costa, M. A. G. Maia, R. A. Gruendl A. R. Walker, B. Yanny, A. Drlica-Wagner, A. Benoit-Levy, T. M. C. Abbott, S. S. Allam, J. A nnis, J. P. Bernstein, R. A. Bernstein, E. Bertin, D. Brooks, E. Buckley-Geer, A. Carnero Rosell, C. E. Cunha, D. L. DePoy, S. Desai, H. T. Diehl, P. Doel, J. Estrada , et al. (28 additional authors not shown)

Abstract: The Dark Energy Camera has captured a large set of images as part of Science Verification (SV) for the Dark Energy Survey. The SV footprint covers a lar ge portion of the outer Large Magellanic Cloud (LMC), providing photometry 1.5 magnitudes fainter than the main sequence turn-off of the oldest LMC stel lar population. We derive geometrical and structural parameters for various stellar population… ▽ More The Dark Energy Camera has captured a large set of images as part of Science Verification (SV) for the Dark Energy Survey. The SV footprint covers a lar ge portion of the outer Large Magellanic Cloud (LMC), providing photometry 1.5 magnitudes fainter than the main sequence turn-off of the oldest LMC stel lar population. We derive geometrical and structural parameters for various stellar populations in the LMC disk. For the distribution of all LMC stars, we find an inclination of $i=-38.14^{\circ}\pm0.08^{\circ}$ (near side in the North) and a position angle for the line of nodes of $θ_0=129.51^{\circ}\pm0.17^{\circ}$. We find that stars younger than $\sim 4$ Gyr are more centrally concentrated than older stars. Fitting a projected exponential disk shows that the scale radius of the old populations is $R_{>4 Gyr}=1.41\pm0.01$ kpc, while the younger population has $R_{<4 Gyr}=0.72\pm0.01$ kpc. Howe ver, the spatial distribution of the younger population deviates significantly from the projected exponential disk model. The distribution of old stars suggests a large truncation radius of $R_{t}=13.5\pm0.8$ kpc. If this truncation is dominated by the tidal field of the Galaxy, we find that the LMC is $\simeq 24^{+9}_{-6}$ times less massive than the encircled Galactic mass. By measuring the Red Clump peak magnitude and comparing with the best-fit LM C disk model, we find that the LMC disk is warped and thicker in the outer regions north of the LMC centre. Our findings may either be interpreted as a warped and flared disk in the LMC outskirts, or as evidence of a spheroidal halo component △ Less

Submitted 17 February, 2015; originally announced February 2015.

Comments: 18 pages, 13 figures, 3 tables; accepted for publication in MNRAS

Showing 1–10 of 10 results for author: To, N