Search | arXiv e-print repository

doi 10.1142/S0219649224500370

Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT

Authors: Rohit Raju, Peeta Basa Pati, SA Gandheesh, Gayatri Sanjana Sannala, Suriya KS

Abstract: Text continues to remain a relevant form of representation for information. Text documents are created either in digital native platforms or through the conversion of other media files such as images and speech. While the digital native text is invariably obtained through physical or virtual keyboards, technologies such as OCR and speech recognition are utilized to transform the images and speech… ▽ More Text continues to remain a relevant form of representation for information. Text documents are created either in digital native platforms or through the conversion of other media files such as images and speech. While the digital native text is invariably obtained through physical or virtual keyboards, technologies such as OCR and speech recognition are utilized to transform the images and speech signals into text content. All these variety of mechanisms of text generation also introduce errors into the captured text. This project aims at analyzing different kinds of error that occurs in text documents. The work employs two of the advanced deep neural network-based language models, namely, BART and MarianMT, to rectify the anomalies present in the text. Transfer learning of these models with available dataset is performed to finetune their capacity for error correction. A comparative study is conducted to investigate the effectiveness of these models in handling each of the defined error categories. It is observed that while both models can bring down the erroneous sentences by 20+%, BART can handle spelling errors far better (24.6%) than grammatical errors (8.8%). △ Less

Submitted 25 March, 2024; originally announced March 2024.

Journal ref: Journal of Information & Knowledge Management, 2024, World Scientific

arXiv:2401.02879 [pdf, other]

Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational Training

Authors: M. Emre Sahin, Benjamin C. B. Symons, Pushpak Pati, Fayyaz Minhas, Declan Millar, Maria Gabrani, Jan Lukas Robertus, Stefano Mensa

Abstract: Quantum machine learning with quantum kernels for classification problems is a growing area of research. Recently, quantum kernel alignment techniques that parameterise the kernel have been developed, allowing the kernel to be trained and therefore aligned with a specific dataset. While quantum kernel alignment is a promising technique, it has been hampered by considerable training costs because t… ▽ More Quantum machine learning with quantum kernels for classification problems is a growing area of research. Recently, quantum kernel alignment techniques that parameterise the kernel have been developed, allowing the kernel to be trained and therefore aligned with a specific dataset. While quantum kernel alignment is a promising technique, it has been hampered by considerable training costs because the full kernel matrix must be constructed at every training iteration. Addressing this challenge, we introduce a novel method that seeks to balance efficiency and performance. We present a sub-sampling training approach that uses a subset of the kernel matrix at each training step, thereby reducing the overall computational cost of the training. In this work, we apply the sub-sampling method to synthetic datasets and a real-world breast cancer dataset and demonstrate considerable reductions in the number of circuits required to train the quantum kernel while maintaining classification accuracy. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.15010 [pdf, other]

SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology

Authors: Saarthak Kapse, Pushpak Pati, Srijan Das, **gwei Zhang, Chao Chen, Maria Vakalopoulou, Joel Saltz, Dimitris Samaras, Rajarsi R. Gupta, Prateek Prasanna

Abstract: Introducing interpretability and reasoning into Multiple Instance Learning (MIL) methods for Whole Slide Image (WSI) analysis is challenging, given the complexity of gigapixel slides. Traditionally, MIL interpretability is limited to identifying salient regions deemed pertinent for downstream tasks, offering little insight to the end-user (pathologist) regarding the rationale behind these selectio… ▽ More Introducing interpretability and reasoning into Multiple Instance Learning (MIL) methods for Whole Slide Image (WSI) analysis is challenging, given the complexity of gigapixel slides. Traditionally, MIL interpretability is limited to identifying salient regions deemed pertinent for downstream tasks, offering little insight to the end-user (pathologist) regarding the rationale behind these selections. To address this, we propose Self-Interpretable MIL (SI-MIL), a method intrinsically designed for interpretability from the very outset. SI-MIL employs a deep MIL framework to guide an interpretable branch grounded on handcrafted pathological features, facilitating linear predictions. Beyond identifying salient regions, SI-MIL uniquely provides feature-level interpretations rooted in pathological insights for WSIs. Notably, SI-MIL, with its linear prediction constraints, challenges the prevalent myth of an inevitable trade-off between model interpretability and performance, demonstrating competitive results compared to state-of-the-art methods on WSI-level prediction tasks across three cancer types. In addition, we thoroughly benchmark the local and global-interpretability of SI-MIL in terms of statistical analysis, a domain expert study, and desiderata of interpretability, namely, user-friendliness and faithfulness. △ Less

Submitted 18 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

arXiv:2310.11353 [pdf, other]

Hybrid quantum-classical graph neural networks for tumor classification in digital pathology

Authors: Anupama Ray, Dhiraj Madan, Srushti Patil, Maria Anna Rapsomaniki, Pushpak Pati

Abstract: Advances in classical machine learning and single-cell technologies have paved the way to understand interactions between disease cells and tumor microenvironments to accelerate therapeutic discovery. However, challenges in these machine learning methods and NP-hard problems in spatial Biology create an opportunity for quantum computing algorithms. We create a hybrid quantum-classical graph neural… ▽ More Advances in classical machine learning and single-cell technologies have paved the way to understand interactions between disease cells and tumor microenvironments to accelerate therapeutic discovery. However, challenges in these machine learning methods and NP-hard problems in spatial Biology create an opportunity for quantum computing algorithms. We create a hybrid quantum-classical graph neural network (GNN) that combines GNN with a Variational Quantum Classifier (VQC) for classifying binary sub-tasks in breast cancer subty**. We explore two variants of the same, the first with fixed pretrained GNN parameters and the second with end-to-end training of GNN+VQC. The results demonstrate that the hybrid quantum neural network (QNN) is at par with the state-of-the-art classical graph neural networks (GNN) in terms of weighted precision, recall and F1-score. We also show that by means of amplitude encoding, we can compress information in logarithmic number of qubits and attain better performance than using classical compression (which leads to information loss while kee** the number of qubits required constant in both regimes). Finally, we show that end-to-end training enables to improve over fixed GNN parameters and also slightly improves over vanilla GNN with same number of dimensions. △ Less

Submitted 17 October, 2023; originally announced October 2023.

Comments: submitted to ICASSP 2023

arXiv:2307.13331 [pdf]

Unexpected magnetism explained in Cu/Cu2O-rGO nanocomposite

Authors: Rajarshi Roy, Kaustav Bhattacharjee, Satya Prakash Pati, Korak Biswas, Kalyan Kumar Chattopadhyay

Abstract: The observation of room temperature ferromagnetism along with a low temperature paramagnetic counterpart in undoped Cu-Cu2O-rGO nanocomposite was demonstrated. A phenomenological approach was taken to explain the observations based on 3D Ising model for arbitrary spins generated due to Cu vacancy in the Cu2O system preferably at the interface. The observation of room temperature ferromagnetism along with a low temperature paramagnetic counterpart in undoped Cu-Cu2O-rGO nanocomposite was demonstrated. A phenomenological approach was taken to explain the observations based on 3D Ising model for arbitrary spins generated due to Cu vacancy in the Cu2O system preferably at the interface. △ Less

Submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.05734 [pdf, other]

Towards quantum-enabled cell-centric therapeutics

Authors: Saugata Basu, Jannis Born, Aritra Bose, Sara Capponi, Dimitra Chalkia, Timothy A Chan, Hakan Doga, Frederik F. Flother, Gad Getz, Mark Goldsmith, Tanvi Gujarati, Aldo Guzman-Saenz, Dimitrios Iliopoulos, Gavin O. Jones, Stefan Knecht, Dhiraj Madan, Sabrina Maniscalco, Nicola Mariella, Joseph A. Morrone, Khadijeh Najafi, Pushpak Pati, Daniel Platt, Maria Anna Rapsomaniki, Anupama Ray, Kahn Rhrissorrakrai , et al. (8 additional authors not shown)

Abstract: In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum com… ▽ More In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum computing has already started to be recognized in fields such as cryptanalysis, natural science simulations, and optimization among others, very little is known about the full potential of quantum computing simulations and machine learning in the realm of healthcare and life science (HCLS). Herein, we discuss the transformational changes we expect from the use of quantum computation for HCLS research, more specifically in the field of cell-centric therapeutics. Moreover, we identify and elaborate open problems in cell engineering, tissue modeling, perturbation modeling, and bio-topology while discussing candidate quantum algorithms for research on these topics and their potential advantages over classical computational approaches. △ Less

Submitted 1 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

Comments: 6 figures

arXiv:2302.01287 [pdf, other]

Multi-scale Feature Alignment for Continual Learning of Unlabeled Domains

Authors: Kevin Thandiackal, Luigi Piccinelli, Pushpak Pati, Orcun Goksel

Abstract: Methods for unsupervised domain adaptation (UDA) help to improve the performance of deep neural networks on unseen domains without any labeled data. Especially in medical disciplines such as histopathology, this is crucial since large datasets with detailed annotations are scarce. While the majority of existing UDA methods focus on the adaptation from a labeled source to a single unlabeled target… ▽ More Methods for unsupervised domain adaptation (UDA) help to improve the performance of deep neural networks on unseen domains without any labeled data. Especially in medical disciplines such as histopathology, this is crucial since large datasets with detailed annotations are scarce. While the majority of existing UDA methods focus on the adaptation from a labeled source to a single unlabeled target domain, many real-world applications with a long life cycle involve more than one target domain. Thus, the ability to sequentially adapt to multiple target domains becomes essential. In settings where the data from previously seen domains cannot be stored, e.g., due to data protection regulations, the above becomes a challenging continual learning problem. To this end, we propose to use generative feature-driven image replay in conjunction with a dual-purpose discriminator that not only enables the generation of images with realistic features for replay, but also promotes feature alignment during domain adaptation. We evaluate our approach extensively on a sequence of three histopathological datasets for tissue-type classification, achieving state-of-the-art results. We present detailed ablation experiments studying our proposed method components and demonstrate a possible use-case of our continual UDA method for an unsupervised patch-based segmentation task given high-resolution tissue images. △ Less

Submitted 2 February, 2023; originally announced February 2023.

arXiv:2301.02933 [pdf, other]

Weakly Supervised Joint Whole-Slide Segmentation and Classification in Prostate Cancer

Authors: Pushpak Pati, Guillaume Jaume, Zeineb Ayadi, Kevin Thandiackal, Behzad Bozorgtabar, Maria Gabrani, Orcun Goksel

Abstract: The segmentation and automatic identification of histological regions of diagnostic interest offer a valuable aid to pathologists. However, segmentation methods are hampered by the difficulty of obtaining pixel-level annotations, which are tedious and expensive to obtain for Whole-Slide images (WSI). To remedy this, weakly supervised methods have been developed to exploit the annotations directly… ▽ More The segmentation and automatic identification of histological regions of diagnostic interest offer a valuable aid to pathologists. However, segmentation methods are hampered by the difficulty of obtaining pixel-level annotations, which are tedious and expensive to obtain for Whole-Slide images (WSI). To remedy this, weakly supervised methods have been developed to exploit the annotations directly available at the image level. However, to our knowledge, none of these techniques is adapted to deal with WSIs. In this paper, we propose WholeSIGHT, a weakly-supervised method, to simultaneously segment and classify WSIs of arbitrary shapes and sizes. Formally, WholeSIGHT first constructs a tissue-graph representation of the WSI, where the nodes and edges depict tissue regions and their interactions, respectively. During training, a graph classification head classifies the WSI and produces node-level pseudo labels via post-hoc feature attribution. These pseudo labels are then used to train a node classification head for WSI segmentation. During testing, both heads simultaneously render class prediction and segmentation for an input WSI. We evaluated WholeSIGHT on three public prostate cancer WSI datasets. Our method achieved state-of-the-art weakly-supervised segmentation performance on all datasets while resulting in better or comparable classification with respect to state-of-the-art weakly-supervised WSI classification methods. Additionally, we quantify the generalization capability of our method in terms of segmentation and classification performance, uncertainty estimation, and model calibration. △ Less

Submitted 7 January, 2023; originally announced January 2023.

arXiv:2301.01211 [pdf, other]

Generative appearance replay for continual unsupervised domain adaptation

Authors: Boqi Chen, Kevin Thandiackal, Pushpak Pati, Orcun Goksel

Abstract: Deep learning models can achieve high accuracy when trained on large amounts of labeled data. However, real-world scenarios often involve several challenges: Training data may become available in installments, may originate from multiple different domains, and may not contain labels for training. Certain settings, for instance medical applications, often involve further restrictions that prohibit… ▽ More Deep learning models can achieve high accuracy when trained on large amounts of labeled data. However, real-world scenarios often involve several challenges: Training data may become available in installments, may originate from multiple different domains, and may not contain labels for training. Certain settings, for instance medical applications, often involve further restrictions that prohibit retention of previously seen data due to privacy regulations. In this work, to address such challenges, we study unsupervised segmentation in continual learning scenarios that involve domain shift. To that end, we introduce GarDA (Generative Appearance Replay for continual Domain Adaptation), a generative-replay based approach that can adapt a segmentation model sequentially to new domains with unlabeled data. In contrast to single-step unsupervised domain adaptation (UDA), continual adaptation to a sequence of domains enables leveraging and consolidation of information from multiple domains. Unlike previous approaches in incremental UDA, our method does not require access to previously seen data, making it applicable in many practical scenarios. We evaluate GarDA on two datasets with different organs and modalities, where it substantially outperforms existing techniques. △ Less

Submitted 13 February, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

Comments: Fixed typos

arXiv:2204.12454 [pdf, other]

Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images

Authors: Kevin Thandiackal, Boqi Chen, Pushpak Pati, Guillaume Jaume, Drew F. K. Williamson, Maria Gabrani, Orcun Goksel

Abstract: Multiple Instance Learning (MIL) methods have become increasingly popular for classifying giga-pixel sized Whole-Slide Images (WSIs) in digital pathology. Most MIL methods operate at a single WSI magnification, by processing all the tissue patches. Such a formulation induces high computational requirements, and constrains the contextualization of the WSI-level representation to a single scale. A f… ▽ More Multiple Instance Learning (MIL) methods have become increasingly popular for classifying giga-pixel sized Whole-Slide Images (WSIs) in digital pathology. Most MIL methods operate at a single WSI magnification, by processing all the tissue patches. Such a formulation induces high computational requirements, and constrains the contextualization of the WSI-level representation to a single scale. A few MIL methods extend to multiple scales, but are computationally more demanding. In this paper, inspired by the pathological diagnostic process, we propose ZoomMIL, a method that learns to perform multi-level zooming in an end-to-end manner. ZoomMIL builds WSI representations by aggregating tissue-context information from multiple magnifications. The proposed method outperforms the state-of-the-art MIL methods in WSI classification on two large datasets, while significantly reducing the computational demands with regard to Floating-Point Operations (FLOPs) and processing time by up to 40x. △ Less

Submitted 26 July, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: Typos corrected; Changed dataset name from INSEC to CRC upon dataset creators' request; Update affiliation and fix typos;

arXiv:2111.04740 [pdf, other]

BRACS: A Dataset for BReAst Carcinoma Subty** in H&E Histology Images

Authors: Nadia Brancati, Anna Maria Anniciello, Pushpak Pati, Daniel Riccio, Giosuè Scognamiglio, Guillaume Jaume, Giuseppe De Pietro, Maurizio Di Bonito, Antonio Foncubierta, Gerardo Botti, Maria Gabrani, Florinda Feroce, Maria Frucci

Abstract: Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women with cancer. Recent advancements in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by the pathologists is cumbersome, time-consuming, and is subject to… ▽ More Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women with cancer. Recent advancements in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by the pathologists is cumbersome, time-consuming, and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems have empowered the rapid digitization of pathology slides, and enabled to develop digital workflows. These advances further enable to leverage Artificial Intelligence (AI) to assist, automate, and augment pathological diagnosis. But the AI techniques, especially Deep Learning (DL), require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as, data-acquisition level constrains, time-consuming and expensive annotations, and anonymization of private information. In this paper, we introduce the BReAst Carcinoma Subty** (BRACS) dataset, a large cohort of annotated Hematoxylin & Eosin (H&E)-stained images to facilitate the characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs), and 4539 Regions of Interest (ROIs) extracted from the WSIs. Each WSI, and respective ROIs, are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subty** both at WSI- and ROI-level. Further, by including the understudied atypical lesions, BRACS offers an unique opportunity for leveraging AI to better understand their characteristics. △ Less

Submitted 8 November, 2021; originally announced November 2021.

Comments: 10 pages, 3 figures, 8 tables, 30 references

arXiv:2107.10073 [pdf, other]

HistoCartography: A Toolkit for Graph Analytics in Digital Pathology

Authors: Guillaume Jaume, Pushpak Pati, Valentin Anklin, Antonio Foncubierta, Maria Gabrani

Abstract: Advances in entity-graph based analysis of histopathology images have brought in a new paradigm to describe tissue composition, and learn the tissue structure-to-function relationship. Entity-graphs offer flexible and scalable representations to characterize tissue organization, while allowing the incorporation of prior pathological knowledge to further support model interpretability and explainab… ▽ More Advances in entity-graph based analysis of histopathology images have brought in a new paradigm to describe tissue composition, and learn the tissue structure-to-function relationship. Entity-graphs offer flexible and scalable representations to characterize tissue organization, while allowing the incorporation of prior pathological knowledge to further support model interpretability and explainability. However, entity-graph analysis requires prerequisites for image-to-graph translation and knowledge of state-of-the-art machine learning algorithms applied to graph-structured data, which can potentially hinder their adoption. In this work, we aim to alleviate these issues by develo** HistoCartography, a standardized python API with necessary preprocessing, machine learning and explainability tools to facilitate graph-analytics in computational pathology. Further, we have benchmarked the computational time and performance on multiple datasets across different imaging types and histopathology tasks to highlight the applicability of the API for building computational pathology workflows. △ Less

Submitted 21 July, 2021; originally announced July 2021.

arXiv:2103.03129 [pdf, other]

Learning Whole-Slide Segmentation from Inexact and Incomplete Labels using Tissue Graphs

Authors: Valentin Anklin, Pushpak Pati, Guillaume Jaume, Behzad Bozorgtabar, Antonio Foncubierta-Rodríguez, Jean-Philippe Thiran, Mathilde Sibony, Maria Gabrani, Orcun Goksel

Abstract: Segmenting histology images into diagnostically relevant regions is imperative to support timely and reliable decisions by pathologists. To this end, computer-aided techniques have been proposed to delineate relevant regions in scanned histology slides. However, the techniques necessitate task-specific large datasets of annotated pixels, which is tedious, time-consuming, expensive, and infeasible… ▽ More Segmenting histology images into diagnostically relevant regions is imperative to support timely and reliable decisions by pathologists. To this end, computer-aided techniques have been proposed to delineate relevant regions in scanned histology slides. However, the techniques necessitate task-specific large datasets of annotated pixels, which is tedious, time-consuming, expensive, and infeasible to acquire for many histology tasks. Thus, weakly-supervised semantic segmentation techniques are proposed to utilize weak supervision that is cheaper and quicker to acquire. In this paper, we propose SegGini, a weakly supervised segmentation method using graphs, that can utilize weak multiplex annotations, i.e. inexact and incomplete annotations, to segment arbitrary and large images, scaling from tissue microarray (TMA) to whole slide image (WSI). Formally, SegGini constructs a tissue-graph representation for an input histology image, where the graph nodes depict tissue regions. Then, it performs weakly-supervised segmentation via node classification by using inexact image-level labels, incomplete scribbles, or both. We evaluated SegGini on two public prostate cancer datasets containing TMAs and WSIs. Our method achieved state-of-the-art segmentation performance on both datasets for various annotation settings while being comparable to a pathologist baseline. △ Less

Submitted 4 March, 2021; originally announced March 2021.

Comments: 10 pages, 5 figures

arXiv:2102.11057 [pdf, other]

Hierarchical Graph Representations in Digital Pathology

Authors: Pushpak Pati, Guillaume Jaume, Antonio Foncubierta, Florinda Feroce, Anna Maria Anniciello, Giosuè Scognamiglio, Nadia Brancati, Maryse Fiche, Estelle Dubruc, Daniel Riccio, Maurizio Di Bonito, Giuseppe De Pietro, Gerardo Botti, Jean-Philippe Thiran, Maria Frucci, Orcun Goksel, Maria Gabrani

Abstract: Cancer diagnosis, prognosis, and therapy response predictions from tissue specimens highly depend on the phenotype and topological distribution of constituting histological entities. Thus, adequate tissue representations for encoding histological entities is imperative for computer aided cancer patient care. To this end, several approaches have leveraged cell-graphs that encode cell morphology and… ▽ More Cancer diagnosis, prognosis, and therapy response predictions from tissue specimens highly depend on the phenotype and topological distribution of constituting histological entities. Thus, adequate tissue representations for encoding histological entities is imperative for computer aided cancer patient care. To this end, several approaches have leveraged cell-graphs that encode cell morphology and organization to denote the tissue information. These allow for utilizing machine learning to map tissue representations to tissue functionality to help quantify their relationship. Though cellular information is crucial, it is incomplete alone to comprehensively characterize complex tissue structure. We herein treat the tissue as a hierarchical composition of multiple types of histological entities from fine to coarse level, capturing multivariate tissue information at multiple levels. We propose a novel multi-level hierarchical entity-graph representation of tissue specimens to model hierarchical compositions that encode histological entities as well as their intra- and inter-entity level interactions. Subsequently, a graph neural network is proposed to operate on the hierarchical entity-graph representation to map the tissue structure to tissue functionality. Specifically, for input histology images we utilize well-defined cells and tissue regions to build HierArchical Cell-to-Tissue (HACT) graph representations, and devise HACT-Net, a graph neural network, to classify such HACT representations. As part of this work, we introduce the BReAst Carcinoma Subty** (BRACS) dataset, a large cohort of H&E stained breast tumor images, to evaluate our proposed methodology against pathologists and state-of-the-art approaches. Through comparative assessment and ablation studies, our method is demonstrated to yield superior classification results compared to alternative methods as well as pathologists. △ Less

Submitted 17 March, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

arXiv:2011.12646 [pdf, other]

Quantifying Explainers of Graph Neural Networks in Computational Pathology

Authors: Guillaume Jaume, Pushpak Pati, Behzad Bozorgtabar, Antonio Foncubierta-Rodríguez, Florinda Feroce, Anna Maria Anniciello, Tilman Rau, Jean-Philippe Thiran, Maria Gabrani, Orcun Goksel

Abstract: Explainability of deep learning methods is imperative to facilitate their clinical adoption in digital pathology. However, popular deep learning methods and explainability techniques (explainers) based on pixel-wise processing disregard biological entities' notion, thus complicating comprehension by pathologists. In this work, we address this by adopting biological entity-based graph processing an… ▽ More Explainability of deep learning methods is imperative to facilitate their clinical adoption in digital pathology. However, popular deep learning methods and explainability techniques (explainers) based on pixel-wise processing disregard biological entities' notion, thus complicating comprehension by pathologists. In this work, we address this by adopting biological entity-based graph processing and graph explainers enabling explanations accessible to pathologists. In this context, a major challenge becomes to discern meaningful explainers, particularly in a standardized and quantifiable fashion. To this end, we propose herein a set of novel quantitative metrics based on statistics of class separability using pathologically measurable concepts to characterize graph explainers. We employ the proposed metrics to evaluate three types of graph explainers, namely the layer-wise relevance propagation, gradient-based saliency, and graph pruning approaches, to explain Cell-Graph representations for Breast Cancer Subty**. The proposed metrics are also applicable in other domains by using domain-specific intuitive concepts. We validate the qualitative and quantitative findings on the BRACS dataset, a large cohort of breast cancer RoIs, by expert pathologists. △ Less

Submitted 14 May, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

Comments: CVPR 2021

arXiv:2007.00584 [pdf, other]

HACT-Net: A Hierarchical Cell-to-Tissue Graph Neural Network for Histopathological Image Classification

Authors: Pushpak Pati, Guillaume Jaume, Lauren Alisha Fernandes, Antonio Foncubierta, Florinda Feroce, Anna Maria Anniciello, Giosue Scognamiglio, Nadia Brancati, Daniel Riccio, Maurizio Do Bonito, Giuseppe De Pietro, Gerardo Botti, Orcun Goksel, Jean-Philippe Thiran, Maria Frucci, Maria Gabrani

Abstract: Cancer diagnosis, prognosis, and therapeutic response prediction are heavily influenced by the relationship between the histopathological structures and the function of the tissue. Recent approaches acknowledging the structure-function relationship, have linked the structural and spatial patterns of cell organization in tissue via cell-graphs to tumor grades. Though cell organization is imperative… ▽ More Cancer diagnosis, prognosis, and therapeutic response prediction are heavily influenced by the relationship between the histopathological structures and the function of the tissue. Recent approaches acknowledging the structure-function relationship, have linked the structural and spatial patterns of cell organization in tissue via cell-graphs to tumor grades. Though cell organization is imperative, it is insufficient to entirely represent the histopathological structure. We propose a novel hierarchical cell-to-tissue-graph (HACT) representation to improve the structural depiction of the tissue. It consists of a low-level cell-graph, capturing cell morphology and interactions, a high-level tissue-graph, capturing morphology and spatial distribution of tissue parts, and cells-to-tissue hierarchies, encoding the relative spatial distribution of the cells with respect to the tissue distribution. Further, a hierarchical graph neural network (HACT-Net) is proposed to efficiently map the HACT representations to histopathological breast cancer subtypes. We assess the methodology on a large set of annotated tissue regions of interest from H\&E stained breast carcinoma whole-slides. Upon evaluation, the proposed method outperformed recent convolutional neural network and graph neural network approaches for breast cancer multi-class subty**. The proposed entity-based topological analysis is more inline with the pathological diagnostic procedure of the tissue. It provides more command over the tissue modelling, therefore encourages the further inclusion of pathological priors into task-specific tissue representation. △ Less

Submitted 1 July, 2020; originally announced July 2020.

arXiv:2007.00311 [pdf, other]

Towards Explainable Graph Representations in Digital Pathology

Authors: Guillaume Jaume, Pushpak Pati, Antonio Foncubierta-Rodriguez, Florinda Feroce, Giosue Scognamiglio, Anna Maria Anniciello, Jean-Philippe Thiran, Orcun Goksel, Maria Gabrani

Abstract: Explainability of machine learning (ML) techniques in digital pathology (DP) is of great significance to facilitate their wide adoption in clinics. Recently, graph techniques encoding relevant biological entities have been employed to represent and assess DP images. Such paradigm shift from pixel-wise to entity-wise analysis provides more control over concept representation. In this paper, we intr… ▽ More Explainability of machine learning (ML) techniques in digital pathology (DP) is of great significance to facilitate their wide adoption in clinics. Recently, graph techniques encoding relevant biological entities have been employed to represent and assess DP images. Such paradigm shift from pixel-wise to entity-wise analysis provides more control over concept representation. In this paper, we introduce a post-hoc explainer to derive compact per-instance explanations emphasizing diagnostically important entities in the graph. Although we focus our analyses to cells and cellular interactions in breast cancer subty**, the proposed explainer is generic enough to be extended to other topological representations in DP. Qualitative and quantitative analyses demonstrate the efficacy of the explainer in generating comprehensive and compact explanations. △ Less

Submitted 1 July, 2020; originally announced July 2020.

Comments: ICML'20 workshop on Computational Biology

arXiv:2006.13556 [pdf, other]

NINEPINS: Nuclei Instance Segmentation with Point Annotations

Authors: Ting-An Yen, Hung-Chun Hsu, Pushpak Pati, Maria Gabrani, Antonio Foncubierta-Rodríguez, Pau-Choo Chung

Abstract: Deep learning-based methods are gaining traction in digital pathology, with an increasing number of publications and challenges that aim at easing the work of systematically and exhaustively analyzing tissue slides. These methods often achieve very high accuracies, at the cost of requiring large annotated datasets to train. This requirement is especially difficult to fulfill in the medical field,… ▽ More Deep learning-based methods are gaining traction in digital pathology, with an increasing number of publications and challenges that aim at easing the work of systematically and exhaustively analyzing tissue slides. These methods often achieve very high accuracies, at the cost of requiring large annotated datasets to train. This requirement is especially difficult to fulfill in the medical field, where expert knowledge is essential. In this paper we focus on nuclei segmentation, which generally requires experienced pathologists to annotate the nuclear areas in gigapixel histological images. We propose an algorithm for instance segmentation that uses pseudo-label segmentations generated automatically from point annotations, as a method to reduce the burden for pathologists. With the generated segmentation masks, the proposed method trains a modified version of HoVer-Net model to achieve instance segmentation. Experimental results show that the proposed method is robust to inaccuracies in point annotations and comparison with Hover-Net trained with fully annotated instance masks shows that a degradation in segmentation performance does not always imply a degradation in higher order tasks such as tissue classification. △ Less

Submitted 24 June, 2020; originally announced June 2020.

arXiv:2006.09772 [pdf, other]

doi 10.1109/ISBI45749.2020.9098431

Mitosis Detection Under Limited Annotation: A Joint Learning Approach

Authors: Pushpak Pati, Antonio Foncubierta-Rodriguez, Orcun Goksel, Maria Gabrani

Abstract: Mitotic counting is a vital prognostic marker of tumor proliferation in breast cancer. Deep learning-based mitotic detection is on par with pathologists, but it requires large labeled data for training. We propose a deep classification framework for enhancing mitosis detection by leveraging class label information, via softmax loss, and spatial distribution information among samples, via distance… ▽ More Mitotic counting is a vital prognostic marker of tumor proliferation in breast cancer. Deep learning-based mitotic detection is on par with pathologists, but it requires large labeled data for training. We propose a deep classification framework for enhancing mitosis detection by leveraging class label information, via softmax loss, and spatial distribution information among samples, via distance metric learning. We also investigate strategies towards steadily providing informative samples to boost the learning. The efficacy of the proposed framework is established through evaluation on ICPR 2012 and AMIDA 2013 mitotic data. Our framework significantly improves the detection with small training data and achieves on par or superior performance compared to state-of-the-art methods for using the entire training data. △ Less

Submitted 2 July, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

Comments: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI)

arXiv:1701.04252 [pdf]

Lateral ferromagnetic domain control in Cr2O3/Pt/Co positive exchange bias system

Authors: T. Nozaki, M. Al-Mahdawi, S. P. Pati, S. Ye, M. Sahashi

Abstract: We investigated the perpendicular exchange bias (PEB) switching from negative- to positive-exchange bias state for Cr2O3/Pt/Co exchange coupling thin film system exhibiting positive exchange bias phenomena. By changing Pt spacer layer thickness or measurements temperature, we demonstrated the control of two kind of intermediate state of the switching; the double hysteresis loop indicating local, n… ▽ More We investigated the perpendicular exchange bias (PEB) switching from negative- to positive-exchange bias state for Cr2O3/Pt/Co exchange coupling thin film system exhibiting positive exchange bias phenomena. By changing Pt spacer layer thickness or measurements temperature, we demonstrated the control of two kind of intermediate state of the switching; the double hysteresis loop indicating local, non-averaged PEB, and single hysteresis loop indicating averaged PEB. We proposed the way to control the lateral ferromagnetic domain though the control of PEB magnitude. △ Less

Submitted 16 January, 2017; originally announced January 2017.

arXiv:1608.04531 [pdf, other]

doi 10.1103/PhysRevB.94.224417

Finite-size scaling effect on Néel temperature of antiferromagnetic Cr$_2$O$_3$-(0001) films in an exchange-coupled heterostructure

Authors: Satya Prakash Pati, Muftah Al-Mahdawi, Shujun Ye, Yohei Shiokawa, Tomohiro Nozaki, Masashi Sahashi

Abstract: The scaling of antiferromagnetic ordering temperature of corundum-type chromia films have been investigated. Néel temperature $T_N$ was determined from the effect of perpendicular exchange-bias on the magnetization of a weakly-coupled adjacent ferromagnet. For a thick-film case, the validity of detection is confirmed by a susceptibility measurement. Detection of $T_N$ was possible down to 1-nm-thi… ▽ More The scaling of antiferromagnetic ordering temperature of corundum-type chromia films have been investigated. Néel temperature $T_N$ was determined from the effect of perpendicular exchange-bias on the magnetization of a weakly-coupled adjacent ferromagnet. For a thick-film case, the validity of detection is confirmed by a susceptibility measurement. Detection of $T_N$ was possible down to 1-nm-thin chromia films. The scaling of ordering temperature with thickness was studied using different buffering materials, and compared with Monte-Carlo simulations. The spin-correlation length and the corresponding critical exponent were estimated, and they were consistent between experimental and simulation results. The spin-correlation length is an order of magnitude less than cubic antiferromagnets. We propose that the difference is from the change of number of exchange-coupling links in the two crystal systems. △ Less

Submitted 4 January, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

Journal ref: Physical Review B, vol. 94, no. 22, p. 224417, Dec. 2016

arXiv:1608.02390 [pdf, other]

doi 10.1103/PhysRevB.95.144423

Low-energy magnetoelectric control of domain states in exchange-coupled heterostructures

Authors: Muftah Al-Mahdawi, Satya Prakash Pati, Yohei Shiokawa, Shujun Ye, Tomohiro Nozaki, Masashi Sahashi

Abstract: The electric manipulation of antiferromagnets has become an area of great interest recently for zero-stray-field spintronic devices, and for their rich spin dynamics. Generally, the application of antiferromagnetic media for information memories and storage requires a heterostructure with a ferromagnetic layer for readout through the exchange-bias field. In magnetoelectric and multiferroic antifer… ▽ More The electric manipulation of antiferromagnets has become an area of great interest recently for zero-stray-field spintronic devices, and for their rich spin dynamics. Generally, the application of antiferromagnetic media for information memories and storage requires a heterostructure with a ferromagnetic layer for readout through the exchange-bias field. In magnetoelectric and multiferroic antiferromagnets, the exchange coupling exerts an additional impediment (energy barrier) to magnetization reversal by the applied magnetoelectric energy. We proposed and verified a method to overcome this barrier. We controlled the energy required for switching the magnetic domains in magnetoelectric \cro films by compensating the exchange-coupling energy from the ferromagnetic layer with the Zeeman energy of a small volumetric spontaneous magnetization found for the sputtered \cro films. Based on a simplified phenomenological model of the field-cooling process, the magnetic and electric fields required for switching could be tuned. As an example, the switching of antiferromagnetic domains around a zero-threshold electric field was demonstrated at a magnetic field of 2.6 kOe. △ Less

Submitted 10 January, 2017; v1 submitted 8 August, 2016; originally announced August 2016.

arXiv:1605.03680 [pdf]

Enhancing the blocking temperature of perpendicular-exchange biased Cr2O3 thin films using spacer and buffer layers

Authors: Naoki Shimomura, Satya Prakash Pati, Tomohiro Nozaki, Tatsuo Shibata, Masashi Sahashi

Abstract: In this study, we investigated the effect of spacer and buffer layers on the blocking temperature TB of the perpendicular exchange bias of thin Cr2O3 films, and revealed a high TB of 260 K for 20-nm-thick Cr2O3 thin films. By inserting a Ru spacer layer between the Cr2O3 and Co films and changing the spacer thickness, we controlled the magnitude of the exchange bias and TB. By comparing the TB val… ▽ More In this study, we investigated the effect of spacer and buffer layers on the blocking temperature TB of the perpendicular exchange bias of thin Cr2O3 films, and revealed a high TB of 260 K for 20-nm-thick Cr2O3 thin films. By inserting a Ru spacer layer between the Cr2O3 and Co films and changing the spacer thickness, we controlled the magnitude of the exchange bias and TB. By comparing the TB values of the 20-nm-thick Cr2O3 films on Pt and alpha-Fe2O3 buffers, we investigated the lattice strain effect on the TB. We show that higher TB value can be obtained using an alpha-Fe2O3 buffer, which is likely because of the lattice-strain-induced increase of Cr2O3 magnetic anisotropy. △ Less

Submitted 12 May, 2016; originally announced May 2016.

arXiv:1602.07547 [pdf, other]

doi 10.1088/1361-6463/aa623e

Critical behavior of sputter-deposited magnetoelectric antiferromagnetic Cr$_2$O$_3$ films near Néel temperature

Authors: Muftah Al-Mahdawi, Yohei Shiokawa, Satya Prakash Pati, Shujun Ye, Tomohiro Nozaki, Masashi Sahashi

Abstract: Chromium(III) oxide is a classical collinear antiferromagnet with a linear magnetoelectric effect. We are presenting the measurements of the magnetoelectric susceptibility $α$ of a sputter-deposited 500-nm film and a bulk single-crystal of Cr$_\mathrm{2}$O$_\mathrm{3}$. We investigated the magnetic phase-transition and the critical exponent $β$ of the sublattice magnetization near Néel temperature… ▽ More Chromium(III) oxide is a classical collinear antiferromagnet with a linear magnetoelectric effect. We are presenting the measurements of the magnetoelectric susceptibility $α$ of a sputter-deposited 500-nm film and a bulk single-crystal of Cr$_\mathrm{2}$O$_\mathrm{3}$. We investigated the magnetic phase-transition and the critical exponent $β$ of the sublattice magnetization near Néel temperature. For the films, an exponent of 0.49(1) was found below 293 K, and changed to 1.06(4) near the Néel temperature of 298 K. For the single-crystal, the exponent was constant at 0.324(4). We investigated the reversal probability of antiferromagnetic domains during magnetoelectric field cooling. For the sputtered films, reversal probability was zero above 298 K and stabilized only below 293 K. We attribute this behavior to formation of grains during film growth, which gives different intergrain and intragrain exchange-coupling energies. For the single-crystal, reversal probability was stabilized immediately at the Néel temperature of 307.6 K. △ Less

Submitted 20 May, 2016; v1 submitted 24 February, 2016; originally announced February 2016.

Journal ref: Journal of Physics D: Applied Physics 50, 155004 (2017)

Showing 1–24 of 24 results for author: Pati, P