-
PLUTO: Pathology-Universal Transformer
Authors:
Dinkar Juyal,
Harshith Padigela,
Chintan Shah,
Daniel Shenker,
Natalia Harguindeguy,
Yi Liu,
Blake Martin,
Yibo Zhang,
Michael Nercessian,
Miles Markey,
Isaac Finberg,
Kelsey Luu,
Daniel Borders,
Syed Ashar Javed,
Emma Krause,
Raymond Biju,
Aashish Sood,
Allen Ma,
Jackson Nyman,
John Shamshoian,
Guillaume Chhor,
Darpan Sanghavi,
Marc Thibault,
Limin Yu,
Fedaa Najdawi
, et al. (8 additional authors not shown)
Abstract:
Pathology is the study of microscopic inspection of tissue, and a pathology diagnosis is often the medical gold standard to diagnose disease. Pathology images provide a unique challenge for computer-vision-based analysis: a single pathology Whole Slide Image (WSI) is gigapixel-sized and often contains hundreds of thousands to millions of objects of interest across multiple resolutions. In this wor…
▽ More
Pathology is the study of microscopic inspection of tissue, and a pathology diagnosis is often the medical gold standard to diagnose disease. Pathology images provide a unique challenge for computer-vision-based analysis: a single pathology Whole Slide Image (WSI) is gigapixel-sized and often contains hundreds of thousands to millions of objects of interest across multiple resolutions. In this work, we propose PathoLogy Universal TransfOrmer (PLUTO): a light-weight pathology FM that is pre-trained on a diverse dataset of 195 million image tiles collected from multiple sites and extracts meaningful representations across multiple WSI scales that enable a large variety of downstream pathology tasks. In particular, we design task-specific adaptation heads that utilize PLUTO's output embeddings for tasks which span pathology scales ranging from subcellular to slide-scale, including instance segmentation, tile classification, and slide-level prediction. We compare PLUTO's performance to other state-of-the-art methods on a diverse set of external and internal benchmarks covering multiple biologically relevant tasks, tissue types, resolutions, stains, and scanners. We find that PLUTO matches or outperforms existing task-specific baselines and pathology-specific foundation models, some of which use orders-of-magnitude larger datasets and model sizes when compared to PLUTO. Our findings present a path towards a universal embedding to power pathology image analysis, and motivate further exploration around pathology foundation models in terms of data diversity, architectural improvements, sample efficiency, and practical deployability in real-world applications.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
SPIRE-SIES: A Spontaneous Indian English Speech Corpus
Authors:
Abhayjeet Singh,
Charu Shah,
Rajashri Varadaraj,
Sonakshi Chauhan,
Prasanta Kumar Ghosh
Abstract:
In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of Indian English speech data is one of the major hindrances in develo** robust speech systems which are adapted to the Indian speech style. Moreover this scarcity is even more for spontaneous speech. This corpus is crowd sourced over varied Indian nativities, genders and age groups. Traditional spontaneous s…
▽ More
In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of Indian English speech data is one of the major hindrances in develo** robust speech systems which are adapted to the Indian speech style. Moreover this scarcity is even more for spontaneous speech. This corpus is crowd sourced over varied Indian nativities, genders and age groups. Traditional spontaneous speech collection strategies involve capturing of speech during interviewing or conversations. In this study, we use images as stimuli to induce spontaneity in speech. Transcripts for 23 hours is generated and validated which can serve as a spontaneous speech ASR benchmark. Quality of the corpus is validated with voice activity detection based segmentation, gender verification and image semantic correlation. Which determines a relationship between image stimulus and recorded speech using caption keywords derived from Image2Text model and high occurring words derived from whisper ASR generated transcripts.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
ContriMix: Scalable stain color augmentation for domain generalization without domain labels in digital pathology
Authors:
Tan H. Nguyen,
Dinkar Juyal,
** Li,
Aaditya Prakash,
Shima Nofallah,
Chintan Shah,
Sai Chowdary Gullapally,
Limin Yu,
Michael Griffin,
Anand Sampat,
John Abel,
Justin Lee,
Amaro Taylor-Weiner
Abstract:
Differences in staining and imaging procedures can cause significant color variations in histopathology images, leading to poor generalization when deploying deep-learning models trained from a different data source. Various color augmentation methods have been proposed to generate synthetic images during training to make models more robust, eliminating the need for stain normalization during test…
▽ More
Differences in staining and imaging procedures can cause significant color variations in histopathology images, leading to poor generalization when deploying deep-learning models trained from a different data source. Various color augmentation methods have been proposed to generate synthetic images during training to make models more robust, eliminating the need for stain normalization during test time. Many color augmentation methods leverage domain labels to generate synthetic images. This approach causes three significant challenges to scaling such a model. Firstly, incorporating data from a new domain into deep-learning models trained on existing domain labels is not straightforward. Secondly, dependency on domain labels prevents the use of pathology images without domain labels to improve model performance. Finally, implementation of these methods becomes complicated when multiple domain labels (e.g., patient identification, medical center, etc) are associated with a single image. We introduce ContriMix, a novel domain label free stain color augmentation method based on DRIT++, a style-transfer method. Contrimix leverages sample stain color variation within a training minibatch and random mixing to extract content and attribute information from pathology images. This information can be used by a trained ContriMix model to create synthetic images to improve the performance of existing classifiers. ContriMix outperforms competing methods on the Camelyon17-WILDS dataset. Its performance is consistent across different slides in the test set while being robust to the color variation from rare substances in pathology images. We make our code and trained ContriMix models available for research use. The code for ContriMix can be found at https://gitlab.com/huutan86/contrimix
△ Less
Submitted 8 March, 2024; v1 submitted 7 June, 2023;
originally announced June 2023.
-
High-Fidelity Model of Stand-Alone Diesel Electric Generator with Hybrid Turbine-Governor Configuration for Microgrid Studies
Authors:
Chinmay Shah,
Mariko Shirazi,
Richard Wies,
Phylicia Cicilio,
Timothy Hansen,
Reinaldo Tonkoski
Abstract:
Diesel electric generators are an inherent part of remote hybrid microgrids found in remote regions of the world that provide primary frequency response (PFR) to restore system frequency during load or generation changes. However, with inverter-based resources (IBR) integration into microgrids, the IBR control provides a fast frequency response (FFR) to restore the system frequency. Hence, supplem…
▽ More
Diesel electric generators are an inherent part of remote hybrid microgrids found in remote regions of the world that provide primary frequency response (PFR) to restore system frequency during load or generation changes. However, with inverter-based resources (IBR) integration into microgrids, the IBR control provides a fast frequency response (FFR) to restore the system frequency. Hence, supplementing PFR with FFR requires a sophisticated control system and a high fidelity diesel electric generator model to design these control systems. In this work, a high-fidelity model of a diesel electric generator is developed. Its parameters are tuned using a surrogate optimization algorithm by emulating its response during a load change to a 400 kVA Caterpillar C-15 diesel generator, similar to those found in remote microgrids. The diesel electric generator model consists of a synchronous machine, DC4B excitation with V/Hz limiter, and a proposed modified IEEE GGOV1 engine-governor model (GGOV1D). The performance of the GGOV1D is compared with simple, Woodward DEGOV, and a standard IEEE GGOV1 engine-governor model. Results show that error in the diesel electric generator's response to load changes using the GGOV1D model is lower with an improved frequency response during the arresting and rebound period than the other engine-governor models.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.