Search | arXiv e-print repository

PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology

Authors: George Shaikovski, Adam Casson, Kristen Severson, Eric Zimmermann, Yi Kan Wang, Jeremy D. Kunz, Juan A. Retamero, Gerard Oakley, David Klimstra, Christopher Kanan, Matthew Hanna, Michal Zelechowski, Julian Viret, Neil Tenenholtz, James Hall, Nicolo Fusi, Razik Yousfi, Peter Hamilton, William A. Moye, Eugene Vorontsov, Siqi Liu, Thomas J. Fuchs

Abstract: Foundation models in computational pathology promise to unlock the development of new clinical decision support systems and models for precision medicine. However, there is a mismatch between most clinical analysis, which is defined at the level of one or more whole slide images, and foundation models to date, which process the thousands of image tiles contained in a whole slide image separately.… ▽ More Foundation models in computational pathology promise to unlock the development of new clinical decision support systems and models for precision medicine. However, there is a mismatch between most clinical analysis, which is defined at the level of one or more whole slide images, and foundation models to date, which process the thousands of image tiles contained in a whole slide image separately. The requirement to train a network to aggregate information across a large number of tiles in multiple whole slide images limits these models' impact. In this work, we present a slide-level foundation model for H&E-stained histopathology, PRISM, that builds on Virchow tile embeddings and leverages clinical report text for pre-training. Using the tile embeddings, PRISM produces slide-level embeddings with the ability to generate clinical reports, resulting in several modes of use. Using text prompts, PRISM achieves zero-shot cancer detection and sub-ty** performance approaching and surpassing that of a supervised aggregator model. Using the slide embeddings with linear classifiers, PRISM surpasses supervised aggregator models. Furthermore, we demonstrate that fine-tuning of the PRISM slide encoder yields label-efficient training for biomarker prediction, a task that typically suffers from low availability of training data; an aggregator initialized with PRISM and trained on as little as 10% of the training data can outperform a supervised baseline that uses all of the data. △ Less

Submitted 22 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2309.07778 [pdf, other]

Virchow: A Million-Slide Digital Pathology Foundation Model

Authors: Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun Lee, Julian Viret, Eric Robert, Yi Kan Wang, Jeremy D. Kunz, Matthew C. H. Lee, Jan Bernhard, Ran A. Godrich, Gerard Oakley, Ewan Millar, Matthew Hanna, Juan Retamero , et al. (6 additional authors not shown)

Abstract: The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computati… ▽ More The use of artificial intelligence to enable precision medicine and decision support systems through the analysis of pathology images has the potential to revolutionize the diagnosis and treatment of cancer. Such applications will depend on models' abilities to capture the diverse patterns observed in pathology images. To address this challenge, we present Virchow, a foundation model for computational pathology. Using self-supervised learning empowered by the DINOv2 algorithm, Virchow is a vision transformer model with 632 million parameters trained on 1.5 million hematoxylin and eosin stained whole slide images from diverse tissue and specimen types, which is orders of magnitude more data than previous works. The Virchow model enables the development of a pan-cancer detection system with 0.949 overall specimen-level AUC across 17 different cancer types, while also achieving 0.937 AUC on 7 rare cancer types. The Virchow model sets the state-of-the-art on the internal and external image tile level benchmarks and slide level biomarker prediction tasks. The gains in performance highlight the importance of training on massive pathology image datasets, suggesting scaling up the data and network architecture can improve the accuracy for many high-impact computational pathology applications where limited amounts of training data are available. △ Less

Submitted 17 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

arXiv:2208.13690 [pdf, other]

doi 10.1145/3555077.3556470

Terahertz Communications Can Work in Rain and Snow: Impact of Adverse Weather Conditions on Channels at 140 GHz

Authors: Priyangshu Sen, Jacob Hall, Michele Polese, Vitaly Petrov, Duschia Bodet, Francesco Restuccia, Tommaso Melodia, Josep M. Jornet

Abstract: Next-generation wireless networks will leverage the spectrum above 100 GHz to enable ultra-high data rate communications over multi-GHz-wide bandwidths. The propagation environment at such high frequencies, however, introduces challenges throughout the whole protocol stack design, from physical layer signal processing to application design. Therefore, it is fundamental to develop a holistic unders… ▽ More Next-generation wireless networks will leverage the spectrum above 100 GHz to enable ultra-high data rate communications over multi-GHz-wide bandwidths. The propagation environment at such high frequencies, however, introduces challenges throughout the whole protocol stack design, from physical layer signal processing to application design. Therefore, it is fundamental to develop a holistic understanding of the channel propagation and fading characteristics over realistic deployment scenarios and ultra-wide bands. In this paper, we conduct an extensive measurement campaign to evaluate the impact of weather conditions on a wireless link in the 130-150 GHz band through a channel sounding campaign with clear weather, rain, and snow in a typical urban backhaul scenario. We present a novel channel sounder design that captures signals with -82 dBm sensitivity and 20 GHz of bandwidth. We analyze link budget, capacity, as well as channel parameters such as the delay spread and the K-factor. Our experimental results indicate that in the considered context the adverse weather does not interrupt the link, but introduces some additional constraints (e.g., high delay spread and increase in path loss in snow conditions) that need to be accounted for in the design of reliable Sixth Generation (6G) communication links above 100 GHz. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: P. Sen, J. Hall, M. Polese, V. Petrov, D. Bodet, F. Restuccia, T. Melodia, J. M. Jornet. 2022. Terahertz Communications Can Work in Rain and Snow: Impact of Adverse Weather Conditions on Channels at 140 GHz. In 6th ACM Workshop on Millimeter-Wave and Terahertz Networks and Sensing Systems (mmNets'22), October 17, 2022, Sydney, NSW, Australia. ACM, New York, NY, USA, 6 pages

arXiv:2203.16398 [pdf, other]

Incorporating Gradient Similarity for Robust Time Delay Estimation in Ultrasound Elastography

Authors: Md Ashikuzzaman, Timothy J. Hall, Hassan Rivaz

Abstract: Energy-based ultrasound elastography techniques minimize a regularized cost function consisting of data and continuity terms to obtain local displacement estimates based on the local time-delay estimation (TDE) between radio-frequency (RF) frames. The data term associated with the existing techniques takes only the amplitude similarity into account and hence is not sufficiently robust to the outli… ▽ More Energy-based ultrasound elastography techniques minimize a regularized cost function consisting of data and continuity terms to obtain local displacement estimates based on the local time-delay estimation (TDE) between radio-frequency (RF) frames. The data term associated with the existing techniques takes only the amplitude similarity into account and hence is not sufficiently robust to the outlier samples present in the RF frames under consideration. This drawback creates noticeable artifacts in the strain image. To resolve this issue, we propose to formulate the data function as a linear combination of the amplitude and gradient similarity constraints. We estimate the adaptive weight concerning each similarity term following an iterative scheme. Finally, we optimize the non-linear cost function in an efficient manner to convert the problem to a sparse system of linear equations which are solved for millions of variables. We call our technique rGLUE: robust data term in GLobal Ultrasound Elastography. rGLUE has been validated using simulation, phantom, in vivo liver, and breast datasets. In all of our experiments, rGLUE substantially outperforms the recent elastography methods both visually and quantitatively. For simulated, phantom, and in vivo datasets, respectively, rGLUE achieves 107%, 18%, and 23% improvements of signal-to-noise ratio (SNR) and 61%, 19%, and 25% improvements of contrast-to-noise ratio (CNR) over GLUE, a recently-published elastography algorithm. △ Less

Submitted 30 March, 2022; originally announced March 2022.

arXiv:2203.10678 [pdf, other]

SweiNet: Deep Learning Based Uncertainty Quantification for Ultrasound Shear Wave Elasticity Imaging

Authors: Felix Q. **, Lindsey C. Carlson, Helen Feltovich, Timothy J. Hall, Mark L. Palmeri

Abstract: In ultrasound shear wave elasticity (SWE) imaging, a number of algorithms exist for estimating the shear wave speed (SWS) from spatiotemporal displacement data. However, no method provides a well-calibrated and practical uncertainty metric, hindering SWE's clinical adoption and utility in downstream decision-making. Here, we designed a deep learning SWS estimator that simultaneously outputs a quan… ▽ More In ultrasound shear wave elasticity (SWE) imaging, a number of algorithms exist for estimating the shear wave speed (SWS) from spatiotemporal displacement data. However, no method provides a well-calibrated and practical uncertainty metric, hindering SWE's clinical adoption and utility in downstream decision-making. Here, we designed a deep learning SWS estimator that simultaneously outputs a quantitative and well-calibrated uncertainty value for each estimate. Our deep neural network (DNN) takes as input a single 2D spatiotemporal plane of tracked displacement data and outputs the two parameters $m$ and $σ$ of a log-normal probability distribution. For training and testing, we used in vivo 2D-SWE data of the cervix collected from 30 pregnant subjects, totaling 551 acquisitions and >2 million space-time plots. Points were grouped by uncertainty into bins to assess uncertainty calibration: the predicted uncertainty closely matched the root-mean-square estimation error, with an average absolute percent deviation of 3.84%. We created a leave-one-out ensemble model that estimated uncertainty with better calibration (1.45%) than any individual ensemble member on a held-out patient's data. Lastly, we applied the DNN to an external dataset to evaluate its generalizability. We have made the trained model, SweiNet, openly available to provide the research community with a fast SWS estimator that also outputs a well-calibrated estimate of the predictive uncertainty. △ Less

Submitted 20 March, 2022; originally announced March 2022.

Comments: 9 pages, 8 figures

arXiv:2109.09900 [pdf]

Estimation of the Scatterer Size Distributions in Quantitative Ultrasound Using Constrained Optimization

Authors: Noushin Jafarpisheh, Ivan M. Rosado-Mendez, Timothy J. Hall, Hassan Rivaz

Abstract: Quantitative ultrasound (QUS) parameters such as the effective scatterer diameter (ESD) reveal tissue properties by analyzing ultrasound backscattered echo signal. ESD can be attained through parametrizing backscatter coefficient using form factor models. However, reporting a single scatterer size cannot accurately characterize a tissue, particularly when the media contains scattering sources with… ▽ More Quantitative ultrasound (QUS) parameters such as the effective scatterer diameter (ESD) reveal tissue properties by analyzing ultrasound backscattered echo signal. ESD can be attained through parametrizing backscatter coefficient using form factor models. However, reporting a single scatterer size cannot accurately characterize a tissue, particularly when the media contains scattering sources with a broad range of sizes. Here we estimate the probability of contribution of each scatterer size by modeling the measured form factor as a linear combination of form factors from individual sacatterer sizes. We perform the estimation using two novel techniques. In the first technique, we cast scatterer size distribution as an optimization problem, and efficiently solve it using a linear system of equations. In the second technique, we use the solution of this system of equations to constrain the optimization function, and solve the constrained problem. The methods are evaluated in simulated backscattered coefficients using Faran theory. We evaluate the robustness of the proposed techniques by adding Gaussian noise. The results show that both methods can accurately estimate the scatterer size distribution, and that the second method outperforms the first one. △ Less

Submitted 20 September, 2021; originally announced September 2021.

arXiv:2109.03184 [pdf, other]

The Convergence of Blockchain, IoT and 6G: Potential, Opportunities, Challenges and Research Roadmap

Authors: Abu Jahid, Mohammed H. Alsharif, Trevor J. Hall

Abstract: The world is undergoing a profound transformation with the advent of intelligent information era. 6G networks envisioned being the game changer in next generation wireless communication systems that will address the challenges of limited information speed escalated with the augmentation of billions of data applications encountered by the current fifth generation (5G) networks. Some key radical tec… ▽ More The world is undergoing a profound transformation with the advent of intelligent information era. 6G networks envisioned being the game changer in next generation wireless communication systems that will address the challenges of limited information speed escalated with the augmentation of billions of data applications encountered by the current fifth generation (5G) networks. Some key radical technologies in 6G together with existing 5G candidate schemes will guarantee the expected quality of experience (QoE) to attain ubiquitous wireless connectivity for the Internet of Everything (IoE) ranging from the telecom industry to digital smart industries. Blockchain technology (BCT) has gained significant attention due to undertake the decentralization, transparency, spectrum resource scarcity, inherent privacy and security, poor interoperability, confidentiality, and emerging smart applications domains including Industrial IoT and Industry 4.0. The mismatch between the requirements of many data intensive disruptive IoT applications and 5G network capabilities steered the demand of decentralized BCT based 6G architecture. Inspired by these facts, this paper studies an extensive survey to draw a new direction of blockchain integration into 6G mobile networks, IoT technologies, and smart industries focusing the potential merits and challenges in terms of infrastructure sharing, computational loads, latency, bandwidth overhead, business model, sustainability goals, and edge intelligence. We highlighted the convergence of IoT in blockchain to enable intelligent distribution in future industrial IoT and the technical model of 6G networks to realize the successful deployment of BCT schemes. This paper pointed out the current intriguing challenges, canvassed the mitigation techniques, and plausible future research opportunities that may benefit the pursuit of this vision. △ Less

Submitted 7 September, 2021; originally announced September 2021.

arXiv:2012.02738 [pdf, other]

Ultrasound Scatterer Density Classification Using Convolutional Neural Networks by Exploiting Patch Statistics

Authors: Ali K. Z. Tehrani, Mina Amiri, Ivan M. Rosado-Mendez, Timothy J. Hall, Hassan Rivaz

Abstract: Quantitative ultrasound (QUS) can reveal crucial information on tissue properties such as scatterer density. If the scatterer density per resolution cell is above or below 10, the tissue is considered as fully developed speckle (FDS) or low-density scatterers (LDS), respectively. Conventionally, the scatterer density has been classified using estimated statistical parameters of the amplitude of ba… ▽ More Quantitative ultrasound (QUS) can reveal crucial information on tissue properties such as scatterer density. If the scatterer density per resolution cell is above or below 10, the tissue is considered as fully developed speckle (FDS) or low-density scatterers (LDS), respectively. Conventionally, the scatterer density has been classified using estimated statistical parameters of the amplitude of backscattered echoes. However, if the patch size is small, the estimation is not accurate. These parameters are also highly dependent on imaging settings. In this paper, we propose a convolutional neural network (CNN) architecture for QUS, and train it using simulation data. We further improve the network performance by utilizing patch statistics as additional input channels. We evaluate the network using simulation data, experimental phantoms and in vivo data. We also compare our proposed network with different classic and deep learning models, and demonstrate its superior performance in classification of tissues with different scatterer density values. The results also show that the proposed network is able to work with different imaging parameters with no need for a reference phantom. This work demonstrates the potential of CNNs in classifying scatterer density in ultrasound images. △ Less

Submitted 4 December, 2020; originally announced December 2020.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2012.00155 [pdf, other]

A Contemporary Survey on Free Space Optical Communication: Potential, Technical Challenges, Recent Advances and Research Direction

Authors: Abu Jahid, Mohammed H. Alsharif, Trevor J. Hall

Abstract: Optical wireless communication (OWC) covering an ultra-wide range of unlicensed spectrum has emerged as an extent efficient solution to mitigate conventional RF spectrum scarcity ranging from communication distances from nm to several kilometers. Free space optical (FSO) systems operating near IR (NIR) band in OWC links has received substantial attention for enormous data transmission between fixe… ▽ More Optical wireless communication (OWC) covering an ultra-wide range of unlicensed spectrum has emerged as an extent efficient solution to mitigate conventional RF spectrum scarcity ranging from communication distances from nm to several kilometers. Free space optical (FSO) systems operating near IR (NIR) band in OWC links has received substantial attention for enormous data transmission between fixed transceivers covering few kilometers path distance due to high optical bandwidth and higher bit rate as well. Despite the potential benefits of FSO technology, its widespread link reliability suffers especially in the long-range deployment due to atmospheric turbulence, cloud induced fading, some other environmental factors such as fog, aerosol, temperature variations, storms, heavy rain, cloud, pointing error, and scintillation. FSO has the potential to offloading massive traffic demands from RF networks, consequently the combined application of FSO/RF and radio over FSO (RoFSO) systems is regarded as an excellent solution to support 5G and beyond for improving the limitations of an individual system. This survey presents the overview of several key technologies and implications of state-of-the-art criteria in terms of spectrum reuse, classification, architecture and applications are described for understanding FSO. This paper provides principle, significance, demonstration, and recent technological development of FSO technology among different appealing optical wireless technologies. The opportunities in the near future, the potential challenges that need to be addressed to realize the successful deployment of FSO schemes are outlined. △ Less

Submitted 30 November, 2020; originally announced December 2020.

Comments: 59 pages, 14 figures

Showing 1–9 of 9 results for author: Hall, J