Skip to main content

Showing 1–23 of 23 results for author: Chen, R J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00224  [pdf, other

    cs.CV stat.AP

    Multimodal Prototy** for cancer survival prediction

    Authors: Andrew H. Song, Richard J. Chen, Guillaume Jaume, Anurag J. Vaidya, Alexander S. Baras, Faisal Mahmood

    Abstract: Multimodal survival methods combining gigapixel histology whole-slide images (WSIs) and transcriptomic profiles are particularly promising for patient prognostication and stratification. Current approaches involve tokenizing the WSIs into smaller patches (>10,000 patches) and transcriptomics into gene groups, which are then integrated using a Transformer for predicting outcomes. However, this proc… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: ICML 2024

  2. arXiv:2406.16192  [pdf, other

    cs.CV

    HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image Analysis

    Authors: Guillaume Jaume, Paul Doucet, Andrew H. Song, Ming Y. Lu, Cristina Almagro-Pérez, Sophia J. Wagner, Anurag J. Vaidya, Richard J. Chen, Drew F. K. Williamson, Ahrong Kim, Faisal Mahmood

    Abstract: Spatial transcriptomics (ST) enables interrogating the molecular composition of tissue with ever-increasing resolution, depth, and sensitivity. However, costs, rapidly evolving technology, and lack of standards have constrained computational methods in ST to narrow tasks and small cohorts. In addition, the underlying tissue morphology as reflected by H&E-stained whole slide images (WSIs) encodes r… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Under review

  3. arXiv:2405.11643  [pdf, other

    cs.CV cs.LG stat.AP

    Morphological Prototy** for Unsupervised Slide Representation Learning in Computational Pathology

    Authors: Andrew H. Song, Richard J. Chen, Tong Ding, Drew F. K. Williamson, Guillaume Jaume, Faisal Mahmood

    Abstract: Representation learning of pathology whole-slide images (WSIs) has been has primarily relied on weak supervision with Multiple Instance Learning (MIL). However, the slide representations resulting from this approach are highly tailored to specific clinical tasks, which limits their expressivity and generalization, particularly in scenarios with limited data. Instead, we hypothesize that morphologi… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  4. arXiv:2405.11618  [pdf, other

    cs.CV cs.AI

    Transcriptomics-guided Slide Representation Learning in Computational Pathology

    Authors: Guillaume Jaume, Lukas Oldenburg, Anurag Vaidya, Richard J. Chen, Drew F. K. Williamson, Thomas Peeters, Andrew H. Song, Faisal Mahmood

    Abstract: Self-supervised learning (SSL) has been successful in building patch embeddings of small histology images (e.g., 224x224 pixels), but scaling these models to learn slide embeddings from the entirety of giga-pixel whole-slide images (WSIs) remains challenging. Here, we leverage complementary information from gene expression profiles to guide slide representation learning using multimodal pre-traini… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: CVPR'24, Oral

  5. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2312.07814  [pdf, other

    cs.CV cs.AI

    A Foundational Multimodal Vision Language AI Assistant for Human Pathology

    Authors: Ming Y. Lu, Bowen Chen, Drew F. K. Williamson, Richard J. Chen, Kenji Ikamura, Georg Gerber, Ivy Liang, Long Phi Le, Tong Ding, Anil V Parwani, Faisal Mahmood

    Abstract: The field of computational pathology has witnessed remarkable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders. However, despite the explosive growth of generative artificial intelligence (AI), there has been limited study on building general purpose, multimodal AI assistants tailored to pathology. Here we present PathChat, a vis… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  7. arXiv:2308.15474  [pdf, other

    cs.CV cs.AI q-bio.TO

    A General-Purpose Self-Supervised Model for Computational Pathology

    Authors: Richard J. Chen, Tong Ding, Ming Y. Lu, Drew F. K. Williamson, Guillaume Jaume, Bowen Chen, Andrew Zhang, Daniel Shao, Andrew H. Song, Muhammad Shaban, Mane Williams, Anurag Vaidya, Sharifa Sahai, Lukas Oldenburg, Luca L. Weishaupt, Judy J. Wang, Walt Williams, Long Phi Le, Georg Gerber, Faisal Mahmood

    Abstract: Tissue phenoty** is a fundamental computational pathology (CPath) task in learning objective characterizations of histopathologic biomarkers in anatomic pathology. However, whole-slide imaging (WSI) poses a complex computer vision problem in which the large-scale image resolutions of WSIs and the enormous diversity of morphological phenotypes preclude large-scale data annotation. Current efforts… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  8. arXiv:2307.12914  [pdf, other

    cs.CV cs.AI

    Towards a Visual-Language Foundation Model for Computational Pathology

    Authors: Ming Y. Lu, Bowen Chen, Drew F. K. Williamson, Richard J. Chen, Ivy Liang, Tong Ding, Guillaume Jaume, Igor Odintsov, Andrew Zhang, Long Phi Le, Georg Gerber, Anil V Parwani, Faisal Mahmood

    Abstract: The accelerated adoption of digital pathology and advances in deep learning have enabled the development of powerful models for various pathology tasks across a diverse array of diseases and patient cohorts. However, model training is often difficult due to label scarcity in the medical domain and the model's usage is limited by the specific task and disease for which it is trained. Additionally,… ▽ More

    Submitted 25 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  9. arXiv:2306.07831  [pdf, other

    cs.CV

    Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images

    Authors: Ming Y. Lu, Bowen Chen, Andrew Zhang, Drew F. K. Williamson, Richard J. Chen, Tong Ding, Long Phi Le, Yung-Sung Chuang, Faisal Mahmood

    Abstract: Contrastive visual language pretraining has emerged as a powerful method for either training new language-aware image encoders or augmenting existing pretrained models with zero-shot visual recognition capabilities. However, existing works typically train on large datasets of image-text pairs and have been designed to perform downstream tasks involving only small to medium sized-images, neither of… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted to CVPR 2023

  10. arXiv:2206.08885  [pdf, other

    eess.IV cs.CV cs.LG stat.ME

    Incorporating intratumoral heterogeneity into weakly-supervised deep learning models via variance pooling

    Authors: Iain Carmichael, Andrew H. Song, Richard J. Chen, Drew F. K. Williamson, Tiffany Y. Chen, Faisal Mahmood

    Abstract: Supervised learning tasks such as cancer survival prediction from gigapixel whole slide images (WSIs) are a critical challenge in computational pathology that requires modeling complex features of the tumor microenvironment. These learning tasks are often solved with deep multi-instance learning (MIL) models that do not explicitly capture intratumoral heterogeneity. We develop a novel variance poo… ▽ More

    Submitted 19 November, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022

  11. arXiv:2206.02647  [pdf, other

    cs.CV

    Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning

    Authors: Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal Mahmood

    Abstract: Vision Transformers (ViTs) and their multi-scale and hierarchical variations have been successful at capturing image representations but their use has been generally studied for low-resolution images (e.g. - 256x256, 384384). For gigapixel whole-slide imaging (WSI) in computational pathology, WSIs can be as large as 150000x150000 pixels at 20X magnification and exhibit a hierarchical structure of… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR 2022 (Oral)

  12. arXiv:2203.00585  [pdf, other

    cs.CV q-bio.TO

    Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology

    Authors: Richard J. Chen, Rahul G. Krishnan

    Abstract: Tissue phenoty** is a fundamental task in learning objective characterizations of histopathologic biomarkers within the tumor-immune microenvironment in cancer pathology. However, whole-slide imaging (WSI) is a complex computer vision in which: 1) WSIs have enormous image resolutions with precludes large-scale pixel-level efforts in data curation, and 2) diversity of morphological phenotypes res… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Learning Meaningful Representations of Life (NeurIPS 2021)

  13. arXiv:2110.00603  [pdf, other

    cs.CV cs.LG

    Algorithm Fairness in AI for Medicine and Healthcare

    Authors: Richard J. Chen, Tiffany Y. Chen, Jana Lipkova, Judy J. Wang, Drew F. K. Williamson, Ming Y. Lu, Sharifa Sahai, Faisal Mahmood

    Abstract: In the current development and deployment of many artificial intelligence (AI) systems in healthcare, algorithm fairness is a challenging problem in delivering equitable care. Recent evaluation of AI models stratified across race sub-populations have revealed inequalities in how patients are diagnosed, given treatments, and billed for healthcare costs. In this perspective article, we summarize the… ▽ More

    Submitted 23 March, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

  14. arXiv:2108.02278  [pdf, other

    cs.CV cs.AI q-bio.GN q-bio.QM q-bio.TO

    Pan-Cancer Integrative Histology-Genomic Analysis via Interpretable Multimodal Deep Learning

    Authors: Richard J. Chen, Ming Y. Lu, Drew F. K. Williamson, Tiffany Y. Chen, Jana Lipkova, Muhammad Shaban, Maha Shady, Mane Williams, Bum** Joo, Zahra Noor, Faisal Mahmood

    Abstract: The rapidly emerging field of deep learning-based computational pathology has demonstrated promise in develo** objective prognostic models from histology whole slide images. However, most prognostic models are either based on histology or genomics alone and do not address how histology and genomics can be integrated to develop joint image-omic prognostic models. Additionally identifying explaina… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Demo: http://pancancer.mahmoodlab.org

  15. arXiv:2107.13048  [pdf, other

    eess.IV cs.CV q-bio.TO

    Whole Slide Images are 2D Point Clouds: Context-Aware Survival Prediction using Patch-based Graph Convolutional Networks

    Authors: Richard J. Chen, Ming Y. Lu, Muhammad Shaban, Chengkuan Chen, Tiffany Y. Chen, Drew F. K. Williamson, Faisal Mahmood

    Abstract: Cancer prognostication is a challenging task in computational pathology that requires context-aware representations of histology features to adequately infer patient survival. Despite the advancements made in weakly-supervised deep learning, many approaches are not context-aware and are unable to model important morphological feature interactions between cell identities and tissue types that are p… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: MICCAI 2021

  16. arXiv:2009.10190  [pdf, other

    eess.IV cs.CV cs.LG q-bio.TO

    Federated Learning for Computational Pathology on Gigapixel Whole Slide Images

    Authors: Ming Y. Lu, Dehan Kong, Jana Lipkova, Richard J. Chen, Rajendra Singh, Drew F. K. Williamson, Tiffany Y. Chen, Faisal Mahmood

    Abstract: Deep Learning-based computational pathology algorithms have demonstrated profound ability to excel in a wide array of tasks that range from characterization of well known morphological phenotypes to predicting non-human-identifiable features from histology such as molecular alterations. However, the development of robust, adaptable, and accurate deep learning-based models often rely on the collect… ▽ More

    Submitted 22 September, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

  17. arXiv:2008.12949  [pdf, other

    cs.CV cs.LG

    VR-Caps: A Virtual Environment for Capsule Endoscopy

    Authors: Kagan Incetan, Ibrahim Omer Celik, Abdulhamid Obeid, Guliz Irem Gokceler, Kutsev Bengisu Ozyoruk, Yasin Almalioglu, Richard J. Chen, Faisal Mahmood, Hunter Gilbert, Nicholas J. Durr, Mehmet Turan

    Abstract: Current capsule endoscopes and next-generation robotic capsules for diagnosis and treatment of gastrointestinal diseases are complex cyber-physical platforms that must orchestrate complex software and hardware functions. The desired tasks for these systems include visual localization, depth estimation, 3D map**, disease detection and segmentation, automated navigation, active control, path reali… ▽ More

    Submitted 14 January, 2021; v1 submitted 29 August, 2020; originally announced August 2020.

    Comments: 18 pages, 14 figures

  18. arXiv:2004.09666  [pdf, other

    eess.IV cs.CV cs.LG q-bio.TO

    Data Efficient and Weakly Supervised Computational Pathology on Whole Slide Images

    Authors: Ming Y. Lu, Drew F. K. Williamson, Tiffany Y. Chen, Richard J. Chen, Matteo Barbieri, Faisal Mahmood

    Abstract: The rapidly emerging field of computational pathology has the potential to enable objective diagnosis, therapeutic response prediction and identification of new morphological features of clinical relevance. However, deep learning-based computational pathology approaches either require manual annotation of gigapixel whole slide images (WSIs) in fully-supervised settings or thousands of WSIs with sl… ▽ More

    Submitted 21 May, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  19. arXiv:2002.05459  [pdf, other

    cs.CV cs.LG eess.IV

    EndoL2H: Deep Super-Resolution for Capsule Endoscopy

    Authors: Yasin Almalioglu, Kutsev Bengisu Ozyoruk, Abdulkadir Gokce, Kagan Incetan, Guliz Irem Gokceler, Muhammed Ali Simsek, Kivanc Ararat, Richard J. Chen, Nicholas J. Durr, Faisal Mahmood, Mehmet Turan

    Abstract: Although wireless capsule endoscopy is the preferred modality for diagnosis and assessment of small bowel diseases, the poor camera resolution is a substantial limitation for both subjective and automated diagnostics. Enhanced-resolution endoscopy has shown to improve adenoma detection rate for conventional endoscopy and is likely to do the same for capsule endoscopy. In this work, we propose and… ▽ More

    Submitted 22 June, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: 23 pages, submitted to IEEE Transactions on Medical Imaging, corresponding Author: Mehmet Turan

  20. arXiv:1912.08937  [pdf, other

    cs.CV q-bio.GN q-bio.TO

    Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis

    Authors: Richard J. Chen, Ming Y. Lu, **gwen Wang, Drew F. K. Williamson, Scott J. Rodig, Neal I. Lindeman, Faisal Mahmood

    Abstract: Cancer diagnosis, prognosis, and therapeutic response predictions are based on morphological information from histology slides and molecular profiles from genomic data. However, most deep learning-based objective outcome prediction and grading paradigms are based on histology or genomics alone and do not make use of the complementary information in an intuitive manner. In this work, we propose Pat… ▽ More

    Submitted 3 September, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: Code and trained models are made available at: https://github.com/mahmoodlab/PathomicFusion

  21. arXiv:1910.13328  [pdf, other

    cs.CV cs.LG eess.IV q-bio.TO

    Weakly Supervised Prostate TMA Classification via Graph Convolutional Networks

    Authors: **gwen Wang, Richard J. Chen, Ming Y. Lu, Alexander Baras, Faisal Mahmood

    Abstract: Histology-based grade classification is clinically important for many cancer types in stratifying patients distinct treatment groups. In prostate cancer, the Gleason score is a grading system used to measure the aggressiveness of prostate cancer from the spatial organization of cells and the distribution of glands. However, the subjective interpretation of Gleason score often suffers from large in… ▽ More

    Submitted 6 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

  22. arXiv:1910.10825  [pdf, other

    cs.CV q-bio.TO

    Semi-Supervised Histology Classification using Deep Multiple Instance Learning and Contrastive Predictive Coding

    Authors: Ming Y. Lu, Richard J. Chen, **gwen Wang, Debora Dillon, Faisal Mahmood

    Abstract: Convolutional neural networks can be trained to perform histology slide classification using weak annotations with multiple instance learning (MIL). However, given the paucity of labeled histology data, direct application of MIL can easily suffer from overfitting and the network is unable to learn rich feature representations due to the weak supervisory signal. We propose to overcome such limitati… ▽ More

    Submitted 2 November, 2019; v1 submitted 23 October, 2019; originally announced October 2019.

  23. arXiv:1907.00283  [pdf, other

    eess.IV cs.CV cs.RO

    SLAM Endoscopy enhanced by adversarial depth prediction

    Authors: Richard J. Chen, Taylor L. Bobrow, Thomas Athey, Faisal Mahmood, Nicholas J. Durr

    Abstract: Medical endoscopy remains a challenging application for simultaneous localization and map** (SLAM) due to the sparsity of image features and size constraints that prevent direct depth-sensing. We present a SLAM approach that incorporates depth predictions made by an adversarially-trained convolutional neural network (CNN) applied to monocular endoscopy images. The depth network is trained with s… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

    Report number: KDD'19 Workshop on Applied Data Science for Healthcare