Skip to main content

Showing 1–19 of 19 results for author: Lasko, T A

.
  1. arXiv:2405.10993  [pdf

    q-bio.QM

    No winners: Performance of lung cancer prediction models depends on screening-detected, incidental, and biopsied pulmonary nodule use cases

    Authors: Thomas Z. Li, Kaiwen Xu, Aravind Krishnan, Riqiang Gao, Michael N. Kammer, Sanja Antic, David Xiao, Michael Knight, Yency Martinez, Rafael Paez, Robert J. Lentz, Stephen Deppen, Eric L. Grogan, Thomas A. Lasko, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Statistical models for predicting lung cancer have the potential to facilitate earlier diagnosis of malignancy and avoid invasive workup of benign disease. Many models have been published, but comparative studies of their utility in different clinical settings in which patients would arguably most benefit are scarce. This study retrospectively evaluated promising predictive models for lung cancer… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Submitted to Radiology: AI

  2. arXiv:2402.05802  [pdf, other

    cs.LG stat.AP stat.ML

    Unsupervised Discovery of Clinical Disease Signatures Using Probabilistic Independence

    Authors: Thomas A. Lasko, John M. Still, Thomas Z. Li, Marco Barbero Mota, William W. Stead, Eric V. Strobl, Bennett A. Landman, Fabien Maldonado

    Abstract: Insufficiently precise diagnosis of clinical disease is likely responsible for many treatment failures, even for common conditions and treatments. With a large enough dataset, it may be possible to use unsupervised machine learning to define clinical disease patterns more precisely. We present an approach to learning these patterns by using probabilistic independence to disentangle the imprint on… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 29 Pages, 8 figures

    ACM Class: I.2.6; I.2.1; J.3

  3. arXiv:2311.04787  [pdf

    cs.LG cs.PF stat.ML

    Why Do Probabilistic Clinical Models Fail To Transport Between Sites?

    Authors: Thomas A. Lasko, Eric V. Strobl, William W. Stead

    Abstract: The rising popularity of artificial intelligence in healthcare is highlighting the problem that a computational model achieving super-human clinical performance at its training sites may perform substantially worse at new sites. In this perspective, we present common sources for this failure to transport, which we divide into sources under the control of the experimenter and sources inherent to th… ▽ More

    Submitted 28 December, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 20 pages, 3 figures

  4. arXiv:2304.02836  [pdf, other

    eess.IV cs.CV cs.LG

    Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

    Authors: Thomas Z. Li, John M. Still, Kaiwen Xu, Ho Hin Lee, Leon Y. Cai, Aravind R. Krishnan, Riqiang Gao, Mirza S. Khan, Sanja Antic, Michael Kammer, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman, Thomas A. Lasko

    Abstract: The accuracy of predictive models for solitary pulmonary nodule (SPN) diagnosis can be greatly increased by incorporating repeat imaging and medical context, such as electronic health records (EHRs). However, clinically routine modalities such as imaging and diagnostic codes can be asynchronous and irregularly sampled over different time scales which are obstacles to longitudinal multimodal learni… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to MICCAI 2023

  5. arXiv:2210.15340  [pdf, other

    stat.ML cs.LG stat.AP

    Sample-Specific Root Causal Inference with Latent Variables

    Authors: Eric V. Strobl, Thomas A. Lasko

    Abstract: Root causal analysis seeks to identify the set of initial perturbations that induce an unwanted outcome. In prior work, we defined sample-specific root causes of disease using exogenous error terms that predict a diagnosis in a structural equation model. We rigorously quantified predictivity using Shapley values. However, the associated algorithms for inferring root causes assume no latent confoun… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  6. arXiv:2209.14378  [pdf, other

    eess.IV cs.CV

    UNesT: Local Spatial Representation Learning with Hierarchical Transformer for Efficient Medical Segmentation

    Authors: Xin Yu, Qi Yang, Yinchi Zhou, Leon Y. Cai, Riqiang Gao, Ho Hin Lee, Thomas Li, Shunxing Bao, Zhoubing Xu, Thomas A. Lasko, Richard G. Abramson, Zizhao Zhang, Yuankai Huo, Bennett A. Landman, Yucheng Tang

    Abstract: Transformer-based models, capable of learning better global dependencies, have recently demonstrated exceptional representation learning capabilities in computer vision and medical image analysis. Transformer reformats the image into separate patches and realizes global communication via the self-attention mechanism. However, positional information between patches is hard to preserve in such 1D se… ▽ More

    Submitted 7 September, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 19 pages, 17 figures. arXiv admin note: text overlap with arXiv:2203.02430

  7. arXiv:2209.01676  [pdf

    eess.IV cs.CV q-bio.QM

    Time-distance vision transformers in lung cancer diagnosis from longitudinal computed tomography

    Authors: Thomas Z. Li, Kaiwen Xu, Riqiang Gao, Yucheng Tang, Thomas A. Lasko, Fabien Maldonado, Kim Sandler, Bennett A. Landman

    Abstract: Features learned from single radiologic images are unable to provide information about whether and how much a lesion may be changing over time. Time-dependent features computed from repeated images can capture those changes and help identify malignant lesions by their temporal behavior. However, longitudinal medical imaging presents the unique challenge of sparse, irregular time intervals in data… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: Summited to SPIE 2023 - Medical Imaging. 10 pages

  8. arXiv:2206.08833  [pdf

    cs.CV

    A Comparative Study of Confidence Calibration in Deep Learning: From Computer Vision to Medical Imaging

    Authors: Riqiang Gao, Thomas Li, Yucheng Tang, Zhoubing Xu, Michael Kammer, Sanja L. Antic, Kim Sandler, Fabien Moldonado, Thomas A. Lasko, Bennett Landman

    Abstract: Although deep learning prediction models have been successful in the discrimination of different classes, they can often suffer from poor calibration across challenging domains including healthcare. Moreover, the long-tail distribution poses great challenges in deep learning classification problems including clinical disease prediction. There are approaches proposed recently to calibrate deep pred… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 17 pages, 6 figures

  9. arXiv:2205.13085  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Identifying Patient-Specific Root Causes with the Heteroscedastic Noise Model

    Authors: Eric V. Strobl, Thomas A. Lasko

    Abstract: Complex diseases are caused by a multitude of factors that may differ between patients even within the same diagnostic category. A few underlying root causes may nevertheless initiate the development of disease within each patient. We therefore focus on identifying patient-specific root causes of disease, which we equate to the sample-specific predictivity of the exogenous error terms in a structu… ▽ More

    Submitted 6 July, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  10. arXiv:2205.11627  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Identifying Patient-Specific Root Causes of Disease

    Authors: Eric V. Strobl, Thomas A. Lasko

    Abstract: Complex diseases are caused by a multitude of factors that may differ between patients. As a result, hypothesis tests comparing all patients to all healthy controls can detect many significant variables with inconsequential effect sizes. A few highly predictive root causes may nevertheless generate disease within each patient. In this paper, we define patient-specific root causes as variables subj… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  11. arXiv:2203.02430  [pdf, other

    eess.IV cs.CV

    Characterizing Renal Structures with 3D Block Aggregate Transformers

    Authors: Xin Yu, Yucheng Tang, Yinchi Zhou, Riqiang Gao, Qi Yang, Ho Hin Lee, Thomas Li, Shunxing Bao, Yuankai Huo, Zhoubing Xu, Thomas A. Lasko, Richard G. Abramson, Bennett A. Landman

    Abstract: Efficiently quantifying renal structures can provide distinct spatial context and facilitate biomarker discovery for kidney morphology. However, the development and evaluation of the transformer model to segment the renal cortex, medulla, and collecting system remains challenging due to data inefficiency. Inspired by the hierarchical structures in vision transformer, we propose a novel method usin… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  12. arXiv:2111.13229  [pdf, other

    stat.ML cs.LG stat.ME

    Generalizing Clinical Trials with Convex Hulls

    Authors: Eric V. Strobl, Thomas A. Lasko

    Abstract: Randomized clinical trials eliminate confounding but impose strict exclusion criteria that limit recruitment to a subset of the population. Observational datasets are more inclusive but suffer from confounding -- often providing overly optimistic estimates of treatment response over time due to partially optimized physician prescribing patterns. We therefore assume that the unconfounded treatment… ▽ More

    Submitted 27 October, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

  13. arXiv:2107.11882  [pdf, other

    eess.IV cs.CV cs.LG

    Lung Cancer Risk Estimation with Incomplete Data: A Joint Missing Imputation Perspective

    Authors: Riqiang Gao, Yucheng Tang, Kaiwen Xu, Ho Hin Lee, Steve Deppen, Kim Sandler, Pierre Massion, Thomas A. Lasko, Yuankai Huo, Bennett A. Landman

    Abstract: Data from multi-modality provide complementary information in clinical prediction, but missing data in clinical cohorts limits the number of subjects in multi-modal learning context. Multi-modal missing imputation is challenging with existing methods when 1) the missing data span across heterogeneous modalities (e.g., image vs. non-image); or 2) one modality is largely missing. In this paper, we a… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

    Comments: Early Accepted by MICCAI 2021. Traveling Award

  14. arXiv:2105.00455  [pdf, other

    stat.ML cs.LG stat.ME

    Synthesized Difference in Differences

    Authors: Eric V. Strobl, Thomas A. Lasko

    Abstract: We consider estimating the conditional average treatment effect for everyone by eliminating confounding and selection bias. Unfortunately, randomized clinical trials (RCTs) eliminate confounding but impose strict exclusion criteria that prevent sampling of the entire clinical population. Observational datasets are more inclusive but suffer from confounding. We therefore analyze RCT and observation… ▽ More

    Submitted 11 June, 2021; v1 submitted 2 May, 2021; originally announced May 2021.

    Comments: Accepted to ACM BCB 2021

  15. arXiv:2003.07921  [pdf, other

    cs.LG stat.ML

    Semi-supervised Contrastive Learning Using Partial Label Information

    Authors: Colin B. Hansen, Vishwesh Nath, Diego A. Mesa, Yuankai Huo, Bennett A. Landman, Thomas A. Lasko

    Abstract: In semi-supervised learning, information from unlabeled examples is used to improve the model learned from labeled examples. In some learning problems, partial label information can be inferred from otherwise unlabeled examples and used to further improve the model. In particular, partial label information exists when subsets of training examples are known to have the same label, even though the l… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 March, 2020; originally announced March 2020.

  16. arXiv:1907.11051  [pdf, other

    stat.AP

    Computational Phenotype Discovery via Probabilistic Independence

    Authors: Thomas A. Lasko, Diego A. Mesa

    Abstract: Computational Phenotype Discovery research has taken various pragmatic approaches to disentangling phenotypes from the episodic observations in Electronic Health Records. In this work, we use transformation into continuous, longitudinal curves to abstract away the sparse irregularity of the data, and we introduce probabilistic independence as a guiding principle for disentangling phenotypes into p… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Presented at KDD Workshop on Applied Data Science for Healthcare 2019

  17. arXiv:1906.09549  [pdf

    eess.IV cs.CV

    Fully Automatic Liver Attenuation Estimation Combing CNN Segmentation and Morphological Operations

    Authors: Yuankai Huo, James G. Terry, Jiachen Wang, Sangeeta Nair, Thomas A. Lasko, Barry I. Freedman, J. Jeffery Carr, Bennett A. Landman

    Abstract: Manually tracing regions of interest (ROIs) within the liver is the de facto standard method for measuring liver attenuation on computed tomography (CT) in diagnosing nonalcoholic fatty liver disease (NAFLD). However, manual tracing is resource intensive. To address these limitations and to expand the availability of a quantitative CT measure of hepatic steatosis, we propose the automatic liver at… ▽ More

    Submitted 29 June, 2019; v1 submitted 23 June, 2019; originally announced June 2019.

    Comments: Medical Physics

  18. arXiv:1802.04233  [pdf, other

    stat.AP

    Embedding Complexity In the Data Representation Instead of In the Model: A Case Study Using Heterogeneous Medical Data

    Authors: Jacek M. Bajor, Diego A. Mesa, Travis J. Osterman, Thomas A. Lasko

    Abstract: Electronic Health Records have become popular sources of data for secondary research, but their use is hampered by the amount of effort it takes to overcome the sparsity, irregularity, and noise that they contain. Modern learning architectures can remove the need for expert-driven feature engineering, but not the need for expert-driven preprocessing to abstract away the inherent messiness of clini… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  19. arXiv:1402.4732  [pdf, other

    stat.ML cs.LG stat.AP

    Efficient Inference of Gaussian Process Modulated Renewal Processes with Application to Medical Event Data

    Authors: Thomas A. Lasko

    Abstract: The episodic, irregular and asynchronous nature of medical data render them difficult substrates for standard machine learning algorithms. We would like to abstract away this difficulty for the class of time-stamped categorical variables (or events) by modeling them as a renewal process and inferring a probability density over continuous, longitudinal, nonparametric intensity functions modulating… ▽ More

    Submitted 19 February, 2014; originally announced February 2014.

    Comments: 8 pages, 4 figures

    Report number: VU-DBMI-2014-01-001 ACM Class: G.3; I.2.1; I.5.1; I.5.2; I.5.4; J.3