Skip to main content

Showing 1–12 of 12 results for author: Kohane, I

.
  1. arXiv:2407.08874  [pdf

    cs.DB

    Implications of map**s between ICD clinical diagnosis codes and Human Phenotype Ontology terms

    Authors: Amelia LM Tan, Rafael S Gonçalves, William Yuan, Gabriel A Brat, The Consortium for Clinical Characterization of COVID-19 by EHR, Robert Gentleman, Isaac S Kohane

    Abstract: Objective: Integrating EHR data with other resources is essential in rare disease research due to low disease prevalence. Such integration is dependent on the alignment of ontologies used for data annotation. The International Classification of Diseases (ICD) is used to annotate clinical diagnoses; the Human Phenotype Ontology (HPO) to annotate phenotypes. Although these ontologies overlap in biom… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2306.11547  [pdf, other

    cs.LG

    Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events

    Authors: Matthew B. A. McDermott, Bret Nestor, Peniel Argaw, Isaac Kohane

    Abstract: Generative, pre-trained transformers (GPTs, a.k.a. "Foundation Models") have reshaped natural language processing (NLP) through their versatility in diverse downstream tasks. However, their potential extends far beyond NLP. This paper provides a software utility to help realize this potential, extending the applicability of GPTs to continuous-time sequences of complex events with internal dependen… ▽ More

    Submitted 21 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  3. arXiv:2212.10320  [pdf

    cs.AI q-bio.QM

    Construction of extra-large scale screening tools for risks of severe mental illnesses using real world healthcare data

    Authors: Dianbo Liu, Karmel W. Choi, Paulo Lizano, William Yuan, Kun-Hsing Yu, Jordan W. Smoller, Isaac Kohane

    Abstract: Importance: The prevalence of severe mental illnesses (SMIs) in the United States is approximately 3% of the whole population. The ability to conduct risk screening of SMIs at large scale could inform early prevention and treatment. Objective: A scalable machine learning based tool was developed to conduct population-level risk screening for SMIs, including schizophrenia, schizoaffective disorde… ▽ More

    Submitted 12 January, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  4. arXiv:2212.01437  [pdf, other

    cs.LG

    Identifying Heterogeneous Treatment Effects in Multiple Outcomes using Joint Confidence Intervals

    Authors: Peniel N. Argaw, Elizabeth Healey, Isaac S. Kohane

    Abstract: Heterogeneous treatment effects (HTEs) are commonly identified during randomized controlled trials (RCTs). Identifying subgroups of patients with similar treatment effects is of high interest in clinical research to advance precision medicine. Often, multiple clinical outcomes are measured during an RCT, each having a potentially heterogeneous effect. Recently there has been high interest in ident… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted to ML4H 2022. Available at https://proceedings.mlr.press/v193/argaw22a.html

    Journal ref: Proceedings of Machine Learning Research 193 (2022) 141-170

  5. arXiv:1911.10241  [pdf, other

    q-bio.QM cs.LG stat.ML

    Cross-modal representation alignment of molecular structure and perturbation-induced transcriptional profiles

    Authors: Samuel G. Finlayson, Matthew B. A. McDermott, Alex V. Pickering, Scott L. Lipnick, Isaac S. Kohane

    Abstract: Modeling the relationship between chemical structure and molecular activity is a key goal in drug development. Many benchmark tasks have been proposed for molecular property prediction, but these tasks are generally aimed at specific, isolated biomedical properties. In this work, we propose a new cross-modal small molecule retrieval task, designed to force a model to learn to associate the structu… ▽ More

    Submitted 1 October, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted for oral presentation at the Pacific Symposium of Biocomputing, 2021

  6. arXiv:1812.01547  [pdf, other

    cs.CV

    Towards generative adversarial networks as a new paradigm for radiology education

    Authors: Samuel G. Finlayson, Hyunkwang Lee, Isaac S. Kohane, Luke Oakden-Rayner

    Abstract: Medical students and radiology trainees typically view thousands of images in order to "train their eye" to detect the subtle visual patterns necessary for diagnosis. Nevertheless, infrastructural and legal constraints often make it difficult to access and quickly query an abundance of images with a user-specified feature set. In this paper, we use a conditional generative adversarial network (GAN… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/224

  7. arXiv:1811.01294  [pdf, other

    q-bio.QM cs.CL

    Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes

    Authors: Brett K. Beaulieu-Jones, Isaac S. Kohane, Andrew L. Beam

    Abstract: Biomedical association studies are increasingly done using clinical concepts, and in particular diagnostic codes from clinical data repositories as phenotypes. Clinical concepts can be represented in a meaningful, vector space using word embedding models. These embeddings allow for comparison between clinical concepts or for straightforward input to machine learning models. Using traditional appro… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Comments: To appear in 2019 Pacific Symposium on Biocomputing

  8. arXiv:1804.05296  [pdf, other

    cs.CR cs.CY cs.LG stat.ML

    Adversarial Attacks Against Medical Deep Learning Systems

    Authors: Samuel G. Finlayson, Hyung Won Chung, Isaac S. Kohane, Andrew L. Beam

    Abstract: The discovery of adversarial examples has raised concerns about the practical deployment of deep learning systems. In this paper, we demonstrate that adversarial examples are capable of manipulating deep learning systems across three clinical domains. For each of our representative medical deep learning classifiers, both white and black box attacks were highly successful. Our models are representa… ▽ More

    Submitted 4 February, 2019; v1 submitted 14 April, 2018; originally announced April 2018.

  9. arXiv:1804.02097  [pdf, other

    stat.ME stat.AP stat.ML

    Multi-view Banded Spectral Clustering with Application to ICD9 Clustering

    Authors: Luwan Zhang, Katherine Liao, Issac Kohane, Tianxi Cai

    Abstract: Despite recent development in methodology, community detection remains a challenging problem. Existing literature largely focuses on the standard setting where a network is learned using an observed adjacency matrix from a single data source. Constructing a shared network from multiple data sources is more challenging due to the heterogeneity across populations. Additionally, no existing method le… ▽ More

    Submitted 20 June, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

  10. arXiv:1804.01486  [pdf, other

    cs.CL cs.AI stat.ML

    Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data

    Authors: Andrew L. Beam, Benjamin Kompa, Allen Schmaltz, Inbar Fried, Griffin Weber, Nathan P. Palmer, Xu Shi, Tianxi Cai, Isaac S. Kohane

    Abstract: Word embeddings are a popular approach to unsupervised learning of word relationships that are widely used in natural language processing. In this article, we present a new set of embeddings for medical concepts learned using an extremely large collection of multimodal medical data. Leaning on recent theoretical insights, we demonstrate how an insurance claims database of 60 million members, a col… ▽ More

    Submitted 19 August, 2019; v1 submitted 4 April, 2018; originally announced April 2018.

  11. arXiv:1804.00735  [pdf, other

    stat.CO stat.AP

    A Fast Divide-and-Conquer Sparse Cox Regression

    Authors: Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, Isaac Kohane, Tianxi Cai

    Abstract: We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the part… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

  12. arXiv:1710.03613  [pdf

    q-bio.NC

    Auditory Brainstem Response in Infants and Children with Autism: A Meta-Analysis

    Authors: Oren Miron, Andrew L. Beam, Isaac S. Kohane

    Abstract: Infants with autism were recently found to have prolonged Auditory Brainstem Response (ABR); however, at older ages, findings are contradictory. We compared ABR differences between participants with autism and controls with respect to age using a meta-analysis. Data sources included MEDLINE, EMBASE, Web of Science, Google Scholar, HOLLIS and ScienceDirect from their inception to June 2016. The 25… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.