Skip to main content

Showing 1–8 of 8 results for author: Gallifant, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00242  [pdf, other

    cs.CL

    EHRmonize: A Framework for Medical Concept Abstraction from Electronic Health Records using Large Language Models

    Authors: João Matos, Jack Gallifant, Jian Pei, A. Ian Wong

    Abstract: Electronic health records (EHRs) contain vast amounts of complex data, but harmonizing and processing this information remains a challenging and costly task requiring significant clinical expertise. While large language models (LLMs) have shown promise in various healthcare applications, their potential for abstracting medical concepts from EHRs remains largely unexplored. We introduce EHRmonize,… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: submitted for review, total of 10 pages

  2. arXiv:2406.19653  [pdf, other

    cs.LG cs.AI

    ACES: Automatic Cohort Extraction System for Event-Stream Datasets

    Authors: Justin Xu, Jack Gallifant, Alistair E. W. Johnson, Matthew B. A. McDermott

    Abstract: Reproducibility remains a significant challenge in machine learning (ML) for healthcare. In this field, datasets, model pipelines, and even task/cohort definitions are often private, leading to a significant barrier in sharing, iterating, and understanding ML results on electronic health record (EHR) datasets. In this paper, we address a significant part of this problem by introducing the Automati… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: For ACES Online Documentation, see https://eventstreamaces.readthedocs.io/en/latest/

  3. arXiv:2406.13152  [pdf, other

    cs.CL

    Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective

    Authors: David Restrepo, Chenwei Wu, Constanza Vásquez-Venegas, João Matos, Jack Gallifant, Luis Filipe

    Abstract: The deployment of large language models (LLMs) in healthcare has demonstrated substantial potential for enhancing clinical decision-making, administrative efficiency, and patient outcomes. However, the underrepresentation of diverse groups in the development and application of these models can perpetuate biases, leading to inequitable healthcare delivery. This paper presents a comprehensive scient… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.12066  [pdf, other

    cs.CL

    Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks

    Authors: Jack Gallifant, Shan Chen, Pedro Moreira, Nikolaj Munch, Mingye Gao, Jackson Pond, Leo Anthony Celi, Hugo Aerts, Thomas Hartvigsen, Danielle Bitterman

    Abstract: Medical knowledge is context-dependent and requires consistent reasoning across various natural language expressions of semantically equivalent phrases. This is particularly crucial for drug names, where patients often use brand names like Advil or Tylenol instead of their generic equivalents. To study this, we create a new robustness dataset, RABBITS, to evaluate performance differences on medica… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: submitted for review, total 15 pages

  5. arXiv:2405.05506  [pdf, other

    cs.CL

    Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias

    Authors: Shan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman

    Abstract: Large language models (LLMs) are increasingly essential in processing natural languages, yet their application is frequently compromised by biases and inaccuracies originating in their training data. In this study, we introduce Cross-Care, the first benchmark framework dedicated to assessing biases and real world knowledge in LLMs, specifically focusing on the representation of disease prevalence… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Submitted for review, data visualization tool available at: www.crosscare.net

  6. arXiv:2405.05049  [pdf

    cs.CL

    Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

    Authors: Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi

    Abstract: Background Advancements in Large Language Models (LLMs) hold transformative potential in healthcare, however, recent work has raised concern about the tendency of these models to produce outputs that display racial or gender biases. Although training data is a likely source of such biases, exploration of disease and demographic associations in text data at scale has been limited. Methods We cond… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  7. arXiv:2403.19511  [pdf

    cs.CL

    Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data

    Authors: Shan Chen, Jack Gallifant, Marco Guevara, Yanjun Gao, Majid Afshar, Timothy Miller, Dmitriy Dligach, Danielle S. Bitterman

    Abstract: Generative models have been showing potential for producing data in mass. This study explores the enhancement of clinical natural language processing performance by utilizing synthetic data generated from advanced language models. Promising results show feasible applications in such a high-stakes domain.

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: submitted to review

  8. arXiv:2401.06091  [pdf, other

    cs.LG stat.ME

    A Closer Look at AUROC and AUPRC under Class Imbalance

    Authors: Matthew B. A. McDermott, Lasse Hyldig Hansen, Haoran Zhang, Giovanni Angelotti, Jack Gallifant

    Abstract: In machine learning (ML), a widespread adage is that the area under the precision-recall curve (AUPRC) is a superior metric for model comparison to the area under the receiver operating characteristic (AUROC) for binary classification tasks with class imbalance. This paper challenges this notion through novel mathematical analysis, illustrating that AUROC and AUPRC can be concisely related in prob… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 January, 2024; originally announced January 2024.