Skip to main content

Showing 1–23 of 23 results for author: Yetisgen, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00826  [pdf, other

    cs.CL

    Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods

    Authors: Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Social determinants of health (SDoH) play a critical role in sha** health outcomes, particularly in pediatric populations where interventions can have long-term implications. SDoH are frequently studied in the Electronic Health Record (EHR), which provides a rich repository for diverse patient data. In this work, we present a novel annotated corpus, the Pediatric Social History Annotation Corpus… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: 12 pages, 2 figures and 3 tables. Accepted by LREC-COLING 2024

  2. arXiv:2403.18975  [pdf, other

    cs.CL

    A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models

    Authors: Namu Park, Kevin Lybarger, Giridhar Kaushik Ramachandran, Spencer Lewis, Aashka Damani, Ozlem Uzuner, Martin Gunn, Meliha Yetisgen

    Abstract: Medical imaging is critical to the diagnosis, surveillance, and treatment of many health conditions, including oncological, neurological, cardiovascular, and musculoskeletal disorders, among others. Radiologists interpret these complex, unstructured images and articulate their assessments through narrative reports that remain largely unstructured. This unstructured narrative must be converted into… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  3. arXiv:2401.01620  [pdf, other

    cs.AI cs.CL

    Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication

    Authors: Philip Chung, Christine T Fong, Andrew M Walters, Nima Aghaeepour, Meliha Yetisgen, Vikas N O'Reilly-Shah

    Abstract: We investigate whether general-domain large language models such as GPT-4 Turbo can perform risk stratification and predict post-operative outcome measures using a description of the procedure and a patient's clinical notes derived from the electronic health record. We examine predictive performance on 8 different tasks: prediction of ASA Physical Status Classification, hospital admission, ICU adm… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  4. arXiv:2310.02451  [pdf, other

    cs.CL

    Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes

    Authors: Xiruo Ding, Zhecheng Sheng, Meliha Yetişgen, Serguei Pakhomov, Trevor Cohen

    Abstract: Natural Language Processing (NLP) methods have been broadly applied to clinical tasks. Machine learning and deep learning approaches have been used to improve the performance of clinical NLP. However, these approaches require sufficiently large datasets for training, and trained models have been shown to transfer poorly across sites. These issues have led to the promotion of data collection and in… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted in AMIA 2023 Annual Symposium

  5. arXiv:2306.09544  [pdf, other

    cs.CL

    Building blocks for complex tasks: Robust generative event extraction for radiology reports under domain shifts

    Authors: Sitong Zhou, Meliha Yetisgen, Mari Ostendorf

    Abstract: This paper explores methods for extracting information from radiology reports that generalize across exam modalities to reduce requirements for annotated data. We demonstrate that multi-pass T5-based text-to-text generative models exhibit better generalization across exam modalities compared to approaches that employ BERT-based task-specific classification layers. We then develop methods that redu… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Journal ref: The 5th Clinical Natural Language Processing Workshop. At ACL 2023

  6. arXiv:2306.07170  [pdf, other

    cs.CL

    Prompt-based Extraction of Social Determinants of Health Using Few-shot Learning

    Authors: Giridhar Kaushik Ramachandran, Yujuan Fu, Bin Han, Kevin Lybarger, Nicholas J Dobbins, Özlem Uzuner, Meliha Yetisgen

    Abstract: Social determinants of health (SDOH) documented in the electronic health record through unstructured text are increasingly being studied to understand how SDOH impacts patient health outcomes. In this work, we utilize the Social History Annotation Corpus (SHAC), a multi-institutional corpus of de-identified social history sections annotated for SDOH, including substance use, employment, and living… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  7. arXiv:2306.02022  [pdf, other

    cs.CL

    ACI-BENCH: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation

    Authors: Wen-wai Yim, Yujuan Fu, Asma Ben Abacha, Neal Snider, Thomas Lin, Meliha Yetisgen

    Abstract: Recent immense breakthroughs in generative models such as in GPT4 have precipitated re-imagined ubiquitous usage of these models in all applications. One area that can benefit by improvements in artificial intelligence (AI) is healthcare. The note generation task from doctor-patient encounters, and its associated electronic medical record documentation, is one of the most arduous time-consuming ta… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  8. LeafAI: query generator for clinical cohort discovery rivaling a human programmer

    Authors: Nicholas J Dobbins, Bin Han, Weipeng Zhou, Kristine Lan, H. Nina Kim, Robert Harrington, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Objective: Identifying study-eligible patients within clinical databases is a critical step in clinical research. However, accurate query design typically requires extensive technical and biomedical expertise. We sought to create a system capable of generating data model-agnostic queries while also providing novel logical reasoning capabilities for complex clinical trial eligibility criteria. Ma… ▽ More

    Submitted 14 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Journal of the American Medical Informatics Association, 2023;, ocad149

  9. The 2022 n2c2/UW Shared Task on Extracting Social Determinants of Health

    Authors: Kevin Lybarger, Meliha Yetisgen, Özlem Uzuner

    Abstract: Objective: The n2c2/UW SDOH Challenge explores the extraction of social determinant of health (SDOH) information from clinical notes. The objectives include the advancement of natural language processing (NLP) information extraction techniques for SDOH and clinical information more broadly. This paper presents the shared task, data, participating teams, performance results, and considerations for… ▽ More

    Submitted 13 February, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    ACM Class: I.2.7

    Journal ref: Journal of the American Medical Informatics Association (2023)

  10. arXiv:2212.07538  [pdf, other

    cs.CL

    Leveraging Natural Language Processing to Augment Structured Social Determinants of Health Data in the Electronic Health Record

    Authors: Kevin Lybarger, Nicholas J Dobbins, Ritche Long, Angad Singh, Patrick Wedgeworth, Ozlem Ozuner, Meliha Yetisgen

    Abstract: Objective: Social determinants of health (SDOH) impact health outcomes and are documented in the electronic health record (EHR) through structured data and unstructured clinical notes. However, clinical notes often contain more comprehensive SDOH information, detailing aspects such as status, severity, and temporality. This work has two primary objectives: i) develop a natural language processing… ▽ More

    Submitted 14 April, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  11. arXiv:2209.09485  [pdf, other

    cs.CL

    Generalizing through Forgetting -- Domain Generalization for Symptom Event Extraction in Clinical Notes

    Authors: Sitong Zhou, Kevin Lybarger, Meliha Yetisgen, Mari Ostendorf

    Abstract: Symptom information is primarily documented in free-text clinical notes and is not directly accessible for downstream applications. To address this challenge, information extraction approaches that can handle clinical language variation across different institutions and specialties are needed. In this paper, we present domain generalization for symptom extraction using pretraining and fine-tuning… ▽ More

    Submitted 23 February, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Journal ref: AMIA 2023 Informatics Summit

  12. Extracting Medication Changes in Clinical Narratives using Pre-trained Language Models

    Authors: Giridhar Kaushik Ramachandran, Kevin Lybarger, Yaya Liu, Diwakar Mahajan, Jennifer J. Liang, Ching-Huei Tsou, Meliha Yetisgen, Özlem Uzuner

    Abstract: An accurate and detailed account of patient medications, including medication changes within the patient timeline, is essential for healthcare providers to provide appropriate patient care. Healthcare providers or the patients themselves may initiate changes to patient medication. Medication changes take many forms, including prescribed medication and associated dosage modification. These changes… ▽ More

    Submitted 12 January, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Journal ref: Journal of Biomedical Informatics.139.2023.104302.1532-0464

  13. The Leaf Clinical Trials Corpus: a new resource for query generation from clinical trial eligibility criteria

    Authors: Nicholas J Dobbins, Tony Mullen, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Identifying cohorts of patients based on eligibility criteria such as medical conditions, procedures, and medication use is critical to recruitment for clinical trials. Such criteria are often most naturally described in free-text, using language familiar to clinicians and researchers. In order to identify potential participants at scale, these criteria must first be translated into queries on cli… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  14. arXiv:2112.13512  [pdf

    cs.CL

    Event-based clinical findings extraction from radiology reports with pre-trained language model

    Authors: Wilson Lau, Kevin Lybarger, Martin L. Gunn, Meliha Yetisgen

    Abstract: Radiology reports contain a diverse and rich set of clinical abnormalities documented by radiologists during their interpretation of the images. Comprehensive semantic representations of radiological findings would enable a wide range of secondary use applications to support diagnosis, triage, outcomes prediction, and clinical research. In this paper, we present a new corpus of radiology reports a… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  15. arXiv:2108.09211  [pdf, other

    cs.CL

    Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model

    Authors: Kevin Lybarger, Aashka Damani, Martin Gunn, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Medical imaging is critical to the diagnosis and treatment of numerous medical problems, including many forms of cancer. Medical imaging reports distill the findings and observations of radiologists, creating an unstructured textual representation of unstructured medical images. Large-scale use of this text-encoded information requires converting the unstructured text to a structured, semantic rep… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  16. arXiv:2103.06352  [pdf, other

    cs.CL

    Identifying ARDS using the Hierarchical Attention Network with Sentence Objectives Framework

    Authors: Kevin Lybarger, Linzee Mabrey, Matthew Thau, Pavan K. Bhatraju, Mark Wurfel, Meliha Yetisgen

    Abstract: Acute respiratory distress syndrome (ARDS) is a life-threatening condition that is often undiagnosed or diagnosed late. ARDS is especially prominent in those infected with COVID-19. We explore the automatic identification of ARDS indicators and confounding factors in free-text chest radiograph reports. We present a new annotated corpus of chest radiograph reports and introduce the Hierarchical Att… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  17. arXiv:2102.11032  [pdf

    cs.CL

    Performance of Automatic De-identification Across Different Note Types

    Authors: Nicholas Dobbins, David Wayne, Kahyun Lee, Özlem Uzuner, Meliha Yetisgen

    Abstract: Free-text clinical notes detail all aspects of patient care and have great potential to facilitate quality improvement and assurance initiatives as well as advance clinical research. However, concerns about patient privacy and confidentiality limit the use of clinical notes for research. As a result, the information documented in these notes remains unavailable for most researchers. De-identificat… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Journal ref: AMIA Virtual Summits 2021

  18. arXiv:2102.11031  [pdf

    cs.CL cs.LG

    Jointly Learning Clinical Entities and Relations with Contextual Language Models and Explicit Context

    Authors: Paul Barry, Sam Henry, Meliha Yetisgen, Bridget McInnes, Ozlem Uzuner

    Abstract: We hypothesize that explicit integration of contextual information into an Multi-task Learning framework would emphasize the significance of context for boosting performance in jointly learning Named Entity Recognition (NER) and Relation Extraction (RE). Our work proves this hypothesis by segmenting entities from their surrounding context and by building contextual representations using each indep… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  19. arXiv:2102.08517  [pdf

    cs.CL cs.CR cs.LG

    Transferability of Neural Network Clinical De-identification Systems

    Authors: Kahyun Lee, Nicholas J. Dobbins, Bridget McInnes, Meliha Yetisgen, Ozlem Uzuner

    Abstract: Objective: Neural network de-identification studies have focused on individual datasets. These studies assume the availability of a sufficient amount of human-annotated data to train models that can generalize to corresponding test data. In real-world situations, however, researchers often have limited or no in-house training data. Existing systems and external data can help jump-start de-identifi… ▽ More

    Submitted 17 August, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  20. arXiv:2012.00974  [pdf, other

    cs.CL cs.LG

    Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework

    Authors: Kevin Lybarger, Mari Ostendorf, Matthew Thompson, Meliha Yetisgen

    Abstract: Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven,… ▽ More

    Submitted 10 March, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

  21. arXiv:2009.00694  [pdf

    cs.CL

    Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation

    Authors: Wilson Lau, Laura Aaltonen, Martin Gunn, Meliha Yetisgen

    Abstract: Selecting radiology examination protocol is a repetitive, and time-consuming process. In this paper, we present a deep learning approach to automatically assign protocols to computer tomography examinations, by pre-training a domain-specific BERT model ($BERT_{rad}$). To handle the high data imbalance across exam protocols, we used a knowledge distillation approach that up-sampled the minority cla… ▽ More

    Submitted 6 July, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

    Comments: accepted at American Medical Informatics Association symposium 2021

  22. Annotating Social Determinants of Health Using Active Learning, and Characterizing Determinants Using Neural Event Extraction

    Authors: Kevin Lybarger, Mari Ostendorf, Meliha Yetisgen

    Abstract: Social determinants of health (SDOH) affect health outcomes, and knowledge of SDOH can inform clinical decision-making. Automatically extracting SDOH information from clinical text requires data-driven information extraction models trained on annotated corpora that are heterogeneous and frequently include critical SDOH. This work presents a new corpus with SDOH annotations, a novel active learning… ▽ More

    Submitted 2 December, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Journal ref: Journal of Biomedical Informatics 113 (2021) 103631

  23. arXiv:1905.05877  [pdf

    cs.CL

    Extraction and Analysis of Clinically Important Follow-up Recommendations in a Large Radiology Dataset

    Authors: Wilson Lau, Thomas H Payne, Ozlem Uzuner, Meliha Yetisgen

    Abstract: Communication of follow-up recommendations when abnormalities are identified on imaging studies is prone to error. In this paper, we present a natural language processing approach based on deep learning to automatically identify clinically important recommendations in radiology reports. Our approach first identifies the recommendation sentences and then extracts reason, test, and time frame of the… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Under Review at American Medical Informatics Association Fall Symposium'2019