Skip to main content

Showing 1–5 of 5 results for author: Stremmel, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07376  [pdf, other

    cs.CL

    LLMs in Biomedicine: A study on clinical Named Entity Recognition

    Authors: Masoud Monajatipoor, Jiaxin Yang, Joel Stremmel, Melika Emami, Fazlolah Mohaghegh, Mozhdeh Rouhsedaghat, Kai-Wei Chang

    Abstract: Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity. This paper investigates the application of LLMs in the medical domain by exploring strategies to enhance their performance for the Named-Entity Recognition (NER) task. Specifically, our study reveals the import… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  2. arXiv:2312.03567  [pdf, other

    cs.CL

    XAIQA: Explainer-Based Data Augmentation for Extractive Question Answering

    Authors: Joel Stremmel, Ardavan Saeedi, Hamid Hassanzadeh, Sanjit Batra, Jeffrey Hertzberg, Jaime Murillo, Eran Halperin

    Abstract: Extractive question answering (QA) systems can enable physicians and researchers to query medical records, a foundational capability for designing clinical studies and understanding patient medical history. However, building these systems typically requires expert-annotated QA pairs. Large language models (LLMs), which can perform extractive QA, depend on high quality data in their prompts, specia… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 8 pages

    MSC Class: I.2.7

  3. arXiv:2311.13735  [pdf, other

    cs.CL

    Surpassing GPT-4 Medical Coding with a Two-Stage Approach

    Authors: Zhichao Yang, Sanjit Singh Batra, Joel Stremmel, Eran Halperin

    Abstract: Recent advances in large language models (LLMs) show potential for clinical applications, such as clinical decision support and trial recommendations. However, the GPT-4 LLM predicts an excessive number of ICD codes for medical coding tasks, leading to high recall but low precision. To tackle this challenge, we introduce LLM-codex, a two-stage approach to predict ICD codes that first generates evi… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 19 pages

  4. arXiv:2209.01174  [pdf, other

    cs.CL

    Extend and Explain: Interpreting Very Long Language Models

    Authors: Joel Stremmel, Brian L. Hill, Jeffrey Hertzberg, Jaime Murillo, Llewelyn Allotey, Eran Halperin

    Abstract: While Transformer language models (LMs) are state-of-the-art for information extraction, long text introduces computational challenges requiring suboptimal preprocessing steps or alternative model architectures. Sparse attention LMs can represent longer sequences, overcoming performance hurdles. However, it remains unclear how to explain predictions from these models, as not all tokens attend to e… ▽ More

    Submitted 28 November, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

    Comments: 11 pages

    MSC Class: I.2.7

    Journal ref: Proceedings of the 2nd Machine Learning for Health symposium, PMLR 193:218-258, 2022

  5. arXiv:2005.04828  [pdf, other

    cs.LG stat.ML

    Pretraining Federated Text Models for Next Word Prediction

    Authors: Joel Stremmel, Arjun Singh

    Abstract: Federated learning is a decentralized approach for training models on distributed devices, by summarizing local changes and sending aggregate parameters from local models to the cloud rather than the data itself. In this research we employ the idea of transfer learning to federated training for next word prediction (NWP) and conduct a number of experiments demonstrating enhancements to current bas… ▽ More

    Submitted 17 August, 2020; v1 submitted 10 May, 2020; originally announced May 2020.