Skip to main content

Showing 1–2 of 2 results for author: Shyr, C

Searching in archive cs. Search in all archives.
.
  1. Identifying and Extracting Rare Disease Phenotypes with Large Language Models

    Authors: Cathy Shyr, Yan Hu, Paul A. Harris, Hua Xu

    Abstract: Rare diseases (RDs) are collectively common and affect 300 million people worldwide. Accurate phenoty** is critical for informing diagnosis and treatment, but RD phenotypes are often embedded in unstructured text and time-consuming to extract manually. While natural language processing (NLP) models can perform named entity recognition (NER) to automate extraction, a major bottleneck is the devel… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Journal ref: J Healthc Inform Res 8, 438-461 (2024)

  2. arXiv:2207.04588  [pdf, other

    stat.ML cs.LG

    Multi-Study Boosting: Theoretical Considerations for Merging vs. Ensembling

    Authors: Cathy Shyr, Pragya Sur, Giovanni Parmigiani, Prasad Patil

    Abstract: Cross-study replicability is a powerful model evaluation criterion that emphasizes generalizability of predictions. When training cross-study replicable prediction models, it is critical to decide between merging and treating the studies separately. We study boosting algorithms in the presence of potential heterogeneity in predictor-outcome relationships across studies and compare two multi-study… ▽ More

    Submitted 12 July, 2022; v1 submitted 10 July, 2022; originally announced July 2022.