Skip to main content

Showing 1–3 of 3 results for author: Manghnani, K

.
  1. arXiv:2312.06457  [pdf, other

    cs.AI cs.CL cs.IR

    Large Language Models with Retrieval-Augmented Generation for Zero-Shot Disease Phenoty**

    Authors: Will E. Thompson, David M. Vidmar, Jessica K. De Freitas, John M. Pfeifer, Brandon K. Fornwalt, Ruijun Chen, Gabriel Altay, Kabir Manghnani, Andrew C. Nelsen, Kellie Morland, Martin C. Stumpe, Riccardo Miotto

    Abstract: Identifying disease phenotypes from electronic health records (EHRs) is critical for numerous secondary uses. Manually encoding physician knowledge into rules is particularly challenging for rare diseases due to inadequate EHR coding, necessitating review of clinical notes. Large language models (LLMs) offer promise in text understanding but may not efficiently handle real-world clinical documenta… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Deep Generative Models for Health Workshop NeurIPS 2023

    ACM Class: I.2.7

  2. arXiv:1904.06100  [pdf, other

    cs.CL cs.AI cs.LG

    Adapting Sequence to Sequence models for Text Normalization in Social Media

    Authors: Ismini Lourentzou, Kabir Manghnani, ChengXiang Zhai

    Abstract: Social media offer an abundant source of valuable raw data, however informal writing can quickly become a bottleneck for many natural language processing (NLP) tasks. Off-the-shelf tools are usually trained on formal text and cannot explicitly handle noise found in short online posts. Moreover, the variety of frequently occurring linguistic variations presents several challenges, even for humans w… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: Accepted at the 13th International AAAI Conference on Web and Social Media (ICWSM 2019)

  3. arXiv:1812.03188  [pdf, other

    cs.LG stat.ML

    METCC: METric learning for Confounder Control Making distance matter in high dimensional biological analysis

    Authors: Kabir Manghnani, Adam Drake, Nathan Wan, Imran Haque

    Abstract: High-dimensional data acquired from biological experiments such as next generation sequencing are subject to a number of confounding effects. These effects include both technical effects, such as variation across batches from instrument noise or sample processing, or institution-specific differences in sample acquisition and physical handling, as well as biological effects arising from true but ir… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018

    Report number: ML4H/2018/211