Skip to main content

Showing 1–18 of 18 results for author: Kavuluru, R

.
  1. arXiv:2405.13546  [pdf, other

    cs.CL cs.IR

    Knowledge-Driven Cross-Document Relation Extraction

    Authors: Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru

    Abstract: Relation extraction (RE) is a well-known NLP application often treated as a sentence- or document-level task. However, a handful of recent efforts explore it across documents or in the cross-document setting (CrossDocRE). This is distinct from the single document case because different documents often focus on disparate themes, while text within a document tends to have a single goal. Linking find… ▽ More

    Submitted 18 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted in ACL 2024 Findings

  2. arXiv:2402.13470  [pdf, other

    cs.CL

    How Important is Domain Specificity in Language Models and Instruction Finetuning for Biomedical Relation Extraction?

    Authors: Aviv Brokman, Ramakanth Kavuluru

    Abstract: Cutting edge techniques developed in the general NLP domain are often subsequently applied to the high-value, data-rich biomedical domain. The past few years have seen generative language models (LMs), instruction finetuning, and few-shot learning become foci of NLP research. As such, generative LMs pretrained on biomedical corpora have proliferated and biomedical instruction finetuning has been a… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  3. arXiv:2401.11800  [pdf, other

    cs.IR

    Revisiting Document-Level Relation Extraction with Context-Guided Link Prediction

    Authors: Monika Jain, Raghava Mutharaju, Ramakanth Kavuluru, Kuldeep Singh

    Abstract: Document-level relation extraction (DocRE) poses the challenge of identifying relationships between entities within a document as opposed to the traditional RE setting where a single sentence is input. Existing approaches rely on logical reasoning or contextual cues from entities. This paper reframes document-level RE as link prediction over a knowledge graph with distinct benefits: 1) Our approac… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted in AAAI 2024

  4. arXiv:2311.13729  [pdf, other

    cs.CL

    Comparison of pipeline, sequence-to-sequence, and GPT models for end-to-end relation extraction: experiments with the rare disease use-case

    Authors: Shashank Gupta, Xuguang Ai, Ramakanth Kavuluru

    Abstract: End-to-end relation extraction (E2ERE) is an important and realistic application of natural language processing (NLP) in biomedicine. In this paper, we aim to compare three prevailing paradigms for E2ERE using a complex dataset focused on rare diseases involving discontinuous and nested entities. We use the RareDis information extraction dataset to evaluate three competing approaches (for E2ERE):… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: In V2 we added new experiments with T5 models. The dataset and code for all our experiments are publicly available: https://github.com/shashank140195/Raredis

  5. arXiv:2304.01344  [pdf, other

    cs.CL

    End-to-End Models for Chemical-Protein Interaction Extraction: Better Tokenization and Span-Based Pipeline Strategies

    Authors: Xuguang Ai, Ramakanth Kavuluru

    Abstract: End-to-end relation extraction (E2ERE) is an important task in information extraction, more so for biomedicine as scientific literature continues to grow exponentially. E2ERE typically involves identifying entities (or named entity recognition (NER)) and associated relations, while most RE tasks simply assume that the entities are provided upfront and end up performing relation classification. E2E… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted to appear in IEEE ICHI 2023 (HealthNLP workshop). Tokenized dataset and code: https://github.com/bionlproc/end-to-end-ChemProt

  6. arXiv:2303.16886  [pdf, other

    cs.CL

    End-to-End $n$-ary Relation Extraction for Combination Drug Therapies

    Authors: Yuhang Jiang, Ramakanth Kavuluru

    Abstract: Combination drug therapies are treatment regimens that involve two or more drugs, administered more commonly for patients with cancer, HIV, malaria, or tuberculosis. Currently there are over 350K articles in PubMed that use the "combination drug therapy" MeSH heading with at least 10K articles published per year over the past two decades. Extracting combination therapies from scientific literature… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted to appear in IEEE ICHI 2023. Code: https://github.com/bionlproc/end-to-end-CombDrugExt

  7. arXiv:2303.10659  [pdf

    cs.CL

    COVID-19 event extraction from Twitter via extractive question answering with continuous prompts

    Authors: Yuhang Jiang, Ramakanth Kavuluru

    Abstract: As COVID-19 ravages the world, social media analytics could augment traditional surveys in assessing how the pandemic evolves and capturing consumer chatter that could help healthcare agencies in addressing it. This typically involves mining disclosure events that mention testing positive for the disease or discussions surrounding perceptions and beliefs in preventative or treatment options. The 2… ▽ More

    Submitted 22 March, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: Accepted to appear in MEDINFO 2023. Code: https://github.com/bionlproc/twitter-covid-QA-extraction

  8. arXiv:2202.12448  [pdf

    cs.CL

    Deep neural networks for fine-grained surveillance of overdose mortality

    Authors: Patrick J. Ward, April M. Young, Svetla Slavova, Madison Liford, Lara Daniels, Ripley Lucas, Ramakanth Kavuluru

    Abstract: Surveillance of drug overdose deaths relies on death certificates for identification of the substances that caused death. Drugs and drug classes can be identified through the International Classification of Diseases, 10th Revision (ICD-10) codes present on death certificates. However, ICD-10 codes do not always provide high levels of specificity in drug identification. To achieve more fine-grained… ▽ More

    Submitted 6 June, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted to appear in the American Journal of Epidemiology

  9. arXiv:2110.10780  [pdf

    cs.CL cs.IR

    An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)

    Authors: Sijia Liu, Andrew Wen, Liwei Wang, Huan He, Sunyang Fu, Robert Miller, Andrew Williams, Daniel Harris, Ramakanth Kavuluru, Mei Liu, Noor Abu-el-rub, Dalton Schutte, Rui Zhang, Masoud Rouhizadeh, John D. Osborne, Yongqun He, Umit Topaloglu, Stephanie S Hong, Joel H Saltz, Thomas Schaffter, Emily Pfaff, Christopher G. Chute, Tim Duong, Melissa A. Haendel, Rafael Fuentes , et al. (7 additional authors not shown)

    Abstract: While we pay attention to the latest advances in clinical natural language processing (NLP), we can notice some resistance in the clinical and translational research community to adopt NLP models due to limited transparency, interpretability, and usability. In this study, we proposed an open natural language processing development framework. We evaluated it through the implementation of NLP algori… ▽ More

    Submitted 21 March, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: update on contents

  10. arXiv:2103.08800  [pdf, other

    cs.LG cs.AI

    Predicting Opioid Use Disorder from Longitudinal Healthcare Data using Multi-stream Transformer

    Authors: Sajjad Fouladvand, Jeffery Talbert, Linda P. Dwoskin, Heather Bush, Amy Lynn Meadows, Lars E. Peterson, Ramakanth Kavuluru, ** Chen

    Abstract: Opioid Use Disorder (OUD) is a public health crisis costing the US billions of dollars annually in healthcare, lost workplace productivity, and crime. Analyzing longitudinal healthcare data is critical in addressing many real-world problems in healthcare. Leveraging the real-world longitudinal healthcare data, we propose a novel multi-stream transformer model called MUPOD for OUD identification. M… ▽ More

    Submitted 7 July, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: This manuscript has been accepted by AMIA 2021 for oral presentation on November 1, 2021

  11. Improved Biomedical Word Embeddings in the Transformer Era

    Authors: Jiho Noh, Ramakanth Kavuluru

    Abstract: Biomedical word embeddings are usually pre-trained on free text corpora with neural methods that capture local and global distributional properties. They are leveraged in downstream tasks using various neural architectures that are designed to optimize task-specific objectives that might further tune such embeddings. Since 2018, however, there is a marked shift from these static embeddings to cont… ▽ More

    Submitted 23 July, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: This paper has been accepted for publication in the Journal of Biomedical Informatics

  12. Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization

    Authors: Jiho Noh, Ramakanth Kavuluru

    Abstract: Information retrieval (IR) for precision medicine (PM) often involves looking for multiple pieces of evidence that characterize a patient case. This typically includes at least the name of a condition and a genetic variation that applies to the patient. Other factors such as demographic attributes, comorbidities, and social determinants may also be pertinent. As such, the retrieval problem is ofte… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted to EMNLP 2020 Findings as Long Paper (11 page, 4 figures)

  13. arXiv:2010.03060  [pdf, other

    cs.LG cs.CL cs.CV eess.IV

    Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging

    Authors: Gongbo Liang, Connor Greenwell, Yu Zhang, Xiaoqin Wang, Ramakanth Kavuluru, Nathan Jacobs

    Abstract: A key challenge in training neural networks for a given medical imaging task is often the difficulty of obtaining a sufficient number of manually labeled examples. In contrast, textual imaging reports, which are often readily available in medical records, contain rich but unstructured interpretations written by experts as part of standard clinical practice. We propose using these textual reports a… ▽ More

    Submitted 8 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: This work is accepted to the IEEE Journal of Biomedical and Health Informatics

  14. arXiv:1910.12419  [pdf, other

    cs.CL

    Attention-Gated Graph Convolutions for Extracting Drug Interaction Information from Drug Labels

    Authors: Tung Tran, Ramakanth Kavuluru, Halil Kilicoglu

    Abstract: Preventable adverse events as a result of medical errors present a growing concern in the healthcare system. As drug-drug interactions (DDIs) may lead to preventable adverse events, being able to extract DDIs from drug labels into a machine-processable form is an important step toward effective dissemination of drug safety information. In this study, we tackle the problem of jointly extracting dru… ▽ More

    Submitted 4 November, 2019; v1 submitted 27 October, 2019; originally announced October 2019.

  15. arXiv:1905.07464  [pdf, other

    cs.CL

    A Multi-Task Learning Framework for Extracting Drugs and Their Interactions from Drug Labels

    Authors: Tung Tran, Ramakanth Kavuluru, Halil Kilicoglu

    Abstract: Preventable adverse drug reactions as a result of medical errors present a growing concern in modern medicine. As drug-drug interactions (DDIs) may cause adverse reactions, being able to extracting DDIs from drug labels into machine-readable form is an important effort in effectively deploying drug safety information. The DDI track of TAC 2018 introduces two large hand-annotated test sets for the… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: To appear in TAC 2018 Proceedings

  16. arXiv:1905.07458  [pdf, other

    cs.CL

    Neural Metric Learning for Fast End-to-End Relation Extraction

    Authors: Tung Tran, Ramakanth Kavuluru

    Abstract: Relation extraction (RE) is an indispensable information extraction task in several disciplines. RE models typically assume that named entity recognition (NER) is already performed in a previous step by another independent model. Several recent efforts, under the theme of end-to-end RE, seek to exploit inter-task correlations by modeling both NER and RE tasks jointly. Earlier work in this area com… ▽ More

    Submitted 27 August, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

  17. arXiv:1802.01255  [pdf

    cs.CL

    Chemical-protein relation extraction with ensembles of SVM, CNN, and RNN models

    Authors: Yifan Peng, Anthony Rios, Ramakanth Kavuluru, Zhiyong Lu

    Abstract: Text mining the relations between chemicals and proteins is an increasingly important task. The CHEMPROT track at BioCreative VI aims to promote the development and evaluation of systems that can automatically detect the chemical-protein relations in running text (PubMed abstracts). This manuscript describes our submission, which is an ensemble of three systems, including a Support Vector Machine,… ▽ More

    Submitted 4 February, 2018; originally announced February 2018.

    Comments: Accepted in Proceedings of the BioCreative VI Workshop

  18. arXiv:1610.08557  [pdf, other

    cs.CL

    Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

    Authors: A. K. M. Sabbir, Antonio Jimeno Yepes, Ramakanth Kavuluru

    Abstract: Biomedical word sense disambiguation (WSD) is an important intermediate task in many natural language processing applications such as named entity recognition, syntactic parsing, and relation extraction. In this paper, we employ knowledge-based approaches that also exploit recent advances in neural word/concept embeddings to improve over the state-of-the-art in biomedical WSD using the MSH WSD dat… ▽ More

    Submitted 29 September, 2017; v1 submitted 26 October, 2016; originally announced October 2016.

    Comments: 8 pages, accepted to appear in proceedings of IEEE BIBE 2017