Skip to main content

Showing 1–12 of 12 results for author: Devarakonda, M

.
  1. arXiv:2405.03726  [pdf

    q-bio.GN cs.LG

    sc-OTGM: Single-Cell Perturbation Modeling by Solving Optimal Mass Transport on the Manifold of Gaussian Mixtures

    Authors: Andac Demir, Elizaveta Solovyeva, James Boylan, Mei Xiao, Fabrizio Serluca, Sebastian Hoersch, Jeremy Jenkins, Murthy Devarakonda, Bulent Kiziltan

    Abstract: Influenced by breakthroughs in LLMs, single-cell foundation models are emerging. While these models show successful performance in cell type clustering, phenotype classification, and gene perturbation response prediction, it remains to be seen if a simpler model could achieve comparable or better results, especially with limited data. This is important, as the quantity and quality of single-cell d… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ICLR 2024, Machine Learning for Genomics Explorations Workshop

  2. arXiv:2309.15979  [pdf

    cs.AI q-bio.QM

    Clinical Trial Recommendations Using Semantics-Based Inductive Inference and Knowledge Graph Embeddings

    Authors: Murthy V. Devarakonda, Smita Mohanty, Raja Rao Sunkishala, Nag Mallampalli, Xiong Liu

    Abstract: Designing a new clinical trial entails many decisions, such as defining a cohort and setting the study objectives to name a few, and therefore can benefit from recommendations based on exhaustive mining of past clinical trial records. Here, we propose a novel recommendation methodology, based on neural embeddings trained on a first-of-a-kind knowledge graph of clinical trials. We addressed several… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 13 pages (w/o bibliography), 4 Figures, 6 Tables

  3. arXiv:2212.14102  [pdf, other

    cs.LG cs.AI cs.CL cs.CY q-bio.QM

    Customizing Knowledge Graph Embedding to Improve Clinical Study Recommendation

    Authors: Xiong Liu, Iya Khalil, Murthy Devarakonda

    Abstract: Inferring knowledge from clinical trials using knowledge graph embedding is an emerging area. However, customizing graph embeddings for different use cases remains a significant challenge. We propose custom2vec, an algorithmic framework to customize graph embeddings by incorporating user preferences in training the embeddings. It captures user preferences by adding custom nodes and links derived f… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  4. arXiv:2110.10027  [pdf

    q-bio.QM cs.CL cs.LG

    Clinical Trial Information Extraction with BERT

    Authors: Xiong Liu, Greg L. Hersch, Iya Khalil, Murthy Devarakonda

    Abstract: Natural language processing (NLP) of clinical trial documents can be useful in new trial design. Here we identify entity types relevant to clinical trial design and propose a framework called CT-BERT for information extraction from clinical trial text. We trained named entity recognition (NER) models to extract eligibility criteria entities by fine-tuning a set of pre-trained BERT models. We then… ▽ More

    Submitted 11 September, 2021; originally announced October 2021.

    Comments: HealthNLP 2021, IEEE International Conference on Healthcare Informatics (ICHI 2021)

  5. arXiv:2109.02808  [pdf, other

    cs.CL cs.AI cs.CY cs.LG q-bio.QM

    A Scalable AI Approach for Clinical Trial Cohort Optimization

    Authors: Xiong Liu, Cheng Shi, Uday Deore, Yingbo Wang, Myah Tran, Iya Khalil, Murthy Devarakonda

    Abstract: FDA has been promoting enrollment practices that could enhance the diversity of clinical trial populations, through broadening eligibility criteria. However, how to broaden eligibility remains a significant challenge. We propose an AI approach to Cohort Optimization (AICO) through transformer-based natural language processing of the eligibility criteria and evaluation of the criteria using real-wo… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: PharML 2021 (Machine Learning for Pharma and Healthcare Applications) at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021)

  6. arXiv:2101.02017  [pdf, ps, other

    cs.IR cs.LG

    COVID-19: Comparative Analysis of Methods for Identifying Articles Related to Therapeutics and Vaccines without Using Labeled Data

    Authors: Mihir Parmar, Ashwin Karthik Ambalavanan, Hong Guan, Rishab Banerjee, Jitesh Pabla, Murthy Devarakonda

    Abstract: Here we proposed an approach to analyze text classification methods based on the presence or absence of task-specific terms (and their synonyms) in the text. We applied this approach to study six different transfer-learning and unsupervised methods for screening articles relevant to COVID-19 vaccines and therapeutics. The analysis revealed that while a BERT model trained on search-engine results g… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 6 pages, 3 Tables, Appendix

  7. arXiv:2004.06222  [pdf

    cs.CL cs.IR cs.LG

    Cascade Neural Ensemble for Identifying Scientifically Sound Articles

    Authors: Ashwin Karthik Ambalavanan, Murthy Devarakonda

    Abstract: Background: A significant barrier to conducting systematic reviews and meta-analysis is efficiently finding scientifically sound relevant articles. Typically, less than 1% of articles match this requirement which leads to a highly imbalanced task. Although feature-engineered and early neural networks models were studied for this task, there is an opportunity to improve the results. Methods: We f… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: 11 pages, 4 figures, and 9 tables

  8. arXiv:2004.06216  [pdf

    cs.CL cs.LG

    Robustly Pre-trained Neural Model for Direct Temporal Relation Extraction

    Authors: Hong Guan, Jianfu Li, Hua Xu, Murthy Devarakonda

    Abstract: Background: Identifying relationships between clinical events and temporal expressions is a key challenge in meaningfully analyzing clinical text for use in advanced AI applications. While previous studies exist, the state-of-the-art performance has significant room for improvement. Methods: We studied several variants of BERT (Bidirectional Encoder Representations using Transformers) some invol… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: 10 pages, 1 Figure, 7 Tables

  9. arXiv:1911.03869  [pdf, other

    cs.CL cs.IR cs.LG

    Knowledge Guided Named Entity Recognition for BioMedical Text

    Authors: Pratyay Banerjee, Kuntal Kumar Pal, Murthy Devarakonda, Chitta Baral

    Abstract: In this work, we formulate the NER task as a multi-answer knowledge guided QA task (KGQA) which helps to predict entities only by assigning B, I and O tags without associating entity types with the tags. We provide different knowledge contexts, such as, entity types, questions, definitions and examples along with the text and train on a combined dataset of 18 biomedical corpora. This formulation (… ▽ More

    Submitted 18 September, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: 6 pages, 2 figures, 5 tables, WIP

  10. arXiv:1906.11930  [pdf

    cs.CL

    Training Models to Extract Treatment Plans from Clinical Notes Using Contents of Sections with Headings

    Authors: Ananya Poddar, Bharath Dandala, Murthy Devarakonda

    Abstract: Objective: Using natural language processing (NLP) to find sentences that state treatment plans in a clinical note, would automate plan extraction and would further enable their use in tools that help providers and care managers. However, as in the most NLP tasks on clinical text, creating gold standard to train and test NLP models is tedious and expensive. Fortuitously, sometimes but not always c… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: 15 pages, 4 Figures, and 4 Tables

  11. arXiv:1902.09674  [pdf

    cs.CL

    Develo** and Using Special-Purpose Lexicons for Cohort Selection from Clinical Notes

    Authors: Samarth Rawal, Ashok Prakash, Soumya Adhya, Sidharth Kulkarni, Saadat Anwar, Chitta Baral, Murthy Devarakonda

    Abstract: Background and Significance: Selecting cohorts for a clinical trial typically requires costly and time-consuming manual chart reviews resulting in poor participation. To help automate the process, National NLP Clinical Challenges (N2C2) conducted a shared challenge by defining 13 criteria for clinical trial cohort selection and by providing training and test datasets. This research was motivated b… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: 13 pages, paper describing the NLP system built for N2C2 Task 1 2018 shared challenge in biomedical NLP

  12. arXiv:1805.06816  [pdf

    cs.CL cs.CY

    Annotating Electronic Medical Records for Question Answering

    Authors: Preethi Raghavan, Siddharth Patwardhan, Jennifer J. Liang, Murthy V. Devarakonda

    Abstract: Our research is in the relatively unexplored area of question answering technologies for patient-specific questions over their electronic health records. A large dataset of human expert curated question and answer pairs is an important pre-requisite for develo**, training and evaluating any question answering system that is powered by machine learning. In this paper, we describe a process for cr… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: 10 pages, 2016