Skip to main content

Showing 1–8 of 8 results for author: Caufield, J H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03044  [pdf

    cs.LG cs.AI

    The Artificial Intelligence Ontology: LLM-assisted construction of AI concept hierarchies

    Authors: Marcin P. Joachimiak, Mark A. Miller, J. Harry Caufield, Ryan Ly, Nomi L. Harris, Andrew Tritt, Christopher J. Mungall, Kristofer E. Bouchard

    Abstract: The Artificial Intelligence Ontology (AIO) is a systematization of artificial intelligence (AI) concepts, methodologies, and their interrelations. Developed via manual curation, with the additional assistance of large language models (LLMs), AIO aims to address the rapidly evolving landscape of AI by providing a comprehensive framework that encompasses both technical and ethical aspects of AI tech… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  2. arXiv:2310.03666  [pdf

    cs.CL cs.AI

    MapperGPT: Large Language Models for Linking and Map** Entities

    Authors: Nicolas Matentzoglu, J. Harry Caufield, Harshad B. Hegde, Justin T. Reese, Sierra Moxon, Hyeongsik Kim, Nomi L. Harris, Melissa A Haendel, Christopher J. Mungall

    Abstract: Aligning terminological resources, including ontologies, controlled vocabularies, taxonomies, and value sets is a critical part of data integration in many domains such as healthcare, chemistry, and biomedical research. Entity map** is the process of determining correspondences between entities across these resources, such as gene identifiers, disease concepts, or chemical entity identifiers. Ma… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2305.13338  [pdf

    q-bio.GN cs.AI cs.CL q-bio.QM

    Gene Set Summarization using Large Language Models

    Authors: Marcin P. Joachimiak, J. Harry Caufield, Nomi L. Harris, Hyeongsik Kim, Christopher J. Mungall

    Abstract: Molecular biologists frequently interpret gene lists derived from high-throughput experiments and computational analysis. This is typically done as a statistical enrichment analysis that measures the over- or under-representation of biological function terms associated with genes or their properties, based on curated assertions from a knowledge base (KB) such as the Gene Ontology (GO). Interpretin… ▽ More

    Submitted 25 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  4. arXiv:2304.02711  [pdf, other

    cs.AI cs.LG

    Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning

    Authors: J. Harry Caufield, Harshad Hegde, Vincent Emonet, Nomi L. Harris, Marcin P. Joachimiak, Nicolas Matentzoglu, HyeongSik Kim, Sierra A. T. Moxon, Justin T. Reese, Melissa A. Haendel, Peter N. Robinson, Christopher J. Mungall

    Abstract: Creating knowledge bases and ontologies is a time consuming task that relies on a manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrary complex nested knowledge schemas. Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (S… ▽ More

    Submitted 22 December, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Updated 2023-12-22

  5. arXiv:2302.10800  [pdf

    q-bio.QM cs.AI cs.LG

    KG-Hub -- Building and Exchanging Biological Knowledge Graphs

    Authors: J Harry Caufield, Tim Putman, Kevin Schaper, Deepak R Unni, Harshad Hegde, Tiffany J Callahan, Luca Cappelletti, Sierra AT Moxon, Vida Ravanmehr, Seth Carbon, Lauren E Chan, Katherina Cortes, Kent A Shefchek, Glass Elsarboukh, James P Balhoff, Tommaso Fontana, Nicolas Matentzoglu, Richard M Bruskiewich, Anne E Thessen, Nomi L Harris, Monica C Munoz-Torres, Melissa A Haendel, Peter N Robinson, Marcin P Joachimiak, Christopher J Mungall , et al. (1 additional authors not shown)

    Abstract: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of knowledge graphs is lacking. Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of knowledge graphs. Features include a simp… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  6. arXiv:2106.12608  [pdf, other

    cs.CL q-bio.QM

    Clinical Named Entity Recognition using Contextualized Token Representations

    Authors: Yichao Zhou, Chelsea Ju, J. Harry Caufield, Kevin Shih, Calvin Chen, Yizhou Sun, Kai-Wei Chang, Peipei **, Wei Wang

    Abstract: The clinical named entity recognition (CNER) task seeks to locate and classify clinical terminologies into predefined categories, such as diagnostic procedure, disease disorder, severity, medication, medication dosage, and sign symptom. CNER facilitates the study of side-effect on medications including identification of novel phenomena and human-focused information extraction. Existing approaches… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 1 figure, 6 tables

  7. arXiv:2103.00562  [pdf, other

    cs.CL cs.LG

    CREATe: Clinical Report Extraction and Annotation Technology

    Authors: Yichao Zhou, Wei-Ting Chen, Bowen Zhang, David Lee, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei **, Wei Wang

    Abstract: Clinical case reports are written descriptions of the unique aspects of a particular clinical case, playing an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies. However, to our knowledge, there has been no attempt to develop an end-to-end system to annotate, index, or otherwise curate these reports. In this paper, we propose a novel computational r… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: 7 Figures, ICDE 2021 Demo

  8. arXiv:2012.08790  [pdf, other

    cs.CL cs.LG

    Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference

    Authors: Yichao Zhou, Yu Yan, Rujun Han, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei **, Wei Wang

    Abstract: There has been a steady need in the medical community to precisely extract the temporal relations between clinical events. In particular, temporal information can facilitate a variety of downstream applications such as case report retrieval and medical question answering. Existing methods either require expensive feature engineering or are incapable of modeling the global relational dependencies a… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 10 pages, 4 figures, 7 tables, accepted by AAAI 2021