Skip to main content

Showing 1–14 of 14 results for author: Hoyt, C T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.05727  [pdf

    cs.AI cs.CE

    An Open-Source Knowledge Graph Ecosystem for the Life Sciences

    Authors: Tiffany J. Callahan, Ignacio J. Tripodi, Adrianne L. Stefanski, Luca Cappelletti, Sanya B. Taneja, Jordan M. Wyrwa, Elena Casiraghi, Nicolas A. Matentzoglu, Justin Reese, Jonathan C. Silverstein, Charles Tapley Hoyt, Richard D. Boyce, Scott A. Malec, Deepak R. Unni, Marcin P. Joachimiak, Peter N. Robinson, Christopher J. Mungall, Emanuele Cavalleri, Tommaso Fontana, Giorgio Valentini, Marco Mesiti, Lucas A. Gillenwater, Brook Santangelo, Nicole A. Vasilevsky, Robert Hoehndorf , et al. (7 additional authors not shown)

    Abstract: Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integrat… ▽ More

    Submitted 30 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  2. Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologies

    Authors: Nicolas Matentzoglu, Damien Goutte-Gattat, Shawn Zheng Kai Tan, James P. Balhoff, Seth Carbon, Anita R. Caron, William D. Duncan, Joe E. Flack, Melissa Haendel, Nomi L. Harris, William R Hogan, Charles Tapley Hoyt, Rebecca C. Jackson, HyeongSik Kim, Huseyin Kir, Martin Larralde, Julie A. McMurry, James A. Overton, Bjoern Peters, Clare Pilgrim, Ray Stefancsik, Sofia MC Robb, Sabrina Toro, Nicole A Vasilevsky, Ramona Walls , et al. (2 additional authors not shown)

    Abstract: Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 19 pages, 2 supplementary tables, 1 supplementary figure

  3. arXiv:2203.07544  [pdf, other

    cs.LG cs.AI

    A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge Graphs

    Authors: Charles Tapley Hoyt, Max Berrendorf, Mikhail Galkin, Volker Tresp, Benjamin M. Gyori

    Abstract: The link prediction task on knowledge graphs without explicit negative triples in the training data motivates the usage of rank-based metrics. Here, we review existing rank-based metrics and propose desiderata for improved metrics to address lack of interpretability and comparability of existing metrics to datasets of different sizes and properties. We introduce a simple theoretical framework for… ▽ More

    Submitted 19 April, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted at the Workshop on Graph Learning Benchmarks @ The WebConf 2022

  4. arXiv:2203.01520  [pdf, other

    cs.LG cs.AI

    An Open Challenge for Inductive Link Prediction on Knowledge Graphs

    Authors: Mikhail Galkin, Max Berrendorf, Charles Tapley Hoyt

    Abstract: An emerging trend in representation learning over knowledge graphs (KGs) moves beyond transductive link prediction tasks over a fixed set of known entities in favor of inductive tasks that imply training on one graph and performing inference over a new graph with unseen entities. In inductive setups, node features are often not available and training shallow entity embedding matrices is meaningles… ▽ More

    Submitted 18 April, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted at the Workshop on Graph Learning Benchmarks @ The WebConf 2022

  5. arXiv:2202.05240  [pdf, other

    cs.LG cs.AI

    ChemicalX: A Deep Learning Library for Drug Pair Scoring

    Authors: Benedek Rozemberczki, Charles Tapley Hoyt, Anna Gogleva, Piotr Grabowski, Klas Karis, Andrej Lamov, Andriy Nikolov, Sebastian Nilsson, Michael Ughetto, Yu Wang, Tyler Derr, Benjamin M Gyori

    Abstract: In this paper, we introduce ChemicalX, a PyTorch-based deep learning library designed for providing a range of state of the art models to solve the drug pair scoring task. The primary objective of the library is to make deep drug pair scoring models accessible to machine learning researchers and practitioners in a streamlined framework.The design of ChemicalX reuses existing high level model train… ▽ More

    Submitted 26 May, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: https://github.com/AstraZeneca/chemicalx

  6. A Simple Standard for Sharing Ontological Map**s (SSSOM)

    Authors: Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jiménez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler , et al. (19 additional authors not shown)

    Abstract: Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for map** between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Map**s often lack the metadata needed to be correctly interpreted and applied. For example, ar… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: Corresponding author: Christopher J. Mungall <[email protected]>

  7. arXiv:2105.10488  [pdf, other

    q-bio.BM cs.AI cs.LG

    Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery

    Authors: Stephen Bonner, Ian P Barrett, Cheng Ye, Rowan Swiers, Ola Engkvist, Charles Tapley Hoyt, William L Hamilton

    Abstract: Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE) models have recently begun to be explored in the context of drug discovery and have the potential to assist in key challenges such as target identification. In the drug discovery domain, KGs can be employed as part of a process which can result in lab-based experiments being performed, or impact on other decisions, incurring sign… ▽ More

    Submitted 23 May, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

    Journal ref: Artificial Intelligence in the Life Sciences (2022): 100036

  8. A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge Graph Perspective

    Authors: Stephen Bonner, Ian P Barrett, Cheng Ye, Rowan Swiers, Ola Engkvist, Andreas Bender, Charles Tapley Hoyt, William L Hamilton

    Abstract: Drug discovery and development is a complex and costly process. Machine learning approaches are being investigated to help improve the effectiveness and speed of multiple stages of the drug discovery pipeline. Of these, those that use Knowledge Graphs (KG) have promise in many tasks, including drug repurposing, drug toxicity prediction and target gene-disease prioritisation. In a drug discovery KG… ▽ More

    Submitted 26 November, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Journal ref: Briefings in Bioinformatics, 2022

  9. arXiv:2102.06626  [pdf, other

    cs.LG

    Do-calculus enables estimation of causal effects in partially observed biomolecular pathways

    Authors: Sara Mohammad-Taheri, Jeremy Zucker, Charles Tapley Hoyt, Karen Sachs, Vartika Tewari, Robert Ness, and Olga Vitek

    Abstract: Estimating causal queries, such as changes in protein abundance in response to a perturbation, is a fundamental task in the analysis of biomolecular pathways. The estimation requires experimental measurements on the pathway components. However, in practice many pathway components are left unobserved (latent) because they are either unknown, or difficult to measure. Latent variable models (LVMs) ar… ▽ More

    Submitted 24 October, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: https://academic.oup.com/bioinformatics/article/38/Supplement_1/i350/6617530

    Journal ref: Bioinformatics 38(2022): i350-i358

  10. arXiv:2101.05136  [pdf, other

    q-bio.QM cs.AI cs.LG

    Leveraging Structured Biological Knowledge for Counterfactual Inference: a Case Study of Viral Pathogenesis

    Authors: Jeremy Zucker, Kaushal Paneri, Sara Mohammad-Taheri, Somya Bhargava, Pallavi Kolambkar, Craig Bakker, Jeremy Teuton, Charles Tapley Hoyt, Kristie Oxford, Robert Ness, Olga Vitek

    Abstract: Counterfactual inference is a useful tool for comparing outcomes of interventions on complex systems. It requires us to represent the system in form of a structural causal model, complete with a causal diagram, probabilistic assumptions on exogenous variables, and functional assignments. Specifying such models can be extremely difficult in practice. The process requires substantial domain expertis… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: In proceeding of IEEE, Transactions on Big Data

  11. arXiv:2007.14175  [pdf, ps, other

    cs.LG cs.AI stat.ML

    PyKEEN 1.0: A Python Library for Training and Evaluating Knowledge Graph Embeddings

    Authors: Mehdi Ali, Max Berrendorf, Charles Tapley Hoyt, Laurent Vermue, Sahand Sharifzadeh, Volker Tresp, Jens Lehmann

    Abstract: Recently, knowledge graph embeddings (KGEs) received significant attention, and several software libraries have been developed for training and evaluating KGEs. While each of them addresses specific needs, we re-designed and re-implemented PyKEEN, one of the first KGE libraries, in a community effort. PyKEEN 1.0 enables users to compose knowledge graph embedding models (KGEMs) based on a wide rang… ▽ More

    Submitted 30 July, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

  12. arXiv:2006.13365  [pdf, other

    cs.LG cs.AI stat.ML

    Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework

    Authors: Mehdi Ali, Max Berrendorf, Charles Tapley Hoyt, Laurent Vermue, Mikhail Galkin, Sahand Sharifzadeh, Asja Fischer, Volker Tresp, Jens Lehmann

    Abstract: The heterogeneity in recently published knowledge graph embedding models' implementations, training, and evaluation has made fair and thorough comparisons difficult. In order to assess the reproducibility of previously published results, we re-implemented and evaluated 21 interaction models in the PyKEEN software package. Here, we outline which results could be reproduced with their reported hyper… ▽ More

    Submitted 1 November, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

  13. arXiv:2006.08589  [pdf

    cs.DL cs.SE

    The role of metadata in reproducible computational research

    Authors: Jeremy Leipzig, Daniel Nüst, Charles Tapley Hoyt, Stian Soiland-Reyes, Karthik Ram, Jane Greenberg

    Abstract: Reproducible computational research (RCR) is the keystone of the scientific method for in silico analyses, packaging the transformation of raw data to published results. In addition to its role in research integrity, RCR has the capacity to significantly accelerate evaluation and reuse. This potential and wide-support for the FAIR principles have motivated interest in metadata standards supporting… ▽ More

    Submitted 19 April, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 53 pages, 18 figures, 2 tables, 216 references

  14. arXiv:2001.10560  [pdf, other

    cs.LG cs.AI stat.ML

    The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability

    Authors: Mehdi Ali, Hajira Jabeen, Charles Tapley Hoyt, Jens Lehman

    Abstract: There is an emerging trend of embedding knowledge graphs (KGs) in continuous vector spaces in order to use those for machine learning tasks. Recently, many knowledge graph embedding (KGE) models have been proposed that learn low dimensional representations while trying to maintain the structural properties of the KGs such as the similarity of nodes depending on their edges to other nodes. KGEs can… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.