Skip to main content

Showing 1–6 of 6 results for author: Martínez-Romero, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09107  [pdf

    cs.DL

    A Comprehensive Approach to Ensuring Quality in Spreadsheet-Based Metadata

    Authors: Martin J. O'Connor, Marcos Martínez-Romero, Mete Ugur Akdogan, Josef Hardi, Mark A. Musen

    Abstract: While scientists increasingly recognize the importance of metadata in describing their data, spreadsheets remain the preferred tool for supplying this information despite their limitations in ensuring compliance and quality. Various tools have been developed to address these limitations, but they suffer from their own shortcomings, such as steep learning curves and limited customization. In this p… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  2. arXiv:2208.02836  [pdf

    cs.DL

    Modeling community standards for metadata as templates makes data FAIR

    Authors: Mark A. Musen, Martin J. O'Connor, Erik Schultes, Marcos Martinez-Romero, Josef Hardi, John Graybeal

    Abstract: It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be "rich" and to adhere to "domain-relevant" community standards. Scientific communities should be able to defin… ▽ More

    Submitted 14 October, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: 20 pages, 1 table, 5 figures

  3. The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, Attila L. Egyedi, Debra Willrett, John Graybeal, Mark A. Musen

    Abstract: The Center for Expanded Data Annotation and Retrieval (CEDAR) aims to revolutionize the way that metadata describing scientific experiments are authored. The software we have developed--the CEDAR Workbench--is a suite of Web-based tools and REST APIs that allows users to construct metadata templates, to fill in templates to generate high-quality metadata, and to share and manage these resources. T… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  4. Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases

    Authors: Marcos Martínez-Romero, Martin J. O'Connor, Attila L. Egyedi, Debra Willrett, Josef Hardi, John Graybeal, Mark A. Musen

    Abstract: Metadata-the machine-readable descriptions of the data-are increasingly seen as crucial for describing the vast array of biomedical datasets that are currently being deposited in public repositories. While most public repositories have firm requirements that metadata must accompany submitted datasets, the quality of those metadata is generally very poor. A key problem is that the typical metadata… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  5. arXiv:1708.01286  [pdf

    cs.DB

    Metadata in the BioSample Online Repository are Impaired by Numerous Anomalies

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, John Graybeal, Mark A. Musen

    Abstract: The metadata about scientific experiments are crucial for finding, reproducing, and reusing the data that the metadata describe. We present a study of the quality of the metadata stored in BioSample--a repository of metadata about samples used in biomedical experiments managed by the U.S. National Center for Biomedical Technology Information (NCBI). We tested whether 6.6 million BioSample metadata… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

  6. NCBO Ontology Recommender 2.0: An Enhanced Approach for Biomedical Ontology Recommendation

    Authors: Marcos Martinez-Romero, Clement Jonquet, Martin J. O'Connor, John Graybeal, Alejandro Pazos, Mark A. Musen

    Abstract: Biomedical researchers use ontologies to annotate their data with ontology terms, enabling better data integration and interoperability. However, the number, variety and complexity of current biomedical ontologies make it cumbersome for researchers to determine which ones to reuse for their specific needs. To overcome this problem, in 2010 the National Center for Biomedical Ontology (NCBO) release… ▽ More

    Submitted 25 May, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: 29 pages, 8 figures, 11 tables

    ACM Class: I.2.4

    Journal ref: Journal of Biomedical Semantics 8 (2017) 1-22