Search | arXiv e-print repository

The Digitalization of Bioassays in the Open Research Knowledge Graph

Authors: Jennifer D'Souza, Anita Monteverdi, Muhammad Haris, Marco Anteghini, Kheir Eddine Farfar, Markus Stocker, Vitor A. P. Martins dos Santos, Sören Auer

Abstract: Background: Recent years are seeing a growing impetus in the semantification of scholarly knowledge at the fine-grained level of scientific entities in knowledge graphs. The Open Research Knowledge Graph (ORKG) https://www.orkg.org/ represents an important step in this direction, with thousands of scholarly contributions as structured, fine-grained, machine-readable data. There is a need, however,… ▽ More Background: Recent years are seeing a growing impetus in the semantification of scholarly knowledge at the fine-grained level of scientific entities in knowledge graphs. The Open Research Knowledge Graph (ORKG) https://www.orkg.org/ represents an important step in this direction, with thousands of scholarly contributions as structured, fine-grained, machine-readable data. There is a need, however, to engender change in traditional community practices of recording contributions as unstructured, non-machine-readable text. For this in turn, there is a strong need for AI tools designed for scientists that permit easy and accurate semantification of their scholarly contributions. We present one such tool, ORKG-assays. Implementation: ORKG-assays is a freely available AI micro-service in ORKG written in Python designed to assist scientists obtain semantified bioassays as a set of triples. It uses an AI-based clustering algorithm which on gold-standard evaluations over 900 bioassays with 5,514 unique property-value pairs for 103 predicates shows competitive performance. Results and Discussion: As a result, semantified assay collections can be surveyed on the ORKG platform via tabulation or chart-based visualizations of key property values of the chemicals and compounds offering smart knowledge access to biochemists and pharmaceutical researchers in the advancement of drug development. △ Less

Submitted 28 March, 2022; originally announced March 2022.

Comments: 12 pages, 5 figures, In Review at DeXa 2022 https://www.dexa.org/dexa2022

arXiv:2111.15182 [pdf, other]

Easy Semantification of Bioassays

Authors: Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer

Abstract: Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. We propose a solution for automatically semantifying biological assays. Our solution contrasts the problem of automated semantification as labeling versus clustering where the two methods are on opposite ends of the method complex… ▽ More Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. We propose a solution for automatically semantifying biological assays. Our solution contrasts the problem of automated semantification as labeling versus clustering where the two methods are on opposite ends of the method complexity spectrum. Characteristically modeling our problem, we find the clustering solution significantly outperforms a deep neural network state-of-the-art labeling approach. This novel contribution is based on two factors: 1) a learning objective closely modeled after the data outperforms an alternative approach with sophisticated semantic modeling; 2) automatically semantifying biological assays achieves a high performance F1 of nearly 83%, which to our knowledge is the first reported standardized evaluation of the task offering a strong benchmark model. △ Less

Submitted 2 December, 2021; v1 submitted 30 November, 2021; originally announced November 2021.

Comments: 12 pages, 5 figures, Accepted for Publication in AIxIA 2021 (https://aixia2021.disco.unimib.it/home-page)

arXiv:2110.04731 [pdf, ps, other]

Universal Adversarial Attacks on Neural Networks for Power Allocation in a Massive MIMO System

Authors: Pablo Millán Santos, B. R. Manoj, Meysam Sadeghi, Erik G. Larsson

Abstract: Deep learning (DL) architectures have been successfully used in many applications including wireless systems. However, they have been shown to be susceptible to adversarial attacks. We analyze DL-based models for a regression problem in the context of downlink power allocation in massive multiple-input-multiple-output systems and propose universal adversarial perturbation (UAP)-crafting methods as… ▽ More Deep learning (DL) architectures have been successfully used in many applications including wireless systems. However, they have been shown to be susceptible to adversarial attacks. We analyze DL-based models for a regression problem in the context of downlink power allocation in massive multiple-input-multiple-output systems and propose universal adversarial perturbation (UAP)-crafting methods as white-box and black-box attacks. We benchmark the UAP performance of white-box and black-box attacks for the considered application and show that the adversarial success rate can achieve up to 60% and 40%, respectively. The proposed UAP-based attacks make a more practical and realistic approach as compared to classical white-box attacks. △ Less

Submitted 10 October, 2021; originally announced October 2021.

Comments: accepted for publication in IEEE Wireless Communications Letters

arXiv:2009.08801 [pdf, other]

SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph

Authors: Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer

Abstract: As a novel contribution to the problem of semantifying biological assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequency-based baseline approach. Specifically, the neura… ▽ More As a novel contribution to the problem of semantifying biological assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequency-based baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. △ Less

Submitted 16 September, 2020; originally announced September 2020.

Comments: In proceedings of the '22nd International Conference on Knowledge Engineering and Knowledge Management' 'Demo and Poster section'

arXiv:2009.07642 [pdf, other]

Representing Semantified Biological Assays in the Open Research Knowledge Graph

Authors: Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer

Abstract: In the biotechnology and biomedical domains, recent text mining efforts advocate for machine-interpretable, and preferably, semantified, documentation formats of laboratory processes. This includes wet-lab protocols, (in)organic materials synthesis reactions, genetic manipulations and procedures for faster computer-mediated analysis and predictions. Herein, we present our work on the representatio… ▽ More In the biotechnology and biomedical domains, recent text mining efforts advocate for machine-interpretable, and preferably, semantified, documentation formats of laboratory processes. This includes wet-lab protocols, (in)organic materials synthesis reactions, genetic manipulations and procedures for faster computer-mediated analysis and predictions. Herein, we present our work on the representation of semantified bioassays in the Open Research Knowledge Graph (ORKG). In particular, we describe a semantification system work-in-progress to generate, automatically and quickly, the critical semantified bioassay data mass needed to foster a consistent user audience to adopt the ORKG for recording their bioassays and facilitate the organisation of research, according to FAIR principles. △ Less

Submitted 16 September, 2020; originally announced September 2020.

Comments: In Proceedings of 'The 22nd International Conference on Asia-Pacific Digital Libraries'

Showing 1–5 of 5 results for author: Santos, P M