Search | arXiv e-print repository

doi 10.1093/database/baac035

A Simple Standard for Sharing Ontological Map**s (SSSOM)

Authors: Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jiménez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler , et al. (19 additional authors not shown)

Abstract: Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for map** between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Map**s often lack the metadata needed to be correctly interpreted and applied. For example, ar… ▽ More Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for map** between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Map**s often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Are they associated in some other way? Such relationships between the mapped terms are often not documented, leading to incorrect assumptions and making them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Also, the lack of descriptions of how map**s were done makes it hard to combine and reconcile map**s, particularly curated and automated ones. The Simple Standard for Sharing Ontological Map**s (SSSOM) addresses these problems by: 1. Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in map**s explicit. 2. Defining an easy to use table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data standards. 3. Implementing open and community-driven collaborative workflows designed to evolve the standard continuously to address changing requirements and map** practices. 4. Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases, and survey some existing work on standardizing the exchange of map**s, with the goal of making map**s Findable, Accessible, Interoperable, and Reusable (FAIR). The SSSOM specification is at http://w3id.org/sssom/spec. △ Less

Submitted 13 December, 2021; originally announced December 2021.

Comments: Corresponding author: Christopher J. Mungall <[email protected]>

arXiv:2110.01742 [pdf, other]

doi 10.1109/MELECON53508.2022.9843099

Epileptic Seizure Classification Using Combined Labels and a Genetic Algorithm

Authors: Scot Davidson, Niamh McCallan, Kok Yew Ng, Pardis Biglarbeigi, Dewar Finlay, Boon Leong Lan, James McLaughlin

Abstract: Epilepsy affects 50 million people worldwide and is one of the most common serious neurological disorders. Seizure detection and classification is a valuable tool for diagnosing and maintaining the condition. An automated classification algorithm will allow for accurate diagnosis. Utilising the Temple University Hospital (TUH) Seizure Corpus, six seizure types are compared; absence, complex partia… ▽ More Epilepsy affects 50 million people worldwide and is one of the most common serious neurological disorders. Seizure detection and classification is a valuable tool for diagnosing and maintaining the condition. An automated classification algorithm will allow for accurate diagnosis. Utilising the Temple University Hospital (TUH) Seizure Corpus, six seizure types are compared; absence, complex partial, myoclonic, simple partial, tonic and tonic- clonic models. This study proposes a method that utilises unique features with a novel parallel classifier - Parallel Genetic Naive Bayes (NB) Seizure Classifier (PGNBSC). The PGNBSC algorithm searches through the features and by reclassifying the data each time, the algorithm will create a matrix for optimum search criteria. Ictal states from the EEGs are segmented into 1.8 s windows, where the epochs are then further decomposed into 13 different features from the first intrinsic mode function (IMF). The features are compared using an original NB classifier in the first model. This is improved upon in a second model by using a genetic algorithm (Binary Grey Wolf Optimisation, Option 1) with a NB classifier. The third model uses a combination of the simple partial and complex partial seizures to provide the highest classification accuracy for each of the six seizures amongst the three models (20%, 53%, and 85% for first, second, and third model, respectively). △ Less

Submitted 28 April, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

Comments: 6 pages, 3 figures, accepted for publication at the 21st IEEE Mediterranean Electrotechnical Conference (MELECON 2022)

Journal ref: 2022 IEEE 21st Mediterranean Electrotechnical Conference (MELECON)

arXiv:2104.15106 [pdf, other]

Latent Factor Decomposition Model: Applications for Questionnaire Data

Authors: Connor J. McLaughlin, Efi G. Kokkotou, Jean A. King, Lisa A. Conboy, Ali Yousefi

Abstract: The analysis of clinical questionnaire data comes with many inherent challenges. These challenges include the handling of data with missing fields, as well as the overall interpretation of a dataset with many fields of different scales and forms. While numerous methods have been developed to address these challenges, they are often not robust, statistically sound, or easily interpretable. Here, we… ▽ More The analysis of clinical questionnaire data comes with many inherent challenges. These challenges include the handling of data with missing fields, as well as the overall interpretation of a dataset with many fields of different scales and forms. While numerous methods have been developed to address these challenges, they are often not robust, statistically sound, or easily interpretable. Here, we propose a latent factor modeling framework that extends the principal component analysis for both categorical and quantitative data with missing elements. The model simultaneously provides the principal components (basis) and each patients' projections on these bases in a latent space. We show an application of our modeling framework through Irritable Bowel Syndrome (IBS) symptoms, where we find correlations between these projections and other standardized patient symptom scales. This latent factor model can be easily applied to different clinical questionnaire datasets for clustering analysis and interpretable inference. △ Less

Submitted 2 August, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

Comments: Accepted for the 43rd IEEE Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2021

arXiv:1903.04421

Augmenting expert detection of early coronary artery occlusion from 12 lead electrocardiograms using deep learning

Authors: Rob Brisk, Raymond R Bond. Dewar D Finlay, James McLaughlin, Alicja Piadlo, Stephen J Leslie, David E Gossman, Ian B A Menown, David J McEneaney

Abstract: Early diagnosis of acute coronary artery occlusion based on electrocardiogram (ECG) findings is essential for prompt delivery of primary percutaneous coronary intervention. Current ST elevation (STE) criteria are specific but insensitive. Consequently, it is likely that many patients are missing out on potentially life-saving treatment. Experts combining non-specific ECG changes with STE detect is… ▽ More Early diagnosis of acute coronary artery occlusion based on electrocardiogram (ECG) findings is essential for prompt delivery of primary percutaneous coronary intervention. Current ST elevation (STE) criteria are specific but insensitive. Consequently, it is likely that many patients are missing out on potentially life-saving treatment. Experts combining non-specific ECG changes with STE detect ischaemia with higher sensitivity, but at the cost of specificity. We show that a deep learning model can detect ischaemia caused by acute coronary artery occlusion with a better balance of sensitivity and specificity than STE criteria, existing computerised analysers or expert cardiologists. △ Less

Submitted 18 November, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

Comments: Our attempts to produce what we considered to be an acceptable level of explainability from our algorithm have not yielded a satisfactory account of its internal logic and we do not feel this is acceptable from a clinical application. We will publish a fuller account of our work on this issue and its implications on the validation of clinical deep learning algorithms in the near future

ACM Class: I.2.1

arXiv:1301.6972 [pdf, ps, other]

Using evolutionary computation to create vectorial Boolean functions with low differential uniformity and high nonlinearity

Authors: James McLaughlin, John A. Clark

Abstract: The two most important criteria for vectorial Boolean functions used as S-boxes in block ciphers are differential uniformity and nonlinearity. Previous work in this field has focused only on nonlinearity and a different criterion, autocorrelation. In this paper, we describe the results of experiments in using simulated annealing, memetic algorithms, and ant colony optimisation to create vectorial… ▽ More The two most important criteria for vectorial Boolean functions used as S-boxes in block ciphers are differential uniformity and nonlinearity. Previous work in this field has focused only on nonlinearity and a different criterion, autocorrelation. In this paper, we describe the results of experiments in using simulated annealing, memetic algorithms, and ant colony optimisation to create vectorial Boolean functions with low differential uniformity. Keywords: Metaheuristics, simulated annealing, memetic algorithms, ant colony optimization, cryptography, Boolean functions, vectorial Boolean functions. △ Less

Submitted 29 January, 2013; originally announced January 2013.

arXiv:1009.5626 [pdf, ps, other]

The Realizable Extension Problem and the Weighted Graph $(K_{3,3},l)$

Authors: Jonathan McLaughlin

Abstract: This note outlines the realizable extension problem for weighted graphs and provides results of a detailed analysis of this problem for the weighted graph $(K_{3,3},l)$. This analysis is then utilized to provide a result relating to the connectedness of the moduli space of planar realizations of $(K_{3,3},l)$. The note culminates with two examples which show that in general, realizability and conn… ▽ More This note outlines the realizable extension problem for weighted graphs and provides results of a detailed analysis of this problem for the weighted graph $(K_{3,3},l)$. This analysis is then utilized to provide a result relating to the connectedness of the moduli space of planar realizations of $(K_{3,3},l)$. The note culminates with two examples which show that in general, realizability and connectedness results relating to the moduli spaces of weighted cycles which are contained in a larger weighted graph cannot be extended to similar results regarding the moduli space of the larger weighted graph. △ Less

Submitted 28 September, 2010; originally announced September 2010.

Comments: 15 pages, 14 figures

Showing 1–6 of 6 results for author: McLaughlin, J