-
EpiGeoPop: A Tool for Develo** Spatially Accurate Country-level Epidemiological Models
Authors:
Lara Herriott,
Henriette L. Capel,
Isaac Ellmen,
Nathan Schofield,
Jiayuan Zhu,
Ben Lambert,
David Gavaghan,
Ioana Bouros,
Richard Creswell,
Kit Gallagher
Abstract:
Mathematical models play a crucial role in understanding the spread of infectious disease outbreaks and influencing policy decisions. These models aid pandemic preparedness by predicting outcomes under hypothetical scenarios and identifying weaknesses in existing frameworks. However, their accuracy, utility, and comparability are being scrutinized. Agent-based models (ABMs) have emerged as a valua…
▽ More
Mathematical models play a crucial role in understanding the spread of infectious disease outbreaks and influencing policy decisions. These models aid pandemic preparedness by predicting outcomes under hypothetical scenarios and identifying weaknesses in existing frameworks. However, their accuracy, utility, and comparability are being scrutinized. Agent-based models (ABMs) have emerged as a valuable tool, capturing population heterogeneity and spatial effects, particularly when assessing intervention strategies. Here we present EpiGeoPop, a user-friendly tool for rapidly preparing spatially accurate population configurations of entire countries. EpiGeoPop helps to address the problem of complex and time-consuming model set up in ABMs, specifically improving the integration of spatial detail. We subsequently demonstrate the importance of accurate spatial detail in ABM simulations of disease outbreaks using Epiabm, an ABM based on Imperial College London's CovidSim with improved modularity, documentation and testing. Our investigation involves the interplay between population density, the implementation of spatial transmission, and realistic interventions implemented in Epiabm.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
A Deep Metric Learning Approach to Account Linking
Authors:
Aleem Khan,
Elizabeth Fleming,
Noah Schofield,
Marcus Bishop,
Nicholas Andrews
Abstract:
We consider the task of linking social media accounts that belong to the same author in an automated fashion on the basis of the content and metadata of their corresponding document streams. We focus on learning an embedding that maps variable-sized samples of user activity -- ranging from single posts to entire months of activity -- to a vector space, where samples by the same author map to nearb…
▽ More
We consider the task of linking social media accounts that belong to the same author in an automated fashion on the basis of the content and metadata of their corresponding document streams. We focus on learning an embedding that maps variable-sized samples of user activity -- ranging from single posts to entire months of activity -- to a vector space, where samples by the same author map to nearby points. The approach does not require human-annotated data for training purposes, which allows us to leverage large amounts of social media content. The proposed model outperforms several competitive baselines under a novel evaluation framework modeled after established recognition benchmarks in other domains. Our method achieves high linking accuracy, even with small samples from accounts not seen at training time, a prerequisite for practical applications of the proposed linking framework.
△ Less
Submitted 15 May, 2021;
originally announced May 2021.
-
Survey on data management in radiation protection research
Authors:
Balázs G. Madas,
Paul N. Schofield
Abstract:
The importance of datasharing is of increasing concern to funding bodies and institutions. With some prescience, the radiobiology community has established data sharing infrastructures over the last two decades, including STORE; however, the utilisation of these databases is disappointing. The aim of the present study was to identify the current state of datasharing amongst researchers in radiatio…
▽ More
The importance of datasharing is of increasing concern to funding bodies and institutions. With some prescience, the radiobiology community has established data sharing infrastructures over the last two decades, including STORE; however, the utilisation of these databases is disappointing. The aim of the present study was to identify the current state of datasharing amongst researchers in radiation protection, and to identify barriers to effective sharing. An electronic survey was prepared, including questions on post-publication data provision, institutional, funding agency, and journal policies, awareness of datasharing infrastructures, attitudinal barriers, and technical support. The survey was sent to the members of a mailing list maintained by the EC funded CONCERT project. Responses identified that the radiation protection community shared similar concerns to other groups canvassed in earlier studies; the perceived negative impact of datasharing on competitiveness, career development and reputation, along with concern about the costs of data management. More surprising was the lack of awareness of existing datasharing platforms. We find that there is a clear need for education and training in data management and for a significant programme of improving awareness of Open Data issues.
△ Less
Submitted 8 May, 2018;
originally announced May 2018.
-
Analysis of the human diseasome reveals phenotype modules across common, genetic, and infectious diseases
Authors:
Robert Hoehndorf,
Paul N Schofield,
Georgios V Gkoutos
Abstract:
Phenotypes are the observable characteristics of an organism arising from its response to the environment. Phenotypes associated with engineered and natural genetic variation are widely recorded using phenotype ontologies in model organisms, as are signs and symptoms of human Mendelian diseases in databases such as OMIM and Orphanet. Exploiting these resources, several computational methods have b…
▽ More
Phenotypes are the observable characteristics of an organism arising from its response to the environment. Phenotypes associated with engineered and natural genetic variation are widely recorded using phenotype ontologies in model organisms, as are signs and symptoms of human Mendelian diseases in databases such as OMIM and Orphanet. Exploiting these resources, several computational methods have been developed for integration and analysis of phenotype data to identify the genetic etiology of diseases or suggest plausible interventions. A similar resource would be highly useful not only for rare and Mendelian diseases, but also for common, complex and infectious diseases. We apply a semantic text- mining approach to identify the phenotypes (signs and symptoms) associated with over 8,000 diseases. We demonstrate that our method generates phenotypes that correctly identify known disease-associated genes in mice and humans with high accuracy. Using a phenotypic similarity measure, we generate a human disease network in which diseases that share signs and symptoms cluster together, and we use this network to identify phenotypic disease modules.
△ Less
Submitted 26 November, 2014; v1 submitted 3 November, 2014;
originally announced November 2014.
-
Aber-OWL: a framework for ontology-based data access in biology
Authors:
Robert Hoehndorf,
Luke Slater,
Paul N. Schofield,
Georgios V. Gkoutos
Abstract:
Many ontologies have been developed in biology and these ontologies increasingly contain large volumes of formalized knowledge commonly expressed in the Web Ontology Language (OWL). Computational access to the knowledge contained within these ontologies relies on the use of automated reasoning. We have developed the Aber-OWL infrastructure that provides reasoning services for bio-ontologies. Aber-…
▽ More
Many ontologies have been developed in biology and these ontologies increasingly contain large volumes of formalized knowledge commonly expressed in the Web Ontology Language (OWL). Computational access to the knowledge contained within these ontologies relies on the use of automated reasoning. We have developed the Aber-OWL infrastructure that provides reasoning services for bio-ontologies. Aber-OWL consists of an ontology repository, a set of web services and web interfaces that enable ontology-based semantic access to biological data and literature. Aber-OWL is freely available at http://aber-owl.net.
△ Less
Submitted 25 July, 2014;
originally announced July 2014.