-
Data-Driven Switchback Experiments: Theoretical Tradeoffs and Empirical Bayes Designs
Authors:
Ruoxuan Xiong,
Alex Chin,
Sean J. Taylor
Abstract:
We study the design and analysis of switchback experiments conducted on a single aggregate unit. The design problem is to partition the continuous time space into intervals and switch treatments between intervals, in order to minimize the estimation error of the treatment effect. We show that the estimation error depends on four factors: carryover effects, periodicity, serially correlated outcomes…
▽ More
We study the design and analysis of switchback experiments conducted on a single aggregate unit. The design problem is to partition the continuous time space into intervals and switch treatments between intervals, in order to minimize the estimation error of the treatment effect. We show that the estimation error depends on four factors: carryover effects, periodicity, serially correlated outcomes, and impacts from simultaneous experiments. We derive a rigorous bias-variance decomposition and show the tradeoffs of the estimation error from these factors. The decomposition provides three new insights in choosing a design: First, balancing the periodicity between treated and control intervals reduces the variance; second, switching less frequently reduces the bias from carryover effects while increasing the variance from correlated outcomes, and vice versa; third, randomizing interval start and end points reduces both bias and variance from simultaneous experiments. Combining these insights, we propose a new empirical Bayes design approach. This approach uses prior data and experiments for designing future experiments. We illustrate this approach using real data from a ride-sharing platform, yielding a design that reduces MSE by 33% compared to the status quo design used on the platform.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Brain Morphology Normative modelling platform for abnormality and Centile estimation: Brain MoNoCle
Authors:
Bethany Little,
Nida Alyas,
Alexander Surtees,
Gavin P Winston,
John S Duncan,
David A Cousins,
John-Paul Taylor,
Peter Taylor,
Karoline Leiberg,
Yujiang Wang
Abstract:
Normative models of brain structure estimate the effects of covariates such as age and sex using large samples of healthy controls. These models can then be applied to smaller clinical cohorts to distinguish disease effects from other covariates. However, these advanced statistical modelling approaches can be difficult to access, and processing large healthy cohorts is computationally demanding. T…
▽ More
Normative models of brain structure estimate the effects of covariates such as age and sex using large samples of healthy controls. These models can then be applied to smaller clinical cohorts to distinguish disease effects from other covariates. However, these advanced statistical modelling approaches can be difficult to access, and processing large healthy cohorts is computationally demanding. Thus, accessible platforms with pre-trained normative models are needed.
We present such a platform for brain morphology analysis as an open-source web application https://cnnplab.shinyapps.io/normativemodelshiny/, with six key features: (i) user-friendly web interface, (ii) individual and group outputs, (iii) multi-site analysis, (iv) regional and whole-brain analysis, (v) integration with existing tools, and (vi) featuring multiple morphology metrics.
Using a diverse sample of 3,276 healthy controls across 21 sites, we pre-trained normative models on various metrics. We validated the models with a small clinical sample of individuals with bipolar disorder, showing outputs that aligned closely with existing literature only after applying our normative modelling. Further validation with a cohort of temporal lobe epilepsy showed agreement with previous group-level findings and individual-level seizure lateralisation. Finally, with the ability to investigate multiple morphology measures in the same framework, we found that biological covariates are better explained in specific morphology measures, and for clinical applications, only some measures are sensitive to the disease process.
Our platform offers a comprehensive framework to analyse brain morphology in clinical and research settings. Validations confirm the superiority of normative models and the advantage of investigating a range of brain morphology metrics together.
△ Less
Submitted 26 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Learning to Abstract Visuomotor Map**s using Meta-Reinforcement Learning
Authors:
Carlos A. Velazquez-Vargas,
Isaac Ray Christian,
Jordan A. Taylor,
Sreejan Kumar
Abstract:
We investigated the human capacity to acquire multiple visuomotor map**s for de novo skills. Using a grid navigation paradigm, we tested whether contextual cues implemented as different "grid worlds", allow participants to learn two distinct key-map**s more efficiently. Our results indicate that when contextual information is provided, task performance is significantly better. The same held tr…
▽ More
We investigated the human capacity to acquire multiple visuomotor map**s for de novo skills. Using a grid navigation paradigm, we tested whether contextual cues implemented as different "grid worlds", allow participants to learn two distinct key-map**s more efficiently. Our results indicate that when contextual information is provided, task performance is significantly better. The same held true for meta-reinforcement learning agents that differed in whether or not they receive contextual information when performing the task. We evaluated their accuracy in predicting human performance in the task and analyzed their internal representations. The results indicate that contextual cues allow the formation of separate representations in space and time when using different visuomotor map**s, whereas the absence of them favors sharing one representation. While both strategies can allow learning of multiple visuomotor map**s, we showed contextual cues provide a computational advantage in terms of how many map**s can be learned.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Inferring the dynamics of ionic currents from recursive piecewise data assimilation of approximate neuron models
Authors:
Stephen A. Wells,
Joseph D. Taylor,
Paul G. Morris,
Alain Nogaret
Abstract:
We construct neuron models from data by transferring information from an observed time series to the state variables and parameters of Hodgkin-Huxley models. When the learning period completes, the model will predict additional observations and its parameters uniquely characterise the complement of ion channels. However, the assimilation of biological data, as opposed to model data, is complicated…
▽ More
We construct neuron models from data by transferring information from an observed time series to the state variables and parameters of Hodgkin-Huxley models. When the learning period completes, the model will predict additional observations and its parameters uniquely characterise the complement of ion channels. However, the assimilation of biological data, as opposed to model data, is complicated by the lack of knowledge of the true neuron equations. Reliance on guessed conductance models is plagued with multi-valued parameter solutions. Here, we report on the distributions of parameters and currents predicted with intentionally erroneous models, over-specified models, and an approximate model fitting hippocampal neuron data. We introduce a recursive piecewise data assimilation (RPDA) algorithm that converges with near-perfect reliability when the model is known. When the model is unknown, we show model error introduces correlations between certain parameters. The ionic currents reconstructed from these parameters are excellent predictors of true currents and carry a higher degree of confidence, >95.5%, than underlying parameters, >53%. Unexpressed ionic currents are correctly filtered out even in the presence of mild model error. When the model is unknown, the covariance eigenvalues of parameter estimates are found to be a good gauge of model error. Our results suggest that biological information may be retrieved from data by focussing on current estimates rather than parameters.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Reproducible image-based profiling with Pycytominer
Authors:
Erik Serrano,
Srinivas Niranj Chandrasekaran,
Dave Bunten,
Kenneth I. Brewer,
Jenna Tomkinson,
Roshan Kern,
Michael Bornholdt,
Stephen Fleming,
Ruifan Pei,
John Arevalo,
Hillary Tsang,
Vincent Rubinetti,
Callum Tromans-Coia,
Tim Becker,
Erin Weisbart,
Charlotte Bunne,
Alexandr A. Kalinin,
Rebecca Senft,
Stephen J. Taylor,
Nasim Jamali,
Adeniyi Adeboye,
Hamdah Shafqat Abbasi,
Allen Goodman,
Juan C. Caicedo,
Anne E. Carpenter
, et al. (3 additional authors not shown)
Abstract:
Advances in high-throughput microscopy have enabled the rapid acquisition of large numbers of high-content microscopy images. Whether by deep learning or classical algorithms, image analysis pipelines then produce single-cell features. To process these single-cells for downstream applications, we present Pycytominer, a user-friendly, open-source python package that implements the bioinformatics st…
▽ More
Advances in high-throughput microscopy have enabled the rapid acquisition of large numbers of high-content microscopy images. Whether by deep learning or classical algorithms, image analysis pipelines then produce single-cell features. To process these single-cells for downstream applications, we present Pycytominer, a user-friendly, open-source python package that implements the bioinformatics steps, known as image-based profiling. We demonstrate Pycytominers usefulness in a machine learning project to predict nuisance compounds that cause undesirable cell injuries.
△ Less
Submitted 2 July, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
Multivariate brain-cognition associations in euthymic bipolar disorder
Authors:
Bethany Little,
Carly Flowers,
Andrew Blamire,
Peter Thelwall,
John-Paul Taylor,
Peter Gallagher,
David Andrew Cousins,
Yujiang Wang
Abstract:
Background: People with bipolar disorder (BD) tend to show widespread cognitive impairment compared to healthy controls. Impairments in processing speed (PS), attention, and executive function (EF) may represent 'core' impairments that have a role in wider cognitive dysfunction. Cognitive impairments appear to relate to structural brain abnormalities in BD, but whether core deficits are related to…
▽ More
Background: People with bipolar disorder (BD) tend to show widespread cognitive impairment compared to healthy controls. Impairments in processing speed (PS), attention, and executive function (EF) may represent 'core' impairments that have a role in wider cognitive dysfunction. Cognitive impairments appear to relate to structural brain abnormalities in BD, but whether core deficits are related to particular brain regions is unclear and much of the research on brain-cognition associations is limited by univariate analysis and small samples.
Methods: Euthymic BD patients (n=56) and matched healthy controls (n=26) underwent T1-weighted MRI scans and completed neuropsychological tests of PS, attention, and EF. We utilised public datasets to develop a normative model of cortical thickness (n=5,977) to generate robust estimations of cortical abnormalities in patients. Canonical correlation analysis was used to assess multivariate brain-cognition associations in BD, controlling for age, sex, and premorbid IQ.
Results: BD showed impairments on tests of PS, attention, and EF, and abnormal cortical thickness in several brain regions compared to healthy controls. Impairments in tests of PS and EF were most strongly associated with cortical thickness in left inferior temporal, right entorhinal, and right temporal pole areas.
Conclusion: Impairments in PS, attention, and EF can be observed in euthymic BD and may be related to abnormal cortical thickness in temporal regions. Future research should continue to leverage multivariate methods to examine complex brain-cognition associations in BD. Future research may benefit from exploring covariance between traditional brain structural morphological metrics such as cortical thickness, cortical volume, and surface area.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Single shot diagnosis of ion channel dysfunction from assimilation of cell membrane dynamics
Authors:
Paul G Morris,
Joseph D. Taylor,
Julian F. R. Paton,
Alain Nogaret
Abstract:
Many neurological diseases originate in the dysfunction of cellular ion channels. Their diagnosis presents a challenge especially when alterations in the complement of ion channels are a priori unknown. Current approaches based on voltage clamps lack the throughput necessary to identify the mutations causing changes in electrical activity. Here, we introduce a single-shot method for diagnosing cha…
▽ More
Many neurological diseases originate in the dysfunction of cellular ion channels. Their diagnosis presents a challenge especially when alterations in the complement of ion channels are a priori unknown. Current approaches based on voltage clamps lack the throughput necessary to identify the mutations causing changes in electrical activity. Here, we introduce a single-shot method for diagnosing changes in the complement of ion channels from changes in the electrical activity of a cell. We developed data assimilation (DA) to estimate the parameters of individual ion channels and from these parameters reconstruct the ionic currents of hippocampal CA1 neurons to within 11% of their actual value. DA correctly predicts which ionic current is altered and by how much after we blocked the BK, SK, A and HCN channels with selective antagonists of known potency. We anticipate our assay will transform the treatment of neurological disease through comprehensive diagnosis and drug screening.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
MITI Minimum Information guidelines for highly multiplexed tissue images
Authors:
Denis Schapiro,
Clarence Yapp,
Artem Sokolov,
Sheila M. Reynolds,
Yu-An Chen,
Damir Sudar,
Yubin Xie,
Jeremy L. Muhlich,
Raquel Arias-Camison,
Sarah Arena,
Adam J. Taylor,
Milen Nikolov,
Madison Tyler,
Jia-Ren Lin,
Erik A. Burlingame,
Human Tumor Atlas Network,
Young H. Chang,
Samouil L Farhi,
Vésteinn Thorsson,
Nithya Venkatamohan,
Julia L. Drewes,
Dana Pe'er,
David A. Gutman,
Markus D. Herrmann,
Nils Gehlenborg
, et al. (14 additional authors not shown)
Abstract:
The imminent release of tissue atlases combining multi-channel microscopy with single cell sequencing and other omics data from normal and diseased specimens creates an urgent need for data and metadata standards that guide data deposition, curation and release. We describe a Minimum Information about highly multiplexed Tissue Imaging (MITI) standard that applies best practices developed for genom…
▽ More
The imminent release of tissue atlases combining multi-channel microscopy with single cell sequencing and other omics data from normal and diseased specimens creates an urgent need for data and metadata standards that guide data deposition, curation and release. We describe a Minimum Information about highly multiplexed Tissue Imaging (MITI) standard that applies best practices developed for genomics and other microscopy data to highly multiplexed tissue images and traditional histology.
△ Less
Submitted 23 February, 2022; v1 submitted 21 August, 2021;
originally announced August 2021.
-
How do we generalize?
Authors:
Jessica Elizabeth Taylor,
Aurelio Cortese,
Helen C. Barron,
Xiaochuan Pan,
Masamichi Sakagami,
Dagmar Zeithamova
Abstract:
Humans and animals are able to generalize or transfer information from previous experience so that they can behave appropriately in novel situations. What mechanisms--computations, representations, and neural systems--give rise to this remarkable ability? The members of this Generative Adversarial Collaboration (GAC) come from a range of academic backgrounds but are all interested in uncovering th…
▽ More
Humans and animals are able to generalize or transfer information from previous experience so that they can behave appropriately in novel situations. What mechanisms--computations, representations, and neural systems--give rise to this remarkable ability? The members of this Generative Adversarial Collaboration (GAC) come from a range of academic backgrounds but are all interested in uncovering the mechanisms of generalization. We started out this GAC with the aim of arbitrating between two alternative conceptual accounts: (1) generalization stems from integration of multiple experiences into summary representations that reflect generalized knowledge, and (2) generalization is computed on-the-fly using separately stored individual memories. Across the course of this collaboration, we found that--despite using different terminology and techniques, and although some of our specific papers may provide evidence one way or the other--we in fact largely agree that both of these broad accounts (as well as several others) are likely valid. We believe that future research and theoretical synthesis across multiple lines of research is necessary to help determine the degree to which different candidate generalization mechanisms may operate simultaneously, operate on different scales, or be employed under distinct conditions. Here, as the first step, we introduce some of these candidate mechanisms and we discuss the issues currently hindering better synthesis of generalization research. Finally, we introduce some of our own research questions that have arisen over the course of this GAC, that we believe would benefit from future collaborative efforts.
△ Less
Submitted 27 August, 2021; v1 submitted 2 April, 2021;
originally announced April 2021.
-
Quantum Computing at the Frontiers of Biological Sciences
Authors:
Prashant S. Emani,
Jonathan Warrell,
Alan Anticevic,
Stefan Bekiranov,
Michael Gandal,
Michael J. McConnell,
Guillermo Sapiro,
Alán Aspuru-Guzik,
Justin Baker,
Matteo Bastiani,
Patrick McClure,
John Murray,
Stamatios N Sotiropoulos,
Jacob Taylor,
Geetha Senthil,
Thomas Lehner,
Mark B. Gerstein,
Aram W. Harrow
Abstract:
The search for meaningful structure in biological data has relied on cutting-edge advances in computational technology and data science methods. However, challenges arise as we push the limits of scale and complexity in biological problems. Innovation in massively parallel, classical computing hardware and algorithms continues to address many of these challenges, but there is a need to simultaneou…
▽ More
The search for meaningful structure in biological data has relied on cutting-edge advances in computational technology and data science methods. However, challenges arise as we push the limits of scale and complexity in biological problems. Innovation in massively parallel, classical computing hardware and algorithms continues to address many of these challenges, but there is a need to simultaneously consider new paradigms to circumvent current barriers to processing speed. Accordingly, we articulate a view towards quantum computation and quantum information science, where algorithms have demonstrated potential polynomial and exponential computational speedups in certain applications, such as machine learning. The maturation of the field of quantum computing, in hardware and algorithm development, also coincides with the growth of several collaborative efforts to address questions across length and time scales, and scientific disciplines. We use this coincidence to explore the potential for quantum computing to aid in one such endeavor: the merging of insights from genetics, genomics, neuroimaging and behavioral phenoty**. By examining joint opportunities for computational innovation across fields, we highlight the need for a common language between biological data analysis and quantum computing. Ultimately, we consider current and future prospects for the employment of quantum computing algorithms in the biological sciences.
△ Less
Submitted 16 November, 2019;
originally announced November 2019.
-
The reliability of a deep learning model in clinical out-of-distribution MRI data: a multicohort study
Authors:
Gustav Mårtensson,
Daniel Ferreira,
Tobias Granberg,
Lena Cavallin,
Ketil Oppedal,
Alessandro Padovani,
Irena Rektorova,
Laura Bonanni,
Matteo Pardini,
Milica Kramberger,
John-Paul Taylor,
Jakub Hort,
Jón Snædal,
Jaime Kulisevsky,
Frederic Blanc,
Angelo Antonini,
Patrizia Mecocci,
Bruno Vellas,
Magda Tsolaki,
Iwona Kłoszewska,
Hilkka Soininen,
Simon Lovestone,
Andrew Simmons,
Dag Aarsland,
Eric Westman
Abstract:
Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was…
▽ More
Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical data sets---collected with different scanners, protocols and disease populations---and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to data sets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.
△ Less
Submitted 1 November, 2019;
originally announced November 2019.
-
Whole genome sequencing identifies putative associations between genomic polymorphisms and clinical response to the antiepileptic drug levetiracetam
Authors:
DV Vavoulis,
AT Pagnamenta,
SJL Knight,
MM Pentony,
M Armstrong,
EC Galizia,
S Balestrini,
SM Sisodiya,
JC Taylor
Abstract:
In the context of pharmacogenomics, whole genome sequencing provides a powerful approach for identifying correlations between response variability to specific drugs and genomic polymorphisms in a population, in an unbiased manner. In this study, we employed whole genome sequencing of DNA samples from patients showing extreme response (n=72) and non-response (n=27) to the antiepileptic drug levetir…
▽ More
In the context of pharmacogenomics, whole genome sequencing provides a powerful approach for identifying correlations between response variability to specific drugs and genomic polymorphisms in a population, in an unbiased manner. In this study, we employed whole genome sequencing of DNA samples from patients showing extreme response (n=72) and non-response (n=27) to the antiepileptic drug levetiracetam, in order to identify genomic variants that underlie response to the drug. Although no common SNP (MAF>5%) crossed the conventional genome-wide significance threshold of 5e-8, we found common polymorphisms in genes SPNS3, HDC, MDGA2, NSG1 and RASGEF1C, which collectively predict clinical response to levetiracetam in our cohort with ~91% predictive accuracy. Among these genes, HDC, NSG1, MDGA2 and RASGEF1C are potentially implicated in synaptic neurotransmission, while SPNS3 is an atypical solute carrier transporter homologous to SV2A, the known molecular target of levetiracetam. Furthermore, we performed gene- and pathway-based statistical analysis on sets of rare and low-frequency variants (MAF<5%) and we identified associations between the following genes or pathways and response to levetiracetam: a) genes PRKCB and DLG2, which are involved in glutamatergic neurotransmission, a known target of anticonvulsants, including levetiracetam; b) genes FILIP1 and SEMA6D, which are involved in axon guidance and modelling of neural connections; and c) pathways with a role in synaptic neurotransmission, such as WNT5A-dependent internalization of FZD4 and disinhibition of SNARE formation. In summary, our approach to utilise whole genome sequencing on subjects with extreme response phenotypes is a feasible route to generate plausible hypotheses for investigating the genetic factors underlying drug response variability in cases of pharmaco-resistant epilepsy.
△ Less
Submitted 23 September, 2019;
originally announced September 2019.
-
Imaging the Develo** Heart: Synchronized Timelapse Microscopy During Developmental Changes
Authors:
Carl J. Nelson,
Charlotte Buckley,
John J. Mullins,
Martin A. Denvir,
Jonathan Taylor
Abstract:
How do you use imaging to analyse the development of the heart, which not only changes shape but also undergoes constant, high-speed, quasi-periodic changes? We have integrated ideas from prospective and retrospective optical gating to capture long-term, phase-locked developmental time-lapse videos. In this paper we demonstrate the success of this approach over a key developmental time period: hea…
▽ More
How do you use imaging to analyse the development of the heart, which not only changes shape but also undergoes constant, high-speed, quasi-periodic changes? We have integrated ideas from prospective and retrospective optical gating to capture long-term, phase-locked developmental time-lapse videos. In this paper we demonstrate the success of this approach over a key developmental time period: heart loo**, where large changes in heart shape prevent previous prospective gating approaches from capturing phase-locked videos. We use the comparison with other approaches to in vivo heart imaging to highlight the importance of collecting the most appropriate data for the biological question.
△ Less
Submitted 15 February, 2018;
originally announced February 2018.
-
4D blood flow map** using SPIM-microPIV in the develo** zebrafish heart
Authors:
Vytautas Zickus,
Jonathan M. Taylor
Abstract:
Fluid-structure interaction in the develo** heart is an active area of research in developmental biology. However, investigation of heart dynamics is mostly limited to computational fluid dynamics simulations using heart wall structure information only, or single plane blood flow information - so there is a need for 3D + time resolved data to fully understand cardiac function. We present an imag…
▽ More
Fluid-structure interaction in the develo** heart is an active area of research in developmental biology. However, investigation of heart dynamics is mostly limited to computational fluid dynamics simulations using heart wall structure information only, or single plane blood flow information - so there is a need for 3D + time resolved data to fully understand cardiac function. We present an imaging platform combining selective plane illumination microscopy (SPIM) with micro particle image velocimetry ({\textmu}PIV) to enable 3D-resolved flow map** in a microscopic environment, free from many of the sources of error and bias present in traditional epifluorescence-based {\textmu}PIV systems. By using our new system in conjunction with optical heart beat synchronisation, we demonstrte the ability obtain non-invasive 3D + time resolved blood flow measurements in the heart of a living zebrafish embryo.
△ Less
Submitted 15 February, 2018;
originally announced February 2018.
-
Proteins analysed as virtual knots
Authors:
Keith Alexander,
Alexander J Taylor,
Mark R Dennis
Abstract:
Long, flexible physical filaments are naturally tangled and knotted, from macroscopic string down to long-chain molecules. The existence of knotting in a filament naturally affects its configuration and properties, and may be very stable or disappear rapidly under manipulation and interaction. Knotting has been previously identified in protein backbone chains, for which these mechanical constraint…
▽ More
Long, flexible physical filaments are naturally tangled and knotted, from macroscopic string down to long-chain molecules. The existence of knotting in a filament naturally affects its configuration and properties, and may be very stable or disappear rapidly under manipulation and interaction. Knotting has been previously identified in protein backbone chains, for which these mechanical constraints are of fundamental importance to their molecular functionality, despite their being open curves in which the knots are not mathematically well defined; knotting can only be identified by closing the termini of the chain somehow. We introduce a new method for resolving knotting in open curves using virtual knots, a wider class of topological objects that do not require a classical closure and so naturally capture the topological ambiguity inherent in open curves. We describe the results of analysing proteins in the Protein Data Bank by this new scheme, recovering and extending previous knotting results, and identifying topological interest in some new cases. The statistics of virtual knots in protein chains are compared with those of open random walks and Hamiltonian subchains on cubic lattices, identifying a regime of open curves in which the virtual knotting description is likely to be important.
△ Less
Submitted 18 November, 2016;
originally announced November 2016.
-
Single Molecule Data Analysis: An Introduction
Authors:
Meysam Tavakoli,
J. Nicholas Taylor,
Chun-Biu Li,
Tamiki Komatsuzaki,
Steve Pressé
Abstract:
We review methods of data analysis for biophysical data with a special emphasis on single molecule applications. Our review is intended for anyone, from student to established researcher. For someone just getting started, we focus on exposing the logic, strength and limitations of each method and cite, as appropriate, the relevant literature for implementation details. We review traditional freque…
▽ More
We review methods of data analysis for biophysical data with a special emphasis on single molecule applications. Our review is intended for anyone, from student to established researcher. For someone just getting started, we focus on exposing the logic, strength and limitations of each method and cite, as appropriate, the relevant literature for implementation details. We review traditional frequentist and Bayesian parametric approaches to data analysis and subsequently extend our discussion to recent non-parametric and information theoretic methods.
△ Less
Submitted 23 November, 2016; v1 submitted 1 June, 2016;
originally announced June 2016.
-
Predicting age of human subjects based on structural connectivity from diffusion tensor imaging
Authors:
Cheol E. Han,
Luis R. Peraza,
John-Paul Taylor,
Marcus Kaiser
Abstract:
Predicting brain maturity using noninvasive magnetic resonance images (MRI) can distinguish different age groups and help to assess neurodevelopmental disorders. However, group-wise differences are often less informative for assessing features of individuals. Here, we propose a simple method to predict the age of an individual subject solely based on structural connectivity data from diffusion ten…
▽ More
Predicting brain maturity using noninvasive magnetic resonance images (MRI) can distinguish different age groups and help to assess neurodevelopmental disorders. However, group-wise differences are often less informative for assessing features of individuals. Here, we propose a simple method to predict the age of an individual subject solely based on structural connectivity data from diffusion tensor imaging (DTI). Our simple predictor computed a weighted sum of the strength of all connections of an individual. The weight consists of the fiber strength, given by the number of streamlines following tract tracing, multiplied by the importance of that connection for an observed feature--age in this case. We tested this approach using DTI data from 121 healthy subjects aged 4 to 85 years. After determining importance in a training dataset, our predicted ages in the test dataset showed a strong correlation (rho = 0.77) with real age deviating by, on average, only 10 years.
△ Less
Submitted 20 May, 2014;
originally announced May 2014.
-
Worldwide Patterns of Ancestry, Divergence, and Admixture in Domesticated Cattle
Authors:
Jared E. Decker,
Stephanie D. McKay,
Megan M. Rolf,
JaeWoo Kim,
Antonio Molina Alcalá,
Tad S. Sonstegard,
Olivier Hanotte,
Anders Götherström,
Christopher M. Seabury,
Lisa Praharani,
Masroor Ellahi Babar,
Luciana Correia de Almeida Regitano,
Mehmet Ali Yildiz,
Michael P. Heaton,
Wansheng Lui,
Chu-Zhao Lei,
James M. Reecy,
Muhammad Saif-Ur-Rehman,
Robert D. Schnabel,
Jeremy F. Taylor
Abstract:
The domestication and development of cattle has considerably impacted human societies, but the histories of cattle breeds have been poorly understood especially for African, Asian, and American breeds. Using genotypes from 43,043 autosomal single nucleotide polymorphism markers scored in 1,543 animals, we evaluate the population structure of 134 domesticated bovid breeds. Regardless of the analyti…
▽ More
The domestication and development of cattle has considerably impacted human societies, but the histories of cattle breeds have been poorly understood especially for African, Asian, and American breeds. Using genotypes from 43,043 autosomal single nucleotide polymorphism markers scored in 1,543 animals, we evaluate the population structure of 134 domesticated bovid breeds. Regardless of the analytical method or sample subset, the three major groups of Asian indicine, Eurasian taurine, and African taurine were consistently observed. Patterns of geographic dispersal resulting from co-migration with humans and exportation are recognizable in phylogenetic networks. All analytical methods reveal patterns of hybridization which occurred after divergence. Using 19 breeds, we map the cline of indicine introgression into Africa. We infer that African taurine possess a large portion of wild African auroch ancestry, causing their divergence from Eurasian taurine. We detect exportation patterns in Asia and identify a cline of Eurasian taurine/indicine hybridization in Asia. We also identify the influence of species other than Bos taurus in the formation of Asian breeds. We detect the pronounced influence of Shorthorn cattle in the formation of European breeds. Iberian and Italian cattle possess introgression from African taurine. American Criollo cattle are shown to be of Iberian, and not African, decent. Indicine introgression into American cattle occurred in the Americas, and not Europe. We argue that cattle migration, movement and trading followed by admixture have been important forces in sha** modern bovine genomic variation.
△ Less
Submitted 2 January, 2014; v1 submitted 19 September, 2013;
originally announced September 2013.
-
Identification of criticality in neuronal avalanches: II. A theoretical and empirical investigation of the driven case
Authors:
Caroline Hartley,
Timothy J Taylor,
Istvan Z Kiss,
Simon F Farmer,
Luc Berthouze
Abstract:
The observation of apparent power-laws in neuronal systems has led to the suggestion that the brain is at, or close to, a critical state and may be a self-organised critical system. Within the framework of self-organised criticality a separation of timescales is thought to be crucial for the observation of power-law dynamics and computational models are often constructed with this property. Howeve…
▽ More
The observation of apparent power-laws in neuronal systems has led to the suggestion that the brain is at, or close to, a critical state and may be a self-organised critical system. Within the framework of self-organised criticality a separation of timescales is thought to be crucial for the observation of power-law dynamics and computational models are often constructed with this property. However, this is not necessarily a characteristic of physiological neural networks - external input does not only occur when the network is at rest/a steady state. In this paper we study a simple neuronal network model driven by a continuous external input (i.e.\ the model does not have a separation of timescales) and analytically tuned to operate in the region of a critical state (it reaches the critical regime exactly in the absence of input - the case studied in the companion paper to this article). The system displays avalanche dynamics in the form of cascades of neuronal firing separated by periods of silence. We observe partial scale-free behaviour in the distribution of avalanche size for low levels of external input. We analytically derive the distributions of waiting times and investigate their temporal behaviour in relation to different levels of external input, showing that the system's dynamics can exhibit partial long-range temporal correlations. We further show that as the system approaches the critical state by two alternative `routes', different markers of criticality (partial scale-free behaviour and long-range temporal correlations) are displayed. This suggests that signatures of criticality exhibited by a particular system in close proximity to a critical state are dependent on the region in parameter space at which the system (currently) resides.
△ Less
Submitted 13 September, 2013;
originally announced September 2013.
-
Interdependency and hierarchy of exact epidemic models on networks
Authors:
Timothy J Taylor,
Istvan Z Kiss
Abstract:
Over the years numerous models of SIS (susceptible - infected - susceptible) disease dynamics unfolding on networks have been proposed. Here, we discuss the links between many of these models and how they can be viewed as more general motif-based models. We illustrate how the different models can be derived from one another and, where this is not possible, discuss extensions to established models…
▽ More
Over the years numerous models of SIS (susceptible - infected - susceptible) disease dynamics unfolding on networks have been proposed. Here, we discuss the links between many of these models and how they can be viewed as more general motif-based models. We illustrate how the different models can be derived from one another and, where this is not possible, discuss extensions to established models that enables this derivation. We also derive a general result for the exact differential equations for the expected number of an arbitrary motif directly from the Kolmogorov/master equations and conclude with a comparison of the performance of the different closed systems of equations on networks of varying structure.
△ Less
Submitted 9 April, 2013; v1 submitted 13 December, 2012;
originally announced December 2012.
-
Detection of selective sweeps in cattle using genome-wide SNP data
Authors:
Holly R. Ramey,
Jared E. Decker,
Stephanie D. McKay,
Megan M. Rolf,
Robert D. Schnabel,
Jeremy F. Taylor
Abstract:
The domestication and subsequent selection by humans to create breeds of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweep…
▽ More
The domestication and subsequent selection by humans to create breeds of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have been identified in the genomes of many species including humans, dogs, horses, and chickens. We attempt to identify regions of the bovine genome that have been subjected to selective sweeps. Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 114 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions; however, phenotypes that we predict to have historically been under strong selection include horned-polled, coat color, stature, ear morphology, and behavior. The identified selective sweeps represent recent events associated with breed formation rather than ancient events associated with domestication. No sweep regions were shared between indicine and taurine breeds reflecting their divergent selection histories. A primary finding of this study is the sensitivity of results to assay resolution. Despite the bias towards common SNPs in the BovineSNP50 design, false positive sweep regions appear to be common due to the limited resolution of the assay. This assay design bias leads to the detection of breed-specific sweep regions, or regions shared by a small number of breeds, restricting the suite of selected phenotypes detected to primarily those associated with breed characteristics.
△ Less
Submitted 10 December, 2012;
originally announced December 2012.
-
Identification of criticality in neuronal avalanches: I. A theoretical investigation of the non-driven case
Authors:
Timothy J. Taylor,
Caroline Hartley,
Péter L. Simon,
Istvan Z Kiss,
Luc Berthouze
Abstract:
In this paper we study a simple model of a purely excitatory neural network that, by construction, operates at a critical point. This model allows us to consider various markers of criticality and illustrate how they should perform in a finite-size system. By calculating the exact distribution of avalanche sizes we are able to show that, over a limited range of avalanche sizes which we precisely i…
▽ More
In this paper we study a simple model of a purely excitatory neural network that, by construction, operates at a critical point. This model allows us to consider various markers of criticality and illustrate how they should perform in a finite-size system. By calculating the exact distribution of avalanche sizes we are able to show that, over a limited range of avalanche sizes which we precisely identify, the distribution has scale free properties but is not a power law. This suggests that it would be inappropriate to dismiss a system as not being critical purely based on an inability to rigorously fit a power law distribution as has been recently advocated. In assessing whether a system, especially a finite-size one, is critical it is thus important to consider other possible markers. We illustrate one of these by showing the divergence of susceptibility as the critical point of the system is approached. Finally, we provide evidence that power laws may underlie other observables of the system, that may be more amenable to robust experimental assessment.
△ Less
Submitted 31 October, 2012;
originally announced October 2012.
-
Punishment can promote defection in group-structured populations
Authors:
Simon T. Powers,
Daniel J. Taylor,
Joanna J. Bryson
Abstract:
Pro-social punishment, whereby cooperators punish defectors, is often suggested as a mechanism that maintains cooperation in large human groups. Importantly, models that support this idea have to date only allowed defectors to be the target of punishment. However, recent empirical work has demonstrated the existence of anti-social punishment in public goods games. That is, individuals that defect…
▽ More
Pro-social punishment, whereby cooperators punish defectors, is often suggested as a mechanism that maintains cooperation in large human groups. Importantly, models that support this idea have to date only allowed defectors to be the target of punishment. However, recent empirical work has demonstrated the existence of anti-social punishment in public goods games. That is, individuals that defect have been found to also punish cooperators. Some recent theoretical studies have found that such anti-social punishment can prevent the evolution of pro-social punishment and cooperation. However, the evolution of anti-social punishment in group-structured populations has not been formally addressed. Previous work has informally argued that group-structure must favour pro-social punishment. Here we formally investigate how two demographic factors, group size and dispersal frequency, affect selection pressures on pro- and anti-social punishment. Contrary to the suggestions of previous work, we find that anti-social punishment can prevent the evolution of pro-social punishment and cooperation under a range of group structures. Given that anti-social punishment has now been found in all studied extant human cultures, the claims of previous models showing the co-evolution of pro-social punishment and cooperation in group-structured populations should be re-evaluated.
△ Less
Submitted 22 August, 2012; v1 submitted 20 June, 2012;
originally announced June 2012.
-
Epidemic threshold and control in a dynamic network
Authors:
Michael Taylor,
Timothy J. Taylor,
Istvan Z. Kiss
Abstract:
In this paper we present a model describing Susceptible-Infected-Susceptible (SIS) type epidemics spreading on a dynamic contact network with random link activation and deletion where link ac- tivation can be locally constrained. We use and adapt a improved effective degree compartmental modelling framework recently proposed by Lindquist et al. [J. Lindquist et al., J. Math Biol. 62, 2, 143 (2010)…
▽ More
In this paper we present a model describing Susceptible-Infected-Susceptible (SIS) type epidemics spreading on a dynamic contact network with random link activation and deletion where link ac- tivation can be locally constrained. We use and adapt a improved effective degree compartmental modelling framework recently proposed by Lindquist et al. [J. Lindquist et al., J. Math Biol. 62, 2, 143 (2010)] and Marceau et al. [V. Marceau et al., Phys. Rev. E 82, 036116 (2010)]. The resulting set of ordinary differential equations (ODEs) is solved numerically and results are compared to those obtained using individual-based stochastic network simulation. We show that the ODEs display excellent agreement with simulation for the evolution of both the disease and the network, and is able to accurately capture the epidemic threshold for a wide range of parameters. We also present an analytical R0 calculation for the dynamic network model and show that depending on the relative timescales of the network evolution and disease transmission two limiting cases are recovered: (i) the static network case when network evolution is slow and (ii) homogeneous random mixing when the network evolution is rapid. We also use our threshold calculation to highlight the dangers of relying on local stability analysis when predicting epidemic outbreaks on evolving networks.
△ Less
Submitted 13 December, 2011; v1 submitted 18 October, 2011;
originally announced October 2011.
-
An Algorithm for RNA Pseudoknots
Authors:
M. Pillsbury,
J. A. Taylor,
H. Orland,
A. Zee
Abstract:
We further develop the large $ N $ formalism presented by some of us in earlier works in order to recursively calculate the partition function of a singly pseudoknotted RNA. We demonstrate that this calculation takes time proportional to the sixth power of the length of the RNA. The algorithm itself is presented in a self-contained form for the convenience of readers interested in implementing i…
▽ More
We further develop the large $ N $ formalism presented by some of us in earlier works in order to recursively calculate the partition function of a singly pseudoknotted RNA. We demonstrate that this calculation takes time proportional to the sixth power of the length of the RNA. The algorithm itself is presented in a self-contained form for the convenience of readers interested in implementing it.
△ Less
Submitted 2 September, 2005; v1 submitted 21 October, 2003;
originally announced October 2003.