Search | arXiv e-print repository

Bayesian design for mathematical models of fruit growth based on misspecified prior information

Authors: Nushrath Najimuddin, David J. Warne, Helen Thompson, James M. McGree

Abstract: Bayesian design can be used for efficient data collection over time when the process can be described by the solution to an ordinary differential equation (ODE). Typically, Bayesian designs in such settings are obtained by maximising the expected value of a utility function that is derived from the joint probability distribution of the parameters and the response, given prior information about an… ▽ More Bayesian design can be used for efficient data collection over time when the process can be described by the solution to an ordinary differential equation (ODE). Typically, Bayesian designs in such settings are obtained by maximising the expected value of a utility function that is derived from the joint probability distribution of the parameters and the response, given prior information about an appropriate ODE. However, in practice, appropriately defining such information \textit{a priori} can be difficult due to incomplete knowledge about the mechanisms that govern how the process evolves over time. In this paper, we propose a method for finding Bayesian designs based on a flexible class of ODEs. Specifically, we consider the inclusion of spline terms into ODEs to provide flexibility in modelling how the process changes over time. We then propose to leverage this flexibility to form designs that are efficient even when the prior information is misspecified. Our approach is motivated by a sampling problem in agriculture where the goal is to provide a better understanding of fruit growth where prior information is based on studies conducted overseas, and therefore is potentially misspecified. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 24 pages, 6 Figures

arXiv:2406.19591 [pdf, other]

Mathematical modelling and uncertainty quantification for analysis of biphasic coral reef recovery patterns

Authors: David J. Warne, Kerryn Crossman, Grace E. M. Heron, Jesse A. Sharp, Wang **, Paul Pao-Yen Wu, Matthew J. Simpson, Kerrie Mengersen, Juan-Carlos Ortiz

Abstract: Coral reefs are increasingly subjected to major disturbances threatening the health of marine ecosystems. Substantial research underway to develop intervention strategies that assist reefs in recovery from, and resistance to, inevitable future climate and weather extremes. To assess potential benefits of interventions, mechanistic understanding of coral reef recovery and resistance patterns is ess… ▽ More Coral reefs are increasingly subjected to major disturbances threatening the health of marine ecosystems. Substantial research underway to develop intervention strategies that assist reefs in recovery from, and resistance to, inevitable future climate and weather extremes. To assess potential benefits of interventions, mechanistic understanding of coral reef recovery and resistance patterns is essential. Recent evidence suggests that more than half of the reefs surveyed across the Great Barrier Reef (GBR) exhibit deviations from standard recovery modelling assumptions when the initial coral cover is low ($\leq 10$\%). New modelling is necessary to account for these observed patterns to better inform management strategies. We consider a new model for reef recovery at the coral cover scale that accounts for biphasic recovery patterns. The model is based on a multispecies Richards' growth model that includes a change point in the recovery patterns. Bayesian inference is applied for uncertainty quantification of key parameters for assessing reef health and recovery patterns. This analysis is applied to benthic survey data from the Australian Institute of Marine Sciences (AIMS). We demonstrate agreement between model predictions and data across every recorded recovery trajectory with at least two years of observations following disturbance events occurring between 1992--2020. This new approach will enable new insights into the biological, ecological and environmental factors that contribute to the duration and severity of biphasic coral recovery patterns across the GBR. These new insights will help to inform managements and monitoring practice to mitigate the impacts of climate change on coral reefs. △ Less

Submitted 27 June, 2024; originally announced June 2024.

MSC Class: 62P12 (Primary)

arXiv:2406.16296 [pdf, other]

Likelihood-based inference, identifiability and prediction using count data from lattice-based random walk models

Authors: Yihan Liu, David J Warne, Matthew J Simpson

Abstract: In vitro cell biology experiments are routinely used to characterize cell migration properties under various experimental conditions. These experiments can be interpreted using lattice-based random walk models to provide insight into underlying biological mechanisms, and continuum limit partial differential equation (PDE) descriptions of the stochastic models can be used to efficiently explore mod… ▽ More In vitro cell biology experiments are routinely used to characterize cell migration properties under various experimental conditions. These experiments can be interpreted using lattice-based random walk models to provide insight into underlying biological mechanisms, and continuum limit partial differential equation (PDE) descriptions of the stochastic models can be used to efficiently explore model properties instead of relying on repeated stochastic simulations. Working with efficient PDE models is of high interest for parameter estimation algorithms that typically require a large number of forward model simulations. Quantitative data from cell biology experiments usually involves non-negative cell counts in different regions of the experimental images, and it is not obvious how to relate finite, noisy count data to the solutions of continuous PDE models that correspond to noise-free density profiles. In this work we illustrate how to develop and implement likelihood-based methods for parameter estimation, parameter identifiability and model prediction for lattice-based models describing collective migration with an arbitrary number of interacting subpopulations. We implement a standard additive Gaussian measurement error model as well as a new physically-motivated multinomial measurement error model that relates noisy count data with the solution of continuous PDE models. Both measurement error models lead to similar outcomes for parameter estimation and parameter identifiability, whereas the standard additive Gaussian measurement error model leads to non-physical prediction outcomes. In contrast, the new multinomial measurement error model involves a lower computational overhead for parameter estimation and identifiability analysis, as well as leading to physically meaningful model predictions. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 34 pages, 7 figures

MSC Class: 92B99; 82M99; 62M99

arXiv:2404.13557 [pdf, other]

Preconditioned Neural Posterior Estimation for Likelihood-free Inference

Authors: Xiaoyu Wang, Ryan P. Kelly, David J. Warne, Christopher Drovandi

Abstract: Simulation based inference (SBI) methods enable the estimation of posterior distributions when the likelihood function is intractable, but where model simulation is feasible. Popular neural approaches to SBI are the neural posterior estimator (NPE) and its sequential version (SNPE). These methods can outperform statistical SBI approaches such as approximate Bayesian computation (ABC), particularly… ▽ More Simulation based inference (SBI) methods enable the estimation of posterior distributions when the likelihood function is intractable, but where model simulation is feasible. Popular neural approaches to SBI are the neural posterior estimator (NPE) and its sequential version (SNPE). These methods can outperform statistical SBI approaches such as approximate Bayesian computation (ABC), particularly for relatively small numbers of model simulations. However, we show in this paper that the NPE methods are not guaranteed to be highly accurate, even on problems with low dimension. In such settings the posterior cannot be accurately trained over the prior predictive space, and even the sequential extension remains sub-optimal. To overcome this, we propose preconditioned NPE (PNPE) and its sequential version (PSNPE), which uses a short run of ABC to effectively eliminate regions of parameter space that produce large discrepancy between simulations and data and allow the posterior emulator to be more accurately trained. We present comprehensive empirical evidence that this melding of neural and statistical SBI methods improves performance over a range of examples, including a motivating example involving a complex agent-based model applied to real tumour growth data. △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: 31 pages, 11 figures

arXiv:2309.09452 [pdf, other]

doi 10.1016/j.ecolind.2024.111828

Beyond expected values: Making environmental decisions using value of information analysis when measurement outcome matters

Authors: Morenikeji D. Akinlotan, David J. Warne, Kate J. Helmstedt, Sarah A. Vollert, Iadine Chadès, Ryan F. Heneghan, Hui Xiao, Matthew P. Adams

Abstract: In ecological and environmental contexts, management actions must sometimes be chosen urgently. Value of information (VoI) analysis provides a quantitative toolkit for projecting the improved management outcomes expected after making additional measurements. However, traditional VoI analysis reports metrics as expected values (i.e. risk-neutral). This can be problematic because expected values hid… ▽ More In ecological and environmental contexts, management actions must sometimes be chosen urgently. Value of information (VoI) analysis provides a quantitative toolkit for projecting the improved management outcomes expected after making additional measurements. However, traditional VoI analysis reports metrics as expected values (i.e. risk-neutral). This can be problematic because expected values hide uncertainties in projections. The true value of a measurement will only be known after the measurement's outcome is known, leaving large uncertainty in the measurement's value before it is performed. As a result, the expected value metrics produced in traditional VoI analysis may not align with the priorities of a risk-averse decision-maker who wants to avoid low-value measurement outcomes. In the present work, we introduce four new VoI metrics that can address a decision-maker's risk-aversion to different measurement outcomes. We demonstrate the benefits of the new metrics with two ecological case studies for which traditional VoI analysis has been previously applied. Using the new metrics, we also demonstrate a clear mathematical link between the often-separated environmental decision-making disciplines of VoI and optimal design of experiments. This mathematical link has the potential to catalyse future collaborations between ecologists and statisticians to work together to quantitatively address environmental decision-making questions of fundamental importance. Overall, the introduced VoI metrics complement existing metrics to provide decision-makers with a comprehensive view of the value of, and risks associated with, a proposed monitoring or measurement activity. This is critical for improved environmental outcomes when decisions must be urgently made. △ Less

Submitted 14 March, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

Comments: 53 pages, 3 figures

Journal ref: Ecological Indicators 160 (2024) 111828

arXiv:2307.16357 [pdf]

Communicating uncertainty in Indigenous sea Country monitoring with Bayesian statistics: towards more informed decision-making

Authors: Katherine Cure, Diego R Barneche, Martial Depczynski, Rebecca Fisher, David J Warne, James M McGree, Jim Underwood, Frank Weisenberger, Elizabeth Evans-Illidge, Daniel Oades, Azton Howard, Phillip McCarthy, Damon Pyke, Zac Edgar, Rodney Maher, Trevor Sampi, Bardi Jawi Traditional Owners

Abstract: First Nations Australians have a cultural obligation to look after land and sea Country, and Indigenous-partnered science is beginning to drive socially inclusive initiatives in conservation. The Australian Institute of Marine Science has partnered with Indigenous communities in systematically collecting monitoring data to understand the natural variability of ecological communities and better inf… ▽ More First Nations Australians have a cultural obligation to look after land and sea Country, and Indigenous-partnered science is beginning to drive socially inclusive initiatives in conservation. The Australian Institute of Marine Science has partnered with Indigenous communities in systematically collecting monitoring data to understand the natural variability of ecological communities and better inform sea Country management. Monitoring partnerships are centred around the 2-way sharing of Traditional Ecological Knowledge, training in science and technology, and develo** communication products that can be accessed across the broader community. We present a case study with the Bardi Jawi Rangers in northwest Australia focusing on a 3-year co-developed and co-delivered monitoring dataset for culturally important fish in coral reef ecosystems. We show how uncertainty estimated by Bayesian statistics can be incorporated into monitoring indicators and facilitate fuller communication between scientists and First Nations partners about the limitations of monitoring to identify change. △ Less

Submitted 30 July, 2023; originally announced July 2023.

arXiv:2305.10710 [pdf, other]

doi 10.1007/s11222-023-10361-w

Generalised likelihood profiles for models with intractable likelihoods

Authors: David J. Warne, Oliver J. Maclaren, Elliot J. Carr, Matthew J. Simpson, Christopher Drovandi

Abstract: Likelihood profiling is an efficient and powerful frequentist approach for parameter estimation, uncertainty quantification and practical identifiablity analysis. Unfortunately, these methods cannot be easily applied for stochastic models without a tractable likelihood function. Such models are typical in many fields of science, rendering these classical approaches impractical in these settings. T… ▽ More Likelihood profiling is an efficient and powerful frequentist approach for parameter estimation, uncertainty quantification and practical identifiablity analysis. Unfortunately, these methods cannot be easily applied for stochastic models without a tractable likelihood function. Such models are typical in many fields of science, rendering these classical approaches impractical in these settings. To address this limitation, we develop a new approach to generalising the methods of likelihood profiling for situations when the likelihood cannot be evaluated but stochastic simulations of the assumed data generating process are possible. Our approach is based upon recasting developments from generalised Bayesian inference into a frequentist setting. We derive a method for constructing generalised likelihood profiles and calibrating these profiles to achieve desired frequentist coverage for a given coverage level. We demonstrate the performance of our method on realistic examples from the literature and highlight the capability of our approach for the purpose of practical identifability analysis for models with intractable likelihoods. △ Less

Submitted 19 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

MSC Class: 62M20 (Primary) 62-08; 62F40 (Secondary)

arXiv:2301.13368 [pdf, other]

Misspecification-robust Sequential Neural Likelihood for Simulation-based Inference

Authors: Ryan P. Kelly, David J. Nott, David T. Frazier, David J. Warne, Chris Drovandi

Abstract: Simulation-based inference techniques are indispensable for parameter estimation of mechanistic and simulable models with intractable likelihoods. While traditional statistical approaches like approximate Bayesian computation and Bayesian synthetic likelihood have been studied under well-specified and misspecified settings, they often suffer from inefficiencies due to wasted model simulations. Neu… ▽ More Simulation-based inference techniques are indispensable for parameter estimation of mechanistic and simulable models with intractable likelihoods. While traditional statistical approaches like approximate Bayesian computation and Bayesian synthetic likelihood have been studied under well-specified and misspecified settings, they often suffer from inefficiencies due to wasted model simulations. Neural approaches, such as sequential neural likelihood (SNL) avoid this wastage by utilising all model simulations to train a neural surrogate for the likelihood function. However, the performance of SNL under model misspecification is unreliable and can result in overconfident posteriors centred around an inaccurate parameter estimate. In this paper, we propose a novel SNL method, which through the incorporation of additional adjustment parameters, is robust to model misspecification and capable of identifying features of the data that the model is not able to recover. We demonstrate the efficacy of our approach through several illustrative examples, where our method gives more accurate point estimates and uncertainty quantification than SNL. △ Less

Submitted 7 March, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

arXiv:2212.09999 [pdf, other]

doi 10.1109/WSC57314.2022.10015326

Robust simulation design for generalized linear models in conditions of heteroscedasticity or correlation

Authors: Andrew Gill, David J. Warne, Antony M. Overstall, Clare McGrory, James M. McGree

Abstract: A meta-model of the input-output data of a computationally expensive simulation is often employed for prediction, optimization, or sensitivity analysis purposes. Fitting is enabled by a designed experiment, and for computationally expensive simulations, the design efficiency is of importance. Heteroscedasticity in simulation output is common, and it is potentially beneficial to induce dependence t… ▽ More A meta-model of the input-output data of a computationally expensive simulation is often employed for prediction, optimization, or sensitivity analysis purposes. Fitting is enabled by a designed experiment, and for computationally expensive simulations, the design efficiency is of importance. Heteroscedasticity in simulation output is common, and it is potentially beneficial to induce dependence through the reuse of pseudo-random number streams to reduce the variance of the meta-model parameter estimators. In this paper, we develop a computational approach to robust design for computer experiments without the need to assume independence or identical distribution of errors. Through explicit inclusion of the variance or correlation structures into the meta-model distribution, either maximum likelihood estimation or generalized estimating equations can be employed to obtain an appropriate Fisher information matrix. Robust designs can then be computationally sought which maximize some relevant summary measure of this matrix, averaged across a prior distribution of any unknown parameters. △ Less

Submitted 20 December, 2022; originally announced December 2022.

MSC Class: 62K05

arXiv:2211.05357 [pdf, other]

Bayesian score calibration for approximate models

Authors: Joshua J Bon, David J Warne, David J Nott, Christopher Drovandi

Abstract: Scientists continue to develop increasingly complex mechanistic models to reflect their knowledge more realistically. Statistical inference using these models can be challenging since the corresponding likelihood function is often intractable and model simulation may be computationally burdensome. Fortunately, in many of these situations, it is possible to adopt a surrogate model or approximate li… ▽ More Scientists continue to develop increasingly complex mechanistic models to reflect their knowledge more realistically. Statistical inference using these models can be challenging since the corresponding likelihood function is often intractable and model simulation may be computationally burdensome. Fortunately, in many of these situations, it is possible to adopt a surrogate model or approximate likelihood function. It may be convenient to conduct Bayesian inference directly with the surrogate, but this can result in bias and poor uncertainty quantification. In this paper we propose a new method for adjusting approximate posterior samples to reduce bias and produce more accurate uncertainty quantification. We do this by optimizing a transform of the approximate posterior that maximizes a scoring rule. Our approach requires only a (fixed) small number of complex model simulations and is numerically stable. We demonstrate good performance of the new method on several examples of increasing complexity. △ Less

Submitted 27 October, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

Comments: 27 pages, 8 figures, 5 tables

arXiv:2112.11971 [pdf, other]

Efficient Multifidelity Likelihood-Free Bayesian Inference with Adaptive Computational Resource Allocation

Authors: Thomas P Prescott, David J Warne, Ruth E Baker

Abstract: Likelihood-free Bayesian inference algorithms are popular methods for calibrating the parameters of complex, stochastic models, required when the likelihood of the observed data is intractable. These algorithms characteristically rely heavily on repeated model simulations. However, whenever the computational cost of simulation is even moderately expensive, the significant burden incurred by likeli… ▽ More Likelihood-free Bayesian inference algorithms are popular methods for calibrating the parameters of complex, stochastic models, required when the likelihood of the observed data is intractable. These algorithms characteristically rely heavily on repeated model simulations. However, whenever the computational cost of simulation is even moderately expensive, the significant burden incurred by likelihood-free algorithms leaves them unviable in many practical applications. The multifidelity approach has been introduced (originally in the context of approximate Bayesian computation) to reduce the simulation burden of likelihood-free inference without loss of accuracy, by using the information provided by simulating computationally cheap, approximate models in place of the model of interest. The first contribution of this work is to demonstrate that multifidelity techniques can be applied in the general likelihood-free Bayesian inference setting. Analytical results on the optimal allocation of computational resources to simulations at different levels of fidelity are derived, and subsequently implemented practically. We provide an adaptive multifidelity likelihood-free inference algorithm that learns the relationships between models at different fidelities and adapts resource allocation accordingly, and demonstrate that this algorithm produces posterior estimates with near-optimal efficiency. △ Less

Submitted 22 December, 2021; originally announced December 2021.

arXiv:2110.14082 [pdf, other]

doi 10.1016/j.jcp.2022.111543

Multifidelity multilevel Monte Carlo to accelerate approximate Bayesian parameter inference for partially observed stochastic processes

Authors: David J. Warne, Thomas P. Prescott, Ruth E. Baker, Matthew J. Simpson

Abstract: Models of stochastic processes are widely used in almost all fields of science. Theory validation, parameter estimation, and prediction all require model calibration and statistical inference using data. However, data are almost always incomplete observations of reality. This leads to a great challenge for statistical inference because the likelihood function will be intractable for almost all par… ▽ More Models of stochastic processes are widely used in almost all fields of science. Theory validation, parameter estimation, and prediction all require model calibration and statistical inference using data. However, data are almost always incomplete observations of reality. This leads to a great challenge for statistical inference because the likelihood function will be intractable for almost all partially observed stochastic processes. This renders many statistical methods, especially within a Bayesian framework, impossible to implement. Therefore, computationally expensive likelihood-free approaches are applied that replace likelihood evaluations with realisations of the model and observation process. For accurate inference, however, likelihood-free techniques may require millions of expensive stochastic simulations. To address this challenge, we develop a new method based on recent advances in multilevel and multifidelity. Our approach combines the multilevel Monte Carlo telesco** summation, applied to a sequence of approximate Bayesian posterior targets, with a multifidelity rejection sampler to minimise the number of computationally expensive exact simulations required for accurate inference. We present the derivation of our new algorithm for likelihood-free Bayesian inference, discuss practical implementation details, and demonstrate substantial performance improvements. Using examples from systems biology, we demonstrate improvements of more than two orders of magnitude over standard rejection sampling techniques. Our approach is generally applicable to accelerate other sampling schemes, such as sequential Monte Carlo, to enable feasible Bayesian analysis for realistic practical applications in physics, chemistry, biology, epidemiology, ecology and economics. △ Less

Submitted 1 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

MSC Class: 65C05

arXiv:1912.12404 [pdf, other]

doi 10.1016/j.jtbi.2020.110255

A practical guide to pseudo-marginal methods for computational inference in systems biology

Authors: David J. Warne, Ruth E. Baker, Matthew J. Simpson

Abstract: For many stochastic models of interest in systems biology, such as those describing biochemical reaction networks, exact quantification of parameter uncertainty through statistical inference is intractable. Likelihood-free computational inference techniques enable parameter inference when the likelihood function for the model is intractable but the generation of many sample paths is feasible throu… ▽ More For many stochastic models of interest in systems biology, such as those describing biochemical reaction networks, exact quantification of parameter uncertainty through statistical inference is intractable. Likelihood-free computational inference techniques enable parameter inference when the likelihood function for the model is intractable but the generation of many sample paths is feasible through stochastic simulation of the forward problem. The most common likelihood-free method in systems biology is approximate Bayesian computation that accepts parameters that result in low discrepancy between stochastic simulations and measured data. However, it can be difficult to assess how the accuracy of the resulting inferences are affected by the choice of acceptance threshold and discrepancy function. The pseudo-marginal approach is an alternative likelihood-free inference method that utilises a Monte Carlo estimate of the likelihood function. This approach has several advantages, particularly in the context of noisy, partially observed, time-course data typical in biochemical reaction network studies. Specifically, the pseudo-marginal approach facilitates exact inference and uncertainty quantification, and may be efficiently combined with particle filters for low variance, high-accuracy likelihood estimation. In this review, we provide a practical introduction to the pseudo-marginal approach using inference for biochemical reaction networks as a series of case studies. Implementations of key algorithms and examples are provided using the Julia programming language; a high performance, open source programming language for scientific computing. △ Less

Submitted 28 December, 2019; originally announced December 2019.

MSC Class: 92C42 (Primary) 62F15; 97K80 (Secondary)

arXiv:1909.06540 [pdf, other]

doi 10.1080/10618600.2021.2000419

Rapid Bayesian inference for expensive stochastic models

Authors: David J. Warne, Ruth E. Baker, Matthew J. Simpson

Abstract: Almost all fields of science rely upon statistical inference to estimate unknown parameters in theoretical and computational models. While the performance of modern computer hardware continues to grow, the computational requirements for the simulation of models are growing even faster. This is largely due to the increase in model complexity, often including stochastic dynamics, that is necessary t… ▽ More Almost all fields of science rely upon statistical inference to estimate unknown parameters in theoretical and computational models. While the performance of modern computer hardware continues to grow, the computational requirements for the simulation of models are growing even faster. This is largely due to the increase in model complexity, often including stochastic dynamics, that is necessary to describe and characterize phenomena observed using modern, high resolution, experimental techniques. Such models are rarely analytically tractable, meaning that extremely large numbers of stochastic simulations are required for parameter inference. In such cases, parameter inference can be practically impossible. In this work, we present new computational Bayesian techniques that accelerate inference for expensive stochastic models by using computationally inexpensive approximations to inform feasible regions in parameter space, and through learning transforms that adjust the biased approximate inferences to closer represent the correct inferences under the expensive stochastic model. Using topical examples from ecology and cell biology, we demonstrate a speed improvement of an order of magnitude without any loss in accuracy. This represents a substantial improvement over current state-of-the-art methods for Bayesian computations when appropriate model approximations are available. △ Less

Submitted 22 February, 2021; v1 submitted 14 September, 2019; originally announced September 2019.

arXiv:1902.09046 [pdf, ps, other]

doi 10.1214/21-BA1265

Vector operations for accelerating expensive Bayesian computations -- a tutorial guide

Authors: David J. Warne, Scott A. Sisson, Christopher Drovandi

Abstract: Many applications in Bayesian statistics are extremely computationally intensive. However, they are often inherently parallel, making them prime targets for modern massively parallel processors. Multi-core and distributed computing is widely applied in the Bayesian community, however, very little attention has been given to fine-grain parallelisation using single instruction multiple data (SIMD) o… ▽ More Many applications in Bayesian statistics are extremely computationally intensive. However, they are often inherently parallel, making them prime targets for modern massively parallel processors. Multi-core and distributed computing is widely applied in the Bayesian community, however, very little attention has been given to fine-grain parallelisation using single instruction multiple data (SIMD) operations that are available on most modern commodity CPUs and is the basis of GPGPU computing. In this work, we practically demonstrate, using standard programming libraries, the utility of the SIMD approach for several topical Bayesian applications. We show that SIMD can improve the floating point arithmetic performance resulting in up to $6\times$ improvement in serial algorithm performance. Importantly, these improvements are multiplicative to any gains achieved through multi-core processing. We illustrate the potential of SIMD for accelerating Bayesian computations and provide the reader with techniques for exploiting modern massively parallel processing environments using standard tools. △ Less

Submitted 14 December, 2020; v1 submitted 24 February, 2019; originally announced February 2019.

MSC Class: 62F15; 62C10; 68W10; 65Y05;

arXiv:1812.05759 [pdf, other]

doi 10.1098/rsif.2018.0943

Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art

Authors: David J. Warne, Ruth E. Baker, Matthew J. Simpson

Abstract: Stochasticity is a key characteristic of intracellular processes such as gene regulation and chemical signalling. Therefore, characterising stochastic effects in biochemical systems is essential to understand the complex dynamics of living things. Mathematical idealisations of biochemically reacting systems must be able to capture stochastic phenomena. While robust theory exists to describe such s… ▽ More Stochasticity is a key characteristic of intracellular processes such as gene regulation and chemical signalling. Therefore, characterising stochastic effects in biochemical systems is essential to understand the complex dynamics of living things. Mathematical idealisations of biochemically reacting systems must be able to capture stochastic phenomena. While robust theory exists to describe such stochastic models, the computational challenges in exploring these models can be a significant burden in practice since realistic models are analytically intractable. Determining the expected behaviour and variability of a stochastic biochemical reaction network requires many probabilistic simulations of its evolution. Using a biochemical reaction network model to assist in the interpretation of time course data from a biological experiment is an even greater challenge due to the intractability of the likelihood function for determining observation probabilities. These computational challenges have been subjects of active research for over four decades. In this review, we present an accessible discussion of the major historical developments and state-of-the-art computational techniques relevant to simulation and inference problems for stochastic biochemical reaction network models. Detailed algorithms for particularly important methods are described and complemented with MATLAB implementations. As a result, this review provides a practical and accessible introduction to computational methods for stochastic models within the life sciences community. △ Less

Submitted 29 January, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

arXiv:1709.05059 [pdf, ps, other]

doi 10.1016/j.bpj.2017.09.016

Optimal quantification of contact inhibition in cell populations

Authors: David J. Warne, Ruth E. Baker, Matthew J. Simpson

Abstract: Contact inhibition refers to a reduction in the rate of cell migration and/or cell proliferation in regions of high cell density. Under normal conditions contact inhibition is associated with the proper functioning tissues, whereas abnormal regulation of contact inhibition is associated with pathological conditions, such as tumor spreading. Unfortunately, standard mathematical modeling practices m… ▽ More Contact inhibition refers to a reduction in the rate of cell migration and/or cell proliferation in regions of high cell density. Under normal conditions contact inhibition is associated with the proper functioning tissues, whereas abnormal regulation of contact inhibition is associated with pathological conditions, such as tumor spreading. Unfortunately, standard mathematical modeling practices mask the importance of parameters that control contact inhibition through scaling arguments. Furthermore, standard experimental protocols are insufficient to quantify the effects of contact inhibition because they focus on data describing early time, low-density dynamics only. Here we use the logistic growth equation as a caricature model of contact inhibition to make recommendations as to how to best mitigate these issues. Taking a Bayesian approach we quantify the trade-off between different features of experimental design and estimates of parameter uncertainty so that we can re-formulate a standard cell proliferation assay to provide estimates of both the low-density intrinsic growth rate, $λ$, and the carrying capacity density, $K$, which is a measure of contact inhibition. △ Less

Submitted 15 September, 2017; originally announced September 2017.

MSC Class: 92C37

arXiv:1702.03126 [pdf, ps, other]

doi 10.1016/j.csda.2018.02.009

Multilevel rejection sampling for approximate Bayesian computation

Authors: David J. Warne, Ruth E. Baker, Matthew J. Simpson

Abstract: Likelihood-free methods, such as approximate Bayesian computation, are powerful tools for practical inference problems with intractable likelihood functions. Markov chain Monte Carlo and sequential Monte Carlo variants of approximate Bayesian computation can be effective techniques for sampling posterior distributions in an approximate Bayesian computation setting. However, without careful conside… ▽ More Likelihood-free methods, such as approximate Bayesian computation, are powerful tools for practical inference problems with intractable likelihood functions. Markov chain Monte Carlo and sequential Monte Carlo variants of approximate Bayesian computation can be effective techniques for sampling posterior distributions in an approximate Bayesian computation setting. However, without careful consideration of convergence criteria and selection of proposal kernels, such methods can lead to very biased inference or computationally inefficient sampling. In contrast, rejection sampling for approximate Bayesian computation, despite being computationally intensive, results in independent, identically distributed samples from the approximated posterior. An alternative method is proposed for the acceleration of likelihood-free Bayesian inference that applies multilevel Monte Carlo variance reduction techniques directly to rejection sampling. The resulting method retains the accuracy advantages of rejection sampling while significantly improving the computational efficiency. △ Less

Submitted 28 February, 2018; v1 submitted 10 February, 2017; originally announced February 2017.

MSC Class: 62F15; 65C05

Showing 1–18 of 18 results for author: Warne, D J