-
The Lifebelt Particle Filter for robust estimation from low-valued count data
Authors:
Alice Corbella,
Trevelyan J. McKinley,
Paul J. Birrell,
Anne M. Presanis,
Simon E. F. Spencer,
Gareth O. Roberts,
Daniela De Angelis
Abstract:
Particle filtering methods are well developed for continuous state-space models. When dealing with discrete spaces on bounded domains, particle filtering methods can still be applied to sample from and marginalise over the unknown hidden states. Nevertheless, problems such as particle degradation can arise in this context and be even more severe than they are within the continuous-state domain: pr…
▽ More
Particle filtering methods are well developed for continuous state-space models. When dealing with discrete spaces on bounded domains, particle filtering methods can still be applied to sample from and marginalise over the unknown hidden states. Nevertheless, problems such as particle degradation can arise in this context and be even more severe than they are within the continuous-state domain: proposed particles can easily be incompatible with the data and the discrete system could often result in all particles having weights of zero. However, if the boundaries of the discrete hidden space are known, then these could be used to prevent particle collapse. In this paper we introduce the Lifebelt Particle Filter (LBPF), a novel method for robust likelihood estimation when low-valued count data arise. The LBPF combines a standard particle filter with one (or more) \textit{lifebelt particles} which, by construction, will tend not to be incompatible with the data. A mixture of resampled and non-resampled particles allows for the preservation of the lifebelt particle, which, together with the remaining particle swarm, provides samples from the filtering distribution, and can be used to generate estimates of the likelihood. The LBPF can be used within a pseudo-marginal scheme to draw inference on static parameters, $ \boldsymbolθ $, governing a discrete state-space model with low-valued counts. We present here the applied case estimating a parameter governing probabilities and timings of deaths and recoveries of hospitalised patients during an epidemic.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Emulation and History Matching using the hmer Package
Authors:
Andrew Iskauskas,
Ian Vernon,
Michael Goldstein,
Danny Scarponi,
Trevelyan J. McKinley,
Richard G. White,
Nicky McCreesh
Abstract:
Modelling complex real-world situations such as infectious diseases, geological phenomena, and biological processes can present a dilemma: the computer model (referred to as a simulator) needs to be complex enough to capture the dynamics of the system, but each increase in complexity increases the evaluation time of such a simulation, making it difficult to obtain an informative description of par…
▽ More
Modelling complex real-world situations such as infectious diseases, geological phenomena, and biological processes can present a dilemma: the computer model (referred to as a simulator) needs to be complex enough to capture the dynamics of the system, but each increase in complexity increases the evaluation time of such a simulation, making it difficult to obtain an informative description of parameter choices that would be consistent with observed reality. While methods for identifying acceptable matches to real-world observations exist, for example optimisation or Markov chain Monte Carlo methods, they may result in non-robust inferences or may be infeasible for computationally intensive simulators. The techniques of emulation and history matching can make such determinations feasible, efficiently identifying regions of parameter space that produce acceptable matches to data while also providing valuable information about the simulator's structure, but the mathematical considerations required to perform emulation can present a barrier for makers and users of such simulators compared to other methods. The hmer package provides an accessible framework for using history matching and emulation on simulator data, leveraging the computational efficiency of the approach while enabling users to easily match to, visualise, and robustly predict from their complex simulators.
△ Less
Submitted 14 December, 2023; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Efficient Bayesian model selection for coupled hidden Markov models with application to infectious diseases
Authors:
Jake Carson,
Trevelyan J. McKinley,
Peter Neal,
Simon E. F. Spencer
Abstract:
Performing model selection for coupled hidden Markov models (CHMMs) is highly challenging, owing to the large dimension of the hidden state process. Whilst in principle the hidden state process can be marginalized out via forward filtering, in practice the computational cost of doing so increases exponentially with the number of coupled Markov chains, making this approach infeasible in most applic…
▽ More
Performing model selection for coupled hidden Markov models (CHMMs) is highly challenging, owing to the large dimension of the hidden state process. Whilst in principle the hidden state process can be marginalized out via forward filtering, in practice the computational cost of doing so increases exponentially with the number of coupled Markov chains, making this approach infeasible in most applications. Monte Carlo methods can be utilized, but despite many remarkable developments in model selection methodology, generic approaches continue to be ill-suited for such high-dimensional problems. Here we develop specialized solutions for CHMMs with weak inter-chain dependencies. Specifically we construct effective proposal distributions for the hidden state process that remain computationally viable as the number of chains increases, and that require little user input or tuning. This methodology is particularly applicable to individual-level infectious disease models characterized as CHMMs, in which each chain represents an individual, and the coupling represents contact between individuals. Since the only significant contacts are between susceptible and infectious individuals, and since multiple infection pathways are often possible, the resulting CHMMs naturally have low inter-chain dependencies. We demonstrate the utility of our methodology with an application to a study of highly pathogenic avian influenza in chickens.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Key Questions for Modelling COVID-19 Exit Strategies
Authors:
Robin N Thompson,
T Deirdre Hollingsworth,
Valerie Isham,
Daniel Arribas-Bel,
Ben Ashby,
Tom Britton,
Peter Challoner,
Lauren H K Chappell,
Hannah Clapham,
Nik J Cunniffe,
A Philip Dawid,
Christl A Donnelly,
Rosalind Eggo,
Sebastian Funk,
Nigel Gilbert,
Julia R Gog,
Paul Glendinning,
William S Hart,
Hans Heesterbeek,
Thomas House,
Matt Keeling,
Istvan Z Kiss,
Mirjam Kretzschmar,
Alun L Lloyd,
Emma S McBryde
, et al. (18 additional authors not shown)
Abstract:
Combinations of intense non-pharmaceutical interventions ('lockdowns') were introduced in countries worldwide to reduce SARS-CoV-2 transmission. Many governments have begun to implement lockdown exit strategies that allow restrictions to be relaxed while attempting to control the risk of a surge in cases. Mathematical modelling has played a central role in guiding interventions, but the challenge…
▽ More
Combinations of intense non-pharmaceutical interventions ('lockdowns') were introduced in countries worldwide to reduce SARS-CoV-2 transmission. Many governments have begun to implement lockdown exit strategies that allow restrictions to be relaxed while attempting to control the risk of a surge in cases. Mathematical modelling has played a central role in guiding interventions, but the challenge of designing optimal exit strategies in the face of ongoing transmission is unprecedented. Here, we report discussions from the Isaac Newton Institute 'Models for an exit strategy' workshop (11-15 May 2020). A diverse community of modellers who are providing evidence to governments worldwide were asked to identify the main questions that, if answered, will allow for more accurate predictions of the effects of different exit strategies. Based on these questions, we propose a roadmap to facilitate the development of reliable models to guide exit strategies. The roadmap requires a global collaborative effort from the scientific community and policy-makers, and is made up of three parts: i) improve estimation of key epidemiological parameters; ii) understand sources of heterogeneity in populations; iii) focus on requirements for data collection, particularly in Low-to-Middle-Income countries. This will provide important information for planning exit strategies that balance socio-economic benefits with public health.
△ Less
Submitted 21 July, 2020; v1 submitted 21 June, 2020;
originally announced June 2020.
-
Model comparison with missing data using MCMC and importance sampling
Authors:
Panayiota Touloupou,
Naif Alzahrani,
Peter Neal,
Simon E. F. Spencer,
Trevelyan J. McKinley
Abstract:
Selecting between competing statistical models is a challenging problem especially when the competing models are non-nested. In this paper we offer a simple solution by devising an algorithm which combines MCMC and importance sampling to obtain computationally efficient estimates of the marginal likelihood which can then be used to compare the models. The algorithm is successfully applied to longi…
▽ More
Selecting between competing statistical models is a challenging problem especially when the competing models are non-nested. In this paper we offer a simple solution by devising an algorithm which combines MCMC and importance sampling to obtain computationally efficient estimates of the marginal likelihood which can then be used to compare the models. The algorithm is successfully applied to longitudinal epidemic and time series data sets and shown to outperform existing methods for computing the marginal likelihood.
△ Less
Submitted 15 December, 2015;
originally announced December 2015.
-
Bayesian Model Choice in Cumulative Link Ordinal Regression Models
Authors:
Trevelyan J. McKinley,
Michelle Morters,
James L. N. Wood
Abstract:
The use of the proportional odds (PO) model for ordinal regression is ubiquitous in the literature. If the assumption of parallel lines does not hold for the data, then an alternative is to specify a non-proportional odds (NPO) model, where the regression parameters are allowed to vary depending on the level of the response. However, it is often difficult to fit these models, and challenges regard…
▽ More
The use of the proportional odds (PO) model for ordinal regression is ubiquitous in the literature. If the assumption of parallel lines does not hold for the data, then an alternative is to specify a non-proportional odds (NPO) model, where the regression parameters are allowed to vary depending on the level of the response. However, it is often difficult to fit these models, and challenges regarding model choice and fitting are further compounded if there are a large number of explanatory variables. We make two contributions towards tackling these issues: firstly, we develop a Bayesian method for fitting these models, that ensures the stochastic ordering conditions hold for an arbitrary finite range of the explanatory variables, allowing NPO models to be fitted to any observed data set. Secondly, we use reversible-jump Markov chain Monte Carlo to allow the model to choose between PO and NPO structures for each explanatory variable, and show how variable selection can be incorporated. These methods can be adapted for any monotonic increasing link functions. We illustrate the utility of these approaches on novel data from a longitudinal study of individual-level risk factors affecting body condition score in a dog population in Zenzele, South Africa.
△ Less
Submitted 26 March, 2015;
originally announced March 2015.