-
Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs
Authors:
Sanmitra Ghosh,
Paul J. Birrell,
Daniela De Angelis
Abstract:
Likelihood-free inference methods based on neural conditional density estimation were shown to drastically reduce the simulation burden in comparison to classical methods such as ABC. When applied in the context of any latent variable model, such as a Hidden Markov model (HMM), these methods are designed to only estimate the parameters, rather than the joint distribution of the parameters and the…
▽ More
Likelihood-free inference methods based on neural conditional density estimation were shown to drastically reduce the simulation burden in comparison to classical methods such as ABC. When applied in the context of any latent variable model, such as a Hidden Markov model (HMM), these methods are designed to only estimate the parameters, rather than the joint distribution of the parameters and the hidden states. Naive application of these methods to a HMM, ignoring the inference of this joint posterior distribution, will thus produce an inaccurate estimate of the posterior predictive distribution, in turn hampering the assessment of goodness-of-fit. To rectify this problem, we propose a novel, sample-efficient likelihood-free method for estimating the high-dimensional hidden states of an implicit HMM. Our approach relies on learning directly the intractable posterior distribution of the hidden states, using an autoregressive-flow, by exploiting the Markov property. Upon evaluating our approach on some implicit HMMs, we found that the quality of the estimates retrieved using our method is comparable to what can be achieved using a much more computationally expensive SMC algorithm.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
The NOSTRA model: coherent estimation of infection sources in the case of possible nosocomial transmission
Authors:
David J Pascall,
Chris Jackson,
Stephanie Evans,
Theodore Gouliouris,
Chris Illingworth,
Stefan Piatek,
Julie V Robotham,
Oliver Stirrup,
Ben Warne,
Judith Breuer,
Daniela De Angelis
Abstract:
Nosocomial infections have important consequences for patients and hospital staff: they worsen patient outcomes and their management stresses already overburdened health systems. Accurate judgements of whether an infection is nosocomial helps staff make appropriate choices to protect other patients within the hospital. Nosocomiality cannot be properly assessed without considering whether the infec…
▽ More
Nosocomial infections have important consequences for patients and hospital staff: they worsen patient outcomes and their management stresses already overburdened health systems. Accurate judgements of whether an infection is nosocomial helps staff make appropriate choices to protect other patients within the hospital. Nosocomiality cannot be properly assessed without considering whether the infected patient came into contact with high risk potential infectors within the hospital. We developed a Bayesian model that integrates epidemiological, contact and pathogen genetic data to determine how likely an infection is to be nosocomial and the probability of given infection candidates being the source of the infection.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
The Lifebelt Particle Filter for robust estimation from low-valued count data
Authors:
Alice Corbella,
Trevelyan J. McKinley,
Paul J. Birrell,
Anne M. Presanis,
Simon E. F. Spencer,
Gareth O. Roberts,
Daniela De Angelis
Abstract:
Particle filtering methods are well developed for continuous state-space models. When dealing with discrete spaces on bounded domains, particle filtering methods can still be applied to sample from and marginalise over the unknown hidden states. Nevertheless, problems such as particle degradation can arise in this context and be even more severe than they are within the continuous-state domain: pr…
▽ More
Particle filtering methods are well developed for continuous state-space models. When dealing with discrete spaces on bounded domains, particle filtering methods can still be applied to sample from and marginalise over the unknown hidden states. Nevertheless, problems such as particle degradation can arise in this context and be even more severe than they are within the continuous-state domain: proposed particles can easily be incompatible with the data and the discrete system could often result in all particles having weights of zero. However, if the boundaries of the discrete hidden space are known, then these could be used to prevent particle collapse. In this paper we introduce the Lifebelt Particle Filter (LBPF), a novel method for robust likelihood estimation when low-valued count data arise. The LBPF combines a standard particle filter with one (or more) \textit{lifebelt particles} which, by construction, will tend not to be incompatible with the data. A mixture of resampled and non-resampled particles allows for the preservation of the lifebelt particle, which, together with the remaining particle swarm, provides samples from the filtering distribution, and can be used to generate estimates of the likelihood. The LBPF can be used within a pseudo-marginal scheme to draw inference on static parameters, $ \boldsymbolθ $, governing a discrete state-space model with low-valued counts. We present here the applied case estimating a parameter governing probabilities and timings of deaths and recoveries of hospitalised patients during an epidemic.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
An approximate diffusion process for environmental stochasticity in infectious disease transmission modelling
Authors:
Sanmitra Ghosh,
Paul J. Birrell,
Daniela De Angelis
Abstract:
Modelling the transmission dynamics of an infectious disease is a complex task. Not only it is difficult to accurately model the inherent non-stationarity and heterogeneity of transmission, but it is nearly impossible to describe, mechanistically, changes in extrinsic environmental factors including public behaviour and seasonal fluctuations. An elegant approach to capturing environmental stochast…
▽ More
Modelling the transmission dynamics of an infectious disease is a complex task. Not only it is difficult to accurately model the inherent non-stationarity and heterogeneity of transmission, but it is nearly impossible to describe, mechanistically, changes in extrinsic environmental factors including public behaviour and seasonal fluctuations. An elegant approach to capturing environmental stochasticity is to model the force of infection as a stochastic process. However, inference in this context requires solving a computationally expensive ``missing data" problem, using data-augmentation techniques. We propose to model the time-varying transmission-potential as an approximate diffusion process using a path-wise series expansion of Brownian motion. This approximation replaces the ``missing data" imputation step with the inference of the expansion coefficients: a simpler and computationally cheaper task. We illustrate the merit of this approach through two examples: modelling influenza using a canonical SIR model, and the modelling of COVID-19 pandemic using a multi-type SEIR model.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Inferring Epidemics from Multiple Dependent Data via Pseudo-Marginal Methods
Authors:
Alice Corbella,
Anne M Presanis,
Paul J Birrell,
Daniela De Angelis
Abstract:
Health-policy planning requires evidence on the burden that epidemics place on healthcare systems. Multiple, often dependent, datasets provide a noisy and fragmented signal from the unobserved epidemic process including transmission and severity dynamics. This paper explores important challenges to the use of state-space models for epidemic inference when multiple dependent datasets are analysed.…
▽ More
Health-policy planning requires evidence on the burden that epidemics place on healthcare systems. Multiple, often dependent, datasets provide a noisy and fragmented signal from the unobserved epidemic process including transmission and severity dynamics. This paper explores important challenges to the use of state-space models for epidemic inference when multiple dependent datasets are analysed. We propose a new semi-stochastic model that exploits deterministic approximations for large-scale transmission dynamics while retaining stochasticity in the occurrence and reporting of relatively rare severe events. This model is suitable for many real-time situations including large seasonal epidemics and pandemics. Within this context, we develop algorithms to provide exact parameter inference and test them via simulation. Finally, we apply our joint model and the proposed algorithm to several surveillance data on the 2017-18 influenza epidemic in England to reconstruct transmission dynamics and estimate the daily new influenza infections as well as severity indicators as the case-hospitalisation risk and the hospital-intensive care risk.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
A comparison of two frameworks for multi-state modelling, applied to outcomes after hospital admissions with COVID-19
Authors:
Christopher Jackson,
Brian Tom,
Peter Kirwan,
Sema Mandal,
Shaun Seaman,
Kevin Kunzmann,
Anne Presanis,
Daniela De Angelis
Abstract:
We compare two multi-state modelling frameworks that can be used to represent dates of events following hospital admission for people infected during an epidemic. The methods are applied to data from people admitted to hospital with COVID-19, to estimate the probability of admission to ICU, the probability of death in hospital for patients before and after ICU admission, the lengths of stay in hos…
▽ More
We compare two multi-state modelling frameworks that can be used to represent dates of events following hospital admission for people infected during an epidemic. The methods are applied to data from people admitted to hospital with COVID-19, to estimate the probability of admission to ICU, the probability of death in hospital for patients before and after ICU admission, the lengths of stay in hospital, and how all these vary with age and gender. One modelling framework is based on defining transition-specific hazard functions for competing risks. A less commonly used framework defines partially-latent subpopulations who will experience each subsequent event, and uses a mixture model to estimate the probability that an individual will experience each event, and the distribution of the time to the event given that it occurs. We compare the advantages and disadvantages of these two frameworks, in the context of the COVID-19 example. The issues include the interpretation of the model parameters, the computational efficiency of estimating the quantities of interest, implementation in software and assessing goodness of fit. In the example, we find that some groups appear to be at very low risk of some events, in particular ICU admission, and these are best represented by using "cure-rate" models to define transition-specific hazards. We provide general-purpose software to implement all the models we describe in the "flexsurv" R package, which allows arbitrarily-flexible distributions to be used to represent the cause-specific hazards or times to events.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Trends in COVID-19 hospital outcomes in England before and after vaccine introduction, a cohort study
Authors:
Peter Kirwan,
Andre Charlett,
Paul Birrell,
Suzanne Elgohari,
Russell Hope,
Sema Mandal,
Daniela De Angelis,
Anne Presanis
Abstract:
Widespread vaccination campaigns have changed the landscape for COVID-19, vastly altering symptoms and reducing morbidity and mortality. We estimate trends in mortality by month of admission and vaccination status among those hospitalised with COVID-19 in England between March 2020 to September 2021, controlling for demographic factors and hospital load.
Among 259,727 hospitalised COVID-19 cases…
▽ More
Widespread vaccination campaigns have changed the landscape for COVID-19, vastly altering symptoms and reducing morbidity and mortality. We estimate trends in mortality by month of admission and vaccination status among those hospitalised with COVID-19 in England between March 2020 to September 2021, controlling for demographic factors and hospital load.
Among 259,727 hospitalised COVID-19 cases, 51,948 (20.0%) experienced mortality in hospital. Hospitalised fatality risk ranged from 40.3% (95% confidence interval 39.4-41.3%) in March 2020 to 8.1% (7.2-9.0%) in June 2021. Older individuals and those with multiple co-morbidities were more likely to die or else experienced longer stays prior to discharge. Compared to unvaccinated people, the hazard of hospitalised mortality was 0.71 (0.67-0.77) with a first vaccine dose, and 0.56 (0.52-0.61) with a second vaccine dose. Compared to hospital load at 0-20% of the busiest week, the hazard of hospitalised mortality during periods of peak load (90-100%), was 1.23 (1.12-1.34).
The prognosis for people hospitalised with COVID-19 in England has varied substantially throughout the pandemic and according to case-mix, vaccination, and hospital load. Our estimates provide an indication for demands on hospital resources, and the relationship between hospital burden and outcomes.
△ Less
Submitted 3 August, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Evaluating the impact of local tracing partnerships on the performance of contact tracing for COVID-19 in England
Authors:
Pantelis Samartsidis,
Shaun R. Seaman,
Abbie Harrison,
Angelos Alexopoulos,
Gareth J. Hughes,
Christopher Rawlinson,
Charlotte Anderson,
Andre Charlett,
Isabel Oliver,
Daniela De Angelis
Abstract:
Assessing the impact of an intervention using time-series observational data on multiple units and outcomes is a frequent problem in many fields of scientific research. In this paper, we present a novel method to estimate intervention effects in such a setting by generalising existing approaches based on the factor analysis model and develo** a Bayesian algorithm for inference. Our method is one…
▽ More
Assessing the impact of an intervention using time-series observational data on multiple units and outcomes is a frequent problem in many fields of scientific research. In this paper, we present a novel method to estimate intervention effects in such a setting by generalising existing approaches based on the factor analysis model and develo** a Bayesian algorithm for inference. Our method is one of the few that can simultaneously: deal with outcomes of mixed type (continuous, binomial, count); increase efficiency in the estimates of the causal effects by jointly modelling multiple outcomes affected by the intervention; easily provide uncertainty quantification for all causal estimands of interest. We use the proposed approach to evaluate the impact that local tracing partnerships (LTP) had on the effectiveness of England's Test and Trace (TT) programme for COVID-19. Our analyses suggest that, overall, LTPs had a small positive impact on TT. However, there is considerable heterogeneity in the estimates of the causal effects over units and time.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Hospitalisation risk for COVID-19 patients infected with SARS-CoV-2 variant B.1.1.7: cohort analysis
Authors:
Tommy Nyberg,
Katherine A. Twohig,
Ross J. Harris,
Shaun R. Seaman,
Joe Flannagan,
Hester Allen,
Andre Charlett,
Daniela De Angelis,
Gavin Dabrera,
Anne M. Presanis
Abstract:
Objective: To evaluate the relationship between coronavirus disease 2019 (COVID-19) diagnosis with SARS-CoV-2 variant B.1.1.7 (also known as Variant of Concern 202012/01) and the risk of hospitalisation compared to diagnosis with wildtype SARS-CoV-2 variants.
Design: Retrospective cohort, analysed using stratified Cox regression.
Setting: Community-based SARS-CoV-2 testing in England, individu…
▽ More
Objective: To evaluate the relationship between coronavirus disease 2019 (COVID-19) diagnosis with SARS-CoV-2 variant B.1.1.7 (also known as Variant of Concern 202012/01) and the risk of hospitalisation compared to diagnosis with wildtype SARS-CoV-2 variants.
Design: Retrospective cohort, analysed using stratified Cox regression.
Setting: Community-based SARS-CoV-2 testing in England, individually linked with hospitalisation data.
Participants: 839,278 laboratory-confirmed COVID-19 patients, of whom 36,233 had been hospitalised within 14 days, tested between 23rd November 2020 and 31st January 2021 and analysed at a laboratory with an available TaqPath assay that enables assessment of S-gene target failure (SGTF). SGTF is a proxy test for the B.1.1.7 variant. Patient data were stratified by age, sex, ethnicity, deprivation, region of residence, and date of positive test.
Main outcome measures: Hospitalisation between 1 and 14 days after the first positive SARS-CoV-2 test.
Results: 27,710 of 592,409 SGTF patients (4.7%) and 8,523 of 246,869 non-SGTF patients (3.5%) had been hospitalised within 1-14 days. The stratum-adjusted hazard ratio (HR) of hospitalisation was 1.52 (95% confidence interval [CI] 1.47 to 1.57) for COVID-19 patients infected with SGTF variants, compared to those infected with non-SGTF variants. The effect was modified by age (P<0.001), with HRs of 0.93-1.21 for SGTF compared to non-SGTF patients below age 20 years, 1.29 in those aged 20-29, and 1.45-1.65 in age groups 30 years or older.
Conclusions: The results suggest that the risk of hospitalisation is higher for individuals infected with the B.1.1.7 variant compared to wildtype SARS-CoV-2, likely reflecting a more severe disease. The higher severity may be specific to adults above the age of 30.
△ Less
Submitted 29 May, 2021; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Quantifying efficiency gains of innovative designs of two-arm vaccine trials for COVID-19 using an epidemic simulation model
Authors:
Rob Johnson,
Chris Jackson,
Anne Presanis,
Sofia S. Villar,
Daniela De Angelis
Abstract:
Clinical trials of a vaccine during an epidemic face particular challenges, such as the pressure to identify an effective vaccine quickly to control the epidemic, and the effect that time-space-varying infection incidence has on the power of a trial. We illustrate how the operating characteristics of different trial design elements may be evaluated using a network epidemic and trial simulation mod…
▽ More
Clinical trials of a vaccine during an epidemic face particular challenges, such as the pressure to identify an effective vaccine quickly to control the epidemic, and the effect that time-space-varying infection incidence has on the power of a trial. We illustrate how the operating characteristics of different trial design elements may be evaluated using a network epidemic and trial simulation model, based on COVID-19 and individually randomised two-arm trials with a binary outcome. We show that "ring" recruitment strategies, prioritising participants at high risk of infection, can result in substantial improvement in terms of power, if sufficiently many contacts of observed cases are at high risk. In addition, we introduce a novel method to make more efficient use of the data from the earliest cases of infection observed in the trial, whose infection may have been too early to be vaccine-preventable. Finally, we compare several methods of response-adaptive randomisation, discussing their advantages and disadvantages in this two-arm context and identifying particular adaptation strategies that preserve power and estimation properties, while slightly reducing the number of infections, given an effective vaccine.
△ Less
Submitted 20 May, 2021; v1 submitted 13 March, 2021;
originally announced April 2021.
-
Trends in risks of severe events and lengths of stay for COVID-19 hospitalisations in England over the pre-vaccination era: results from the Public Health England SARI-Watch surveillance scheme
Authors:
Peter D. Kirwan,
Suzanne Elgohari,
Christopher H. Jackson,
Brian D. M. Tom,
Sema Mandal,
Daniela De Angelis,
Anne M. Presanis
Abstract:
Background: Trends in hospitalised case-fatality risk (HFR), risk of intensive care unit (ICU) admission and lengths of stay for patients hospitalised for COVID-19 in England over the pre-vaccination era are unknown.
Methods: Data on hospital and ICU admissions with COVID-19 at 31 NHS trusts in England were collected by Public Health England's Severe Acute Respiratory Infections surveillance sys…
▽ More
Background: Trends in hospitalised case-fatality risk (HFR), risk of intensive care unit (ICU) admission and lengths of stay for patients hospitalised for COVID-19 in England over the pre-vaccination era are unknown.
Methods: Data on hospital and ICU admissions with COVID-19 at 31 NHS trusts in England were collected by Public Health England's Severe Acute Respiratory Infections surveillance system and linked to death information. We applied parametric multi-state mixture models, accounting for censored outcomes and regressing risks and times between events on month of admission, geography, and baseline characteristics.
Findings: 20,785 adults were admitted with COVID-19 in 2020. Between March and June/July/August estimated HFR reduced from 31.9% (95% confidence interval 30.3-33.5%) to 10.9% (9.4-12.7%), then rose steadily from 21.6% (18.4-25.5%) in September to 25.7% (23.0-29.2%) in December, with steeper increases among older patients, those with multi-morbidity and outside London/South of England. ICU admission risk reduced from 13.9% (12.8-15.2%) in March to 6.2% (5.3-7.1%) in May, rising to a high of 14.2% (11.1-17.2%) in September. Median length of stay in non-critical care increased during 2020, from 6.6 to 12.3 days for those dying, and from 6.1 to 9.3 days for those discharged.
Interpretation: Initial improvements in patient outcomes, corresponding to developments in clinical practice, were not sustained throughout 2020, with HFR in December approaching the levels seen at the start of the pandemic, whilst median hospital stays have lengthened. The role of increased transmission, new variants, case-mix and hospital pressures in increasing COVID-19 severity requires urgent further investigation.
△ Less
Submitted 22 March, 2021; v1 submitted 8 March, 2021;
originally announced March 2021.
-
Implications for HIV elimination by 2030 of recent trends in undiagnosed infection in England: an evidence synthesis
Authors:
Anne M Presanis,
Peter Kirwan,
Ada Miltz,
Sara Croxford,
Ross Harris,
Ellen Heinsbroek,
Chris Jackson,
Hamish Mohammed,
Alison Brown,
Valerie Delpech,
O Noel Gill,
Daniela De Angelis
Abstract:
A target to eliminate Human Immuno-deficiency Virus (HIV) transmission in England by 2030 was set in early 2019. Estimates of recent trends in HIV prevalence, particularly the number of people living with undiagnosed HIV, by exposure group, ethnicity, gender, age group and region, are essential to monitor progress towards elimination. A Bayesian synthesis of evidence from multiple surveillance, de…
▽ More
A target to eliminate Human Immuno-deficiency Virus (HIV) transmission in England by 2030 was set in early 2019. Estimates of recent trends in HIV prevalence, particularly the number of people living with undiagnosed HIV, by exposure group, ethnicity, gender, age group and region, are essential to monitor progress towards elimination. A Bayesian synthesis of evidence from multiple surveillance, demographic and survey datasets relevant to HIV in England is employed to estimate trends in: the number of people living with HIV (PLWH); the proportion of these people unaware of their HIV infection; and the corresponding prevalence of undiagnosed HIV. All estimates are stratified by exposure group, ethnicity, gender, age group (15-34, 35-44, 45-59, 60-74), region (London, outside London) and year (2012-2017). The total number of PLWH aged 15-74 in England increased from 82,400 (95% credible interval, CrI, 78,700 to 89,100) in 2012 to 89,500 (95% CrI 87,400 to 93,300) in 2017. The proportion diagnosed steadily increased from 84% (95% CrI 77 to 88%) to 92% (95% CrI 89 to 94%) over the same time period, corresponding to a halving in the number of undiagnosed infections from 13,500 (95% CrI 9,800 to 20,200) to 6,900 (95% CrI 4,900 to 10,700). This decrease is equivalent to a halving in prevalence of undiagnosed infection and is reflected in all sub-groups of gay, bisexual and other men who have sex with men and most sub-groups of black African heterosexuals. However, decreases were not detected for some sub-groups of other ethnicity heterosexuals, particularly outside London. In 2016, the Joint United Nations Programme on HIV/ AIDS target of diagnosing 90% of people living with HIV was reached in England. To achieve HIV elimination by 2030, current testing efforts should be enhanced to address the numbers of heterosexuals living with undiagnosed HIV, especially outside London.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Analysing Multiple Epidemic Data Sources
Authors:
Daniela De Angelis,
Anne M. Presanis
Abstract:
Evidence-based knowledge of infectious disease burden, including prevalence, incidence, severity and transmission, in different population strata and locations, and possibly in real time, is crucial to the planning and evaluation of public health policies. Direct observation of a disease process is rarely possible. However, latent characteristics of an epidemic and its evolution can often be infer…
▽ More
Evidence-based knowledge of infectious disease burden, including prevalence, incidence, severity and transmission, in different population strata and locations, and possibly in real time, is crucial to the planning and evaluation of public health policies. Direct observation of a disease process is rarely possible. However, latent characteristics of an epidemic and its evolution can often be inferred from the synthesis of indirect information from various routine data sources, as well as expert opinion. The simultaneous synthesis of multiple data sources, often conveniently carried out in a Bayesian framework, poses a number of statistical and computational challenges: the heterogeneity in type, relevance and granularity of the data, together with selection and informative observation biases, lead to complex probabilistic models that are difficult to build and fit, and challenging to criticize. Using motivating case studies of influenza, this chapter illustrates the cycle of model development and criticism in the context of Bayesian evidence synthesis, highlighting the challenges of complex model building, computationally efficient inference, and conflicting evidence.
△ Less
Submitted 13 August, 2018;
originally announced August 2018.
-
Assessing the causal effect of binary interventions from observational panel data with few treated units
Authors:
Pantelis Samartsidis,
Shaun R. Seaman,
Anne M. Presanis,
Matthew Hickman,
Daniela De Angelis
Abstract:
Researchers are often challenged with assessing the impact of an intervention on an outcome of interest in situations where the intervention is non-randomised, the intervention is only applied to one or few units, the intervention is binary, and outcome measurements are available at multiple time points. In this paper, we review existing methods for causal inference in these situations. We detail…
▽ More
Researchers are often challenged with assessing the impact of an intervention on an outcome of interest in situations where the intervention is non-randomised, the intervention is only applied to one or few units, the intervention is binary, and outcome measurements are available at multiple time points. In this paper, we review existing methods for causal inference in these situations. We detail the assumptions underlying each method, emphasize connections between the different approaches and provide guidelines regarding their practical implementation. Several open problems are identified thus highlighting the need for future research.
△ Less
Submitted 19 December, 2019; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Evidence synthesis for stochastic epidemic models
Authors:
Paul J Birrell,
Daniela De Angelis,
Anne M Presanis
Abstract:
In recent years the role of epidemic models in informing public health policies has progressively grown. Models have become increasingly realistic and more complex, requiring the use of multiple data sources to estimate all quantities of interest. This review summarises the different types of stochastic epidemic models that use evidence synthesis and highlights current challenges.
In recent years the role of epidemic models in informing public health policies has progressively grown. Models have become increasingly realistic and more complex, requiring the use of multiple data sources to estimate all quantities of interest. This review summarises the different types of stochastic epidemic models that use evidence synthesis and highlights current challenges.
△ Less
Submitted 8 June, 2017;
originally announced June 2017.
-
Exploiting routinely collected severe case data to monitor and predict influenza outbreaks
Authors:
Alice Corbella,
Xu-Sheng Zhang,
Paul J. Birrell,
Nicky Boddington,
Anne M. Presanis,
Richard G. Pebody,
Daniela De Angelis
Abstract:
Influenza remains a significant burden on health systems. Effective responses rely on the timely understanding of the magnitude and the evolution of an outbreak. For monitoring purposes, data on severe cases of influenza in England are reported weekly to Public Health England. These data are both readily available and have the potential to provide valuable information to estimate and predict the k…
▽ More
Influenza remains a significant burden on health systems. Effective responses rely on the timely understanding of the magnitude and the evolution of an outbreak. For monitoring purposes, data on severe cases of influenza in England are reported weekly to Public Health England. These data are both readily available and have the potential to provide valuable information to estimate and predict the key transmission features of seasonal and pandemic influenza. We propose an epidemic model that links the underlying unobserved influenza transmission process to data on severe influenza cases. Within a Bayesian framework, we infer retrospectively the parameters of the epidemic model for each seasonal outbreak from 2012 to 2015, including: the effective reproduction number; the initial susceptibility; the probability of admission to intensive care given infection; and the effect of school closure on transmission. The model is also implemented in real time to assess whether early forecasting of the number of admission to intensive care is possible. Our model of admissions data allows reconstruction of the underlying transmission dynamics revealing: increased transmission during the season 2013/14 and a noticeable effect of Christmas school holiday on disease spread during season 2012/13 and 2014/15. When information on the initial immunity of the population is available, forecasts of the number of admissions to intensive care can be substantially improved. Readily available severe case data can be effectively used to estimate epidemiological characteristics and to predict the evolution of an epidemic, crucially allowing real-time monitoring of the transmission and severity of the outbreak.
△ Less
Submitted 13 November, 2017; v1 submitted 8 June, 2017;
originally announced June 2017.
-
Quantifying the recency of HIV infection using multiple longitudinal biomarkers
Authors:
Loumpiana Koulai,
Anne Presanis,
Gary Murphy,
Barbara Suligoi,
Daniela De Angelis
Abstract:
Knowledge of the time at which an HIV-infected individual seroconverts, when the immune system starts responding to HIV infection, plays a vital role in the design and implementation of interventions to reduce the impact of the HIV epidemic. A number of biomarkers have been developed to distinguish between recent and long-term HIV infection, based on the antibody response to HIV. To quantify the r…
▽ More
Knowledge of the time at which an HIV-infected individual seroconverts, when the immune system starts responding to HIV infection, plays a vital role in the design and implementation of interventions to reduce the impact of the HIV epidemic. A number of biomarkers have been developed to distinguish between recent and long-term HIV infection, based on the antibody response to HIV. To quantify the recency of infection at an individual level, we propose characterising the growth of such biomarkers from observations from a panel of individuals with known seroconversion time, using Bayesian mixed effect models. We combine this knowledge of the growth patterns with observations from a newly diagnosed individual, to estimate the probability seroconversion occurred in the X months prior to diagnosis. We explore, through a simulation study, the characteristics of different biomarkers that affect our ability to estimate recency, such as the growth rate. In particular, we find that predictive ability is improved by using joint models of two biomarkers, accounting for their correlation, rather than univariate models of single biomarkers.
△ Less
Submitted 8 June, 2017;
originally announced June 2017.
-
MultiBUGS: A parallel implementation of the BUGS modelling framework for faster Bayesian inference
Authors:
Robert J. B. Goudie,
Rebecca M. Turner,
Daniela De Angelis,
Andrew Thomas
Abstract:
MultiBUGS (https://www.multibugs.org) is a new version of the general-purpose Bayesian modelling software BUGS that implements a generic algorithm for parallelising Markov chain Monte Carlo (MCMC) algorithms to speed up posterior inference of Bayesian models. The algorithm parallelises evaluation of the product-form likelihoods formed when a parameter has many children in the directed acyclic grap…
▽ More
MultiBUGS (https://www.multibugs.org) is a new version of the general-purpose Bayesian modelling software BUGS that implements a generic algorithm for parallelising Markov chain Monte Carlo (MCMC) algorithms to speed up posterior inference of Bayesian models. The algorithm parallelises evaluation of the product-form likelihoods formed when a parameter has many children in the directed acyclic graph (DAG) representation; and parallelises sampling of conditionally-independent sets of parameters. A heuristic algorithm is used to decide which approach to use for each parameter and to apportion computation across computational cores. This enables MultiBUGS to automatically parallelise the broad range of statistical models that can be fitted using BUGS-language software, making the dramatic speed-ups of modern multi-core computing accessible to applied statisticians, without requiring any experience of parallel programming. We demonstrate the use of MultiBUGS on simulated data designed to mimic a hierarchical e-health linked-data study of methadone prescriptions including 425,112 observations and 20,426 random effects. Posterior inference for the e-health model takes several hours in existing software, but MultiBUGS can perform inference in only 28 minutes using 48 computational cores.
△ Less
Submitted 29 November, 2018; v1 submitted 11 April, 2017;
originally announced April 2017.
-
Value of Information: Sensitivity Analysis and Research Design in Bayesian Evidence Synthesis
Authors:
Christopher Jackson,
Anne Presanis,
Stefano Conti,
Daniela De Angelis
Abstract:
Suppose we have a Bayesian model which combines evidence from several different sources. We want to know which model parameters most affect the estimate or decision from the model, or which of the parameter uncertainties drive the decision uncertainty. Furthermore we want to prioritise what further data should be collected. These questions can be addressed by Value of Information (VoI) analysis, i…
▽ More
Suppose we have a Bayesian model which combines evidence from several different sources. We want to know which model parameters most affect the estimate or decision from the model, or which of the parameter uncertainties drive the decision uncertainty. Furthermore we want to prioritise what further data should be collected. These questions can be addressed by Value of Information (VoI) analysis, in which we estimate expected reductions in loss from learning specific parameters or collecting data of a given design. We describe the theory and practice of VoI for Bayesian evidence synthesis, using and extending ideas from health economics, computer modelling and Bayesian design. The methods are general to a range of decision problems including point estimation and choices between discrete actions. We apply them to a model for estimating prevalence of HIV infection, combining indirect information from several surveys, registers and expert beliefs. This analysis shows which parameters contribute most of the uncertainty about each prevalence estimate, and provides the expected improvements in precision from collecting specific amounts of additional data.
△ Less
Submitted 27 March, 2017;
originally announced March 2017.
-
Conflict diagnostics for evidence synthesis in a multiple testing framework
Authors:
Anne M. Presanis,
David Ohlssen,
Kai Cui,
Magdalena Rosinska,
Daniela De Angelis
Abstract:
Evidence synthesis models that combine multiple datasets of varying design, to estimate quantities that cannot be directly observed, require the formulation of complex probabilistic models that can be expressed as graphical models. An assessment of whether the different datasets synthesised contribute information that is consistent with each other, and in a Bayesian context, with the prior distrib…
▽ More
Evidence synthesis models that combine multiple datasets of varying design, to estimate quantities that cannot be directly observed, require the formulation of complex probabilistic models that can be expressed as graphical models. An assessment of whether the different datasets synthesised contribute information that is consistent with each other, and in a Bayesian context, with the prior distribution, is a crucial component of the model criticism process. However, a systematic assessment of conflict suffers from the multiple testing problem, through testing for conflict at multiple locations in a model. We demonstrate the systematic use of conflict diagnostics, while accounting for the multiple hypothesis tests of no conflict at each location in the graphical model. The method is illustrated by a network meta-analysis to estimate treatment effects in smoking cessation programs and an evidence synthesis to estimate HIV prevalence in Poland.
△ Less
Submitted 13 September, 2017; v1 submitted 23 February, 2017;
originally announced February 2017.
-
Efficient real-time monitoring of an emerging influenza epidemic: how feasible?
Authors:
Paul J Birrell,
Lorenz Wernisch,
Brian D M Tom,
Leonhard Held,
Gareth O Roberts,
Richard G Pebody,
Daniela De Angelis
Abstract:
A prompt public health response to a new epidemic relies on the ability to monitor and predict its evolution in real time as data accumulate. The 2009 A/H1N1 outbreak in the UK revealed pandemic data as noisy, contaminated, potentially biased, and originating from multiple sources. This seriously challenges the capacity for real-time monitoring. Here we assess the feasibility of real-time inferenc…
▽ More
A prompt public health response to a new epidemic relies on the ability to monitor and predict its evolution in real time as data accumulate. The 2009 A/H1N1 outbreak in the UK revealed pandemic data as noisy, contaminated, potentially biased, and originating from multiple sources. This seriously challenges the capacity for real-time monitoring. Here we assess the feasibility of real-time inference based on such data by constructing an analytic tool combining an age-stratified SEIR transmission model with various observation models describing the data generation mechanisms. As batches of data become available, a sequential Monte Carlo (SMC) algorithm is developed to synthesise multiple imperfect data streams, iterate epidemic inferences and assess model adequacy amidst a rapidly evolving epidemic environment, substantially reducing computation time in comparison to standard MCMC, to ensure timely delivery of real-time epidemic assessments. In application to simulated data designed to mimic the 2009 A/H1N1 epidemic, SMC is shown to have additional benefits in terms of assessing predictive performance and co** with parameter non-identifiability.
△ Less
Submitted 3 May, 2019; v1 submitted 18 August, 2016;
originally announced August 2016.
-
Joining and splitting models with Markov melding
Authors:
Robert J. B. Goudie,
Anne M. Presanis,
David Lunn,
Daniela De Angelis,
Lorenz Wernisch
Abstract:
Analysing multiple evidence sources is often feasible only via a modular approach, with separate submodels specified for smaller components of the available evidence. Here we introduce a generic framework that enables fully Bayesian analysis in this setting. We propose a generic method for forming a suitable joint model when joining submodels, and a convenient computational algorithm for fitting t…
▽ More
Analysing multiple evidence sources is often feasible only via a modular approach, with separate submodels specified for smaller components of the available evidence. Here we introduce a generic framework that enables fully Bayesian analysis in this setting. We propose a generic method for forming a suitable joint model when joining submodels, and a convenient computational algorithm for fitting this joint model in stages, rather than as a single, monolithic model. The approach also enables splitting of large joint models into smaller submodels, allowing inference for the original joint model to be conducted via our multi-stage algorithm. We motivate and demonstrate our approach through two examples: joining components of an evidence synthesis of A/H1N1 influenza, and splitting a large ecology model.
△ Less
Submitted 12 September, 2017; v1 submitted 22 July, 2016;
originally announced July 2016.
-
Synthesising evidence to estimate pandemic (2009) A/H1N1 influenza severity in 2009-2011
Authors:
Anne M. Presanis,
Richard G. Pebody,
Paul J. Birrell,
Brian D. M. Tom,
Helen K. Green,
Hayley Durnall,
Douglas Fleming,
Daniela De Angelis
Abstract:
Knowledge of the severity of an influenza outbreak is crucial for informing and monitoring appropriate public health responses, both during and after an epidemic. However, case-fatality, case-intensive care admission and case-hospitalisation risks are difficult to measure directly. Bayesian evidence synthesis methods have previously been employed to combine fragmented, under-ascertained and biased…
▽ More
Knowledge of the severity of an influenza outbreak is crucial for informing and monitoring appropriate public health responses, both during and after an epidemic. However, case-fatality, case-intensive care admission and case-hospitalisation risks are difficult to measure directly. Bayesian evidence synthesis methods have previously been employed to combine fragmented, under-ascertained and biased surveillance data coherently and consistently, to estimate case-severity risks in the first two waves of the 2009 A/H1N1 influenza pandemic experienced in England. We present in detail the complex probabilistic model underlying this evidence synthesis, and extend the analysis to also estimate severity in the third wave of the pandemic strain during the 2010/2011 influenza season. We adapt the model to account for changes in the surveillance data available over the three waves. We consider two approaches: (a) a two-stage approach using posterior distributions from the model for the first two waves to inform priors for the third wave model; and (b) a one-stage approach modelling all three waves simultaneously. Both approaches result in the same key conclusions: (1) that the age-distribution of the case-severity risks is "u"-shaped, with children and older adults having the highest severity; (2) that the age-distribution of the infection attack rate changes over waves, school-age children being most affected in the first two waves and the attack rate in adults over 25 increasing from the second to third waves; and (3) that when averaged over all age groups, case-severity appears to increase over the three waves. The extent to which the final conclusion is driven by the change in age-distribution of those infected over time is subject to discussion.
△ Less
Submitted 3 February, 2015; v1 submitted 29 August, 2014;
originally announced August 2014.
-
Estimation of HIV Burden through Bayesian Evidence Synthesis
Authors:
Daniela De Angelis,
Anne M. Presanis,
Stefano Conti,
A. E. Ades
Abstract:
Planning, implementation and evaluation of public health policies to control the human immunodeficiency virus (HIV) epidemic require regular monitoring of disease burden. This includes the proportion living with HIV, whether diagnosed or not, and the rate of new infections in the general population and in specific risk groups and regions. Estimation of these quantities is not straightforward: data…
▽ More
Planning, implementation and evaluation of public health policies to control the human immunodeficiency virus (HIV) epidemic require regular monitoring of disease burden. This includes the proportion living with HIV, whether diagnosed or not, and the rate of new infections in the general population and in specific risk groups and regions. Estimation of these quantities is not straightforward: data informing them directly are not typically available, but a wealth of indirect information from surveillance systems and ad hoc studies can inform functions of these quantities. In this paper we show how the estimation problem can be successfully solved through a Bayesian evidence synthesis approach, relaxing the focus on "best available" data to which classical methods are typically restricted. This more comprehensive and flexible use of evidence has led to the adoption of our proposed approach as the official method to estimate HIV prevalence in the United Kingdom since 2005.
△ Less
Submitted 19 May, 2014;
originally announced May 2014.
-
Reconstructing transmission trees for communicable diseases using densely sampled genetic data
Authors:
Colin J. Worby,
Philip D. O'Neill,
Theodore Kypraios,
Julie V. Robotham,
Daniela De Angelis,
Edward J. P. Cartwright,
Sharon J. Peacock,
Ben S. Cooper
Abstract:
Whole genome sequencing of pathogens from multiple hosts in an epidemic offers the potential to investigate who infected whom with unparalleled resolution, potentially yielding important insights into disease dynamics and the impact of control measures. We considered disease outbreaks in a setting with dense genomic sampling, and formulated stochastic epidemic models to investigate person-to-perso…
▽ More
Whole genome sequencing of pathogens from multiple hosts in an epidemic offers the potential to investigate who infected whom with unparalleled resolution, potentially yielding important insights into disease dynamics and the impact of control measures. We considered disease outbreaks in a setting with dense genomic sampling, and formulated stochastic epidemic models to investigate person-to-person transmission, based on observed genomic and epidemiological data. We constructed models in which the genetic distance between sampled genotypes depends on the epidemiological relationship between the hosts. A data augmented Markov chain Monte Carlo algorithm was used to sample over the transmission trees, providing a posterior probability for any given transmission route. We investigated the predictive performance of our methodology using simulated data, demonstrating high sensitivity and specificity, particularly for rapidly mutating pathogens with low transmissibility. We then analyzed data collected during an outbreak of methicillin-resistant Staphylococcus aureus in a hospital, identifying probable transmission routes and estimating epidemiological parameters. Our approach overcomes limitations of previous methods, providing a framework with the flexibility to allow for unobserved infection times, multiple independent introductions of the pathogen, and within-host genetic diversity, as well as allowing forward simulation.
△ Less
Submitted 6 December, 2015; v1 submitted 8 January, 2014;
originally announced January 2014.
-
Conflict Diagnostics in Directed Acyclic Graphs, with Applications in Bayesian Evidence Synthesis
Authors:
Anne M. Presanis,
David Ohlssen,
David J. Spiegelhalter,
Daniela De Angelis
Abstract:
Complex stochastic models represented by directed acyclic graphs (DAGs) are increasingly employed to synthesise multiple, imperfect and disparate sources of evidence, to estimate quantities that are difficult to measure directly. The various data sources are dependent on shared parameters and hence have the potential to conflict with each other, as well as with the model. In a Bayesian framework,…
▽ More
Complex stochastic models represented by directed acyclic graphs (DAGs) are increasingly employed to synthesise multiple, imperfect and disparate sources of evidence, to estimate quantities that are difficult to measure directly. The various data sources are dependent on shared parameters and hence have the potential to conflict with each other, as well as with the model. In a Bayesian framework, the model consists of three components: the prior distribution, the assumed form of the likelihood and structural assumptions. Any of these components may be incompatible with the observed data. The detection and quantification of such conflict and of data sources that are inconsistent with each other is therefore a crucial component of the model criticism process. We first review Bayesian model criticism, with a focus on conflict detection, before describing a general diagnostic for detecting and quantifying conflict between the evidence in different partitions of a DAG. The diagnostic is a p-value based on splitting the information contributing to inference about a "separator" node or group of nodes into two independent groups and testing whether the two groups result in the same inference about the separator node(s). We illustrate the method with three comprehensive examples: an evidence synthesis to estimate HIV prevalence; an evidence synthesis to estimate influenza case-severity; and a hierarchical growth model for rat weights.
△ Less
Submitted 2 October, 2013;
originally announced October 2013.
-
Modeling of the HIV infection epidemic in the Netherlands: A multi-parameter evidence synthesis approach
Authors:
Stefano Conti,
Anne M. Presanis,
Maaike G. van Veen,
Maria Xiridou,
Martin C. Donoghoe,
Annemarie Rinder Stengaard,
Daniela De Angelis
Abstract:
Multi-parameter evidence synthesis (MPES) is receiving growing attention from the epidemiological community as a coherent and flexible analytical framework to accommodate a disparate body of evidence available to inform disease incidence and prevalence estimation. MPES is the statistical methodology adopted by the Health Protection Agency in the UK for its annual national assessment of the HIV epi…
▽ More
Multi-parameter evidence synthesis (MPES) is receiving growing attention from the epidemiological community as a coherent and flexible analytical framework to accommodate a disparate body of evidence available to inform disease incidence and prevalence estimation. MPES is the statistical methodology adopted by the Health Protection Agency in the UK for its annual national assessment of the HIV epidemic, and is acknowledged by the World Health Organization and UNAIDS as a valuable technique for the estimation of adult HIV prevalence from surveillance data. This paper describes the results of utilizing a Bayesian MPES approach to model HIV prevalence in the Netherlands at the end of 2007, using an array of field data from different study designs on various population risk subgroups and with a varying degree of regional coverage. Auxiliary data and expert opinion were additionally incorporated to resolve issues arising from biased, insufficient or inconsistent evidence. This case study offers a demonstration of the ability of MPES to naturally integrate and critically reconcile disparate and heterogeneous sources of evidence, while producing reliable estimates of HIV prevalence used to support public health decision-making.
△ Less
Submitted 27 February, 2012;
originally announced February 2012.