-
Compositional dynamic modelling for causal prediction in multivariate time series
Authors:
Kevin Li,
Graham Tierney,
Christoph Hellmayr,
Mike West
Abstract:
Theoretical developments in sequential Bayesian analysis of multivariate dynamic models underlie new methodology for causal prediction. This extends the utility of existing models with computationally efficient methodology, enabling routine exploration of Bayesian counterfactual analyses with multiple selected time series as synthetic controls. Methodological contributions also define the concept…
▽ More
Theoretical developments in sequential Bayesian analysis of multivariate dynamic models underlie new methodology for causal prediction. This extends the utility of existing models with computationally efficient methodology, enabling routine exploration of Bayesian counterfactual analyses with multiple selected time series as synthetic controls. Methodological contributions also define the concept of outcome adaptive modelling to monitor and inferentially respond to changes in experimental time series following interventions designed to explore causal effects. The benefits of sequential analyses with time-varying parameter models for causal investigations are inherited in this broader setting. A case study in commercial causal analysis-- involving retail revenue outcomes related to marketing interventions-- highlights the methodological advances.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Reply to Discussions of "Multivariate Dynamic Modeling for Bayesian Forecasting of Business Revenue"
Authors:
Anna K. Yanchenko,
Graham Tierney,
Joseph Lawson,
Christoph Hellmayr,
Andrew Cron,
Mike West
Abstract:
We are most grateful to all discussants for their positive comments and many thought-provoking questions. In addition, the discussants provide a number of useful leads into various areas of the literatures on time series, forecasting and commercial application within which the work in our paper is, of course, just one contribution linked to multiple threads. Our view is that, collectively, the dis…
▽ More
We are most grateful to all discussants for their positive comments and many thought-provoking questions. In addition, the discussants provide a number of useful leads into various areas of the literatures on time series, forecasting and commercial application within which the work in our paper is, of course, just one contribution linked to multiple threads. Our view is that, collectively, the discussion contributions nicely expand on the core of the paper and together -- with multiple additional references -- provide an excellent point-of-entrée to the broader field of retail forecasting and its research challenges. Interested readers are encouraged to dig deeply into the discussions and our responses here, and explore referenced sources.
There are several themes that recur across discussants, as well as a range of specific points/questions raised. Following some "big-picture" comments on our perspectives on Bayesian forecasting systems, we comment in turn on some specifics in each contribution.
△ Less
Submitted 2 April, 2023;
originally announced April 2023.
-
Frequency content and filtering of head sensor kinematics: A method to enable field-based inter-study comparisons
Authors:
Gregory Tierney,
Steve Rowson,
Ryan Gellner,
Sadaf Iqbal,
Pardis Biglarbeigi,
James Tooby,
James Woodward,
Amir Farokh Payam
Abstract:
Wearable head sensor systems use different kinematic signal processing approaches which limits field-based inter-study comparisons, especially when artefacts are present in the signal. The aim of this study is to assess the frequency content and characteristics of head kinematic signals from head impact reconstruction laboratory and field-based environments to develop an artefact attenuation filte…
▽ More
Wearable head sensor systems use different kinematic signal processing approaches which limits field-based inter-study comparisons, especially when artefacts are present in the signal. The aim of this study is to assess the frequency content and characteristics of head kinematic signals from head impact reconstruction laboratory and field-based environments to develop an artefact attenuation filtering method (artefact attenuation method). Laboratory impacts (n=72) on a test-dummy headform ranging from 25-150 g were conducted and 126 elite-level rugby union players were equipped with instrumented mouthguards (iMG) for up to four matches. Power spectral density (PSD) characteristics of the laboratory impacts and on-field HAE (n=5694) such as the 95th percentile cumulative sum PSD frequency were utilised to develop the artefact attenuation method. The artefact attenuation method was compared to two other common filtering approaches (Fourth order (2x2 pole), zero-lag Butterworth filter with 200 Hz (-6 dB) cut-off frequency (Butterworth-200Hz) and CFC180 filter) through signal-to-noise ratio (SNR) and mixed linear effects models for laboratory and on-field events, respectively. The artefact attenuation method produced an overall higher SNR than the Butterworth-200Hz and CFC180 filter and on-field peak linear acceleration (PLA) and peak angular acceleration (PAA) values within the magnitude range tested in the laboratory. Median PLA and PAA were higher for the CFC180 filter than the Butterworth-200Hz (p<0.01) and artefact attenuation method (p<0.01), reporting values as high as 294 g and 31.2 krad/s2. The artefact attenuation method can be applied to all commercially available iMG kinematic signals with adequate sample rates to enable field-based inter-study comparisons.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Multivariate Bayesian dynamic modeling for causal prediction
Authors:
Graham Tierney,
Christoph Hellmayr,
Greg Barkimer,
Kevin Li,
Mike West
Abstract:
Bayesian forecasting is developed in multivariate time series analysis for causal inference. Causal evaluation of sequentially observed time series data from control and treated units focuses on the impacts of interventions using contemporaneous outcomes in control units. Methodological developments here concern multivariate dynamic models for time-varying effects across multiple treated units wit…
▽ More
Bayesian forecasting is developed in multivariate time series analysis for causal inference. Causal evaluation of sequentially observed time series data from control and treated units focuses on the impacts of interventions using contemporaneous outcomes in control units. Methodological developments here concern multivariate dynamic models for time-varying effects across multiple treated units with explicit foci on sequential learning and aggregation of intervention effects. Analysis explores dimension reduction across multiple synthetic counterfactual predictors. Computational advances leverage fully conjugate models for efficient sequential learning and inference, including cross-unit correlations and their time variation. This allows full uncertainty quantification on model hyper-parameters via Bayesian model averaging. A detailed case study evaluates interventions in a supermarket promotions experiment, with coupled predictive analyses in selected regions of a large-scale commercial system. Comparisons with existing methods highlight the issues of appropriate uncertainty quantification in casual inference in aggregation across treated units, among other practical concerns.
△ Less
Submitted 19 June, 2024; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Bias and Excess Variance in Election Polling: A Not-So-Hidden Markov Model
Authors:
Graham Tierney,
Alexander Volfovsky
Abstract:
With historic misses in the 2016 and 2020 US Presidential elections, interest in measuring polling errors has increased. The most common method for measuring directional errors and non-sampling excess variability during a postmortem for an election is by assessing the difference between the poll result and election result for polls conducted within a few days of the day of the election. Analyzing…
▽ More
With historic misses in the 2016 and 2020 US Presidential elections, interest in measuring polling errors has increased. The most common method for measuring directional errors and non-sampling excess variability during a postmortem for an election is by assessing the difference between the poll result and election result for polls conducted within a few days of the day of the election. Analyzing such polling error data is notoriously difficult with typical models being extremely sensitive to the time between the poll and the election. We leverage hidden Markov models traditionally used for election forecasting to flexibly capture time-varying preferences and treat the election result as a peak at the typically hidden Markovian process. Our results are much less sensitive to the choice of time window, avoid conflating shifting preferences with polling error, and are more interpretable despite a highly flexible model. We demonstrate these results with data on polls from the 2004 through 2020 US Presidential elections and 1992 through 2020 US Senate elections, concluding that previously reported estimates of bias in Presidential elections were too extreme by 10\%, estimated bias in Senatorial elections was too extreme by 25\%, and excess variability estimates were also too large.
△ Less
Submitted 17 February, 2023; v1 submitted 16 June, 2022;
originally announced June 2022.
-
Multivariate Dynamic Modeling for Bayesian Forecasting of Business Revenue
Authors:
Anna K. Yanchenko,
Graham Tierney,
Joseph Lawson,
Christoph Hellmayr,
Andrew Cron,
Mike West
Abstract:
Forecasting enterprise-wide revenue is critical to many companies and presents several challenges and opportunities for significant business impact. This case study is based on model developments to address these challenges for forecasting in a large-scale retail company. Focused on multivariate revenue forecasting across collections of supermarkets and product Categories, hierarchical dynamic mod…
▽ More
Forecasting enterprise-wide revenue is critical to many companies and presents several challenges and opportunities for significant business impact. This case study is based on model developments to address these challenges for forecasting in a large-scale retail company. Focused on multivariate revenue forecasting across collections of supermarkets and product Categories, hierarchical dynamic models are natural: these are able to couple revenue streams in an integrated forecasting model, while allowing conditional decoupling to enable relevant and sensitive analysis together with scalable computation. Structured models exploit multi-scale modeling to cascade information on price and promotion activities as predictors relevant across Categories and groups of stores. With a context-relevant focus on forecasting revenue 12 weeks ahead, the study highlights product Categories that benefit from multi-scale information, defines insights into when, how and why multivariate models improve forecast accuracy, and shows how cross-Category dependencies can relate to promotion decisions in one Category impacting others. Bayesian modeling developments underlying the case study are accessible in custom code for interested readers.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Author Clustering and Topic Estimation for Short Texts
Authors:
Graham Tierney,
Christopher Bail,
Alexander Volfovsky
Abstract:
Analysis of short text, such as social media posts, is extremely difficult because of their inherent brevity. In addition to classifying topics of such posts, a common downstream task is grou** the authors of these documents for subsequent analyses. We propose a novel model that expands on the Latent Dirichlet Allocation by modeling strong dependence among the words in the same document, with us…
▽ More
Analysis of short text, such as social media posts, is extremely difficult because of their inherent brevity. In addition to classifying topics of such posts, a common downstream task is grou** the authors of these documents for subsequent analyses. We propose a novel model that expands on the Latent Dirichlet Allocation by modeling strong dependence among the words in the same document, with user-level topic distributions. We also simultaneously cluster users, removing the need for post-hoc cluster estimation and improving topic estimation by shrinking noisy user-level topic distributions towards typical values. Our method performs as well as -- or better -- than traditional approaches, and we demonstrate its usefulness on a dataset of tweets from United States Senators, recovering both meaningful topics and clusters that reflect partisan ideology. We also develop a novel measure of echo chambers among these politicians by characterizing insularity of topics discussed by groups of Senators and provide uncertainty quantification.
△ Less
Submitted 16 June, 2022; v1 submitted 15 June, 2021;
originally announced June 2021.