Skip to main content

Showing 1–16 of 16 results for author: Helske, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.16885  [pdf, other

    stat.ME q-bio.PE

    Hidden Markov modelling of spatio-temporal dynamics of measles in 1750-1850 Finland

    Authors: Tiia-Maria Pasanen, Jouni Helske, Tarmo Ketola

    Abstract: Real world spatio-temporal datasets, and phenomena related to them, are often challenging to visualise or gain a general overview of. In order to summarise information encompassed in such data, we combine two well known statistical modelling methods. To account for the spatial dimension, we use the intrinsic modification of the conditional autoregression, and incorporate it with the hidden Markov… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2310.06538  [pdf, other

    stat.AP q-bio.PE

    Spatio-temporal modeling of co-dynamics of smallpox, measles and pertussis in pre-healthcare Finland

    Authors: Tiia-Maria Pasanen, Jouni Helske, Harri Högmander, Tarmo Ketola

    Abstract: Infections are known to interact as previous infections may have an effect on risk of succumbing to a new infection. The co-dynamics can be mediated by immunosuppression or -modulation, shared environmental or climatic drivers, or competition for susceptible hosts. Research and statistical methods in epidemiology often concentrate on large pooled datasets, or high quality data from cities, leaving… ▽ More

    Submitted 20 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  3. arXiv:2309.08558  [pdf, other

    stat.ME cs.CY

    A modern approach to transition analysis and process mining with Markov models: A tutorial with R

    Authors: Jouni Helske, Satu Helske, Mohammed Saqr, Sonsoles López-Pernas, Keefe Murphy

    Abstract: This chapter presents an introduction to Markovian modeling for the analysis of sequence data. Contrary to the deterministic approach seen in the previous sequence analysis chapters, Markovian models are probabilistic models, focusing on the transitions between states instead of studying sequences as a whole. The chapter provides an introduction to this method and differentiates between its most c… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    MSC Class: 60J10

  4. Price Optimization Combining Conjoint Data and Purchase History: A Causal Modeling Approach

    Authors: Lauri Valkonen, Santtu Tikka, Jouni Helske, Juha Karvanen

    Abstract: Pricing decisions of companies require an understanding of the causal effect of a price change on the demand. When real-life pricing experiments are infeasible, data-driven decision-making must be based on alternative data sources such as purchase history (sales data) and conjoint studies where a group of customers is asked to make imaginary purchases in an artificial setup. We present an approach… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

    Journal ref: Observational Studies, 10(1), 37-53, 2024

  5. arXiv:2302.01607  [pdf, other

    stat.ME

    dynamite: An R Package for Dynamic Multivariate Panel Models

    Authors: Santtu Tikka, Jouni Helske

    Abstract: dynamite is an R package for Bayesian inference of intensive panel (time series) data comprising multiple measurements per multiple individuals measured in time. The package supports joint modeling of multiple response variables, time-varying and time-invariant effects, a wide range of discrete and continuous distributions, group-specific random effects, latent factors, and customization of prior… ▽ More

    Submitted 27 May, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

  6. arXiv:2111.04513  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Clustering and Structural Robustness in Causal Diagrams

    Authors: Santtu Tikka, Jouni Helske, Juha Karvanen

    Abstract: Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may become impractical, and the clarity of the representation is lost. Clustering of variables is a natural way to reduce the size of the causal diagr… ▽ More

    Submitted 15 August, 2023; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: This is the version published in JMLR

    Journal ref: Journal of Machine Learning Research, 24(195):1-32, 2023

  7. A Bayesian spatio-temporal analysis of markets during the Finnish 1860s famine

    Authors: Tiia-Maria Pasanen, Miikka Voutilainen, Jouni Helske, Harri Högmander

    Abstract: We develop a Bayesian spatio-temporal model to study pre-industrial grain market integration during the Finnish famine of the 1860s. Our model takes into account several problematic features often present when analysing multiple spatially interdependent time series. For example, compared with the error correction methodology commonly applied in econometrics, our approach allows simultaneous modell… ▽ More

    Submitted 12 April, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Journal ref: Journal of the Royal Statistical Society: Series C (Applied Statistics), 71(5), 1282-1302. (2022)

  8. bssm: Bayesian Inference of Non-linear and Non-Gaussian State Space Models in R

    Authors: Jouni Helske, Matti Vihola

    Abstract: We present an R package bssm for Bayesian non-linear/non-Gaussian state space modelling. Unlike the existing packages, bssm allows for easy-to-use approximate inference based on Gaussian approximations such as the Laplace approximation and the extended Kalman filter. The package accommodates also discretely observed latent diffusion processes. The inference is based on fully automatic, adaptive Ma… ▽ More

    Submitted 28 May, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

    Journal ref: The R Journal (2021) 13:2, pages 578-589

  9. Efficient Bayesian generalized linear models with time-varying coefficients: The walker package in R

    Authors: Jouni Helske

    Abstract: The R package walker extends standard Bayesian general linear models to the case where the effects of the explanatory variables can vary in time. This allows, for example, to model the effects of interventions such as changes in tax policy which gradually increases their effect over time. The Markov chain Monte Carlo algorithms powering the Bayesian inference are based on Hamiltonian Monte Carlo p… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: 9 pages, 1 figure

    Journal ref: SoftwareX, 2022, 18:101016

  10. arXiv:2003.03187  [pdf, other

    stat.ME stat.CO

    Estimation of causal effects with small data in the presence of trapdoor variables

    Authors: Jouni Helske, Santtu Tikka, Juha Karvanen

    Abstract: We consider the problem of estimating causal effects of interventions from observational data when well-known back-door and front-door adjustments are not applicable. We show that when an identifiable causal effect is subject to an implicit functional constraint that is not deducible from conditional independence relations, the estimator of the causal effect can exhibit bias in small samples. This… ▽ More

    Submitted 24 March, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: 25 pages, 8 figures

    Journal ref: Journal of Royal Statistical Society: Series A. 2021, 184:1030-1051

  11. Can visualization alleviate dichotomous thinking? Effects of visual representations on the cliff effect

    Authors: Jouni Helske, Satu Helske, Matthew Cooper, Anders Ynnerman, Lonni Besançon

    Abstract: Common reporting styles for statistical results in scientific articles, such as p-values and confidence intervals (CI), have been reported to be prone to dichotomous interpretations, especially with respect to the null hypothesis significance testing framework. For example when the p-value is small enough or the CIs of the mean effects of a studied drug and a placebo are not overlap**, scientist… ▽ More

    Submitted 28 May, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Journal ref: IEEE Transactions on Visualization and Computer Graphics. 2021; 27(8)

  12. arXiv:1901.02374  [pdf, other

    stat.ML cs.LG

    Graphical model inference: Sequential Monte Carlo meets deterministic approximations

    Authors: Fredrik Lindsten, Jouni Helske, Matti Vihola

    Abstract: Approximate inference in probabilistic graphical models (PGMs) can be grouped into deterministic methods and Monte-Carlo-based methods. The former can often provide accurate and rapid inferences, but are typically associated with biases that are hard to quantify. The latter enjoy asymptotic consistency, but can suffer from high computational costs. In this paper we present a way of bridging the ga… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Journal ref: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada

  13. Introducing libeemd: A program package for performing the ensemble empirical mode decomposition

    Authors: P. J. J. Luukko, J. Helske, E. Räsänen

    Abstract: The ensemble empirical mode decomposition (EEMD) and its complete variant (CEEMDAN) are adaptive, noise-assisted data analysis methods that improve on the ordinary empirical mode decomposition (EMD). All these methods decompose possibly nonlinear and/or nonstationary time series data into a finite amount of components separated by instantaneous frequencies. This decomposition provides a powerful m… ▽ More

    Submitted 3 July, 2017; originally announced July 2017.

    Comments: The final publication is available at Springer via https://dx.doi.org/10.1007/s00180-015-0603-9

    Journal ref: Comput. Stat. 31 545 (2016)

  14. arXiv:1704.00543  [pdf, other

    stat.CO stat.AP

    Mixture Hidden Markov Models for Sequence Data: The seqHMM Package in R

    Authors: Satu Helske, Jouni Helske

    Abstract: Sequence analysis is being more and more widely used for the analysis of social sequences and other multivariate categorical time series data. However, it is often complex to describe, visualize, and compare large sequence data, especially when there are multiple parallel sequences per subject. Hidden (latent) Markov models (HMMs) are able to detect underlying latent structures and they can be use… ▽ More

    Submitted 26 January, 2019; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: 33 pages, 8 figures

    Journal ref: Journal of Statistical Software, 88(3), 1 - 32 (2019)

  15. arXiv:1612.01907  [pdf, other

    stat.CO stat.ME

    KFAS: Exponential Family State Space Models in R

    Authors: Jouni Helske

    Abstract: State space modelling is an efficient and flexible method for statistical inference of a broad class of time series and other data. This paper describes an R package KFAS for state space modelling with the observations from an exponential family, namely Gaussian, Poisson, binomial, negative binomial and gamma distributions. After introducing the basic theory behind Gaussian and non-Gaussian state… ▽ More

    Submitted 24 March, 2017; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: 39 pages, 7 figures. This is a preprint version of an article to appear in the Journal of Statistical Software. Change to previous version: Added grant number to acknowledgments

    Journal ref: Journal of Statistical Software, 78(10), 1 - 39 (2017)

  16. arXiv:1609.02541  [pdf, other

    stat.CO math.PR

    Importance sampling type estimators based on approximate marginal MCMC

    Authors: Matti Vihola, Jouni Helske, Jordan Franks

    Abstract: We consider importance sampling (IS) type weighted estimators based on Markov chain Monte Carlo (MCMC) targeting an approximate marginal of the target distribution. In the context of Bayesian latent variable models, the MCMC typically operates on the hyperparameters, and the subsequent weighting may be based on IS or sequential Monte Carlo (SMC), but allows for multilevel techniques as well. The I… ▽ More

    Submitted 9 March, 2020; v1 submitted 8 September, 2016; originally announced September 2016.

    Comments: 34 pages, 1 figure

    Journal ref: Scand J Statist. 2020; 47: 1339-1376