StepMix: A Python Package for Pseudo-Likelihood Estimation of Generalized Mixture Models with External Variables
Authors:
Sacha Morin,
Robin Legault,
Félix Laliberté,
Zsuzsa Bakk,
Charles-Édouard Giguère,
Roxane de la Sablonnière,
Éric Lacourse
Abstract:
StepMix is an open-source Python package for the pseudo-likelihood estimation (one-, two- and three-step approaches) of generalized finite mixture models (latent profile and latent class analysis) with external variables (covariates and distal outcomes). In many applications in social sciences, the main objective is not only to cluster individuals into latent classes, but also to use these classes…
▽ More
StepMix is an open-source Python package for the pseudo-likelihood estimation (one-, two- and three-step approaches) of generalized finite mixture models (latent profile and latent class analysis) with external variables (covariates and distal outcomes). In many applications in social sciences, the main objective is not only to cluster individuals into latent classes, but also to use these classes to develop more complex statistical models. These models generally divide into a measurement model that relates the latent classes to observed indicators, and a structural model that relates covariates and outcome variables to the latent classes. The measurement and structural models can be estimated jointly using the so-called one-step approach or sequentially using stepwise methods, which present significant advantages for practitioners regarding the interpretability of the estimated latent classes. In addition to the one-step approach, StepMix implements the most important stepwise estimation methods from the literature, including the bias-adjusted three-step methods with Bolk-Croon-Hagenaars and maximum likelihood corrections and the more recent two-step approach. These pseudo-likelihood estimators are presented in this paper under a unified framework as specific expectation-maximization subroutines. To facilitate and promote their adoption among the data science community, StepMix follows the object-oriented design of the scikit-learn library and provides an additional R wrapper.
△ Less
Submitted 16 June, 2024; v1 submitted 7 April, 2023;
originally announced April 2023.
Unfolding selection to infer individual risk heterogeneity for optimising disease forecasts and policy development
Authors:
M. Gabriela M. Gomes,
Nicholas A. Feasey,
Marcelo U. Ferreira,
E. James LaCourse,
Kate E. Langwig,
Lisa Reimer,
Beate Ringwald,
Jamie Rylance,
J. Russell Stothard,
Miriam Taegtmeyer,
Dianne J. Terlouw,
Rachel Tolhurst,
Tom Wingfield,
Stephen B. Gordon
Abstract:
Mathematical models are increasing adopted for setting targets for disease prevention and control. As model-informed policies are implemented, however, the inaccuracies of some forecasts become apparent, for example overprediction of infection burdens and overestimation of intervention impacts. Here, we attribute these discrepancies to methodological limitations in capturing the heterogeneities of…
▽ More
Mathematical models are increasing adopted for setting targets for disease prevention and control. As model-informed policies are implemented, however, the inaccuracies of some forecasts become apparent, for example overprediction of infection burdens and overestimation of intervention impacts. Here, we attribute these discrepancies to methodological limitations in capturing the heterogeneities of real-world systems. The mechanisms underpinning single factors for infection and their interactions determine individual propensities to acquire disease. These are potentially so numerous that to attain a full mechanistic description may be unfeasible. To contribute constructively to the development of health policies, model developers either leave factors out (reductionism) or adopt a broader but coarse description (holism). In our view, predictive capacity requires holistic descriptions of heterogeneity which are currently underutilised in infectious disease epidemiology but common in other disciplines.
△ Less
Submitted 16 May, 2022; v1 submitted 2 September, 2020;
originally announced September 2020.