Skip to main content

Showing 1–7 of 7 results for author: Bakk, Z

.
  1. arXiv:2310.14726  [pdf

    stat.OT

    Unraveling the Skillsets of Data Scientists: Text Mining Analysis of Dutch University Master Programs in Data Science and Artificial Intelligence

    Authors: Mathijs J. Mol, Barbara Belfi, Zsuzsa Bakk

    Abstract: The growing demand for data scientists in the global labor market and the Netherlands has led to a rise in data science and artificial intelligence (AI) master programs offered by universities. However, there is still a lack of clarity regarding the specific skillsets of data scientists. This study aims to address this issue by employing Correlated Topic Modeling (CTM) to analyse the content of 41… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  2. arXiv:2307.10720  [pdf, other

    stat.ME

    Multilevel latent class analysis with covariates: Analysis of cross-national citizenship norms with a two-stage approach

    Authors: Roberto Di Mari, Zsuzsa Bakk, Jennifer Oser, Jouni Kuha

    Abstract: This paper focuses on the substantive application of multilevel LCA to the evolution of citizenship norms in a diverse array of democratic countries. To do so, we present a two-stage approach to fit multilevel latent class models: in the first stage (measurement model construction), unconditional class enumeration is done separately on both low and high level latent variables, estimating only a pa… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  3. arXiv:2305.07276  [pdf, other

    stat.CO stat.ME

    multilevLCA: An R Package for Single-Level and Multilevel Latent Class Analysis with Covariates

    Authors: Johan Lyrvall, Roberto Di Mari, Zsuzsa Bakk, Jennifer Oser, Jouni Kuha

    Abstract: This contribution presents a guide to the R package multilevLCA, which offers a complete and innovative set of technical tools for the latent class analysis of single-level and multilevel categorical data. We describe the available model specifications, mainly falling within the fixed-effect or random-effect approaches. Maximum likelihood estimation of the model parameters, enhanced by a refined i… ▽ More

    Submitted 10 April, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

  4. arXiv:2304.03853  [pdf, other

    stat.ME cs.LG stat.ML

    StepMix: A Python Package for Pseudo-Likelihood Estimation of Generalized Mixture Models with External Variables

    Authors: Sacha Morin, Robin Legault, Félix Laliberté, Zsuzsa Bakk, Charles-Édouard Giguère, Roxane de la Sablonnière, Éric Lacourse

    Abstract: StepMix is an open-source Python package for the pseudo-likelihood estimation (one-, two- and three-step approaches) of generalized finite mixture models (latent profile and latent class analysis) with external variables (covariates and distal outcomes). In many applications in social sciences, the main objective is not only to cluster individuals into latent classes, but also to use these classes… ▽ More

    Submitted 16 June, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Sacha Morin and Robin Legault contributed equally

  5. arXiv:2303.16101  [pdf, other

    stat.ME

    Two-step estimation of latent trait models

    Authors: Jouni Kuha, Zsuzsa Bakk

    Abstract: We consider two-step estimation of latent variable models, in which just the measurement model is estimated in the first step and the measurement parameters are then fixed at their estimated values in the second step where the structural model is estimated. We show how this approach can be implemented for latent trait models (item response theory models) where the latent variables are continuous a… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 39 pages, 2 figures, 17 tables

  6. arXiv:2303.06091  [pdf, other

    stat.ME

    A two-step estimator for multilevel latent class analysis with covariates

    Authors: Roberto Di Mari, Zsuzsa Bakk, Jennifer Oser, Jouni Kuha

    Abstract: We propose a two-step estimator for multilevel latent class analysis (LCA) with covariates. The measurement model for observed items is estimated in its first step, and in the second step covariates are added in the model, kee** the measurement model parameters fixed. We discuss model identification, and derive an Expectation Maximization algorithm for efficient implementation of the estimator.… ▽ More

    Submitted 5 July, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Manuscript version accepted for publication in Psychometrika

  7. arXiv:1801.01464  [pdf, other

    stat.ME

    Cluster-weighted latent class modeling

    Authors: Roberto Di Mari, Antonio Punzo, Zsuzsa Bakk

    Abstract: Usually in Latent Class Analysis (LCA), external predictors are taken to be cluster conditional probability predictors (LC models with covariates), and/or score conditional probability predictors (LC regression models). In such cases, their distribution is not of interest. Class specific distribution is of interest in the distal outcome model, when the distribution of the external variable(s) is a… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.