Skip to main content

Showing 1–11 of 11 results for author: Groenen, P J F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00644  [pdf, other

    stat.ML cs.LG

    Clusterpath Gaussian Graphical Modeling

    Authors: D. J. W. Touw, A. Alfons, P. J. F. Groenen, I. Wilms

    Abstract: Graphical models serve as effective tools for visualizing conditional dependencies between variables. However, as the number of variables grows, interpretation becomes increasingly difficult, and estimation uncertainty increases due to the large number of parameters relative to the number of observations. To address these challenges, we introduce the Clusterpath estimator of the Gaussian Graphical… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 43 pages, 11 figures

  2. arXiv:2302.04627  [pdf, ps, other

    stat.ME

    Dual scaling of rating data

    Authors: Michel van de Velden, Patrick J. F. Groenen

    Abstract: When applied to contingency tables, dual scaling and correspondence are mathematically equivalent methods. For the analysis of rating data, however, the methods differ. To a large extent this is due to differences in preprocessing of the data. In particular, in dual scaling, ratings are either transformed to rank order, or to successive category data before applying a customised dual scaling appro… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  3. arXiv:2212.02914  [pdf, other

    stat.AP

    Effects of Visual Priming on Rating Scale Usage

    Authors: Pieter C. Schoonees, Patrick J. F. Groenen, Michel van de Velden, Hester van Herk

    Abstract: Rating scales are much used in survey research. Often, it is assumed that the scores obtained through rating scales can be compared within and between respondents when studies are in one country. In addition, it is assumed that they can be treated as a numerical scale. In this paper, we study the anchoring effect of a visual stimulus on rating scale usage. To do so, we set up a randomized experime… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  4. arXiv:2211.01877  [pdf, other

    stat.ML cs.LG

    Convex Clustering through MM: An Efficient Algorithm to Perform Hierarchical Clustering

    Authors: Daniel J. W. Touw, Patrick J. F. Groenen, Yoshikazu Terada

    Abstract: Convex clustering is a modern method with both hierarchical and $k$-means clustering characteristics. Although convex clustering can capture complex clustering structures hidden in data, the existing convex clustering algorithms are not scalable to large data sets with sample sizes greater than several thousands. Moreover, it is known that convex clustering sometimes fails to produce a complete hi… ▽ More

    Submitted 21 December, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 27 pages, 8 figures

  5. Robust Mediation Analysis: The R Package robmed

    Authors: Andreas Alfons, Nüfer Y. Ateş, Patrick J. F. Groenen

    Abstract: Mediation analysis is one of the most widely used statistical techniques in the social, behavioral, and medical sciences. Mediation models allow to study how an independent variable affects a dependent variable indirectly through one or more intervening variables, which are called mediators. The analysis is often carried out via a series of linear regressions, in which case the indirect effects ca… ▽ More

    Submitted 17 August, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Journal ref: Journal of Statistical Software, 103(13), 1-45 (2022)

  6. arXiv:2102.08232  [pdf, other

    stat.ME stat.CO stat.ML

    The MELODIC family for simultaneous binary logistic regression in a reduced space

    Authors: Mark de Rooij, Patrick J. F. Groenen

    Abstract: Logistic regression is a commonly used method for binary classification. Researchers often have more than a single binary response variable and simultaneous analysis is beneficial because it provides insight into the dependencies among response variables as well as between the predictor variables and the responses. Moreover, in such a simultaneous analysis the equations can lend each other strengt… ▽ More

    Submitted 24 June, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Comment [v2]: added a paragraph on page 7 about the equivalence to a logistic reduced rank model Comment [v2]: the description of the relationship towards logistic reduced rank models is updated on page 37

  7. arXiv:2002.08146  [pdf, other

    stat.AP

    A censored mixture model for modeling risk taking

    Authors: Nienke F. S. Dijkstra, Henning Tiemeier, Bernd C. Figner, Patrick J. F. Groenen

    Abstract: Risk behavior can have substantial consequences for health, well-being, and functioning. Previous studies have shown an association between real-world risk behavior and risk behavior on experimental tasks, such as the Columbia Card Task, but their modeling is challenging for several reasons. First, many of the experimental risk tasks may end prematurely leading to censored observations. Second, ce… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: 29 pages, 9 figures

  8. arXiv:1807.04982  [pdf, other

    stat.ME

    Generalized simultaneous component analysis of binary and quantitative data

    Authors: Yipeng Song, Johan A. Westerhuis, Nanne Aben, Lodewyk F. A. Wessels, Patrick J. F. Groenen, Age K. Smilde

    Abstract: In the current era of systems biological research there is a need for the integrative analysis of binary and quantitative genomics data sets measured on the same objects. One standard tool of exploring the underlying dependence structure present in multiple quantitative data sets is simultaneous component analysis (SCA) model. However, it does not have any provisions when a part of the data are bi… ▽ More

    Submitted 3 June, 2019; v1 submitted 13 July, 2018; originally announced July 2018.

    Comments: 19 pages, 10 figures

  9. arXiv:1701.06967  [pdf, other

    stat.ME

    SparseStep: Approximating the Counting Norm for Sparse Regularization

    Authors: Gerrit J. J. van den Burg, Patrick J. F. Groenen, Andreas Alfons

    Abstract: The SparseStep algorithm is presented for the estimation of a sparse parameter vector in the linear regression problem. The algorithm works by adding an approximation of the exact counting norm as a constraint on the model parameters and iteratively strengthening this approximation to arrive at a sparse solution. Theoretical analysis of the penalty function shows that the estimator yields unbiased… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

    MSC Class: 62J05; 62J07

  10. arXiv:1603.03174  [pdf, other

    stat.ME

    Multinomial Multiple Correspondence Analysis

    Authors: Patrick J. F. Groenen, Julie Josse

    Abstract: Relations between categorical variables can be analyzed conveniently by multiple correspondence analysis (MCA). %It is well suited to discover relations that may exist between categories of different variables. The graphical representation of MCA results in so-called biplots makes it easy to interpret the most important associations. However, a major drawback of MCA is that it does not have an und… ▽ More

    Submitted 10 March, 2016; originally announced March 2016.

  11. arXiv:1504.07005  [pdf

    stat.ME

    Regularized Consensus PCA

    Authors: Michel Tenenhaus, Arthur Tenenhaus, Patrick J. F. Groenen

    Abstract: A new framework for many multiblock component methods (including consensus and hierarchical PCA) is proposed. It is based on the consensus PCA model: a scheme connecting each block of variables to a superblock obtained by concatenation of all blocks. Regularized consensus PCA is obtained by applying regularized generalized canonical correlation analysis to this scheme for the function… ▽ More

    Submitted 27 April, 2015; originally announced April 2015.