Modelling Complex Survey Data Using R, SAS, SPSS and Stata: A Comparison Using CLSA Datasets
Authors:
Hon Yiu So,
Urun Erbas Oz,
Lauren Griffith,
Susan Kirkland,
**hua Ma,
Parminder Raina,
Nazmul Sohel,
Mary E. Thompson,
Christina Wolfson,
Changbao Wu
Abstract:
The R software has become popular among researchers due to its flexibility and open-source nature. However, researchers in the fields of public health and epidemiological studies are more customary to commercial statistical softwares such as SAS, SPSS and Stata. This paper provides a comprehensive comparison on analysis of health survey data using the R survey package, SAS, SPSS and Stata. We desc…
▽ More
The R software has become popular among researchers due to its flexibility and open-source nature. However, researchers in the fields of public health and epidemiological studies are more customary to commercial statistical softwares such as SAS, SPSS and Stata. This paper provides a comprehensive comparison on analysis of health survey data using the R survey package, SAS, SPSS and Stata. We describe detailed R codes and procedures for other software packages on commonly encountered statistical analyses, such as estimation of population means and regression analysis, using datasets from the Canadian Longitudinal Study on Aging (CLSA). It is hoped that the paper stimulates interest among health science researchers to carry data analysis using R and also serves as a cookbook for statistical analysis using different software packages.
△ Less
Submitted 24 October, 2020; v1 submitted 19 October, 2020;
originally announced October 2020.
Using bootstrap for statistical inference on random graphs
Authors:
Mary E. Thompson,
Lilia Leticia Ramirez Ramirez,
Vyacheslav Lyubchich,
Yulia R. Gel
Abstract:
In this paper, we propose new nonparametric approach to network inference that may be viewed as a fusion of block sampling procedures for temporally and spatially dependent processes with the classical network methodology. We develop estimation and uncertainty quantification procedures for network mean degree using a "patchwork" sample and nonparametric bootstrap, under the assumption of unknown d…
▽ More
In this paper, we propose new nonparametric approach to network inference that may be viewed as a fusion of block sampling procedures for temporally and spatially dependent processes with the classical network methodology. We develop estimation and uncertainty quantification procedures for network mean degree using a "patchwork" sample and nonparametric bootstrap, under the assumption of unknown degree distribution. We investigate asymptotic properties of the proposed patchwork bootstrap procedure and present cross-validation methodology for selecting an optimal patch size. We validate the new patchwork bootstrap on simulated networks with short and long tailed mean degree distributions, and revisit the Erdos collaboration data to illustrate the proposed methodology.
△ Less
Submitted 18 January, 2015; v1 submitted 15 February, 2014;
originally announced February 2014.