Assessing the distribution of discrete survival time in presence of recall error
Authors:
Sedigheh Mirzaei Salehabadi,
Edwina Yeung,
Germaine M. Buck Louis,
Rajeshwari Sundaram
Abstract:
Retrospectively ascertained survival time may be subject to recall error. An example of discrete survival time with such recall error is time-to-pregnancy (TTP), the number of months non-contracepting couples require to get pregnant which is a measure of human fecundity. The epidemiological literature has demonstrated that retrospective TTP is subject to recall error and statistical models focusin…
▽ More
Retrospectively ascertained survival time may be subject to recall error. An example of discrete survival time with such recall error is time-to-pregnancy (TTP), the number of months non-contracepting couples require to get pregnant which is a measure of human fecundity. The epidemiological literature has demonstrated that retrospective TTP is subject to recall error and statistical models focusing on TTP have not accounted for the recall error. We propose a multistage model that utilizes women's retrospectively-reported TTP and associated certainty to estimate the TTP distribution. Our proposed model utilizes a discrete survival function that accounts for random heterogeneity arising from between women TTP data as well as a multinomial regression model to account for her certainty as accuracy may decline over time, i.e., depends on time since pregnancy in estimating the TTP distribution. Other novel features of the model include attention to whether the pregnancy was (un)planned as well as providing an approach to predict survival function for women without a reported TTP. Our model allows for the consideration of covariates for each of the underlying factors of (un)planned pregnancy, measure of certainty and TTP distribution. The proposed model is applicable for any discrete survival time when certainty in reporting may be a consideration. We use Monte Carlo simulations to assess the finite sample performance for the proposed estimators. We illustrate our proposed method using data from Upstate KIDS Study.
△ Less
Submitted 16 October, 2018;
originally announced October 2018.
Estimating menarcheal age distribution from partially recalled data
Authors:
Sedigheh Mirzaei Salehabadi,
Debasis Sengupta,
Rahul Ghosal
Abstract:
In a cross-sectional study, adolescent and young adult females were asked to recall the time of menarche, if experienced. Some respondents recalled the date exactly, some recalled only the month or the year of the event, and some were unable to recall anything. We consider estimation of the menarcheal age distribution from this interval censored data. A~complicated interplay between age-at-event a…
▽ More
In a cross-sectional study, adolescent and young adult females were asked to recall the time of menarche, if experienced. Some respondents recalled the date exactly, some recalled only the month or the year of the event, and some were unable to recall anything. We consider estimation of the menarcheal age distribution from this interval censored data. A~complicated interplay between age-at-event and calendar time, together with the evident fact of memory fading with time, makes the censoring informative. We propose a model where the probabilities of various types of recall would depend on the time since menarche. For parametric estimation we model these probabilities using multinomial regression function. Establishing consistency and asymptotic normality of the parametric MLE requires a bit of tweaking of the standard asymptotic theory, as the data format varies from case to case. We also provide a non-parametric MLE, propose a computationally simpler approximation, and establish the consistency of both these estimators under mild conditions. We study the small sample performance of the parametric and non-parametric estimators through Monte Carlo simulations. Moreover, we provide a graphical check of the assumption of the multinomial model for the recall probabilities, which appears to hold for the menarcheal data set. Our analysis shows that the use of the partially recalled part of the data indeed leads to smaller confidence intervals of the survival function.
△ Less
Submitted 3 March, 2019; v1 submitted 10 October, 2018;
originally announced October 2018.