Search | arXiv e-print repository

Analysis of cohort stepped wedge cluster-randomized trials with non-ignorable dropout via joint modeling

Authors: Alessandro Gasparini, Michael J. Crowther, Emiel O. Hoogendijk, Fan Li, Michael O. Harhay

Abstract: Stepped wedge cluster-randomized trial (CRTs) designs randomize clusters of individuals to intervention sequences, ensuring that every cluster eventually transitions from a control period to receive the intervention under study by the end of the study period. The analysis of stepped wedge CRTs is usually more complex than parallel-arm CRTs due to potential secular trends that result in changing in… ▽ More Stepped wedge cluster-randomized trial (CRTs) designs randomize clusters of individuals to intervention sequences, ensuring that every cluster eventually transitions from a control period to receive the intervention under study by the end of the study period. The analysis of stepped wedge CRTs is usually more complex than parallel-arm CRTs due to potential secular trends that result in changing intra-cluster and period-cluster correlations over time. A further challenge in the analysis of closed-cohort stepped wedge CRTs, which follow groups of individuals enrolled in each period longitudinally, is the occurrence of dropout. This is particularly problematic in studies of individuals at high risk for mortality, which causes non-ignorable missing outcomes. If not appropriately addressed, missing outcomes from death will erode statistical power, at best, and bias treatment effect estimates, at worst. Joint longitudinal-survival models can accommodate informative dropout and missingness patterns in longitudinal studies. Specifically, within this framework one directly models the dropout process via a time-to-event submodel together with the longitudinal outcome of interest. The two submodels are then linked using a variety of possible association structures. This work extends linear mixed-effects models by jointly modeling the dropout process to accommodate informative missing outcome data in closed-cohort stepped wedge CRTs. We focus on constant intervention and general time-on-treatment effect parametrizations for the longitudinal submodel and study the performance of the proposed methodology using Monte Carlo simulation under several data-generating scenarios. We illustrate the joint modeling methodology in practice by reanalyzing the `Frail Older Adults: Care in Transition' (ACT) trial, a stepped wedge CRT of a multifaceted geriatric care model versus usual care in the Netherlands. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.10629 [pdf, other]

Weighting methods for truncation by death in cluster-randomized trials

Authors: Dane Isenberg, Michael Harhay, Nandita Mitra, Fan Li

Abstract: Patient-centered outcomes, such as quality of life and length of hospital stay, are the focus in a wide array of clinical studies. However, participants in randomized trials for elderly or critically and severely ill patient populations may have truncated or undefined non-mortality outcomes if they do not survive through the measurement time point. To address truncation by death, the survivor aver… ▽ More Patient-centered outcomes, such as quality of life and length of hospital stay, are the focus in a wide array of clinical studies. However, participants in randomized trials for elderly or critically and severely ill patient populations may have truncated or undefined non-mortality outcomes if they do not survive through the measurement time point. To address truncation by death, the survivor average causal effect (SACE) has been proposed as a causally interpretable subgroup treatment effect defined under the principal stratification framework. However, the majority of methods for estimating SACE have been developed in the context of individually-randomized trials. Only limited discussions have been centered around cluster-randomized trials (CRTs), where methods typically involve strong distributional assumptions for outcome modeling. In this paper, we propose two weighting methods to estimate SACE in CRTs that obviate the need for potentially complicated outcome distribution modeling. We establish the requisite assumptions that address latent clustering effects to enable point identification of SACE, and we provide computationally-efficient asymptotic variance estimators for each weighting estimator. In simulations, we evaluate our weighting estimators, demonstrating their finite-sample operating characteristics and robustness to certain departures from the identification assumptions. We illustrate our methods using data from a CRT to assess the impact of a sedation protocol on mechanical ventilation among children with acute respiratory failure. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: Code for simulations and R package is available on https://github.com/abcdane1/PtSaceCrts

arXiv:2303.18093 [pdf, other]

Designing a Bayesian adaptive clinical trial to evaluate novel mechanical ventilation strategies in acute respiratory failure using Integrated Nested Laplace Approximations

Authors: Reyhaneh Hosseini, Ziming Chen, Ewan Goligher, Eddy Fan, Niall D. Ferguson, Michael O. Harhay, Sarina Sahetya, Martin Urner, Christopher J. Yarnell, Anna Heath

Abstract: Background: We aimed to design a Bayesian adaption trial through extensive simulations to determine values for key design parameters, demonstrate error rates, and establish the expected sample size. The complexity of the proposed outcome and analysis meant that Markov Chain Monte Carlo methods were required, resulting in an infeasible computational burden. Thus, we leveraged the Integrated Nested… ▽ More Background: We aimed to design a Bayesian adaption trial through extensive simulations to determine values for key design parameters, demonstrate error rates, and establish the expected sample size. The complexity of the proposed outcome and analysis meant that Markov Chain Monte Carlo methods were required, resulting in an infeasible computational burden. Thus, we leveraged the Integrated Nested Laplace Approximations (INLA) algorithm, a fast approximation method, to ensure the feasibility of these simulations. Methods: We simulated Bayesian adaptive two-arm superiority trials that stratified participants into two disease severity states. The outcome was analyzed with proportional odds logistic regression. Trials were stopped for superiority or futility, separately for each state. We calculated the type I error and power across 64 scenarios that varied the stop** thresholds and the minimum sample size before commencing adaptive analyses. We incorporated dynamic borrowing and used INLA to compute the posterior distributions at each adaptive analysis. Designs that maintained a type I error below 5%, a power above 80%, and a feasible mean sample size were then evaluated across 22 scenarios that varied the odds ratios for the two severity states. Results: Power generally increased as the initial sample size and the threshold for declaring futility increased. Two designs were selected for further analysis. In the comprehensive simulations, the one design had a higher chance of reaching a trial conclusion before the maximum sample size and higher probability of declaring superiority when appropriate without a substantial increase in sample size for the more realistic scenarios and was selected as the trial design. Conclusions: We designed a Bayesian adaptive trial to evaluate novel strategies for ventilation using the INLA algorithm to and optimize the trial design through simulation. △ Less

Submitted 31 March, 2023; originally announced March 2023.

arXiv:2303.13960 [pdf]

Demystifying estimands in cluster-randomised trials

Authors: Brennan C Kahan, Bryan Blette, Michael Harhay, Scott Halpern, Vipul Jairath, Andrew Copas, Fan Li

Abstract: Estimands can help clarify the interpretation of treatment effects and ensure that estimators are aligned to the study's objectives. Cluster randomised trials require additional attributes to be defined within the estimand compared to individually randomised trials, including whether treatment effects are marginal or cluster specific, and whether they are participant or cluster average. In this pa… ▽ More Estimands can help clarify the interpretation of treatment effects and ensure that estimators are aligned to the study's objectives. Cluster randomised trials require additional attributes to be defined within the estimand compared to individually randomised trials, including whether treatment effects are marginal or cluster specific, and whether they are participant or cluster average. In this paper, we provide formal definitions of estimands encompassing both these attributes using potential outcomes notation and describe differences between them. We then provide an overview of estimators for each estimand, describe their assumptions, and show consistency (i.e. asymptotically unbiased estimation) for a series of analyses based on cluster level summaries. Then, through a reanalysis of a published cluster randomised trial, we demonstrate that the choice of both estimand and estimator can affect interpretation. For instance, the estimated odds ratio ranged from 1.38 (p=0.17) to 1.83 (p=0.03) depending on the target estimand, and for some estimands, the choice of estimator affected the conclusions by leading to smaller treatment effect estimates. We conclude that careful specification of the estimand, along with an appropriate choice of estimator, are essential to ensuring that cluster randomised trials address the right question. △ Less

Submitted 22 February, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

arXiv:2209.01297 [pdf, other]

Assessing treatment effect heterogeneity in the presence of missing effect modifier data in cluster-randomized trials

Authors: Bryan S. Blette, Scott D. Halpern, Fan Li, Michael O. Harhay

Abstract: Understanding whether and how treatment effects vary across subgroups is crucial to inform clinical practice and recommendations. Accordingly, the assessment of heterogeneous treatment effects (HTE) based on pre-specified potential effect modifiers has become a common goal in modern randomized trials. However, when one or more potential effect modifiers are missing, complete-case analysis may lead… ▽ More Understanding whether and how treatment effects vary across subgroups is crucial to inform clinical practice and recommendations. Accordingly, the assessment of heterogeneous treatment effects (HTE) based on pre-specified potential effect modifiers has become a common goal in modern randomized trials. However, when one or more potential effect modifiers are missing, complete-case analysis may lead to bias and under-coverage. While statistical methods for handling missing data have been proposed and compared for individually randomized trials with missing effect modifier data, few guidelines exist for the cluster-randomized setting, where intracluster correlations in the effect modifiers, outcomes, or even missingness mechanisms may introduce further threats to accurate assessment of HTE. In this article, the performance of several missing data methods are compared through a simulation study of cluster-randomized trials with continuous outcome and missing binary effect modifier data, and further illustrated using real data from the Work, Family, and Health Study. Our results suggest that multilevel multiple imputation (MMI) and Bayesian MMI have better performance than other available methods, and that Bayesian MMI has lower bias and closer to nominal coverage than standard MMI when there are model specification or compatibility issues. △ Less

Submitted 1 December, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2206.10453 [pdf]

doi 10.1177/17407745231160074

Using modified intention-to-treat as a principal stratum estimator for failure to initiate treatment

Authors: Brennan C Kahan, Ian R White, Mark Edwards, Michael O Harhay

Abstract: Background: A common intercurrent event affecting many trials is when some participants do not begin their assigned treatment. Many trials use a modified intention-to-treat (mITT) approach, whereby participants who do not initiate treatment are excluded from the analysis. However, it is not clear the estimand being targeted by such an approach or the assumptions necessary for it to be unbiased.… ▽ More Background: A common intercurrent event affecting many trials is when some participants do not begin their assigned treatment. Many trials use a modified intention-to-treat (mITT) approach, whereby participants who do not initiate treatment are excluded from the analysis. However, it is not clear the estimand being targeted by such an approach or the assumptions necessary for it to be unbiased. Methods: We demonstrate that a mITT analysis which excludes participants who do not begin treatment is estimating a principal stratum estimand (i.e. the treatment effect in the subpopulation of participants who would begin treatment, regardless of which arm they were assigned to). The mITT estimator is unbiased for the principal stratum estimand under the assumption that the intercurrent event is not affected by the assigned treatment arm, that is, participants who initiate treatment in one arm would also do so in the other arm. Results: We identify two key criteria in determining whether the mITT estimator is likely to be unbiased: first, we must be able to measure the participants in each treatment arm who experience the intercurrent event, and second, the assumption that treatment allocation will not affect whether the participant begins treatment must be reasonable. Most double-blind trials will satisfy these criteria, and we provide an example of an open-label trial where these criteria are likely to be satisfied as well. Conclusions: A modified intention-to-treat analysis which excludes participants who do not begin treatment can be an unbiased estimator for the principal stratum estimand. Our framework can help identify when the assumptions for unbiasedness are likely to hold, and thus whether modified intention-to-treat is appropriate or not. △ Less

Submitted 30 January, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

Comments: Changes to Introduction and Abstract, minor changes to Methods

Journal ref: Clin Trials. 2023 Jun;20(3):269-275

arXiv:2205.05624 [pdf, other]

Leveraging baseline covariates to analyze small cluster-randomized trials with a rare binary outcome

Authors: Angela Y. Zhu, Nandita Mitra, Karla Hemming, Michael O. Harhay, Fan Li

Abstract: Cluster-randomized trials (CRTs) involve randomizing entire groups of participants -- called clusters -- to treatment arms but are often comprised of a limited or fixed number of available clusters. While covariate adjustment can account for chance imbalances between treatment arms and increase statistical efficiency in individually-randomized trials, analytical methods for individual-level covari… ▽ More Cluster-randomized trials (CRTs) involve randomizing entire groups of participants -- called clusters -- to treatment arms but are often comprised of a limited or fixed number of available clusters. While covariate adjustment can account for chance imbalances between treatment arms and increase statistical efficiency in individually-randomized trials, analytical methods for individual-level covariate adjustment in small CRTs have received little attention to date. In this paper, we systematically investigate, through extensive simulations, the operating characteristics of propensity score weighting and multivariable regression as two individual-level covariate adjustment strategies for estimating the participant-average causal effect in small CRTs with a rare binary outcome and identify scenarios where each adjustment strategy has a relative efficiency advantage over the other to make practical recommendations. We also examine the finite-sample performance of the bias-corrected sandwich variance estimators associated with propensity score weighting and multivariable regression for quantifying the uncertainty in estimating the participant-average treatment effect. To illustrate the methods for individual-level covariate adjustment, we reanalyze a recent CRT testing a sedation protocol in 31 pediatric intensive care units. △ Less

Submitted 28 November, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

arXiv:2204.06657 [pdf, other]

A Bayesian Machine Learning Approach for Estimating Heterogeneous Survivor Causal Effects: Applications to a Critical Care Trial

Authors: Xinyuan Chen, Michael O. Harhay, Guangyu Tong, Fan Li

Abstract: Motivated by the Acute Respiratory Distress Syndrome Network (ARDSNetwork) ARDS respiratory management (ARMA) trial, we developed a flexible Bayesian machine learning approach to estimate the average causal effect and heterogeneous causal effects among the always-survivors stratum when clinical outcomes are subject to truncation. We adopted Bayesian additive regression trees (BART) to flexibly spe… ▽ More Motivated by the Acute Respiratory Distress Syndrome Network (ARDSNetwork) ARDS respiratory management (ARMA) trial, we developed a flexible Bayesian machine learning approach to estimate the average causal effect and heterogeneous causal effects among the always-survivors stratum when clinical outcomes are subject to truncation. We adopted Bayesian additive regression trees (BART) to flexibly specify separate models for the potential outcomes and latent strata membership. In the analysis of the ARMA trial, we found that the low tidal volume treatment had an overall benefit for participants sustaining acute lung injuries on the outcome of time to returning home, but substantial heterogeneity in treatment effects among the always-survivors, driven most strongly by sex and the alveolar-arterial oxygen gradient at baseline (a physiologic measure of lung function and source of hypoxemia). These findings illustrate how the proposed methodology could guide the prognostic enrichment of future trials in the field. We also demonstrated through a simulation study that our proposed Bayesian machine learning approach outperforms other parametric methods in reducing the estimation bias in both the average causal effect and heterogeneous causal effects for always-survivors. △ Less

Submitted 19 June, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

Comments: arXiv admin note: text overlap with arXiv:1706.06611 by other authors

MSC Class: 62P10

arXiv:2112.00832 [pdf, other]

On the mixed-model analysis of covariance in cluster-randomized trials

Authors: Bingkai Wang, Michael O. Harhay, Jiaqi Tong, Dylan S. Small, Tim P. Morris, Fan Li

Abstract: In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under t… ▽ More In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under the potential outcomes framework, we prove that the mixed-model ANCOVA estimators for the average treatment effect are consistent and asymptotically normal under arbitrary misspecification of its working model. If the probability of receiving treatment is 0.5 for each cluster, we further show that the model-based variance estimator under mixed-model ANCOVA1 (ANCOVA without treatment-covariate interactions) remains consistent, clarifying that the confidence interval given by standard software is asymptotically valid even under model misspecification. Beyond robustness, we discuss several insights on precision among classical methods for analyzing cluster-randomized trials, including the mixed-model ANCOVA, individual-level ANCOVA, and cluster-level ANCOVA estimators. These insights may inform the choice of methods in practice. Our analytical results and insights are illustrated via simulation studies and analyses of three cluster-randomized trials. △ Less

Submitted 8 October, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

arXiv:2010.05684 [pdf]

The implications of outcome truncation in reproductive medicine RCTs: a simulation platform for trialists and simulation study

Authors: Jack Wilkinson, Jonathan Huang, Antonia Marsden, Michael Harhay, Andy Vail, Stephen A Roberts

Abstract: Randomised controlled trials in reproductive medicine are often subject to outcome truncation, where study outcomes are only defined in a subset of participants. Examples include birthweight (measurable only in the subgroup of participants who give birth) and miscarriage (which can only occur in participants who become pregnant). These are typically analysed by making a comparison between treatmen… ▽ More Randomised controlled trials in reproductive medicine are often subject to outcome truncation, where study outcomes are only defined in a subset of participants. Examples include birthweight (measurable only in the subgroup of participants who give birth) and miscarriage (which can only occur in participants who become pregnant). These are typically analysed by making a comparison between treatment arms within the subgroup (comparing birthweights in the subgroup who gave birth, or miscarriages in the subgroup who became pregnant). However, this approach does not represent a randomised comparison when treatment influences the probability of being observed (i.e. survival). The practical implications of this for reproductive trials are unclear. We developed a simulation platform to investigate the implications of outcome truncation for reproductive medicine trials. We used this to perform a simulation study, in which we considered the bias, Type 1 error, coverage, and precision of standard statistical analyses for truncated continuous and binary outcomes. Increasing treatment effect on the intermediate variable, strength of confounding between the intermediate and outcome variables, and interactions between treatment and confounder were found to adversely affect performance. However, within parameter ranges we would consider to be more realistic, the adverse effects were generally not drastic. For binary outcomes, the study highlighted that outcome truncation may lead to none of the participants in a study arm experiencing the outcome event. This was found to have severe consequences for inferences, and this may have implications for meta-analysis. △ Less

Submitted 12 October, 2020; originally announced October 2020.

Comments: 63 pages (including supplementary files). 49 Figures (7 in main document, 42 supplementary figures). Code available at, https://osf.io/gzqbr/

arXiv:1304.3752 [pdf]

doi 10.1186/gb-2013-14-9-r101

Reducing assembly complexity of microbial genomes with single-molecule sequencing

Authors: Sergey Koren, Gregory P Harhay, Timothy PL Smith, James L Bono, Dayna M Harhay, D. Scott Mcvey, Diana Radune, Nicholas H Bergman, Adam M Phillippy

Abstract: Background: The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, whi… ▽ More Background: The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. Results: To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Conclusions: Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization. △ Less

Submitted 15 November, 2013; v1 submitted 12 April, 2013; originally announced April 2013.

Comments: Published in Genome Biology: http://genomebiology.com/2013/14/9/R101/ Supplementary materials available from: http://www.cbcb.umd.edu/software/PBcR/closure/index.html Reducing assembly complexity of microbial genomes with single-molecule sequencing. Koren S, Harhay GP, Smith TPL, Bono JL, Harhay DM, Mcvey SD, Radune D, Bergman NH, Phillippy AM. Genome Biology 14:R101 2013

Showing 1–11 of 11 results for author: Harhay, M