-
Study Duration Prediction for Clinical Trials with Time-to-Event Endpoints Using Mixture Distributions Accounting for Heterogeneous Population
Authors:
Hong Zhang,
Jie Pu,
Shibing Deng,
Satrajit Roychoudhury,
Haitao Chu,
Douglas Robinson
Abstract:
In the era of precision medicine, more and more clinical trials are now driven or guided by biomarkers, which are patient characteristics objectively measured and evaluated as indicators of normal biological processes, pathogenic processes, or pharmacologic responses to therapeutic interventions. With the overarching objective to optimize and personalize disease management, biomarker-guided clinic…
▽ More
In the era of precision medicine, more and more clinical trials are now driven or guided by biomarkers, which are patient characteristics objectively measured and evaluated as indicators of normal biological processes, pathogenic processes, or pharmacologic responses to therapeutic interventions. With the overarching objective to optimize and personalize disease management, biomarker-guided clinical trials increase the efficiency by appropriately utilizing prognostic or predictive biomarkers in the design. However, the efficiency gain is often not quantitatively compared to the traditional all-comers design, in which a faster enrollment rate is expected (e.g. due to no restriction to biomarker positive patients) potentially leading to a shorter duration. To accurately predict biomarker-guided trial duration, we propose a general framework using mixture distributions accounting for heterogeneous population. Extensive simulations are performed to evaluate the impact of heterogeneous population and the dynamics of biomarker characteristics and disease on the study duration. Several influential parameters including median survival time, enrollment rate, biomarker prevalence and effect size are identitied. Re-assessments of two publicly available trials are conducted to empirically validate the prediction accuracy and to demonstrate the practical utility. The R package \emph{detest} is developed to implement the proposed method and is publicly available on CRAN.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
On fixed and uncertain mixture prior weights
Authors:
Beat Neuenschwander,
Simon Wandel,
Satrajit Roychoudhury,
Heinz Schmidli
Abstract:
This paper focuses on the specification of the weights for the components of mixture priors.
This paper focuses on the specification of the weights for the components of mixture priors.
△ Less
Submitted 30 June, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Duration of and time to response in oncology clinical trials from the perspective of the estimand framework
Authors:
Hans-Jochen Weber,
Stephen Corson,
Jiang Li,
Francois Mercier,
Satrajit Roychoudhury,
Martin Oliver Sailer,
Stephen Sun,
Alexander Todd,
Godwin Yung
Abstract:
Duration of response (DOR) and time to response (TTR) are typically evaluated as secondary endpoints in early-stage clinical studies in oncology when efficacy is assessed by the best overall response (BOR) and presented as the overall response rate (ORR). Despite common use of DOR and TTR in particular in single-arm studies, the definition of these endpoints and the questions they are intended to…
▽ More
Duration of response (DOR) and time to response (TTR) are typically evaluated as secondary endpoints in early-stage clinical studies in oncology when efficacy is assessed by the best overall response (BOR) and presented as the overall response rate (ORR). Despite common use of DOR and TTR in particular in single-arm studies, the definition of these endpoints and the questions they are intended to answer remain unclear. Motivated by the estimand framework, we present relevant scientific questions of interest for DOR and TTR and propose corresponding estimand definitions. We elaborate on how to deal with relevant intercurrent events which should follow the same considerations as implemented for the primary response estimand. A case study in mantle cell lymphoma illustrates the implementation of relevant estimands of DOR and TTR. We close the paper with practical recommendations to implement DOR and TTR in clinical study protocols.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
The Predictive Individual Effect for Survival Data
Authors:
Beat Neuenschwander,
Satrajit Roychoudhury,
Simon Wandel,
Kannan Natarajan,
Emmanuel Zuber
Abstract:
The call for patient-focused drug development is loud and clear, as expressed in the 21st Century Cures Act and in recent guidelines and initiatives of regulatory agencies. Among the factors contributing to modernized drug development and improved health-care activities are easily interpretable measures of clinical benefit. In addition, special care is needed for cancer trials with time-to-event e…
▽ More
The call for patient-focused drug development is loud and clear, as expressed in the 21st Century Cures Act and in recent guidelines and initiatives of regulatory agencies. Among the factors contributing to modernized drug development and improved health-care activities are easily interpretable measures of clinical benefit. In addition, special care is needed for cancer trials with time-to-event endpoints if the treatment effect is not constant over time. We propose the predictive individual effect which is a patient-centric and tangible measure of clinical benefit under a wide variety of scenarios. It can be obtained by standard predictive calculations under a rank preservation assumption that has been used previously in trials with treatment switching. We discuss four recent Oncology trials that cover situations with proportional as well as non-proportional hazards (delayed treatment effect or crossing of survival curves). It is shown that the predictive individual effect offers valuable insights beyond p-values, estimates of hazard ratios or differences in median survival. Compared to standard statistical measures, the predictive individual effect is a direct, easily interpretable measure of clinical benefit. It facilitates communication among clinicians, patients, and other parties and should therefore be considered in addition to standard statistical results.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Estimands in Hematologic Oncology Trials
Authors:
Steven Sun,
Hans-Jochen Weber,
Emily Butler,
Kaspar Rufibach,
Satrajit Roychoudhury
Abstract:
The estimand framework included in the addendum to the ICH E9 guideline facilitates discussions to ensure alignment between the key question of interest, the analysis, and interpretation. Therapeutic knowledge and drug mechanism play a crucial role in determining the strategy and defining the estimand for clinical trial designs. Clinical trials in patients with hematological malignancies often pre…
▽ More
The estimand framework included in the addendum to the ICH E9 guideline facilitates discussions to ensure alignment between the key question of interest, the analysis, and interpretation. Therapeutic knowledge and drug mechanism play a crucial role in determining the strategy and defining the estimand for clinical trial designs. Clinical trials in patients with hematological malignancies often present unique challenges for trial design due to complexity of treatment options and existence of potential curative but highly risky procedures, e.g. stem cell transplant or treatment sequence across different phases (induction, consolidation, maintenance). Here, we illustrate how to apply the estimand framework in hematological clinical trials and how the estimand framework can address potential difficulties in trial result interpretation.
This paper is a result of a cross-industry collaboration to connect the International Conference on Harmonisation (ICH) E9 addendum concepts to applications. Three randomized phase 3 trials will be used to consider common challenges including intercurrent events in hematologic oncology trials to illustrate different scientific questions and the consequences of the estimand choice for trial design, data collection, analysis, and interpretation. Template language for describing estimand in both study protocols and statistical analysis plans is suggested for statisticians' reference.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Principal Stratum Strategy: Potential Role in Drug Development
Authors:
Björn Bornkamp,
Kaspar Rufibach,
Jianchang Lin,
Yi Liu,
Devan V. Mehrotra,
Satrajit Roychoudhury,
Heinz Schmidli,
Yue Shentu,
Marcel Wolbers
Abstract:
A randomized trial allows estimation of the causal effect of an intervention compared to a control in the overall population and in subpopulations defined by baseline characteristics. Often, however, clinical questions also arise regarding the treatment effect in subpopulations of patients, which would experience clinical or disease related events post-randomization. Events that occur after treatm…
▽ More
A randomized trial allows estimation of the causal effect of an intervention compared to a control in the overall population and in subpopulations defined by baseline characteristics. Often, however, clinical questions also arise regarding the treatment effect in subpopulations of patients, which would experience clinical or disease related events post-randomization. Events that occur after treatment initiation and potentially affect the interpretation or the existence of the measurements are called {\it intercurrent events} in the ICH E9(R1) guideline. If the intercurrent event is a consequence of treatment, randomization alone is no longer sufficient to meaningfully estimate the treatment effect. Analyses comparing the subgroups of patients without the intercurrent events for intervention and control will not estimate a causal effect. This is well known, but post-hoc analyses of this kind are commonly performed in drug development. An alternative approach is the principal stratum strategy, which classifies subjects according to their potential occurrence of an intercurrent event on both study arms. We illustrate with examples that questions formulated through principal strata occur naturally in drug development and argue that approaching these questions with the ICH E9(R1) estimand framework has the potential to lead to more transparent assumptions as well as more adequate analyses and conclusions. In addition, we provide an overview of assumptions required for estimation of effects in principal strata. Most of these assumptions are unverifiable and should hence be based on solid scientific understanding. Sensitivity analyses are needed to assess robustness of conclusions.
△ Less
Submitted 8 February, 2021; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis
Authors:
Ray S. Lin,
Ji Lin,
Satrajit Roychoudhury,
Keaven M. Anderson,
Tianle Hu,
Bo Huang,
Larry F Leon,
Jason JZ Liao,
Rong Liu,
Xiaodong Luo,
Pralay Mukhopadhyay,
Rui Qin,
Kay Tatsuoka,
Xue**g Wang,
Yang Wang,
Jian Zhu,
Tai-Tsang Chen,
Renee Iacona,
Cross-Pharma Non-proportional Hazards Working Group
Abstract:
The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan…
▽ More
The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Beyond p-values: a phase II dual-criterion design with statistical significance and clinical relevance
Authors:
Satrajit Roychoudhury,
Nicolas Scheuer,
Beat Neuenschwander
Abstract:
Background: Well-designed phase II trials must have acceptable error rates relative to a pre-specified success criterion, usually a statistically significant p-value. Such standard designs may not always suffice from a clinical perspective because clinical relevance may call for more. For example, proof-of-concept in phase II often requires not only statistical significance but also a sufficiently…
▽ More
Background: Well-designed phase II trials must have acceptable error rates relative to a pre-specified success criterion, usually a statistically significant p-value. Such standard designs may not always suffice from a clinical perspective because clinical relevance may call for more. For example, proof-of-concept in phase II often requires not only statistical significance but also a sufficiently large effect estimate.
Purpose: We propose dual-criterion designs to complement statistical significance with clinical relevance, discuss their methodology, and illustrate their implementation in phase II.
Methods: Clinical relevance requires the effect estimate to pass a clinically motivated threshold (the decision value). In contrast to standard designs, the required effect estimate is an explicit design input whereas study power is implicit. The sample size for a dual-criterion design needs careful considerations of the study's operating characteristics (type-I error, power).
Results: Dual-criterion designs are discussed for a randomized controlled and a single-arm phase II trial, including decision criteria, sample size calculations, decisions under various data scenarios, and operating characteristics. The designs facilitate GO/NO-GO decisions due to their complementary statistical-clinical criterion.
Conclusion: To improve evidence-based decision-making, a formal yet transparent quantitative framework is important. Dual-criterion designs offer an appealing statistical-clinical compromise, which may be preferable to standard designs if evidence against the null hypothesis alone does not suffice for an efficacy claim.
△ Less
Submitted 21 August, 2019;
originally announced August 2019.
-
Bayesian leveraging of historical control data for a clinical trial with time-to-event endpoint
Authors:
Satrajit Roychoudhury,
Beat Neuenschwander
Abstract:
The recent 21st Century Cures Act propagates innovations to accelerate the discovery, development, and delivery of 21st century cures. It includes the broader application of Bayesian statistics and the use of evidence from clinical expertise. An example of the latter is the use of trial-external (or historical) data, which promises more efficient or ethical trial designs. We propose a Bayesian met…
▽ More
The recent 21st Century Cures Act propagates innovations to accelerate the discovery, development, and delivery of 21st century cures. It includes the broader application of Bayesian statistics and the use of evidence from clinical expertise. An example of the latter is the use of trial-external (or historical) data, which promises more efficient or ethical trial designs. We propose a Bayesian meta-analytic approach to leveraging historical data for time-to-event endpoints, which are common in oncology and cardiovascular diseases. The approach is based on a robust hierarchical model for piecewise exponential data. It allows for various degrees of between trial-heterogeneity and for leveraging individual as well as aggregate data. An ovarian carcinoma trial and a non-small-cell cancer trial illustrate methodological and practical aspects of leveraging historical data for the analysis and design of time-to-event trials.
△ Less
Submitted 6 February, 2020; v1 submitted 20 August, 2019;
originally announced August 2019.
-
Robust Design and Analysis of Clinical Trials With Non-proportional Hazards: A Straw Man Guidance from a Cross-pharma Working Group
Authors:
Satrajit Roychoudhury,
Keaven M Anderson,
Jiabu Ye,
Pralay Mukhopadhyay
Abstract:
Loss of power and clear description of treatment differences are key issues in designing and analyzing a clinical trial where non-proportional hazard is a possibility. A log-rank test may be very inefficient and interpretation of the hazard ratio estimated using Cox regression is potentially problematic. In this case, the current ICH E9 (R1) addendum would suggest designing a trial with a clinical…
▽ More
Loss of power and clear description of treatment differences are key issues in designing and analyzing a clinical trial where non-proportional hazard is a possibility. A log-rank test may be very inefficient and interpretation of the hazard ratio estimated using Cox regression is potentially problematic. In this case, the current ICH E9 (R1) addendum would suggest designing a trial with a clinically relevant estimand, e.g., expected life gain. This approach considers appropriate analysis methods for supporting the chosen estimand. However, such an approach is case specific and may suffer lack of power for important choices of the underlying alternate hypothesis distribution. On the other hand, there may be a desire to have robust power under different deviations from proportional hazards. Also, we would contend that no single number adequately describes treatment effect under non-proportional hazards scenarios. The cross-pharma working group has proposed a combination test to provide robust power under a variety of alternative hypotheses. These can be specified for primary analysis at the design stage and methods appropriately accounting for combination test correlations are efficient for a variety of scenarios. We have provided design and analysis considerations based on a combination test under different non-proportional hazard types and present a straw man proposal for practitioners. The proposals are illustrated with real life example and simulation.
△ Less
Submitted 12 January, 2021; v1 submitted 19 August, 2019;
originally announced August 2019.