Search | arXiv e-print repository

Optimal Bayesian stepped-wedge cluster randomised trial designs for binary outcome data

Authors: Laura Etfer, James M. S. Wason, Michael J. Grayling

Abstract: Under a generalised estimating equation analysis approach, approximate design theory is used to determine Bayesian D-optimal designs. For two examples, considering simple exchangeable and exponential decay correlation structures, we compare the efficiency of identified optimal designs to balanced stepped-wedge designs and corresponding stepped-wedge designs determined by optimising using a normal… ▽ More Under a generalised estimating equation analysis approach, approximate design theory is used to determine Bayesian D-optimal designs. For two examples, considering simple exchangeable and exponential decay correlation structures, we compare the efficiency of identified optimal designs to balanced stepped-wedge designs and corresponding stepped-wedge designs determined by optimising using a normal approximation approach. The dependence of the Bayesian D-optimal designs on the assumed correlation structure is explored; for the considered settings, smaller decay in the correlation between outcomes across time periods, along with larger values of the intra-cluster correlation, leads to designs closer to a balanced design being optimal. Unlike for normal data, it is shown that the optimal design need not be centro-symmetric in the binary outcome case. The efficiency of the Bayesian D-optimal design relative to a balanced design can be large, but situations are demonstrated in which the advantages are small. Similarly, the optimal design from a normal approximation approach is often not much less efficient than the Bayesian D-optimal design. Bayesian D-optimal designs can be readily identified for stepped-wedge cluster randomised trials with binary outcome data. In certain circumstances, principally ones with strong time period effects, they will indicate that a design unlikely to have been identified by previous methods may be substantially more efficient. However, they require a larger number of assumptions than existing optimal designs, and in many situations existing theory under a normal approximation will provide an easier means of identifying an efficient design for binary outcome data. △ Less

Submitted 15 February, 2024; originally announced February 2024.

arXiv:2306.04430 [pdf, other]

Evaluating the impact of outcome delay on the efficiency of two-arm group-sequential trials

Authors: Aritra Mukherjee, Michael J. Grayling, James M. S. Wason

Abstract: Adaptive designs(AD) are a broad class of trial designs that allow preplanned modifications based on patient data providing improved efficiency and flexibility. However, a delay in observing the primary outcome variable can harm this added efficiency. In this paper, we aim to ascertain the size of such outcome delay that results in the realised efficiency gains of ADs becoming negligible compared… ▽ More Adaptive designs(AD) are a broad class of trial designs that allow preplanned modifications based on patient data providing improved efficiency and flexibility. However, a delay in observing the primary outcome variable can harm this added efficiency. In this paper, we aim to ascertain the size of such outcome delay that results in the realised efficiency gains of ADs becoming negligible compared to classical fixed sample RCTs. We measure the impact of delay by develo** formulae for the no. of overruns in 2 arm GSDs with normal data, assuming different recruitment models. The efficiency of a GSD is usually measured in terms of the expected sample size (ESS), with GSDs generally reducing the ESS compared to a standard RCT. Our formulae measures the efficiency gain from a GSD in terms of ESS reduction that is lost due to delay. We assess whether careful choice of design (e.g., altering the spacing of the IAs) can help recover the benefits of GSDs in presence of delay. We also analyse the efficiency of GSDs with respect to time to complete the trial. Comparing the expected efficiency gains, with and without consideration of delay, it is evident GSDs suffer considerable losses due to delay. Even a small delay can have a significant impact on the trial's efficiency. In contrast, even in the presence of substantial delay, a GSD will have a smaller expected time to trial completion in comparison to a simple RCT. Although the no. of stages have little influence on the efficiency losses, the timing of IAs can impact the efficiency of a GSDs with delay. Particularly, for unequally spaced IAs, pushing IAs towards latter end of the trial can be harmful for the design with delay. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2305.10174 [pdf, other]

Utilising high-dimensional data in randomised clinical trials: a review of methods and practice

Authors: Svetlana Cherlin, Theophile Bigirumurame, Michael J Grayling, Jérémie Nsengimana, Luke Ouma, Aida Santaolalla, Fang Wan, S Faye Williamson, James M S Wason

Abstract: Introduction: Even in effectively conducted randomised trials, the probability of a successful study remains relatively low. With recent advances in the next-generation sequencing technologies, there is a rapidly growing number of high-dimensional data, including genetic, molecular and phenotypic information, that have improved our understanding of driver genes, drug targets, and drug mechanisms o… ▽ More Introduction: Even in effectively conducted randomised trials, the probability of a successful study remains relatively low. With recent advances in the next-generation sequencing technologies, there is a rapidly growing number of high-dimensional data, including genetic, molecular and phenotypic information, that have improved our understanding of driver genes, drug targets, and drug mechanisms of action. The leveraging of high-dimensional data holds promise for increased success of clinical trials. Methods: We provide an overview of methods for utilising high-dimensional data in clinical trials. We also investigate the use of these methods in practice through a review of recently published randomised clinical trials that utilise high-dimensional genetic data. The review includes articles that were published between 2019 and 2021, identified through the PubMed database. Results: Out of 174 screened articles, 100 (57.5%) were randomised clinical trials that collected high-dimensional data. The most common clinical area was oncology (30%), followed by chronic diseases (28%), nutrition and ageing (18%) and cardiovascular diseases (7%). The most common types of data analysed were gene expression data (70%), followed by DNA data (21%). The most common method of analysis (36.3%) was univariable analysis. Articles that described multivariable analyses used standard statistical methods. Most of the clinical trials had two arms. Discussion: New methodological approaches are required for more efficient analysis of the increasing amount of high-dimensional data collected in randomised clinical trials. We highlight the limitations and barriers to the current use of high-dimensional data in trials, and suggest potential avenues for improvement and future work. △ Less

Submitted 5 February, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: 17 pages, 3 figures, 2 tables

arXiv:2303.09548 [pdf, other]

doi 10.1093/mnras/stad841

Photometric study of the late-time near-infrared plateau in Type Ia supernovae

Authors: M. Deckers, O. Graur, K. Maguire, L. Shingles, S. J. Brennan, J. P. Anderson, J. Burke, T. -W. Chen, L. Galbany, M. J. P. Grayling, C. P. Gutiérrez, L. Harvey, D. Hiramatsu, D. A. Howell, C. Inserra, T. Killestein, C. McCully, T. E. Müller-Bravo, M. Nicholl, M. Newsome, E. Padilla Gonzalez, C. Pellegrino, G. Terreran, J. H. Terwel, M. Toy , et al. (1 additional authors not shown)

Abstract: We present an in-depth study of the late-time near-infrared plateau in Type Ia supernovae (SNe Ia), which occurs between 70-500 d. We double the existing sample of SNe Ia observed during the late-time near-infrared plateau with new observations taken with the Hubble Space Telescope, Gemini, New Technology Telescope, the 3.5m Calar Alto Telescope, and the Nordic Optical Telescope. Our sample consis… ▽ More We present an in-depth study of the late-time near-infrared plateau in Type Ia supernovae (SNe Ia), which occurs between 70-500 d. We double the existing sample of SNe Ia observed during the late-time near-infrared plateau with new observations taken with the Hubble Space Telescope, Gemini, New Technology Telescope, the 3.5m Calar Alto Telescope, and the Nordic Optical Telescope. Our sample consists of 24 nearby SNe Ia at redshift < 0.025. We are able to confirm that no plateau exists in the Ks band for most normal SNe Ia. SNe Ia with broader optical light curves at peak tend to have a higher average brightness on the plateau in J and H, most likely due to a shallower decline in the preceding 100 d. SNe Ia that are more luminous at peak also show a steeper decline during the plateau phase in H. We compare our data to state-of-the-art radiative transfer models of nebular SNe Ia in the near-infrared. We find good agreement with the sub-Mch model that has reduced non-thermal ionisation rates, but no physical justification for reducing these rates has yet been proposed. An analysis of the spectral evolution during the plateau demonstrates that the ratio of [Fe II] to [Fe III] contribution in a near-infrared filter determines the light curve evolution in said filter. We find that overluminous SNe decline slower during the plateau than expected from the trend seen for normal SNe Ia △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: 17 pages, 8 figures, Accepted for publication in MNRAS

arXiv:2205.12227 [pdf, other]

doi 10.1093/biostatistics/kxac033

Bayesian sample size determination in basket trials borrowing information between subsets

Authors: Haiyan Zheng, Michael J. Grayling, Pavel Mozgunov, Thomas Jaki, James M. S. Wason

Abstract: Basket trials are increasingly used for the simultaneous evaluation of a new treatment in various patient subgroups under one overarching protocol. We propose a Bayesian approach to sample size determination in basket trials that permit borrowing of information between commensurate subsets. Specifically, we consider a randomised basket trial design where patients are randomly assigned to the new t… ▽ More Basket trials are increasingly used for the simultaneous evaluation of a new treatment in various patient subgroups under one overarching protocol. We propose a Bayesian approach to sample size determination in basket trials that permit borrowing of information between commensurate subsets. Specifically, we consider a randomised basket trial design where patients are randomly assigned to the new treatment or a control within each trial subset (`subtrial' for short). Closed-form sample size formulae are derived to ensure each subtrial has a specified chance of correctly deciding whether the new treatment is superior to or not better than the control by some clinically relevant difference. Given pre-specified levels of pairwise (in)commensurability, the subtrial sample sizes are solved simultaneously. The proposed Bayesian approach resembles the frequentist formulation of the problem in yielding comparable sample sizes for circumstances of no borrowing. When borrowing is enabled between commensurate subtrials, a considerably smaller trial sample size is required compared to the widely implemented approach of no borrowing. We illustrate the use of our sample size formulae with two examples based on real basket trials. A comprehensive simulation study further shows that the proposed methodology can maintain the true positive and false positive rates at desired levels. △ Less

Submitted 24 May, 2022; originally announced May 2022.

Comments: A submitted paper with 18 pages (inclusive of 3 figures and 1 table) and 9 pages of Supplementary Materials

Journal ref: Biostatistics 2022

arXiv:2012.10194 [pdf, other]

Multi-outcome trials with a generalised number of efficacious outcomes

Authors: Martin Law, Michael J. Grayling, Adrian P. Mander

Abstract: Existing multi-outcome designs focus almost entirely on evaluating whether all outcomes show evidence of efficacy or whether at least one outcome shows evidence of efficacy. While a small number of authors have provided multi-outcome designs that evaluate when a general number of outcomes show promise, these designs have been single-stage in nature only. We therefore propose two designs, of group-… ▽ More Existing multi-outcome designs focus almost entirely on evaluating whether all outcomes show evidence of efficacy or whether at least one outcome shows evidence of efficacy. While a small number of authors have provided multi-outcome designs that evaluate when a general number of outcomes show promise, these designs have been single-stage in nature only. We therefore propose two designs, of group-sequential and drop the loser form, that provide this design characteristic in a multi-stage setting. Previous such multi-outcome multi-stage designs have allowed only for a maximum of two outcomes; our designs thus also extend previous related proposals by permitting any number of outcomes. △ Less

Submitted 18 December, 2020; originally announced December 2020.

Comments: 38 pages, 11 figures, 4 tables. Submitted for the degree of Doctor of Philosophy at University of Cambridge

arXiv:2010.06567 [pdf, other]

Conditional Power and Friends: The Why and How of (Un)planned, Unblinded Sample Size Recalculations in Confirmatory Trials

Authors: Kevin Kunzmann, Michael J. Grayling, Kim M. Lee, David S. Robertson, Kaspar Rufibach, James M. S. Wason

Abstract: Adapting the final sample size of a trial to the evidence accruing during the trial is a natural way to address planning uncertainty. Designs with adaptive sample size need to account for their optional stop** to guarantee strict type-I error-rate control. A variety of different methods to maintain type-I error-rate control after unplanned changes of the initial sample size have been proposed in… ▽ More Adapting the final sample size of a trial to the evidence accruing during the trial is a natural way to address planning uncertainty. Designs with adaptive sample size need to account for their optional stop** to guarantee strict type-I error-rate control. A variety of different methods to maintain type-I error-rate control after unplanned changes of the initial sample size have been proposed in the literature. This makes interim analyses for the purpose of sample size recalculation feasible in a regulatory context. Since the sample size is usually determined via an argument based on the power of the trial, an interim analysis raises the question of how the final sample size should be determined conditional on the accrued information. Conditional power is a concept often put forward in this context. Since it depends on the unknown effect size, we take a strict estimation perspective and compare assumed conditional power, observed conditional power, and predictive power with respect to their properties as estimators of the unknown conditional power. We then demonstrate that pre-planning an interim analysis using methodology for unplanned interim analyses is ineffective and naturally leads to the concept of optimal two-stage designs. We conclude that unplanned design adaptations should only be conducted as reaction to trial-external new evidence, operational needs to violate the originally chosen design, or post hoc changes in the objective criterion. Finally, we show that commonly discussed sample size recalculation rules can lead to paradoxical outcomes and propose two alternative ways of reacting to newly emerging trial-external evidence. △ Less

Submitted 13 October, 2020; originally announced October 2020.

arXiv:2007.04168 [pdf, other]

Two-stage single-arm trials are rarely reported adequately

Authors: Michael J. Grayling, Adrian P. Mander

Abstract: Purpose: Two-stage single-arm trial designs are commonly used in phase II oncology to infer treatment effects for a binary primary outcome (e.g., tumour response). It is imperative that such studies be designed, analysed, and reported effectively. However, there is little available evidence on whether this is the case, particularly for key statistical considerations. We therefore comprehensively r… ▽ More Purpose: Two-stage single-arm trial designs are commonly used in phase II oncology to infer treatment effects for a binary primary outcome (e.g., tumour response). It is imperative that such studies be designed, analysed, and reported effectively. However, there is little available evidence on whether this is the case, particularly for key statistical considerations. We therefore comprehensively review such trials, examining in particular quality of reporting. Methods: Published oncology trials that utilised "Simon's two-stage design" over a 5 year period were identified and reviewed. Articles were evaluated on whether they reported sufficient design details, such as the required sample size, and analysis details, such as a confidence interval (CI). The articles that did not adjust their inference for the incorporation of an interim analysis were re-analysed to evaluate the impact on their reported point estimate and CI. Results: Four hundred and twenty five articles that reported the results of a single treatment arm were included. Of these, 47.5% provided the five components that ensure design reproducibility. Only 1.2% and 2.1% reported an adjusted point estimate or CI, respectively. Just 55.3% of trials provided the final stage rejection bound, indicating many trials did not test the hypothesis the design is constructed to assess. Re-analysis of the trials suggests that reported point estimates underestimated treatment effects and that reported CIs were too narrow. Conclusion: Key design details of two-stage single-arm trials are often unreported. Whilst inference is regularly performed, it is rarely done so in a way that removes the bias introduced by the interim analysis. In order to maximise their value, future studies must improve the way that they are analysed and reported. △ Less

Submitted 8 July, 2020; originally announced July 2020.

arXiv:2006.15715 [pdf, other]

doi 10.1080/00031305.2021.1901782

A review of Bayesian perspectives on sample size derivation for confirmatory trials

Authors: Kevin Kunzmann, Michael J. Grayling, Kim May Lee, David S. Robertson, Kaspar Rufibach, James M. S. Wason

Abstract: Sample size derivation is a crucial element of the planning phase of any confirmatory trial. A sample size is typically derived based on constraints on the maximal acceptable type I error rate and a minimal desired power. Here, power depends on the unknown true effect size. In practice, power is typically calculated either for the smallest relevant effect size or a likely point alternative. The fo… ▽ More Sample size derivation is a crucial element of the planning phase of any confirmatory trial. A sample size is typically derived based on constraints on the maximal acceptable type I error rate and a minimal desired power. Here, power depends on the unknown true effect size. In practice, power is typically calculated either for the smallest relevant effect size or a likely point alternative. The former might be problematic if the minimal relevant effect is close to the null, thus requiring an excessively large sample size. The latter is dubious since it does not account for the a priori uncertainty about the likely alternative effect size. A Bayesian perspective on the sample size derivation for a frequentist trial naturally emerges as a way of reconciling arguments about the relative a priori plausibility of alternative effect sizes with ideas based on the relevance of effect sizes. Many suggestions as to how such `hybrid' approaches could be implemented in practice have been put forward in the literature. However, key quantities such as assurance, probability of success, or expected power are often defined in subtly different ways in the literature. Starting from the traditional and entirely frequentist approach to sample size derivation, we derive consistent definitions for the most commonly used `hybrid' quantities and highlight connections, before discussing and demonstrating their use in the context of sample size derivation for clinical trials. △ Less

Submitted 28 June, 2020; originally announced June 2020.

Journal ref: Am. Stat., 2021, 75(4), 424--432

arXiv:1909.03017 [pdf, other]

Optimal curtailed designs for single arm phase II clinical trials

Authors: Martin Law, Michael J. Grayling, Adrian P. Mander

Abstract: In single-arm phase II oncology trials, the most popular choice of design is Simon's two-stage design, which allows early stop** at one interim analysis. However, the expected trial sample size can be reduced further by allowing curtailment. Curtailment is stop** when the final go or no-go decision is certain, so-called non-stochastic curtailment, or very likely, known as stochastic curtailmen… ▽ More In single-arm phase II oncology trials, the most popular choice of design is Simon's two-stage design, which allows early stop** at one interim analysis. However, the expected trial sample size can be reduced further by allowing curtailment. Curtailment is stop** when the final go or no-go decision is certain, so-called non-stochastic curtailment, or very likely, known as stochastic curtailment. In the context of single-arm phase II oncology trials, stochastic curtailment has previously been restricted to stop** in the second stage and/or stop** for a no-go decision only. We introduce two designs that incorporate stochastic curtailment and allow stop** after every observation, for either a go or no-go decision. We obtain optimal stop** boundaries by searching over a range of potential conditional powers, beyond which the trial will stop for a go or no-go decision. This search is novel: firstly, the search is undertaken over a range of values unique to each possible design realisation. Secondly, these values are evaluated taking into account the possibility of early stop**. Finally, each design realisation's operating characteristics are obtained exactly. The proposed designs are compared to existing designs in a real data example. They are also compared under three scenarios, both with respect to four single optimality criteria and using a loss function. The proposed designs are superior in almost all cases. Optimising for the expected sample size under either the null or alternative hypothesis, the saving compared to the popular Simon's design ranges from 22% to 55%. △ Less

Submitted 6 September, 2019; originally announced September 2019.

arXiv:1906.09178 [pdf, other]

A web application for the design of multi-arm clinical trials

Authors: Michael J Grayling, James MS Wason

Abstract: Multi-arm designs provide an effective means of evaluating several treatments within the same clinical trial. Given the large number of treatments now available for testing in many disease areas, it has been argued that their utilisation should increase. However, for any given clinical trial there are numerous possible multi-arm designs that could be used, and choosing between them can be a diffic… ▽ More Multi-arm designs provide an effective means of evaluating several treatments within the same clinical trial. Given the large number of treatments now available for testing in many disease areas, it has been argued that their utilisation should increase. However, for any given clinical trial there are numerous possible multi-arm designs that could be used, and choosing between them can be a difficult task. This task is complicated further by a lack of available easy-to-use software for designing multi-arm trials. To aid the wider implementation of multi-arm clinical trial designs, we have developed a web application for sample size calculation when using a variety of popular multiple comparison corrections. Furthermore, the application supports sample size calculation to control several varieties of power, as well as the determination of optimised arm-wise allocation ratios. It is built using the Shiny package in the R programming language, is free to access on any device with an internet browser, and requires no programming knowledge to use. The application provides the core information required by statisticians and clinicians to review the operating characteristics of a chosen multi-arm clinical trial design. We hope that it will assist with the future utilisation of such designs in practice. △ Less

Submitted 21 June, 2019; originally announced June 2019.

arXiv:1906.05603 [pdf]

A review of available software for adaptive clinical trial design

Authors: Michael J Grayling, Graham M Wheeler

Abstract: Background/Aims: The increasing expense of the drug development process has seen interest in the use of adaptive designs (ADs) grow substantially in recent years. Accordingly, much research has been conducted to identify potential barriers to increasing the use of ADs in practice, and several articles have argued that the availability of user-friendly software will be an important step in making A… ▽ More Background/Aims: The increasing expense of the drug development process has seen interest in the use of adaptive designs (ADs) grow substantially in recent years. Accordingly, much research has been conducted to identify potential barriers to increasing the use of ADs in practice, and several articles have argued that the availability of user-friendly software will be an important step in making ADs easier to implement. Therefore, in this paper we present a review of the current state of software availability for AD. Methods: We first review articles from 31 journals published in 2013-17 that relate to methodology for adaptive trials, in order to assess how often code and software for implementing novel ADs is made available at the time of publication. We contrast our findings against these journals' current policies on code distribution. Secondly, we conduct additional searches of popular code repositories, such as CRAN and GitHub, to identify further existing user-contributed software for ADs. From this, we are able to direct interested parties towards solutions for their problem of interest by classifying available code by type of adaptation. Results: Only 29% of included articles made their code available in some form. In many instances, articles published in journals that had mandatory requirements on code provision still did not make code available. There are several areas in which available software is currently limited or saturated. In particular, many packages are available to address group sequential design, but comparatively little code is present in the public domain to determine biomarker-guided ADs. Conclusions: There is much room for improvement in the provision of software alongside AD publications. Additionally, whilst progress has been made, well-established software for various types of trial adaptation remains sparsely available. △ Less

Submitted 13 June, 2019; originally announced June 2019.

Showing 1–12 of 12 results for author: Grayling, M J