Utilising high-dimensional data in randomised clinical trials: a review of methods and practice
Authors:
Svetlana Cherlin,
Theophile Bigirumurame,
Michael J Grayling,
Jérémie Nsengimana,
Luke Ouma,
Aida Santaolalla,
Fang Wan,
S Faye Williamson,
James M S Wason
Abstract:
Introduction: Even in effectively conducted randomised trials, the probability of a successful study remains relatively low. With recent advances in the next-generation sequencing technologies, there is a rapidly growing number of high-dimensional data, including genetic, molecular and phenotypic information, that have improved our understanding of driver genes, drug targets, and drug mechanisms o…
▽ More
Introduction: Even in effectively conducted randomised trials, the probability of a successful study remains relatively low. With recent advances in the next-generation sequencing technologies, there is a rapidly growing number of high-dimensional data, including genetic, molecular and phenotypic information, that have improved our understanding of driver genes, drug targets, and drug mechanisms of action. The leveraging of high-dimensional data holds promise for increased success of clinical trials. Methods: We provide an overview of methods for utilising high-dimensional data in clinical trials. We also investigate the use of these methods in practice through a review of recently published randomised clinical trials that utilise high-dimensional genetic data. The review includes articles that were published between 2019 and 2021, identified through the PubMed database. Results: Out of 174 screened articles, 100 (57.5%) were randomised clinical trials that collected high-dimensional data. The most common clinical area was oncology (30%), followed by chronic diseases (28%), nutrition and ageing (18%) and cardiovascular diseases (7%). The most common types of data analysed were gene expression data (70%), followed by DNA data (21%). The most common method of analysis (36.3%) was univariable analysis. Articles that described multivariable analyses used standard statistical methods. Most of the clinical trials had two arms. Discussion: New methodological approaches are required for more efficient analysis of the increasing amount of high-dimensional data collected in randomised clinical trials. We highlight the limitations and barriers to the current use of high-dimensional data in trials, and suggest potential avenues for improvement and future work.
△ Less
Submitted 5 February, 2024; v1 submitted 17 May, 2023;
originally announced May 2023.
Improved resection margins in breast-conserving surgery using Terahertz Pulsed imaging data
Authors:
A Santaolalla,
M Sheikh,
M Van Hemelrijck,
A Portieri,
ACC Coolen
Abstract:
New statistical methods were employed to improve the ability to distinguish benign from malignant breast tissue ex vivo in a recent study. The ultimately aim was to improve the intraoperative assessment of positive tumour margins in breast-conserving surgery (BCS), potentially reducing patient re-operation rates. A multivariate Bayesian classifier was applied to the waveform samples produced by a…
▽ More
New statistical methods were employed to improve the ability to distinguish benign from malignant breast tissue ex vivo in a recent study. The ultimately aim was to improve the intraoperative assessment of positive tumour margins in breast-conserving surgery (BCS), potentially reducing patient re-operation rates. A multivariate Bayesian classifier was applied to the waveform samples produced by a Terahertz Pulsed Imaging (TPI) handheld probe system in order to discriminate tumour from benign breast tissue, obtaining a sensitivity of 96% and specificity of 95%.
We compare these results to traditional and to state-of-the-art methods for determining resection margins. Given the general nature of the classifier, it is expected that this method can be applied to other tumour types where resection margins are also critical.
△ Less
Submitted 3 May, 2018;
originally announced May 2018.