-
Credal Learning Theory
Authors:
Michele Caprio,
Maryam Sultana,
Eleni Elia,
Fabio Cuzzolin
Abstract:
Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learnt from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment, however, the data distribution may (and often does) vary, causing domain adaptation/generalization issues. In this paper we lay the foundations for a `credal' theo…
▽ More
Statistical learning theory is the foundation of machine learning, providing theoretical bounds for the risk of models learnt from a (single) training set, assumed to issue from an unknown probability distribution. In actual deployment, however, the data distribution may (and often does) vary, causing domain adaptation/generalization issues. In this paper we lay the foundations for a `credal' theory of learning, using convex sets of probabilities (credal sets) to model the variability in the data-generating distribution. Such credal sets, we argue, may be inferred from a finite sample of training sets. Bounds are derived for the case of finite hypotheses spaces (both assuming realizability or not) as well as infinite model spaces, which directly generalize classical results.
△ Less
Submitted 2 May, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
A regression-based method for detecting publication bias in multivariate meta-analysis
Authors:
Chuan Hong,
**g Zhang,
Yang Li,
Elena Elia,
Richard Riley,
Yong Chen
Abstract:
Publication bias occurs when the publication of research results depends not only on the quality of the research but also on its nature and direction. The consequence is that published studies may not be truly representative of all valid studies undertaken, and this bias may threaten the validity of systematic reviews and meta-analyses - on which evidence-based medicine increasingly relies. Multiv…
▽ More
Publication bias occurs when the publication of research results depends not only on the quality of the research but also on its nature and direction. The consequence is that published studies may not be truly representative of all valid studies undertaken, and this bias may threaten the validity of systematic reviews and meta-analyses - on which evidence-based medicine increasingly relies. Multivariate meta-analysis has recently received increasing attention for its ability reducing potential bias and improving statistical efficiency by borrowing information across outcomes. However, detecting and accounting for publication bias are more challenging in multivariate meta-analysis setting because some studies may be completely unpublished whereas some studies may selectively report part of multiple outcomes. In this paper, we propose a score test for jointly testing publication bias for multiple outcomes, which is novel to the multivariate setting. The proposed test is a natural multivariate extension of the univariate Egger's test, and can handle the above mentioned scenarios simultaneously, It accounts for correlations among multivariate outcomes, while allowing different types of outcomes, and can borrow information across outcomes. The proposed test is shown to be more powerful than the Egger's test, Begg's test and Trim and Fill method through simulation studies. Two data analyses are given to illustrate the performance of the proposed test in practice.
△ Less
Submitted 11 February, 2020;
originally announced February 2020.
-
Combining tumour response and progression free survival as surrogate endpoints for overall survival in advanced colorectal cancer
Authors:
Eleni G. Elia,
Nicolas Städler,
Oriana Ciani,
Rod S. Taylor,
Sylwia Bujkiewicz
Abstract:
Progression free survival (PFS) and tumour response (TR) have been investigated as surrogate endpoints for overall survival (OS) in advanced colorectal cancer (aCRC), however their validity has been shown to be suboptimal. In recent years, meta-analytic methods allowing for use of multiple surrogate endpoints jointly have been proposed. The aim of this research was to assess if PFS and TR used joi…
▽ More
Progression free survival (PFS) and tumour response (TR) have been investigated as surrogate endpoints for overall survival (OS) in advanced colorectal cancer (aCRC), however their validity has been shown to be suboptimal. In recent years, meta-analytic methods allowing for use of multiple surrogate endpoints jointly have been proposed. The aim of this research was to assess if PFS and TR used jointly as surrogate endpoints to OS improve their predictive value. Data were obtained from a systematic review of randomised controlled trials investigating effectiveness of different pharmacological therapies in aCRC: systemic chemotherapies, anti-epidermal growth factor receptor therapies, anti-angiogenic agents, other multi-targeted antifolate treatments and intra-hepatic arterial chemotherapy. Multivariate meta-analysis was used to model the association patterns between treatment effects on the surrogate endpoints (PFS, TR) and the final outcome (OS). Analysis of 33 trials which reported treatment effects on all three outcomes showed reasonably strong association between treatment effects on PFS and OS. A weak surrogate relationship was noted between the treatment effects on TR and OS. Modelling the two surrogate endpoints, TR and PFS, jointly as predictors of OS gave no marked improvement in neither surrogacy patterns nor the precision of predicted treatment effect in the cross-validation procedure. When investigating subgroups of therapy, only small improvement in precision of predicted treatment effects on the final outcome in studies investigating anti-angiogenic therapy was noted. Overall, the simultaneous modelling of two surrogate endpoints did not lead to improvement in association between treatment effects on surrogate and final endpoints in aCRC.
△ Less
Submitted 9 September, 2018;
originally announced September 2018.