Skip to main content

Showing 1–7 of 7 results for author: Vallejos, C A

.
  1. arXiv:2406.03161  [pdf, other

    cs.LG cs.CY

    Ethical considerations of use of hold-out sets in clinical prediction model management

    Authors: Louis Chislett, Louis JM Aslett, Alisha R Davies, Catalina A Vallejos, James Liley

    Abstract: Clinical prediction models are statistical or machine learning models used to quantify the risk of a certain health outcome using patient data. These can then inform potential interventions on patients, causing an effect called performative prediction: predictions inform interventions which influence the outcome they were trying to predict, leading to a potential underestimation of risk in some pa… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2212.05157  [pdf, ps, other

    stat.ME

    A review on competing risks methods for survival analysis

    Authors: Karla Monterrubio-Gómez, Nathan Constantine-Cooke, Catalina A. Vallejos

    Abstract: When modelling competing risks survival data, several techniques have been proposed in both the statistical and machine learning literature. State-of-the-art methods have extended classical approaches with more flexible assumptions that can improve predictive performance, allow high dimensional data and missing values, among others. Despite this, modern approaches have not been widely employed in… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 22 pages, 2 tables

    MSC Class: 62N02

  3. arXiv:2010.11530  [pdf, other

    stat.ML cs.LG

    Model updating after interventions paradoxically introduces bias

    Authors: James Liley, Samuel R Emerson, Bilal A Mateen, Catalina A Vallejos, Louis J M Aslett, Sebastian J Vollmer

    Abstract: Machine learning is increasingly being used to generate prediction models for use in a number of real-world settings, from credit risk assessment to clinical decision support. Recent discussions have highlighted potential problems in the updating of a predictive score for a binary outcome when an existing predictive score forms part of the standard workflow, driving interventions. In this setting,… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Sections of this preprint on 'Successive adjuvancy' (section 4, theorem 2, figures 4,5, and associated discussions) were not included in the originally submitted version of this paper due to length. This material does not appear in the published version of this manuscript, and the reader should be aware that these sections did not undergo peer review

  4. arXiv:1908.08737  [pdf, other

    cs.CR

    Design choices for productive, secure, data-intensive research at scale in the cloud

    Authors: Diego Arenas, Jon Atkins, Claire Austin, David Beavan, Alvaro Cabrejas Egea, Steven Carlysle-Davies, Ian Carter, Rob Clarke, James Cunningham, Tom Doel, Oliver Forrest, Evelina Gabasova, James Geddes, James Hetherington, Radka Jersakova, Franz Kiraly, Catherine Lawrence, Jules Manser, Martin T. O'Reilly, James Robinson, Helen Sherwood-Taylor, Serena Tierney, Catalina A. Vallejos, Sebastian Vollmer, Kirstie Whitaker

    Abstract: We present a policy and process framework for secure environments for productive data science research projects at scale, by combining prevailing data security threat and risk profiles into five sensitivity tiers, and, at each tier, specifying recommended policies for data classification, data ingress, software ingress, data egress, user access, user device control, and analysis environments. By p… ▽ More

    Submitted 15 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  5. arXiv:1809.08024  [pdf, other

    stat.ME stat.AP

    Shrinkage estimation of large covariance matrices using multiple shrinkage targets

    Authors: Harry Gray, Gwenaël G. R. Leday, Catalina A. Vallejos, Sylvia Richardson

    Abstract: Linear shrinkage estimators of a covariance matrix --- defined by a weighted average of the sample covariance matrix and a pre-specified shrinkage target matrix --- are popular when analysing high-throughput molecular data. However, their performance strongly relies on an appropriate choice of target matrix. This paper introduces a more flexible class of linear shrinkage estimators that can accomm… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

  6. arXiv:1406.6728  [pdf, other

    stat.AP

    Bayesian Survival Modelling of University Outcomes

    Authors: Catalina A. Vallejos, Mark F. J. Steel

    Abstract: The aim of this paper is to model the length of registration at university and its associated academic outcome for undergraduate students at the Pontificia Universidad Católica de Chile. Survival time is defined as the time until the end of the enrollment period, which can relate to different reasons - graduation or two types of dropout - that are driven by different processes. Hence, a competing… ▽ More

    Submitted 25 June, 2014; originally announced June 2014.

  7. arXiv:1311.1454  [pdf, ps, other

    stat.ME math.ST stat.AP

    On posterior propriety for the Student-$t$ linear regression model under Jeffreys priors

    Authors: Catalina A. Vallejos, Mark F. J. Steel

    Abstract: Regression models with fat-tailed error terms are an increasingly popular choice to obtain more robust inference to the presence of outlying observations. This article focuses on Bayesian inference for the Student-$t$ linear regression model under objective priors that are based on the Jeffreys rule. Posterior propriety results presented in Fonseca et al. (2008) are revisited and corrected. In par… ▽ More

    Submitted 8 November, 2013; v1 submitted 6 November, 2013; originally announced November 2013.

    Comments: minor editorial changes in this version