Skip to main content

Showing 1–6 of 6 results for author: Salerno, S

.
  1. arXiv:2406.19597  [pdf, other

    stat.ME

    What's the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research

    Authors: Stephen Salerno, Emily K. Roberts, Belinda L. Needham, Tyler H. McCormick, Bhramar Mukherjee, Xu Shi

    Abstract: A basic descriptive question in statistics often asks whether there are differences in mean outcomes between groups based on levels of a discrete covariate (e.g., racial disparities in health outcomes). However, when this categorical covariate of interest is correlated with other factors related to the outcome, direct comparisons may lead to biased estimates and invalid inferential conclusions wit… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2405.13926  [pdf, other

    stat.ME econ.EM

    Some models are useful, but for how long?: A decision theoretic approach to choosing when to refit large-scale prediction models

    Authors: Kentaro Hoffman, Stephen Salerno, Jeff Leek, Tyler McCormick

    Abstract: Large-scale prediction models (typically using tools from artificial intelligence, AI, or machine learning, ML) are increasingly ubiquitous across a variety of industries and scientific domains. Such methods are often paired with detailed data from sources such as electronic health records, wearable sensors, and omics data (high-throughput technology used to understand biology). Despite their util… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.02438  [pdf, other

    cs.CL cs.LG stat.ML

    From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives

    Authors: Shuxian Fan, Adam Visokay, Kentaro Hoffman, Stephen Salerno, Li Liu, Jeffrey T. Leek, Tyler H. McCormick

    Abstract: In settings where most deaths occur outside the healthcare system, verbal autopsies (VAs) are a common tool to monitor trends in causes of death (COD). VAs are interviews with a surviving caregiver or relative that are used to predict the decedent's COD. Turning VAs into actionable insights for researchers and policymakers requires two steps (i) predicting likely COD using the VA interview and (ii… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figures

  4. arXiv:2401.08702  [pdf, other

    stat.ME cs.LG

    Do We Really Even Need Data?

    Authors: Kentaro Hoffman, Stephen Salerno, Awan Afiaz, Jeffrey T. Leek, Tyler H. McCormick

    Abstract: As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g. rising costs, declining survey response rates), researchers increasingly use predictions from pre-trained algorithms as outcome variables. Though appealing for financial and logistical reasons, using standard tools for inference can misrepresent the association b… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  5. arXiv:2212.12028  [pdf, other

    stat.ML cs.LG

    Deep Learning of Semi-Competing Risk Data via a New Neural Expectation-Maximization Algorithm

    Authors: Stephen Salerno, Yi Li

    Abstract: Prognostication for lung cancer, a leading cause of mortality, remains a complex task, as it needs to quantify the associations of risk factors and health events spanning a patient's entire life. One challenge is that an individual's disease course involves non-terminal (e.g., disease progression) and terminal (e.g., death) events, which form semi-competing relationships. Our motivation comes from… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  6. High-Dimensional Survival Analysis: Methods and Applications

    Authors: Stephen Salerno, Yi Li

    Abstract: In the era of precision medicine, time-to-event outcomes such as time to death or progression are routinely collected, along with high-throughput covariates. These high-dimensional data defy classical survival regression models, which are either infeasible to fit or likely to incur low predictability due to over-fitting. To overcome this, recent emphasis has been placed on develo** novel approac… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.