Skip to main content

Showing 1–7 of 7 results for author: Goldstein, B A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.03779  [pdf

    cs.LG cs.AI cs.CY

    A roadmap to fair and trustworthy prediction model validation in healthcare

    Authors: Yilin Ning, Victor Volovici, Marcus Eng Hock Ong, Benjamin Alan Goldstein, Nan Liu

    Abstract: A prediction model is most useful if it generalizes beyond the development data with external validations, but to what extent should it generalize remains unclear. In practice, prediction models are externally validated using data from very different settings, including populations from other health systems or countries, with predictably poor results. This may not be a fair reflection of the perfo… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 12 pages, 2 figures

  2. arXiv:2302.01536  [pdf

    cs.CL cs.LG stat.ML

    Using natural language processing and structured medical data to phenotype patients hospitalized due to COVID-19

    Authors: Feier Chang, Jay Krishnan, Jillian H Hurst, Michael E Yarrington, Deverick J Anderson, Emily C O'Brien, Benjamin A Goldstein

    Abstract: To identify patients who are hospitalized because of COVID-19 as opposed to those who were admitted for other indications, we compared the performance of different computable phenotype definitions for COVID-19 hospitalizations that use different types of data from the electronic health records (EHR), including structured EHR data elements, provider notes, or a combination of both data types. And c… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 21 pages, 2 figures, 3 tables, 1 supplemental figure, 2 supplemental tables

  3. arXiv:2110.02484  [pdf

    cs.LG cs.HC

    Shapley variable importance clouds for interpretable machine learning

    Authors: Yilin Ning, Marcus Eng Hock Ong, Bibhas Chakraborty, Benjamin Alan Goldstein, Daniel Shu Wei Ting, Roger Vaughan, Nan Liu

    Abstract: Interpretable machine learning has been focusing on explaining final models that optimize performance. The current state-of-the-art is the Shapley additive explanations (SHAP) that locally explains variable impact on individual predictions, and it is recently extended for a global assessment across the dataset. Recently, Dong and Rudin proposed to extend the investigation to models from the same c… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  4. AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

    Authors: Han Yuan, Feng Xie, Marcus Eng Hock Ong, Yilin Ning, Marcel Lucas Chee, Seyed Ehsan Saffari, Hairil Rizal Abdullah, Benjamin Alan Goldstein, Bibhas Chakraborty, Nan Liu

    Abstract: Background: Medical decision-making impacts both individual and public health. Clinical scores are commonly used among a wide variety of decision-making models for determining the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. Its current framework, however, still leaves room for… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  5. AutoScore-Survival: Develo** interpretable machine learning-based time-to-event scores with right-censored survival data

    Authors: Feng Xie, Yilin Ning, Han Yuan, Benjamin Alan Goldstein, Marcus Eng Hock Ong, Nan Liu, Bibhas Chakraborty

    Abstract: Scoring systems are highly interpretable and widely used to evaluate time-to-event outcomes in healthcare research. However, existing time-to-event scores are predominantly created ad-hoc using a few manually selected variables based on clinician's knowledge, suggesting an unmet need for a robust and efficient generic score-generating method. AutoScore was previously developed as an interpretabl… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  6. arXiv:2009.08541  [pdf, other

    stat.ML cs.LG

    Variational Disentanglement for Rare Event Modeling

    Authors: Zidi Xiu, Chenyang Tao, Michael Gao, Connor Davis, Benjamin A. Goldstein, Ricardo Henao

    Abstract: Combining the increasing availability and abundance of healthcare data and the current advances in machine learning methods have created renewed opportunities to improve clinical decision support systems. However, in healthcare risk prediction applications, the proportion of cases with the condition (label) of interest is often very low relative to the available sample size. Though very prevalent… ▽ More

    Submitted 16 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to AAAI2021

  7. arXiv:2003.04430  [pdf, other

    stat.ML cs.LG stat.AP

    Variational Learning of Individual Survival Distributions

    Authors: Zidi Xiu, Chenyang Tao, Benjamin A. Goldstein, Ricardo Henao

    Abstract: The abundance of modern health data provides many opportunities for the use of machine learning techniques to build better statistical models to improve clinical decision making. Predicting time-to-event distributions, also known as survival analysis, plays a key role in many clinical applications. We introduce a variational time-to-event prediction model, named Variational Survival Inference (VSI… ▽ More

    Submitted 13 December, 2020; v1 submitted 9 March, 2020; originally announced March 2020.