Skip to main content

Showing 1–15 of 15 results for author: Chen, G H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.05933  [pdf, other

    cs.LG cs.AI stat.ML

    Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression

    Authors: Shahriar Noroozizadeh, Jeremy C. Weiss, George H. Chen

    Abstract: We consider the problem of predicting how the likelihood of an outcome of interest for a patient changes over time as we observe more of the patient data. To solve this problem, we propose a supervised contrastive learning framework that learns an embedding representation for each time step of a patient time series. Our framework learns the embedding space to have the following properties: (1) nea… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Health (ML4H 2023)

    Journal ref: In Machine Learning for Health (ML4H), pages 403-427. PMLR, 2023

  2. arXiv:2305.06862  [pdf, other

    stat.ML cs.HC cs.LG

    A General Framework for Visualizing Embedding Spaces of Neural Survival Analysis Models Based on Angular Information

    Authors: George H. Chen

    Abstract: We propose a general framework for visualizing any intermediate embedding representation used by any neural survival analysis model. Our framework is based on so-called anchor directions in an embedding space. We show how to estimate these anchor directions using clustering or, alternatively, using user-supplied "concepts" defined by collections of raw inputs (e.g., feature vectors all from female… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Conference on Health, Inference, and Learning (CHIL 2023)

  3. arXiv:2211.10508  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Survival Analysis: A Novel Fairness Loss Without Demographics

    Authors: Shu Hu, George H. Chen

    Abstract: We propose a general approach for training survival analysis models that minimizes a worst-case error across all subpopulations that are large enough (occurring with at least a user-specified minimum probability). This approach uses a training loss function that does not know any demographic information to treat as sensitive. Despite this, we demonstrate that our proposed approach often scores bet… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Machine Learning for Health (ML4H 2022)

  4. arXiv:2206.10477  [pdf, other

    cs.LG stat.ML

    Survival Kernets: Scalable and Interpretable Deep Kernel Survival Analysis with an Accuracy Guarantee

    Authors: George H. Chen

    Abstract: Kernel survival analysis models estimate individual survival distributions with the help of a kernel function, which measures the similarity between any two data points. Such a kernel function can be learned using deep kernel survival models. In this paper, we present a new deep kernel survival model called a survival kernet, which scales to large datasets in a manner that is amenable to model int… ▽ More

    Submitted 19 February, 2024; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted at the Journal of Machine Learning Research; compared to the previous arXiv version, this draft includes some minor clarifications/edits

  5. arXiv:2201.00382  [pdf, other

    cs.LG cs.DB stat.AP stat.ML

    ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions

    Authors: Zheng Li, Yue Zhao, Xiyang Hu, Nicola Botta, Cezar Ionescu, George H. Chen

    Abstract: Outlier detection refers to the identification of data points that deviate from a general data distribution. Existing unsupervised approaches often suffer from high computational cost, complex hyperparameter tuning, and limited interpretability, especially when working with large, high-dimensional datasets. To address these issues, we present a simple yet effective algorithm called ECOD (Empirical… ▽ More

    Submitted 24 August, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE Transactions on Knowledge and Data Engineering (TKDE) with fixed data statistics. Zheng Li and Yue Zhao contributed equally. Code is available in PyOD library at https://github.com/yzhao062/pyod

  6. arXiv:2007.12975  [pdf, other

    stat.ML cs.LG

    Deep Kernel Survival Analysis and Subject-Specific Survival Time Prediction Intervals

    Authors: George H. Chen

    Abstract: Kernel survival analysis methods predict subject-specific survival curves and times using information about which training subjects are most similar to a test subject. These most similar training subjects could serve as forecast evidence. How similar any two subjects are is given by the kernel function. In this paper, we present the first neural network framework that learns which kernel functions… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: Machine Learning for Healthcare conference (MLHC 2020)

  7. arXiv:2007.07796  [pdf, other

    cs.LG stat.ML

    Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate

    Authors: George H. Chen, Linhong Li, Ren Zuo, Amanda Coston, Jeremy C. Weiss

    Abstract: We present a neural network framework for learning a survival model to predict a time-to-event outcome while simultaneously learning a topic model that reveals feature relationships. In particular, we model each subject as a distribution over "topics", where a topic could, for instance, correspond to an age group, a disorder, or a disease. The presence of a topic in a subject means that specific c… ▽ More

    Submitted 4 June, 2024; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted at the Artificial Intelligence in Medicine journal; preliminary conference version (see earlier arXiv draft) appeared in the International Conference on Artificial Intelligence in Medicine (AIME 2020)

  8. arXiv:2006.01898  [pdf, other

    stat.AP cs.LG stat.ML

    Predicting Mortality Risk in Viral and Unspecified Pneumonia to Assist Clinicians with COVID-19 ECMO Planning

    Authors: Helen Zhou, Cheng Cheng, Zachary C. Lipton, George H. Chen, Jeremy C. Weiss

    Abstract: Respiratory complications due to coronavirus disease COVID-19 have claimed tens of thousands of lives in 2020. Many cases of COVID-19 escalate from Severe Acute Respiratory Syndrome (SARS-CoV-2) to viral pneumonia to acute respiratory distress syndrome (ARDS) to death. Extracorporeal membranous oxygenation (ECMO) is a life-sustaining oxygenation and ventilation therapy that may be used for patient… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  9. arXiv:2006.00707  [pdf, other

    econ.EM cs.CL stat.AP

    Influence via Ethos: On the Persuasive Power of Reputation in Deliberation Online

    Authors: Emaad Manzoor, George H. Chen, Dokyun Lee, Michael D. Smith

    Abstract: Deliberation among individuals online plays a key role in sha** the opinions that drive votes, purchases, donations and other critical offline behavior. Yet, the determinants of opinion-change via persuasion in deliberation online remain largely unexplored. Our research examines the persuasive power of $\textit{ethos}$ -- an individual's "reputation" -- using a 7-year panel of over a million deb… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  10. arXiv:1910.12774  [pdf, other

    stat.ML cs.LG

    Missing Not at Random in Matrix Completion: The Effectiveness of Estimating Missingness Probabilities Under a Low Nuclear Norm Assumption

    Authors: Wei Ma, George H. Chen

    Abstract: Matrix completion is often applied to data with entries missing not at random (MNAR). For example, consider a recommendation system where users tend to only reveal ratings for items they like. In this case, a matrix completion method that relies on entries being revealed at uniformly sampled row and column indices can yield overly optimistic predictions of unseen user ratings. Recently, various pa… ▽ More

    Submitted 29 October, 2019; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2019)

  11. arXiv:1905.05285  [pdf, other

    stat.ML cs.LG

    Nearest Neighbor and Kernel Survival Analysis: Nonasymptotic Error Bounds and Strong Consistency Rates

    Authors: George H. Chen

    Abstract: We establish the first nonasymptotic error bounds for Kaplan-Meier-based nearest neighbor and kernel survival probability estimators where feature vectors reside in metric spaces. Our bounds imply rates of strong consistency for these nonparametric estimators and, up to a log factor, match an existing lower bound for conditional CDF estimation. Our proof strategy also yields nonasymptotic guarante… ▽ More

    Submitted 14 September, 2022; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: International Conference on Machine Learning (ICML 2019); this draft includes minor corrections

  12. arXiv:1712.00535  [pdf, ps, other

    stat.ML

    Survival-Supervised Topic Modeling with Anchor Words: Characterizing Pancreatitis Outcomes

    Authors: George H. Chen, Jeremy C. Weiss

    Abstract: We introduce a new approach for topic modeling that is supervised by survival analysis. Specifically, we build on recent work on unsupervised topic modeling with so-called anchor words by providing supervision through an elastic-net regularized Cox proportional hazards model. In short, an anchor word being present in a document provides strong indication that the document is partially about a spec… ▽ More

    Submitted 7 December, 2017; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: NIPS Workshop on Machine Learning for Health 2017, fixed some equation typos, some minor wording edits

  13. arXiv:1411.6591  [pdf, other

    cs.LG cs.IR stat.ML

    A Latent Source Model for Online Collaborative Filtering

    Authors: Guy Bresler, George H. Chen, Devavrat Shah

    Abstract: Despite the prevalence of collaborative filtering in recommendation systems, there has been little theoretical development on why and how well it works, especially in the "online" setting, where items are recommended to users over time. We address this theoretical gap by introducing a model for online recommendation systems, cast item recommendation under the model as a learning problem, and analy… ▽ More

    Submitted 31 October, 2014; originally announced November 2014.

    Comments: Advances in Neural Information Processing Systems (NIPS 2014)

  14. arXiv:1303.5508  [pdf, other

    cs.CV cs.LG stat.ML

    Sparse Projections of Medical Images onto Manifolds

    Authors: George H. Chen, Christian Wachinger, Polina Golland

    Abstract: Manifold learning has been successfully applied to a variety of medical imaging problems. Its use in real-time applications requires fast projection onto the low-dimensional space. To this end, out-of-sample extensions are applied by constructing an interpolation function that maps from the input space to the low-dimensional manifold. Commonly used approaches such as the Nyström extension and kern… ▽ More

    Submitted 28 March, 2013; v1 submitted 21 March, 2013; originally announced March 2013.

    Comments: International Conference on Information Processing in Medical Imaging (IPMI 2013)

  15. arXiv:1302.3639  [pdf, other

    stat.ML cs.LG cs.SI

    A Latent Source Model for Nonparametric Time Series Classification

    Authors: George H. Chen, Stanislav Nikolov, Devavrat Shah

    Abstract: For classifying time series, a nearest-neighbor approach is widely used in practice with performance often competitive with or better than more elaborate methods such as neural networks, decision trees, and support vector machines. We develop theoretical justification for the effectiveness of nearest-neighbor-like classification of time series. Our guiding hypothesis is that in many applications,… ▽ More

    Submitted 12 December, 2013; v1 submitted 14 February, 2013; originally announced February 2013.

    Comments: Advances in Neural Information Processing Systems (NIPS 2013)