Skip to main content

Showing 1–11 of 11 results for author: Bennett, K P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17888  [pdf, other

    cs.CL cs.AI cs.LG

    CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design

    Authors: Nafis Neehal, Bowen Wang, Shayom Debopadhaya, Soham Dan, Keerthiram Murugesan, Vibha Anand, Kristin P. Bennett

    Abstract: CTBench is introduced as a benchmark to assess language models (LMs) in aiding clinical study design. Given study-specific metadata, CTBench evaluates AI models' ability to determine the baseline features of a clinical trial (CT), which include demographic and relevant features collected at the trial's start from all participants. These baseline features, typically presented in CT publications (of… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2311.02272  [pdf, other

    cs.DC

    Enabling Cross-Language Data Integration and Scalable Analytics in Decentralized Finance

    Authors: Conor Flynn, Kristin P. Bennett, John S. Erickson, Aaron Green, Oshani Seneviratne

    Abstract: With the agile development process of most academic and corporate entities, designing a robust computational back-end system that can support their ever-changing data needs is a constantly evolving challenge. We propose the implementation of a data and language-agnostic system design that handles different data schemes and sources while subsequently providing researchers and developers a way to co… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages

    ACM Class: C.3

  3. arXiv:2204.04353  [pdf, other

    cs.CL cs.AI cs.CY

    Should we tweet this? Generative response modeling for predicting reception of public health messaging on Twitter

    Authors: Abraham Sanders, Debjani Ray-Majumder, John S. Erickson, Kristin P. Bennett

    Abstract: The way people respond to messaging from public health organizations on social media can provide insight into public perceptions on critical health issues, especially during a global crisis such as COVID-19. It could be valuable for high-impact organizations such as the US Centers for Disease Control and Prevention (CDC) or the World Health Organization (WHO) to understand how these perceptions im… ▽ More

    Submitted 13 May, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at ACM WebSci 2022

    ACM Class: I.2.7

  4. arXiv:2203.04462  [pdf, other

    cs.LG

    Downstream Fairness Caveats with Synthetic Healthcare Data

    Authors: Karan Bhanot, Ioana Baldini, Dennis Wei, Jiaming Zeng, Kristin P. Bennett

    Abstract: This paper evaluates synthetically generated healthcare data for biases and investigates the effect of fairness mitigation techniques on utility-fairness. Privacy laws limit access to health data such as Electronic Medical Records (EMRs) to preserve patient privacy. Albeit essential, these laws hinder research reproducibility. Synthetic data is a viable solution that can enable access to data simi… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  5. arXiv:1911.06411  [pdf, other

    cs.LG cs.CY stat.ML

    Synthetic Event Time Series Health Data Generation

    Authors: Saloni Dash, Ritik Dutta, Isabelle Guyon, Adrien Pavao, Andrew Yale, Kristin P. Bennett

    Abstract: Synthetic medical data which preserves privacy while maintaining utility can be used as an alternative to real medical data, which has privacy costs and resource constraints associated with it. At present, most models focus on generating cross-sectional health data which is not necessarily representative of real data. In reality, medical data is longitudinal in nature, with a single patient having… ▽ More

    Submitted 27 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  6. arXiv:1907.04358  [pdf, other

    cs.LO q-bio.PE stat.ML

    Making Study Populations Visible through Knowledge Graphs

    Authors: Shruthi Chari, Miao Qi, Nkcheniyere N. Agu, Oshani Seneviratne, James P. McCusker, Kristin P. Bennett, Amar K. Das, Deborah L. McGuinness

    Abstract: Treatment recommendations within Clinical Practice Guidelines (CPGs) are largely based on findings from clinical trials and case studies, referred to here as research studies, that are often based on highly selective clinical populations, referred to here as study cohorts. When medical practitioners apply CPG recommendations, they need to understand how well their patient population matches the ch… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 16 pages, 4 figures, 1 table, accepted to the ISWC 2019 Resources Track (https://iswc2019.semanticweb.org/call-for-resources-track-papers/)

  7. arXiv:1811.11190  [pdf, other

    cs.LG cs.AI stat.ML

    Semantically-aware population health risk analyses

    Authors: Alexander New, Sabbir M. Rashid, John S. Erickson, Deborah L. McGuinness, Kristin P. Bennett

    Abstract: One primary task of population health analysis is the identification of risk factors that, for some subpopulation, have a significant association with some health condition. Examples include finding lifestyle factors associated with chronic diseases and finding genetic mutations associated with diseases in precision health. We develop a combined semantic and machine learning system that uses a hea… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

  8. arXiv:1808.04880  [pdf, other

    stat.ML cs.LG stat.AP

    A Precision Environment-Wide Association Study of Hypertension via Supervised Cadre Models

    Authors: Alexander New, Kristin P. Bennett

    Abstract: We consider the problem in precision health of grou** people into subpopulations based on their degree of vulnerability to a risk factor. These subpopulations cannot be discovered with traditional clustering techniques because their quality is evaluated with a supervised metric: the ease of modeling a response variable over observations within them. Instead, we apply the supervised cadre model (… ▽ More

    Submitted 9 December, 2018; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: 9 pages, 5 figures

  9. arXiv:1807.07991  [pdf, other

    cs.AI

    Knowledge Integration for Disease Characterization: A Breast Cancer Example

    Authors: Oshani Seneviratne, Sabbir M. Rashid, Shruthi Chari, James P. McCusker, Kristin P. Bennett, James A. Hendler, Deborah L. McGuinness

    Abstract: With the rapid advancements in cancer research, the information that is useful for characterizing disease, staging tumors, and creating treatment and survivorship plans has been changing at a pace that creates challenges when physicians try to remain current. One example involves increasing usage of biomarkers when characterizing the pathologic prognostic stage of a breast tumor. We present our se… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: International Semantic Web Conference (Resource Track)

  10. Cadre Modeling: Simultaneously Discovering Subpopulations and Predictive Models

    Authors: Alexander New, Curt Breneman, Kristin P. Bennett

    Abstract: We consider the problem in regression analysis of identifying subpopulations that exhibit different patterns of response, where each subpopulation requires a different underlying model. Unlike statistical cohorts, these subpopulations are not known a priori; thus, we refer to them as cadres. When the cadres and their associated models are interpretable, modeling leads to insights about the subpopu… ▽ More

    Submitted 23 October, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: 8 pages, 6 figures

    Journal ref: In 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 2018

  11. arXiv:1108.5397  [pdf, ps, other

    stat.ML cs.LG q-bio.QM

    Prediction of peptide bonding affinity: kernel methods for nonlinear modeling

    Authors: Charles Bergeron, Theresa Hepburn, C. Matthew Sundling, Michael Krein, Bill Katt, Nagamani Sukumar, Curt M. Breneman, Kristin P. Bennett

    Abstract: This paper presents regression models obtained from a process of blind prediction of peptide binding affinity from provided descriptors for several distinct datasets as part of the 2006 Comparative Evaluation of Prediction Algorithms (COEPRA) contest. This paper finds that kernel partial least squares, a nonlinear partial least squares (PLS) algorithm, outperforms PLS, and that the incorporation o… ▽ More

    Submitted 26 August, 2011; originally announced August 2011.