Skip to main content

Showing 1–16 of 16 results for author: Bennett, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17888  [pdf, other

    cs.CL cs.AI cs.LG

    CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design

    Authors: Nafis Neehal, Bowen Wang, Shayom Debopadhaya, Soham Dan, Keerthiram Murugesan, Vibha Anand, Kristin P. Bennett

    Abstract: CTBench is introduced as a benchmark to assess language models (LMs) in aiding clinical study design. Given study-specific metadata, CTBench evaluates AI models' ability to determine the baseline features of a clinical trial (CT), which include demographic and relevant features collected at the trial's start from all participants. These baseline features, typically presented in CT publications (of… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2402.00123  [pdf, other

    cs.CL cs.LG

    Comparing Template-based and Template-free Language Model Probing

    Authors: Sagi Shaier, Kevin Bennett, Lawrence E Hunter, Katharina von der Wense

    Abstract: The differences between cloze-task language model (LM) probing with 1) expert-made templates and 2) naturally-occurring text have often been overlooked. Here, we evaluate 16 different LMs on 10 probing English datasets -- 4 template-based and 6 template-free -- in general and biomedical domains to answer the following research questions: (RQ1) Do model rankings differ between the two approaches? (… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  3. arXiv:2312.12577  [pdf, other

    cs.CE

    An integrated EOS, pore-crush, strength and damage model framework for near-field ground-shock

    Authors: Kane C. Bennett, Alyson M. Stahl, Thomas R. Canfield, Garrett G. Euler

    Abstract: An integrated Equation of State (EOS) and strength/pore-crush/damage model framework is provided for modeling near to source (near-field) ground-shock response, where large deformations and pressures necessitate coupling EOS with pressure-dependent plastic yield and damage. Nonlinear pressure-dependence of strength up to high-pressures is combined with a Modified Cam-Clay-like cap-plasticity model… ▽ More

    Submitted 7 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Report number: LA-UR-23-34028

  4. arXiv:2311.02272  [pdf, other

    cs.DC

    Enabling Cross-Language Data Integration and Scalable Analytics in Decentralized Finance

    Authors: Conor Flynn, Kristin P. Bennett, John S. Erickson, Aaron Green, Oshani Seneviratne

    Abstract: With the agile development process of most academic and corporate entities, designing a robust computational back-end system that can support their ever-changing data needs is a constantly evolving challenge. We propose the implementation of a data and language-agnostic system design that handles different data schemes and sources while subsequently providing researchers and developers a way to co… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages

    ACM Class: C.3

  5. arXiv:2310.10571  [pdf, other

    cs.CL cs.LG

    Emerging Challenges in Personalized Medicine: Assessing Demographic Effects on Biomedical Question Answering Systems

    Authors: Sagi Shaier, Kevin Bennett, Lawrence Hunter, Katharina von der Wense

    Abstract: State-of-the-art question answering (QA) models exhibit a variety of social biases (e.g., with respect to sex or race), generally explained by similar issues in their training data. However, what has been overlooked so far is that in the critical domain of biomedicine, any unjustified change in model output due to patient demographics is problematic: it results in the unfair treatment of patients.… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to IJCNLP-AACL 2023

  6. arXiv:2204.04353  [pdf, other

    cs.CL cs.AI cs.CY

    Should we tweet this? Generative response modeling for predicting reception of public health messaging on Twitter

    Authors: Abraham Sanders, Debjani Ray-Majumder, John S. Erickson, Kristin P. Bennett

    Abstract: The way people respond to messaging from public health organizations on social media can provide insight into public perceptions on critical health issues, especially during a global crisis such as COVID-19. It could be valuable for high-impact organizations such as the US Centers for Disease Control and Prevention (CDC) or the World Health Organization (WHO) to understand how these perceptions im… ▽ More

    Submitted 13 May, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at ACM WebSci 2022

    ACM Class: I.2.7

  7. arXiv:2203.04462  [pdf, other

    cs.LG

    Downstream Fairness Caveats with Synthetic Healthcare Data

    Authors: Karan Bhanot, Ioana Baldini, Dennis Wei, Jiaming Zeng, Kristin P. Bennett

    Abstract: This paper evaluates synthetically generated healthcare data for biases and investigates the effect of fairness mitigation techniques on utility-fairness. Privacy laws limit access to health data such as Electronic Medical Records (EMRs) to preserve patient privacy. Albeit essential, these laws hinder research reproducibility. Synthetic data is a viable solution that can enable access to data simi… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  8. arXiv:2001.07827  [pdf, other

    cs.LG stat.ML

    Coarse-Grain Cluster Analysis of Tensors with Application to Climate Biome Identification

    Authors: Derek DeSantis, Phillip J. Wolfram, Katrina Bennett, Boian Alexandrov

    Abstract: A tensor provides a concise way to codify the interdependence of complex data. Treating a tensor as a d-way array, each entry records the interaction between the different indices. Clustering provides a way to parse the complexity of the data into more readily understandable information. Clustering methods are heavily dependent on the algorithm of choice, as well as the chosen hyperparameters of t… ▽ More

    Submitted 22 May, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Report number: LA-UR-20-20548

  9. arXiv:1911.06411  [pdf, other

    cs.LG cs.CY stat.ML

    Synthetic Event Time Series Health Data Generation

    Authors: Saloni Dash, Ritik Dutta, Isabelle Guyon, Adrien Pavao, Andrew Yale, Kristin P. Bennett

    Abstract: Synthetic medical data which preserves privacy while maintaining utility can be used as an alternative to real medical data, which has privacy costs and resource constraints associated with it. At present, most models focus on generating cross-sectional health data which is not necessarily representative of real data. In reality, medical data is longitudinal in nature, with a single patient having… ▽ More

    Submitted 27 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  10. arXiv:1907.04358  [pdf, other

    cs.LO q-bio.PE stat.ML

    Making Study Populations Visible through Knowledge Graphs

    Authors: Shruthi Chari, Miao Qi, Nkcheniyere N. Agu, Oshani Seneviratne, James P. McCusker, Kristin P. Bennett, Amar K. Das, Deborah L. McGuinness

    Abstract: Treatment recommendations within Clinical Practice Guidelines (CPGs) are largely based on findings from clinical trials and case studies, referred to here as research studies, that are often based on highly selective clinical populations, referred to here as study cohorts. When medical practitioners apply CPG recommendations, they need to understand how well their patient population matches the ch… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 16 pages, 4 figures, 1 table, accepted to the ISWC 2019 Resources Track (https://iswc2019.semanticweb.org/call-for-resources-track-papers/)

  11. arXiv:1811.11190  [pdf, other

    cs.LG cs.AI stat.ML

    Semantically-aware population health risk analyses

    Authors: Alexander New, Sabbir M. Rashid, John S. Erickson, Deborah L. McGuinness, Kristin P. Bennett

    Abstract: One primary task of population health analysis is the identification of risk factors that, for some subpopulation, have a significant association with some health condition. Examples include finding lifestyle factors associated with chronic diseases and finding genetic mutations associated with diseases in precision health. We develop a combined semantic and machine learning system that uses a hea… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

  12. arXiv:1808.04880  [pdf, other

    stat.ML cs.LG stat.AP

    A Precision Environment-Wide Association Study of Hypertension via Supervised Cadre Models

    Authors: Alexander New, Kristin P. Bennett

    Abstract: We consider the problem in precision health of grou** people into subpopulations based on their degree of vulnerability to a risk factor. These subpopulations cannot be discovered with traditional clustering techniques because their quality is evaluated with a supervised metric: the ease of modeling a response variable over observations within them. Instead, we apply the supervised cadre model (… ▽ More

    Submitted 9 December, 2018; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: 9 pages, 5 figures

  13. arXiv:1807.07991  [pdf, other

    cs.AI

    Knowledge Integration for Disease Characterization: A Breast Cancer Example

    Authors: Oshani Seneviratne, Sabbir M. Rashid, Shruthi Chari, James P. McCusker, Kristin P. Bennett, James A. Hendler, Deborah L. McGuinness

    Abstract: With the rapid advancements in cancer research, the information that is useful for characterizing disease, staging tumors, and creating treatment and survivorship plans has been changing at a pace that creates challenges when physicians try to remain current. One example involves increasing usage of biomarkers when characterizing the pathologic prognostic stage of a breast tumor. We present our se… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: International Semantic Web Conference (Resource Track)

  14. Cadre Modeling: Simultaneously Discovering Subpopulations and Predictive Models

    Authors: Alexander New, Curt Breneman, Kristin P. Bennett

    Abstract: We consider the problem in regression analysis of identifying subpopulations that exhibit different patterns of response, where each subpopulation requires a different underlying model. Unlike statistical cohorts, these subpopulations are not known a priori; thus, we refer to them as cadres. When the cadres and their associated models are interpretable, modeling leads to insights about the subpopu… ▽ More

    Submitted 23 October, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: 8 pages, 6 figures

    Journal ref: In 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 2018

  15. arXiv:1711.02629  [pdf

    astro-ph.IM cs.HC

    Virtual Astronaut for Scientific Visualization - A Prototype for Santa Maria Crater on Mars

    Authors: Jue Wang, Keith J. Bennett, Edward A. Guinness

    Abstract: To support scientific visualization of multiple-mission data from Mars, the Virtual Astronaut (VA) creates an interactive virtual 3D environment built on the Unity3D Game Engine. A prototype study was conducted based on orbital and Opportunity Rover data covering Santa Maria Crater in Meridiani Planum on Mars. The VA at Santa Maria provides dynamic visual representations of the imaging, compositio… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: 20 pages, 11 figures

    Journal ref: Future Internet 2012, 4, 1049-1068

  16. arXiv:1108.5397  [pdf, ps, other

    stat.ML cs.LG q-bio.QM

    Prediction of peptide bonding affinity: kernel methods for nonlinear modeling

    Authors: Charles Bergeron, Theresa Hepburn, C. Matthew Sundling, Michael Krein, Bill Katt, Nagamani Sukumar, Curt M. Breneman, Kristin P. Bennett

    Abstract: This paper presents regression models obtained from a process of blind prediction of peptide binding affinity from provided descriptors for several distinct datasets as part of the 2006 Comparative Evaluation of Prediction Algorithms (COEPRA) contest. This paper finds that kernel partial least squares, a nonlinear partial least squares (PLS) algorithm, outperforms PLS, and that the incorporation o… ▽ More

    Submitted 26 August, 2011; originally announced August 2011.