Skip to main content

Showing 1–9 of 9 results for author: Hunter, L E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16779  [pdf, other

    cs.CL

    It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension

    Authors: Sagi Shaier, Lawrence E Hunter, Katharina von der Wense

    Abstract: Natural language processing has seen rapid progress over the past decade. Due to the speed of developments, some practices get established without proper evaluation. Considering one such case and focusing on reading comprehension, we ask our first research question: 1) How does the order of inputs -- i.e., question and context -- affect model performance? Additionally, given recent advancements in… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL Findings

  2. arXiv:2402.00123  [pdf, other

    cs.CL cs.LG

    Comparing Template-based and Template-free Language Model Probing

    Authors: Sagi Shaier, Kevin Bennett, Lawrence E Hunter, Katharina von der Wense

    Abstract: The differences between cloze-task language model (LM) probing with 1) expert-made templates and 2) naturally-occurring text have often been overlooked. Here, we evaluate 16 different LMs on 10 probing English datasets -- 4 template-based and 6 template-free -- in general and biomedical domains to answer the following research questions: (RQ1) Do model rankings differ between the two approaches? (… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024

  3. arXiv:2401.18001  [pdf, other

    cs.CL

    Desiderata for the Context Use of Question Answering Systems

    Authors: Sagi Shaier, Lawrence E Hunter, Katharina von der Wense

    Abstract: Prior work has uncovered a set of common problems in state-of-the-art context-based question answering (QA) systems: a lack of attention to the context when the latter conflicts with a model's parametric knowledge, little robustness to noise, and a lack of consistency with their answers. However, most prior work focus on one or two of those problems in isolation, which makes it difficult to see tr… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted to EACL 2024

  4. arXiv:2310.10583  [pdf, other

    cs.CL cs.LG

    Who Are All The Stochastic Parrots Imitating? They Should Tell Us!

    Authors: Sagi Shaier, Lawrence E. Hunter, Katharina von der Wense

    Abstract: Both standalone language models (LMs) as well as LMs within downstream-task systems have been shown to generate statements which are factually untrue. This problem is especially severe for low-resource languages, where training data is scarce and of worse quality than for high-resource languages. In this opinion piece, we argue that LMs in their current state will never be fully trustworthy in cri… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to IJCNLP-AACL 2023

  5. arXiv:2307.05727  [pdf

    cs.AI cs.CE

    An Open-Source Knowledge Graph Ecosystem for the Life Sciences

    Authors: Tiffany J. Callahan, Ignacio J. Tripodi, Adrianne L. Stefanski, Luca Cappelletti, Sanya B. Taneja, Jordan M. Wyrwa, Elena Casiraghi, Nicolas A. Matentzoglu, Justin Reese, Jonathan C. Silverstein, Charles Tapley Hoyt, Richard D. Boyce, Scott A. Malec, Deepak R. Unni, Marcin P. Joachimiak, Peter N. Robinson, Christopher J. Mungall, Emanuele Cavalleri, Tommaso Fontana, Giorgio Valentini, Marco Mesiti, Lucas A. Gillenwater, Brook Santangelo, Nicole A. Vasilevsky, Robert Hoehndorf , et al. (7 additional authors not shown)

    Abstract: Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integrat… ▽ More

    Submitted 30 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  6. arXiv:2209.04732  [pdf

    cs.DB cs.AI

    Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

    Authors: Tiffany J. Callahan, Adrianne L. Stefanski, Jordan M. Wyrwa, Chenjie Zeng, Anna Ostropolets, Juan M. Banda, William A. Baumgartner Jr., Richard D. Boyce, Elena Casiraghi, Ben D. Coleman, Janine H. Collins, Sara J. Deakyne-Davies, James A. Feinstein, Melissa A. Haendel, Asiyah Y. Lin, Blake Martin, Nicolas A. Matentzoglu, Daniella Meeker, Justin Reese, Jessica Sinclair, Sanya B. Taneja, Katy E. Trinkley, Nicole A. Vasilevsky, Andrew Williams, Xingman A. Zhang , et al. (7 additional authors not shown)

    Abstract: Background: Common data models solve many challenges of standardizing electronic health record (EHR) data, but are unable to semantically integrate all the resources needed for deep phenoty**. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, map** EHR data to OB… ▽ More

    Submitted 30 January, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Supplementary Material is included at the end of the manuscript

    ACM Class: J.3

  7. arXiv:2207.14294  [pdf

    q-bio.GN cs.AI

    Knowledge-Driven Mechanistic Enrichment of the Preeclampsia Ignorome

    Authors: Tiffany J. Callahan, Adrianne L. Stefanski, **-Dong Kim, William A. Baumgartner Jr., Jordan M. Wyrwa, Lawrence E. Hunter

    Abstract: Preeclampsia is a leading cause of maternal and fetal morbidity and mortality. Currently, the only definitive treatment of preeclampsia is delivery of the placenta, which is central to the pathogenesis of the disease. Transcriptional profiling of human placenta from pregnancies complicated by preeclampsia has been extensively performed to identify differentially expressed genes (DEGs). The decisio… ▽ More

    Submitted 2 October, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Preprint of an article submitted for consideration in Pacific Symposium on Biocomputing ©2022 copyright World Scientific Publishing Company https://psb.stanford.edu/

  8. arXiv:2003.11782  [pdf, other

    cs.DM

    Hypernetwork Science: From Multidimensional Networks to Computational Topology

    Authors: Cliff A. Joslyn, Sinan Aksoy, Tiffany J. Callahan, Lawrence E. Hunter, Brett Jefferson, Brenda Praggastis, Emilie A. H. Purvine, Ignacio J. Tripodi

    Abstract: As data structures and mathematical objects used for complex systems modeling, hypergraphs sit nicely poised between on the one hand the world of network models, and on the other that of higher-order mathematical abstractions from algebra, lattice theory, and topology. They are able to represent complex systems interactions more faithfully than graphs and networks, while also being some of the sim… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Report number: PNNL-SA-152208 MSC Class: 05C65; ACM Class: G.2.2

  9. Knowledge-based Biomedical Data Science 2019

    Authors: Tiffany J. Callahan, Harrison Pielke-Lombardo, Ignacio J. Tripodi, Lawrence E. Hunter

    Abstract: Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Manuscript 43 pages with 3 tables; Supplemental material 43 pages with 3 tables

    ACM Class: I.2.0; I.2.1; I.2.4; I.2.7; I.2.m; I.5.0; I.7.0; J.3

    Journal ref: Annual Review of Biomedical Data Science. 2020 Jul 20;3:23-41