Skip to main content

Showing 1–5 of 5 results for author: O'Donoghue, O

.
  1. arXiv:2312.07559  [pdf, other

    cs.CL cs.AI cs.LG

    PaperQA: Retrieval-Augmented Generative Agent for Scientific Research

    Authors: Jakub Lála, Odhran O'Donoghue, Aleksandar Shtedritski, Sam Cox, Samuel G. Rodriques, Andrew D. White

    Abstract: Large Language Models (LLMs) generalize well across language tasks, but suffer from hallucinations and uninterpretability, making it difficult to assess their accuracy without ground-truth. Retrieval-Augmented Generation (RAG) models have been proposed to reduce hallucinations and provide provenance for how an answer was generated. Applying such models to the scientific literature may enable large… ▽ More

    Submitted 14 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

  2. arXiv:2310.10632  [pdf, other

    cs.CL cs.AI cs.RO

    BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology

    Authors: Odhran O'Donoghue, Aleksandar Shtedritski, John Ginger, Ralph Abboud, Ali Essa Ghareeb, Justin Booth, Samuel G Rodriques

    Abstract: The ability to automatically generate accurate protocols for scientific experiments would represent a major step towards the automation of science. Large Language Models (LLMs) have impressive capabilities on a wide range of tasks, such as question answering and the generation of coherent text and code. However, LLMs can struggle with multi-step problems and long-term planning, which are crucial f… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023. Dataset and code: https://github.com/bioplanner/bioplanner

  3. COPER: Continuous Patient State Perceiver

    Authors: Vinod Kumar Chauhan, Anshul Thakur, Odhran O'Donoghue, David A. Clifton

    Abstract: In electronic health records (EHRs), irregular time-series (ITS) occur naturally due to patient health dynamics, reflected by irregular hospital visits, diseases/conditions and the necessity to measure different vitals signs at each visit etc. ITS present challenges in training machine learning algorithms which mostly are built on assumption of coherent fixed dimensional feature space. In this pap… ▽ More

    Submitted 24 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 2 figures; presented in IEEE International Conference on Biomedical and Health Informatics (IEEE BHI-2022)

  4. arXiv:2205.12070  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning for Multi-class Imbalanced Training

    Authors: Jenny Yang, Rasheed El-Bouri, Odhran O'Donoghue, Alexander S. Lachapelle, Andrew A. S. Soltan, David A. Clifton

    Abstract: With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority class. We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets, and extend it for use in m… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  5. arXiv:2111.07348  [pdf, other

    cs.LG cs.CR

    Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery

    Authors: Odhran O'Donoghue, Paul Duckworth, Giuseppe Ughi, Linus Scheibenreif, Kia Khezeli, Adrienne Hoarfrost, Samuel Budd, Patrick Foley, Nicholas Chia, John Kalantari, Graham Mackintosh, Frank Soboczenski, Lauren Sanders

    Abstract: Human medical data can be challenging to obtain due to data privacy concerns, difficulties conducting certain types of experiments, or prohibitive associated costs. In many settings, data from animal models or in-vitro cell lines are available to help augment our understanding of human data. However, this data is known for having low etiological validity in comparison to human data. In this work,… ▽ More

    Submitted 13 February, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract