Skip to main content

Showing 1–1 of 1 results for author: Morrell, E R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14578  [pdf, other

    cs.LG cs.AI

    RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain

    Authors: William James Bolton, Rafael Poyiadzi, Edward R. Morrell, Gabriela van Bergen Gonzalez Bueno, Lea Goetz

    Abstract: Large Language Models (LLMs) increasingly support applications in a wide range of domains, some with potential high societal impact such as biomedicine, yet their reliability in realistic use cases is under-researched. In this work we introduce the Reliability AssesMent for Biomedical LLM Assistants (RAmBLA) framework and evaluate whether four state-of-the-art foundation LLMs can serve as reliable… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Published at ICLR 2024 Workshop on Reliable and Responsible Foundation Models