Skip to main content

Showing 1–1 of 1 results for author: Ghareeb, A E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.10632  [pdf, other

    cs.CL cs.AI cs.RO

    BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology

    Authors: Odhran O'Donoghue, Aleksandar Shtedritski, John Ginger, Ralph Abboud, Ali Essa Ghareeb, Justin Booth, Samuel G Rodriques

    Abstract: The ability to automatically generate accurate protocols for scientific experiments would represent a major step towards the automation of science. Large Language Models (LLMs) have impressive capabilities on a wide range of tasks, such as question answering and the generation of coherent text and code. However, LLMs can struggle with multi-step problems and long-term planning, which are crucial f… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023. Dataset and code: https://github.com/bioplanner/bioplanner