Skip to main content

Showing 1–2 of 2 results for author: Barabucci, G

.
  1. arXiv:2406.14981  [pdf, other

    cs.AI cs.HC

    Human-AI collectives produce the most accurate differential diagnoses

    Authors: N. Zöller, J. Berger, I. Lin, N. Fu, J. Komarneni, G. Barabucci, K. Laskowski, V. Shia, B. Harack, E. A. Chu, V. Trianni, R. H. J. M. Kurvers, S. M. Herzog

    Abstract: Artificial intelligence systems, particularly large language models (LLMs), are increasingly being employed in high-stakes decisions that impact both individuals and society at large, often without adequate safeguards to ensure safety, quality, and equity. Yet LLMs hallucinate, lack common sense, and are biased - shortcomings that may reflect LLMs' inherent limitations and thus may not be remedied… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2402.08806  [pdf, other

    cs.AI

    Combining Insights From Multiple Large Language Models Improves Diagnostic Accuracy

    Authors: Gioele Barabucci, Victor Shia, Eugene Chu, Benjamin Harack, Nathan Fu

    Abstract: Background: Large language models (LLMs) such as OpenAI's GPT-4 or Google's PaLM 2 are proposed as viable diagnostic support tools or even spoken of as replacements for "curbside consults". However, even LLMs specifically trained on medical topics may lack sufficient diagnostic accuracy for real-life applications. Methods: Using collective intelligence methods and a dataset of 200 clinical vigne… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 5 pages, 2 figures, 1 table

    ACM Class: I.2.1; J.3