Skip to main content

Showing 1–1 of 1 results for author: Karny, S

.
  1. arXiv:2401.13835  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    The Calibration Gap between Model and Human Confidence in Large Language Models

    Authors: Mark Steyvers, Heliodoro Tejeda, Aakriti Kumar, Catarina Belem, Sheer Karny, Xinyue Hu, Lukas Mayer, Padhraic Smyth

    Abstract: For large language models (LLMs) to be trusted by humans they need to be well-calibrated in the sense that they can accurately assess and communicate how likely it is that their predictions are correct. Recent work has focused on the quality of internal LLM confidence assessments, but the question remains of how well LLMs can communicate this internal model confidence to human users. This paper ex… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 27 pages, 10 figures