Skip to main content

Showing 1–1 of 1 results for author: Farn, N

.
  1. arXiv:2311.10775  [pdf, other

    cs.CL cs.AI cs.LG

    ToolTalk: Evaluating Tool-Usage in a Conversational Setting

    Authors: Nicholas Farn, Richard Shin

    Abstract: Large language models (LLMs) have displayed massive improvements in reasoning and decision-making skills and can hold natural conversations with users. Many recent works seek to augment LLM-based assistants with external tools so they can access private or up-to-date information and carry out actions on behalf of users. To better measure the performance of these assistants, this paper introduces T… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, ICLR 2024 Submission, https://github.com/microsoft/ToolTalk