Skip to main content

Showing 1–3 of 3 results for author: Alghisi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06399  [pdf, other

    cs.CL cs.AI

    Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for Dialogue

    Authors: Simone Alghisi, Massimo Rizzoli, Gabriel Roccabruna, Seyed Mahed Mousavi, Giuseppe Riccardi

    Abstract: We study the limitations of Large Language Models (LLMs) for the task of response generation in human-machine dialogue. Several techniques have been proposed in the literature for different dialogue types (e.g., Open-Domain). However, the evaluations of these techniques have been limited in terms of base LLMs, dialogue types and evaluation metrics. In this work, we extensively analyze different LL… ▽ More

    Submitted 5 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2404.08700  [pdf, other

    cs.CL cs.AI

    DyKnow:Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs

    Authors: Seyed Mahed Mousavi, Simone Alghisi, Giuseppe Riccardi

    Abstract: LLMs acquire knowledge from massive data snapshots collected at different timestamps. Their knowledge is then commonly evaluated using static benchmarks. However, factual knowledge is generally subject to time-sensitive changes, and static benchmarks cannot address those cases. We present an approach to dynamically evaluate the knowledge in LLMs and their time-sensitiveness against Wikidata, a pub… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2401.02297  [pdf, other

    cs.CL

    Are LLMs Robust for Spoken Dialogues?

    Authors: Seyed Mahed Mousavi, Gabriel Roccabruna, Simone Alghisi, Massimo Rizzoli, Mirco Ravanelli, Giuseppe Riccardi

    Abstract: Large Pre-Trained Language Models have demonstrated state-of-the-art performance in different downstream tasks, including dialogue state tracking and end-to-end response generation. Nevertheless, most of the publicly available datasets and benchmarks on task-oriented dialogues focus on written conversations. Consequently, the robustness of the developed models to spoken interactions is unknown. In… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.