Skip to main content

Showing 1–2 of 2 results for author: Melero, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17789  [pdf, other

    cs.CL cs.AI

    Spanish and LLM Benchmarks: is MMLU Lost in Translation?

    Authors: Irene Plaza, Nina Melero, Cristina del Pozo, Javier Conde, Pedro Reviriego, Marina Mayor-Rocher, María Grandury

    Abstract: The evaluation of Large Language Models (LLMs) is a key element in their continuous improvement process and many benchmarks have been developed to assess the performance of LLMs in different tasks and topics. As LLMs become adopted worldwide, evaluating them in languages other than English is increasingly important. However, most LLM benchmarks are simply translated using an automated tool and the… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  2. arXiv:2403.15491  [pdf, other

    cs.CL

    Open Source Conversational LLMs do not know most Spanish words

    Authors: Javier Conde, Miguel González, Nina Melero, Raquel Ferrando, Gonzalo Martínez, Elena Merino-Gómez, José Alberto Hernández, Pedro Reviriego

    Abstract: The growing interest in Large Language Models (LLMs) and in particular in conversational models with which users can interact has led to the development of a large number of open-source chat LLMs. These models are evaluated on a wide range of benchmarks to assess their capabilities in answering questions or solving problems on almost any possible topic or to test their ability to reason or interpr… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Under Review at SEPLN-2024

    Journal ref: SEPLN Journal 2024