Skip to main content

Showing 1–1 of 1 results for author: Brysbaert, M

.
  1. arXiv:2310.14703  [pdf

    cs.CL

    Establishing Vocabulary Tests as a Benchmark for Evaluating Large Language Models

    Authors: Gonzalo Martínez, Javier Conde, Elena Merino-Gómez, Beatriz Bermúdez-Margaretto, José Alberto Hernández, Pedro Reviriego, Marc Brysbaert

    Abstract: Vocabulary tests, once a cornerstone of language modeling evaluation, have been largely overlooked in the current landscape of Large Language Models (LLMs) like Llama, Mistral, and GPT. While most LLM evaluation benchmarks focus on specific tasks or domain-specific knowledge, they often neglect the fundamental linguistic aspects of language understanding and production. In this paper, we advocate… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.