Skip to main content

Showing 1–9 of 9 results for author: Ormazabal, A

.
  1. arXiv:2405.02287  [pdf, other

    cs.CL cs.AI cs.CV

    Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

    Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

    Abstract: We introduce Vibe-Eval: a new open benchmark and framework for evaluating multimodal chat models. Vibe-Eval consists of 269 visual understanding prompts, including 100 of hard difficulty, complete with gold-standard responses authored by experts. Vibe-Eval is open-ended and challenging with dual objectives: (i) vibe checking multimodal chat models for day-to-day tasks and (ii) rigorously testing a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  2. arXiv:2404.12387  [pdf, other

    cs.CL cs.CV

    Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

    Authors: Reka Team, Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu , et al. (1 additional authors not shown)

    Abstract: We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka. Reka models are able to process and reason with text, images, video, and audio inputs. This technical report discusses details of training some of these models and provides comprehensive evaluation results. We show that Reka Edge and Reka Flash are not only state-of-the-art but al… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2403.20266  [pdf, other

    cs.CL cs.AI cs.LG

    Latxa: An Open Language Model and Evaluation Suite for Basque

    Authors: Julen Etxaniz, Oscar Sainz, Naiara Perez, Itziar Aldabe, German Rigau, Eneko Agirre, Aitor Ormazabal, Mikel Artetxe, Aitor Soroa

    Abstract: We introduce Latxa, a family of large language models for Basque ranging from 7 to 70 billion parameters. Latxa is based on Llama 2, which we continue pretraining on a new Basque corpus comprising 4.3M documents and 4.2B tokens. Addressing the scarcity of high-quality benchmarks for Basque, we further introduce 4 multiple choice evaluation datasets: EusProficiency, comprising 5,169 questions from… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  4. arXiv:2310.20326  [pdf, other

    cs.CL

    Erato: Automatizing Poetry Evaluation

    Authors: Manex Agirrezabal, Hugo Gonçalo Oliveira, Aitor Ormazabal

    Abstract: We present Erato, a framework designed to facilitate the automated evaluation of poetry, including that generated by poetry generation systems. Our framework employs a diverse set of features, and we offer a brief overview of Erato's capabilities and its potential for expansion. Using Erato, we compare and contrast human-authored poetry with automatically-generated poetry, demonstrating its effect… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Journal ref: 22nd Portuguese Conference on Artificial Intelligence (EPIA 2023)

  5. arXiv:2305.16876  [pdf, other

    cs.CL

    CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models

    Authors: Aitor Ormazabal, Mikel Artetxe, Eneko Agirre

    Abstract: Methods for adapting language models (LMs) to new tasks and domains have traditionally assumed white-box access to the model, and work by modifying its parameters. However, this is incompatible with a recent trend in the field, where the highest quality models are only available as black-boxes through inference APIs. Even when the model weights are available, the computational cost of fine-tuning… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: This previously appeared as arXiv:2205.12213v2, which was submitted as new by mistake

  6. arXiv:2205.12213  [pdf, other

    cs.CL

    Principled Paraphrase Generation with Parallel Corpora

    Authors: Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka, Eneko Agirre

    Abstract: Round-trip Machine Translation (MT) is a popular choice for paraphrase generation, which leverages readily available parallel corpora for supervision. In this paper, we formalize the implicit similarity function induced by this approach, and show that it is susceptible to non-paraphrase pairs sharing a single ambiguous translation. Based on these insights, we design an alternative similarity metri… ▽ More

    Submitted 23 May, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  7. arXiv:2205.12206  [pdf, other

    cs.CL cs.AI

    PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry Generation

    Authors: Aitor Ormazabal, Mikel Artetxe, Manex Agirrezabal, Aitor Soroa, Eneko Agirre

    Abstract: Formal verse poetry imposes strict constraints on the meter and rhyme scheme of poems. Most prior work on generating this type of poetry uses existing poems for supervision, which are difficult to obtain for most languages and poetic forms. In this work, we propose an unsupervised approach to generate poems following any given meter and rhyme scheme, without requiring any poetic text for training.… ▽ More

    Submitted 28 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP Findings 2022

  8. Beyond Offline Map**: Learning Cross Lingual Word Embeddings through Context Anchoring

    Authors: Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka, Eneko Agirre

    Abstract: Recent research on cross-lingual word embeddings has been dominated by unsupervised map** approaches that align monolingual embeddings. Such methods critically rely on those embeddings having a similar structure, but it was recently shown that the separate training in different languages causes departures from this assumption. In this paper, we propose an alternative approach that does not have… ▽ More

    Submitted 3 August, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: ACL 2021

  9. Analyzing the Limitations of Cross-lingual Word Embedding Map**s

    Authors: Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa, Eneko Agirre

    Abstract: Recent research in cross-lingual word embeddings has almost exclusively focused on offline methods, which independently train word embeddings in different languages and map them to a shared space through linear transformations. While several authors have questioned the underlying isomorphism assumption, which states that word embeddings in different languages have approximately the same structure,… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: ACL 2019