Skip to main content

Showing 1–1 of 1 results for author: Abonizio, H Q

.
  1. arXiv:2108.13897  [pdf, other

    cs.CL cs.AI

    mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset

    Authors: Luiz Bonifacio, Vitor Jeronymo, Hugo Queiroz Abonizio, Israel Campiotti, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The MS MARCO ranking dataset has been widely used for training deep learning models for IR tasks, achieving considerable effectiveness on diverse zero-shot scenarios. However, this type of resource is scarce in languages other than English. In this work, we present mMARCO, a multilingual version of the MS MARCO passage ranking dataset comprising 13 languages that was created using machine translat… ▽ More

    Submitted 17 August, 2022; v1 submitted 31 August, 2021; originally announced August 2021.