Skip to main content

Showing 1–1 of 1 results for author: Galoğlu, O

.
  1. arXiv:2305.07016  [pdf, other

    cs.CL

    A General-Purpose Multilingual Document Encoder

    Authors: Onur Galoğlu, Robert Litschko, Goran Glavaš

    Abstract: Massively multilingual pretrained transformers (MMTs) have tremendously pushed the state of the art on multilingual NLP and cross-lingual transfer of NLP models in particular. While a large body of work leveraged MMTs to mine parallel data and induce bilingual document embeddings, much less effort has been devoted to training general-purpose (massively) multilingual document encoder that can be us… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.