Skip to main content

Showing 1–1 of 1 results for author: Chavinda, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.00142  [pdf, other

    cs.CL cs.LG

    TaTa: A Multilingual Table-to-Text Dataset for African Languages

    Authors: Sebastian Gehrmann, Sebastian Ruder, Vitaly Nikolaev, Jan A. Botha, Michael Chavinda, Ankur Parikh, Clara Rivera

    Abstract: Existing data-to-text generation datasets are mostly limited to English. To address this lack of data, we create Table-to-Text in African languages (TaTa), the first large multilingual table-to-text dataset with a focus on African languages. We created TaTa by transcribing figures and accompanying text in bilingual reports by the Demographic and Health Surveys Program, followed by professional tra… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 24 pages, 6 figures