Skip to main content

Showing 1–2 of 2 results for author: Kew, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.12683  [pdf, other

    cs.CL

    Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?

    Authors: Tannon Kew, Florian Schottmann, Rico Sennrich

    Abstract: The vast majority of today's large language models are English-centric, having been pretrained predominantly on English text. Yet, in order to meet user expectations, models need to be able to respond appropriately in multiple languages once deployed in downstream applications. Given limited exposure to other languages during pretraining, cross-lingual transfer is important for achieving decent pe… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  2. arXiv:2310.15773  [pdf, other

    cs.CL

    BLESS: Benchmarking Large Language Models on Sentence Simplification

    Authors: Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego, Matthew Shardlow

    Abstract: We present BLESS, a comprehensive performance benchmark of the most recent state-of-the-art large language models (LLMs) on the task of text simplification (TS). We examine how well off-the-shelf LLMs can solve this challenging task, assessing a total of 44 models, differing in size, architecture, pre-training methods, and accessibility, on three test sets from different domains (Wikipedia, news,… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: This paper has been accepted to EMNLP 2023 as a main long paper. 9 pages, 7 figures