Skip to main content

Showing 1–5 of 5 results for author: Štajner, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.02888  [pdf, other

    cs.CL cs.LG

    Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification

    Authors: Horacio Saggion, Sanja Štajner, Daniel Ferrés, Kim Cheng Sheang, Matthew Shardlow, Kai North, Marcos Zampieri

    Abstract: We report findings of the TSAR-2022 shared task on multilingual lexical simplification, organized as part of the Workshop on Text Simplification, Accessibility, and Readability TSAR-2022 held in conjunction with EMNLP 2022. The task called the Natural Language Processing research community to contribute with methods to advance the state of the art in multilingual lexical simplification for English… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  2. arXiv:2209.05301  [pdf, ps, other

    cs.CL

    Lexical Simplification Benchmarks for English, Portuguese, and Spanish

    Authors: Sanja Stajner, Daniel Ferres, Matthew Shardlow, Kai North, Marcos Zampieri, Horacio Saggion

    Abstract: Even in highly-developed countries, as many as 15-30\% of the population can only understand texts written using a basic vocabulary. Their understanding of everyday texts is limited, which prevents them from taking an active role in society and making informed decisions regarding healthcare, legal representation, or democratic choice. Lexical simplification is a natural language processing task th… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  3. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di **, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  4. arXiv:2012.09692  [pdf, other

    cs.CL

    Five Psycholinguistic Characteristics for Better Interaction with Users

    Authors: Sanja Štajner, Seren Yenikent, Marc Franco-Salvador

    Abstract: When two people pay attention to each other and are interested in what the other has to say or write, they almost instantly adapt their writing/speaking style to match the other. For a successful interaction with a user, chatbots and dialogue systems should be able to do the same. We propose a framework consisting of five psycholinguistic textual characteristics for better human-computer interacti… ▽ More

    Submitted 21 March, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 26 pages, 4 figures

  5. arXiv:1804.09132  [pdf, other

    cs.CL

    A Report on the Complex Word Identification Shared Task 2018

    Authors: Seid Muhie Yimam, Chris Biemann, Shervin Malmasi, Gustavo H. Paetzold, Lucia Specia, Sanja Štajner, Anaïs Tack, Marcos Zampieri

    Abstract: We report the findings of the second Complex Word Identification (CWI) shared task organized as part of the BEA workshop co-located with NAACL-HLT'2018. The second CWI shared task featured multilingual and multi-genre datasets divided into four tracks: English monolingual, German monolingual, Spanish monolingual, and a multilingual track with a French test set, and two tasks: binary classification… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.

    Comments: Second CWI Shared Task co-located with the BEA Workshop 2018 at NAACL-HLT in New Orleans, USA