Skip to main content

Showing 1–4 of 4 results for author: Passali, T

.
  1. arXiv:2312.05172  [pdf, other

    cs.CL

    From Lengthy to Lucid: A Systematic Literature Review on NLP Techniques for Taming Long Sentences

    Authors: Tatiana Passali, Efstathios Chatzikyriakidis, Stelios Andreadis, Thanos G. Stavropoulos, Anastasia Matonaki, Anestis Fachantidis, Grigorios Tsoumakas

    Abstract: Long sentences have been a persistent issue in written communication for many years since they make it challenging for readers to grasp the main points or follow the initial intention of the writer. This survey, conducted using the PRISMA guidelines, systematically reviews two main strategies for addressing the issue of long sentences: a) sentence compression and b) sentence splitting. An increase… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Author's Version, Submitted to ACM CSUR

  2. arXiv:2211.09235  [pdf, other

    cs.CL

    Artificial Disfluency Detection, Uh No, Disfluency Generation for the Masses

    Authors: T. Passali, T. Mavropoulos, G. Tsoumakas, G. Meditskos, S. Vrochidis

    Abstract: Existing approaches for disfluency detection typically require the existence of large annotated datasets. However, current datasets for this task are limited, suffer from class imbalance, and lack some types of disfluencies that can be encountered in real-world scenarios. This work proposes LARD, a method for automatically generating artificial disfluencies from fluent text. LARD can simulate all… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 10 pages

  3. arXiv:2206.04317  [pdf, other

    cs.CL

    Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods

    Authors: Tatiana Passali, Grigorios Tsoumakas

    Abstract: Topic-controllable summarization is an emerging research area with a wide range of potential applications. However, existing approaches suffer from significant limitations. For example, the majority of existing methods built upon recurrent architectures, which can significantly limit their performance compared to more recent Transformer-based architectures, while they also require modifications to… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted at LREC-COLING 2024

  4. arXiv:2201.05041  [pdf, other

    cs.CL

    LARD: Large-scale Artificial Disfluency Generation

    Authors: T. Passali, T. Mavropoulos, G. Tsoumakas, G. Meditskos, S. Vrochidis

    Abstract: Disfluency detection is a critical task in real-time dialogue systems. However, despite its importance, it remains a relatively unexplored field, mainly due to the lack of appropriate datasets. At the same time, existing datasets suffer from various issues, including class imbalance issues, which can significantly affect the performance of the model on rare classes, as it is demonstrated in this p… ▽ More

    Submitted 3 May, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: Accepted at LREC 2022