Skip to main content

Showing 1–8 of 8 results for author: Dell'Orletta, F

.
  1. arXiv:2406.12128  [pdf, other

    cs.CL

    AI "News" Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian

    Authors: Giovanni Puccetti, Anna Rogers, Chiara Alzetta, Felice Dell'Orletta, Andrea Esuli

    Abstract: Large Language Models (LLMs) are increasingly used as "content farm" models (CFMs), to generate synthetic text that could pass for real news articles. This is already happening even for languages that do not have high-quality monolingual LLMs. We show that fine-tuning Llama (v1), mostly trained on English, on as little as 40K Italian news articles, is sufficient for producing news-like texts that… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.07288  [pdf, other

    cs.CL

    Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models

    Authors: Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell'Orletta, Malvina Nissim, Marco Guerini

    Abstract: Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has been given to its quality. In this work, we investigate the impact of human intervention on machine-generated data when fine-tuning dialogical models. In particu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2402.17608  [pdf, other

    cs.CL

    Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)

    Authors: Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi

    Abstract: In this paper, we explore the impact of augmenting pre-trained Encoder-Decoder models, specifically T5, with linguistic knowledge for the prediction of a target task. In particular, we investigate whether fine-tuning a T5 model on an intermediate task that predicts structural linguistic properties of sentences modifies its performance in the target task of predicting sentence-level complexity. Our… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to LREC-COLING 2024

  4. arXiv:2401.03833  [pdf, other

    cs.SE

    T-FREX: A Transformer-based Feature Extraction Method from Mobile App Reviews

    Authors: Quim Motger, Alessio Miaschi, Felice Dell'Orletta, Xavier Franch, Jordi Marco

    Abstract: Mobile app reviews are a large-scale data source for software-related knowledge generation activities, including software maintenance, evolution and feedback analysis. Effective extraction of features (i.e., functionalities or characteristics) from these reviews is key to support analysis on the acceptance of these features, identification of relevant new feature requests and prioritization of fea… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted at IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2024). 12 pages (including references), 5 figures, 4 tables

  5. Outliers Dimensions that Disrupt Transformers Are Driven by Frequency

    Authors: Giovanni Puccetti, Anna Rogers, Aleksandr Drozd, Felice Dell'Orletta

    Abstract: While Transformer-based language models are generally very robust to pruning, there is the recently discovered outlier phenomenon: disabling only 48 out of 110M parameters in BERT-base drops its performance by nearly 30% on MNLI. We replicate the original evidence for the outlier phenomenon and we link it to the geometry of the embedding space. We find that in both BERT and RoBERTa the magnitude o… ▽ More

    Submitted 22 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: To appear in Findings of EMNLP 2022

  6. arXiv:2101.01634  [pdf, other

    cs.CL

    On the interaction of automatic evaluation and task framing in headline style transfer

    Authors: Lorenzo De Mattei, Michele Cafagna, Huiyuan Lai, Felice Dell'Orletta, Malvina Nissim, Albert Gatt

    Abstract: An ongoing debate in the NLG community concerns the best way to evaluate systems, with human evaluation often being considered the most reliable method, compared to corpus-based metrics. However, tasks involving subtle textual differences, such as style transfer, tend to be hard for humans to perform. In this paper, we propose an evaluation method for this task based on purposely-trained classifie… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  7. Linguistic Profiling of a Neural Language Model

    Authors: Alessio Miaschi, Dominique Brunato, Felice Dell'Orletta, Giulia Venturi

    Abstract: In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after a fine-tuning process and how this knowledge affects its predictions during several classification problems. We use a wide set of probing tasks, each of which corresponds to a distinct sentence-level feature extracted from different levels of linguistic annotation. We show that BERT is a… ▽ More

    Submitted 7 November, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to COLING 2020

    Journal ref: Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020)

  8. arXiv:2004.14253  [pdf, other

    cs.CL

    GePpeTto Carves Italian into a Language Model

    Authors: Lorenzo De Mattei, Michele Cafagna, Felice Dell'Orletta, Malvina Nissim, Marco Guerini

    Abstract: In the last few years, pre-trained neural architectures have provided impressive improvements across several NLP tasks. Still, generative language models are available mainly for English. We develop GePpeTto, the first generative language model for Italian, built using the GPT-2 architecture. We provide a thorough analysis of GePpeTto's quality by means of both an automatic and a human-based evalu… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.