Skip to main content

Showing 1–3 of 3 results for author: Muffo, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.10977  [pdf, other

    cs.CL stat.ML

    Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition

    Authors: Matteo Muffo, Aldo Cocco, Enrico Bertino

    Abstract: In recent years, Large Language Models such as GPT-3 showed remarkable capabilities in performing NLP tasks in the zero and few shot settings. On the other hand, the experiments highlighted the difficulty of GPT-3 in carrying out tasks that require a certain degree of reasoning, such as arithmetic operations. In this paper we evaluate the ability of Transformer Language Models to perform arithmeti… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 7 pages, 1 figure, published at LREC 2022

    Journal ref: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 291-297

  2. arXiv:2304.03098  [pdf, other

    cs.CL stat.ML

    Static Fuzzy Bag-of-Words: a lightweight sentence embedding algorithm

    Authors: Matteo Muffo, Roberto Tedesco, Licia Sbattella, Vincenzo Scotti

    Abstract: The introduction of embedding techniques has pushed forward significantly the Natural Language Processing field. Many of the proposed solutions have been presented for word-level encoding; anyhow, in the last years, new mechanism to treat information at an higher level of aggregation, like at sentence- and document-level, have emerged. With this work we address specifically the sentence embeddings… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 9 pages, 2 figures

    Journal ref: Proceedings of the 4th International Conference on Natural Language and Speech Processing (ICNLSP 2021)

  3. arXiv:2303.18121  [pdf, ps, other

    cs.CL stat.ML

    BERTino: an Italian DistilBERT model

    Authors: Matteo Muffo, Enrico Bertino

    Abstract: The recent introduction of Transformers language representation models allowed great improvements in many natural language processing (NLP) tasks. However, if on one hand the performances achieved by this kind of architectures are surprising, on the other their usability is limited by the high number of parameters which constitute their network, resulting in high computational and memory demands.… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: 5 pages

    Journal ref: Proceedings of the Seventh Italian Conference on Computational Linguistics CLiC-it 2020