Skip to main content

Showing 1–5 of 5 results for author: Morrelli, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2008.09209  [pdf

    cs.DC cs.AI

    Addestramento con Dataset Sbilanciati

    Authors: Massimiliano Morrelli

    Abstract: English. The following document pursues the objective of comparing some useful methods to balance a dataset and obtain a trained model. The dataset used for training is made up of short and medium length sentences, such as simple phrases or extracts from conversations that took place on web channels. The training of the models will take place with the help of the structures made available by the A… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: in Italian

  2. arXiv:2002.00757  [pdf

    cs.CL

    Similarità per la ricerca del dominio di una frase

    Authors: Massimiliano Morrelli, Giacomo Pansini, Massimiliano Polito, Arturo Vitale

    Abstract: English. This document aims to study the best algorithms to verify the belonging of a specific document to a related domain by comparing different methods for calculating the distance between two vectors. This study has been made possible with the help of the structures made available by the Apache Spark framework. Starting from the study illustrated in the publication "New frontier of textual cla… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

    Comments: in Italian

  3. arXiv:1908.07917  [pdf

    cs.DC

    Nuova frontiera della classificazione testuale: Big data e calcolo distribuito

    Authors: Marco Covelli, Massimiliano Morrelli

    Abstract: This document was created in order to study the algorithms for the categorization of phrases and rank them using the facilities provided by the framework Apache Spark. Starting from the study illustrated in the publication "Classifying textual data: shallow, deep and ensemble methods" by Laura Anderlucci, Lucia Guastadisegni, Cinzia Viroli, we wanted to carry out a study on the possible realizatio… ▽ More

    Submitted 28 June, 2019; originally announced August 2019.

    Comments: in Italian

  4. arXiv:1901.06238  [pdf

    cs.DB

    Integrazione di Apache Hive con Spark

    Authors: Michele Gentile, Massimiliano Morrelli

    Abstract: English. This document describes the solutions adopted, which arose from the need to transfer a large amount of information between the most famous distributed SQL and NoSQL storage systems to perform analysis and/or modification operations exploiting the peculiarities of the same. The goal was achieved using the Spark engine and studying and using the open source library "Hive Warehouse Connector… ▽ More

    Submitted 15 January, 2019; originally announced January 2019.

    Comments: in Italian

  5. arXiv:1810.12059  [pdf

    cs.DB

    Studio e confronto delle strutture di Apache Spark

    Authors: Massimiliano Morrelli

    Abstract: English. This document is designed to study the data structures that can be used in the Apache Spark framework and to evaluate the best performing ones to implement solutions, in particular we will evaluate advantages / disadvantages deriving from the use of Dataset for job creation. The observation of the results provides further support in evaluating the use of Dataset as an alternative to RDD,… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: in Italian