Skip to main content

Showing 1–6 of 6 results for author: Petrillo, U F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08245  [pdf, other

    cs.DC stat.CO

    A Distributed Approach for Persistent Homology Computation on a Large Scale

    Authors: Riccardo Ceccaroni, Lorenzo Di Rocco, Umberto Ferraro Petrillo, Pierpaolo Brutti

    Abstract: Persistent homology (PH) is a powerful mathematical method to automatically extract relevant insights from images, such as those obtained by high-resolution imaging devices like electron microscopes or new-generation telescopes. However, the application of this method comes at a very high computational cost, that is bound to explode more because new imaging devices generate an ever-growing amount… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  2. arXiv:2106.15531  [pdf, other

    q-bio.GN cs.DC

    The Power of Word-Frequency Based Alignment-Free Functions: a Comprehensive Large-scale Experimental Analysis -- Version 3

    Authors: Giuseppe Cattaneo, Umberto Ferraro Petrillo, Raffaele Giancarlo, Francesco Palini, Chiara Romualdi

    Abstract: Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error). However, assessment of their power, i.e., their ability to identify true similarity, has been limited to some members of the D2 family by experiment… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

  3. arXiv:2007.13673  [pdf, other

    cs.DC

    FASTA/Q Data Compressors for MapReduce-Hadoop Genomics:Space and Time Savings Made Easy -- Version 1

    Authors: Umberto Ferraro Petrillo, Francesco Palini, Giuseppe Cattaneo, Raffaele Giancarlo

    Abstract: Motivation: Storage of genomic data is a major cost for the Life Sciences, effectively addressed mostly via specialized data compression methods. For the same reasons of abundance in data production, the use of Big Data technologies is seen as the future for genomic data storage and processing, with MapReduce-Hadoop as leaders. Somewhat surprisingly, none of the specialized FASTA/Q compressors is… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  4. Alignment-free Genomic Analysis via a Big Data Spark Platform

    Authors: Umberto Ferraro Petrillo, Francesco Palini, Giuseppe Cattaneo, Raffaele Giancarlo

    Abstract: Motivation: Alignment-free distance and similarity functions (AF functions, for short) are a well established alternative to two and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive applications, the computation of AF functions is a Big Data problem, with the recent Literature indicating that the development of fast and scalable algorithms comp… ▽ More

    Submitted 23 October, 2021; v1 submitted 2 May, 2020; originally announced May 2020.

    Journal ref: Bioinformatics, Volume 37, Issue 12, 15 June 2021, Pages 1658-1665

  5. arXiv:1807.01566  [pdf, other

    cs.DC

    Analyzing Big Datasets of Genomic Sequences: Fast and Scalable Collection of k-mer Statistics

    Authors: Umberto Ferraro Petrillo, Mara Sorella, Giuseppe Cattaneo, Raffaele Giancarlo, Simona Rombo

    Abstract: Distributed approaches based on the map-reduce programming paradigm have started to be proposed in the bioinformatics domain, due to the large amount of data produced by the next-generation sequencing techniques. However, the use of map-reduce and related Big Data technologies and frameworks (e.g., Apache Hadoop and Spark) does not necessarily produce satisfactory results, in terms of both efficie… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  6. Using HTML5 to Prevent Detection of Drive-by-Download Web Malware

    Authors: Alfredo De Santis, Giancarlo De Maio, Umberto Ferraro Petrillo

    Abstract: The web is experiencing an explosive growth in the last years. New technologies are introduced at a very fast-pace with the aim of narrowing the gap between web-based applications and traditional desktop applications. The results are web applications that look and feel almost like desktop applications while retaining the advantages of being originated from the web. However, these advancements come… ▽ More

    Submitted 13 July, 2015; originally announced July 2015.

    Comments: This is the pre-peer reviewed version of the article: \emph{Using HTML5 to Prevent Detection of Drive-by-Download Web Malware}, which has been published in final form at \url{http://dx.doi.org/10.1002/sec.1077}. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving

    Journal ref: Security and Communication Networks, Volume 8, Issue 7, pages 1237-1255, 10 May 2015