Skip to main content

Showing 1–7 of 7 results for author: Romana, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10426  [pdf, ps, other

    cs.DS cs.DM

    Bit catastrophes for the Burrows-Wheeler Transform

    Authors: Sara Giuliani, Shunsuke Inenaga, Zsuzsanna Lipták, Giuseppe Romana, Marinella Sciortino, Cristian Urbina

    Abstract: A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-lette… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: This work is an extended version of our conference article with the same title, published in the proceedings of DLT 2023

  2. arXiv:2404.07030  [pdf, other

    cs.DS cs.DM

    Exploring Repetitiveness Measures for Two-Dimensional Strings

    Authors: Giuseppe Romana, Marinella Sciortino, Cristian Urbina

    Abstract: Detecting and measuring repetitiveness of strings is a problem that has been extensively studied in data compression and text indexing. However, when the data are structured in a non-linear way, like in the context of two-dimensional strings, inherent redundancy offers a rich source for compression, yet systematic studies on repetitiveness measures are still lacking. In the paper we introduce exte… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2302.13647  [pdf, ps, other

    math.CO cs.DM

    String attractors of some simple-Parry automatic sequences

    Authors: France Gheeraert, Giuseppe Romana, Manon Stipulanti

    Abstract: Firstly studied by Kempa and Prezza in 2018 as the cement of text compression algorithms, string attractors have become a compelling object of theoretical research within the community of combinatorics on words. In this context, they have been studied for several families of finite and infinite words. In this paper, we obtain string attractors of prefixes of particular infinite words generalizing… ▽ More

    Submitted 22 March, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Extended version of a paper published in WORDS 2023. The conference version is arXiv:2302.13647v1

    MSC Class: 68R15; 05A05; 11A67; 68P05; 68Q45

  4. arXiv:2206.00376  [pdf, ps, other

    cs.FL cs.DS

    String Attractors and Infinite Words

    Authors: Antonio Restivo, Giuseppe Romana, Marinella Sciortino

    Abstract: The notion of string attractor has been introduced in [Kempa and Prezza, 2018] in the context of Data Compression and it represents a set of positions of a finite word in which all of its factors can be "attracted". The smallest size $γ^*$ of a string attractor for a finite word is a lower bound for several repetitiveness measures associated with the most common compression schemes, including BWT-… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  5. arXiv:2205.01576  [pdf, other

    cs.DS

    Computing Maximal Unique Matches with the r-index

    Authors: Sara Giuliani, Giuseppe Romana, Massimiliano Rossi

    Abstract: In recent years, pangenomes received increasing attention from the scientific community for their ability to incorporate population variation information and alleviate reference genome bias. Maximal Exact Matches (MEMs) and Maximal Unique Matches (MUMs) have proven themselves to be useful in multiple bioinformatic contexts, for example short-read alignment and multiple-genome alignment. However, s… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Our code is available at: https://github.com/saragiuliani/mum-phinder

  6. arXiv:2202.02609  [pdf, ps, other

    cs.FL cs.DS

    Logarithmic equal-letter runs for BWT of purely morphic words

    Authors: Andrea Frosini, Ilaria Mancini, Simone Rinaldi, Giuseppe Romana, Marinella Sciortino

    Abstract: In this paper we study the number $r_{bwt}$ of equal-letter runs produced by the Burrows-Wheeler transform ($BWT$) when it is applied to purely morphic finite words, which are words generated by iterating prolongable morphisms. Such a parameter $r_{bwt}$ is very significant since it provides a measure of the performances of the $BWT$, in terms of both compressibility and indexing. In particular, w… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

  7. arXiv:1907.04660  [pdf, other

    cs.DS cs.FL

    String Attractors and Combinatorics on Words

    Authors: Sabrina Mantaci, Antonio Restivo, Giuseppe Romana, Giovanna Rosone, Marinella Sciortino

    Abstract: The notion of \emph{string attractor} has recently been introduced in [Prezza, 2017] and studied in [Kempa and Prezza, 2018] to provide a unifying framework for known dictionary-based compressors. A string attractor for a word $w=w[1]w[2]\cdots w[n]$ is a subset $Γ$ of the positions $\{1,\ldots,n\}$, such that all distinct factors of $w$ have an occurrence crossing at least one of the elements of… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.