Skip to main content

Showing 1–4 of 4 results for author: Giuliani, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10426  [pdf, ps, other

    cs.DS cs.DM

    Bit catastrophes for the Burrows-Wheeler Transform

    Authors: Sara Giuliani, Shunsuke Inenaga, Zsuzsanna Lipták, Giuseppe Romana, Marinella Sciortino, Cristian Urbina

    Abstract: A bit catastrophe, loosely defined, is when a change in just one character of a string causes a significant change in the size of the compressed string. We study this phenomenon for the Burrows-Wheeler Transform (BWT), a string transform at the heart of several of the most popular compressors and aligners today. The parameter determining the size of the compressed data is the number of equal-lette… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: This work is an extended version of our conference article with the same title, published in the proceedings of DLT 2023

  2. arXiv:2205.01576  [pdf, other

    cs.DS

    Computing Maximal Unique Matches with the r-index

    Authors: Sara Giuliani, Giuseppe Romana, Massimiliano Rossi

    Abstract: In recent years, pangenomes received increasing attention from the scientific community for their ability to incorporate population variation information and alleviate reference genome bias. Maximal Exact Matches (MEMs) and Maximal Unique Matches (MUMs) have proven themselves to be useful in multiple bioinformatic contexts, for example short-read alignment and multiple-genome alignment. However, s… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Our code is available at: https://github.com/saragiuliani/mum-phinder

  3. Novel Results on the Number of Runs of the Burrows-Wheeler-Transform

    Authors: Sara Giuliani, Shunsuke Inenaga, Zsuzsanna Lipták, Nicola Prezza, Marinella Sciortino, Anna Toffanello

    Abstract: The Burrows-Wheeler-Transform (BWT), a reversible string transformation, is one of the fundamental components of many current data structures in string processing. It is central in data compression, as well as in efficient query algorithms for sequence data, such as webpages, genomic and other biological sequences, or indeed any textual data. The BWT lends itself well to compression because its nu… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: 14 pages, 2 figues

    Report number: 47th Int. Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2021), LNCS 12607: 249--262 (2021)

  4. When a Dollar Makes a BWT

    Authors: Sara Giuliani, Zsuzsanna Lipták, Francesco Masillo, Romeo Rizzi

    Abstract: The Burrows-Wheeler-Transform (BWT) is a reversible string transformation which plays a central role in text compression and is fundamental in many modern bioinformatics applications. The BWT is a permutation of the characters, which is in general better compressible and allows to answer several different query types more efficiently than the original string. It is easy to see that not every str… ▽ More

    Submitted 12 March, 2021; v1 submitted 24 August, 2019; originally announced August 2019.

    Comments: This is the journal version of paper at ICTCS 2019 (20th Italian Conference on Theoretical Computer Science, 9-11 Sept. 2019, Como, Italy). Journal version appeared in TCS 2021

    Journal ref: Theoretical Computer Science 857: 123-146 (2021)