Skip to main content

Showing 1–7 of 7 results for author: Daykin, J W

.
  1. arXiv:2402.17005  [pdf, other

    cs.HC

    A visualization tool to explore alphabet orderings for the Burrows-Wheeler Transform

    Authors: Lily Major, Dave Davies, Amanda Clare, Jacqueline W. Daykin, Benjamin Mora, Christine Zarges

    Abstract: The Burrows-Wheeler Transform (BWT) is an efficient invertible text transformation algorithm with the properties of tending to group identical characters together in a run, and enabling search of the text. This transformation has extensive uses particularly in lossless compression algorithms, indexing, and within bioinformatics for sequence alignment tasks. There has been recent interest in minimi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

    ACM Class: H.5.2

  2. arXiv:2401.16435  [pdf, other

    cs.DM

    Heuristics for the Run-length Encoded Burrows-Wheeler Transform Alphabet Ordering Problem

    Authors: Lily Major, Amanda Clare, Jacqueline W. Daykin, Benjamin Mora, Christine Zarges

    Abstract: The Burrows-Wheeler Transform (BWT) is a string transformation technique widely used in areas such as bioinformatics and file compression. Many applications combine a run-length encoding (RLE) with the BWT in a way which preserves the ability to query the compressed data efficiently. However, these methods may not take full advantage of the compressibility of the BWT as they do not modify the alph… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 32 pages, 8 figures

    ACM Class: I.2.8

  3. arXiv:2107.02503  [pdf, other

    math.CO cs.DS cs.FL

    On Arithmetically Progressed Suffix Arrays and related Burrows-Wheeler Transforms

    Authors: Jacqueline W. Daykin, Dominik Köppl, David Kübel, Florian Stober

    Abstract: We characterize those strings whose suffix arrays are based on arithmetic progressions, in particular, arithmetically progressed permutations where all pairs of successive entries of the permutation have the same difference modulo the respective string length. We show that an arithmetically progressed permutation $P$ coincides with the suffix array of a unary, binary, or ternary string. We further… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  4. arXiv:1806.05942  [pdf, other

    cs.DS

    Enhanced string factoring from alphabet orderings

    Authors: Amanda Clare, Jacqueline W. Daykin

    Abstract: In this note we consider the concept of alphabet ordering in the context of string factoring. We propose a greedy-type algorithm which produces Lyndon factorizations with small numbers of factors along with a modification for large numbers of factors. For the technique we introduce the Exponent Parikh vector. Applications and research directions derived from circ-UMFFs are discussed.

    Submitted 15 June, 2018; originally announced June 2018.

    Comments: 9 pages

  5. arXiv:1708.01130  [pdf, ps, other

    cs.DS

    Efficient pattern matching in degenerate strings with the Burrows-Wheeler transform

    Authors: Jacqueline W. Daykin, Richard Groult, Yannick Guesnet, Thierry Lecroq, Arnaud Lefebvre, Martine Léonard, Laurent Mouchard, Élise Prieur-Gaston, Bruce Watson

    Abstract: A degenerate or indeterminate string on an alphabet $Σ$ is a sequence of non-empty subsets of $Σ$. Given a degenerate string $t$ of length $n$, we present a new method based on the Burrows--Wheeler transform for searching for a degenerate pattern of length $m$ in $t$ running in $O(mn)$ time on a constant size alphabet $Σ$. Furthermore, it is a hybrid pattern-matching technique that works on both r… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 7 pages, 1 figure

  6. arXiv:1507.07038  [pdf, ps, other

    cs.DS

    String Comparison in $V$-Order: New Lexicographic Properties & On-line Applications

    Authors: Ali Alatabbi, Jacqueline W. Daykin, M. Sohel Rahman, W. F. Smyth

    Abstract: $V$-order is a global order on strings related to Unique Maximal Factorization Families (UMFFs), which are themselves generalizations of Lyndon words. $V$-order has recently been proposed as an alternative to lexicographical order in the computation of suffix arrays and in the suffix-sorting induced by the Burrows-Wheeler transform. Efficient $V… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

  7. arXiv:1506.06983  [pdf, ps, other

    cs.DS

    Linear Algorithms for Computing the Lyndon Border Array and the Lyndon Suffix Array

    Authors: Ali Alatabbi, Jacqueline W. Daykin, M. Sohel Rahman

    Abstract: We consider the problem of finding repetitive structures and inherent patterns in a given string $\s{s}$ of length $n$ over a finite totally ordered alphabet. A border $\s{u}$ of a string $\s{s}$ is both a prefix and a suffix of $\s{s}$ such that $\s{u} \not= \s{s}$. The computation of the border array of a string $\s{s}$, namely the borders of each prefix of $\s{s}$, is strongly related to the st… ▽ More

    Submitted 23 June, 2015; originally announced June 2015.