-
arXiv:1908.06428 [pdf, ps, other]
The smallest grammar problem revisited
Abstract: In a seminal paper of Charikar et al. on the smallest grammar problem, the authors derive upper and lower bounds on the approximation ratios for several grammar-based compressors, but in all cases there is a gap between the lower and upper bound. Here the gaps for $\mathsf{LZ78}$ and $\mathsf{BISECTION}$ are closed by showing that the approximation ratio of $\mathsf{LZ78}$ is… ▽ More
Submitted 18 August, 2019; originally announced August 2019.
Comments: A short version of this paper appeared in the Proceedings of SPIRE 2016. This work has been supported by the DFG research project LO 748/10-1 (QUANT-KOMP)
-
arXiv:1802.05490 [pdf, ps, other]
Grammar-based Compression of Unranked Trees
Abstract: We introduce forest straight-line programs (FSLPs) as a compressed representation of unranked ordered node-labelled trees. FSLPs are based on the operations of forest algebra and generalize tree straight-line programs. We compare the succinctness of FSLPs with two other compression schemes for unranked trees: top dags and tree straight-line programs of first-child/next sibling encodings. Efficient… ▽ More
Submitted 15 February, 2018; originally announced February 2018.
Comments: Extended version of a paper at CSR 2018
MSC Class: 68P30; 68Q42 ACM Class: E.4
-
arXiv:1712.05822 [pdf, ps, other]
Optimal top dag compression
Abstract: It is shown that for a given ordered node-labelled tree of size $n$ and with $s$ many different node labels, one can construct in linear time a top dag of height $O(\log n)$ and size $O(n / \log_σn) \cap O(d \cdot \log n)$, where $σ= \max\{ 2, s\}$ and $d$ is the size of the minimal dag. The size bound $O(n / \log_σn)$ is optimal and improves on previous bounds.
Submitted 15 December, 2017; originally announced December 2017.
MSC Class: 68P30; 68P05
-
arXiv:1511.02141 [pdf, ps, other]
Traversing Grammar-Compressed Trees with Constant Delay
Abstract: A grammar-compressed ranked tree is represented with a linear space overhead so that a single traversal step, i.e., the move to the parent or the i-th child, can be carried out in constant time. Moreover, we extend our data structure such that equality of subtrees can be checked in constant time.
Submitted 10 November, 2015; v1 submitted 6 November, 2015; originally announced November 2015.