The block spectrum of RNA pseudoknot structures
Authors:
Thomas J. X. Li,
Christina S. Burris,
Christian M. Reidys
Abstract:
In this paper we analyze the length-spectrum of blocks in $γ$-structures. $γ$-structures are a class of RNA pseudoknot structures that plays a key role in the context of polynomial time RNA folding. A $γ$-structure is constructed by nesting and concatenating specific building components having topological genus at most $γ$. A block is a substructure enclosed by crossing maximal arcs with respect t…
▽ More
In this paper we analyze the length-spectrum of blocks in $γ$-structures. $γ$-structures are a class of RNA pseudoknot structures that plays a key role in the context of polynomial time RNA folding. A $γ$-structure is constructed by nesting and concatenating specific building components having topological genus at most $γ$. A block is a substructure enclosed by crossing maximal arcs with respect to the partial order induced by nesting. We show that, in uniformly generated $γ$-structures, there is a significant gap in this length-spectrum, i.e., there asymptotically almost surely exists a unique longest block of length at least $n-O(n^{1/2})$ and that with high probability any other block has finite length. For fixed $γ$, we prove that the length of the longest block converges to a discrete limit law, and that the distribution of short blocks of given length tends to a negative binomial distribution in the limit of long sequences. We refine this analysis to the length spectrum of blocks of specific pseudoknot types, such as H-type and kissing hairpins. Our results generalize the rainbow spectrum on secondary structures by the first and third authors and are being put into context with the structural prediction of long non-coding RNAs.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.
An Unoriented Variation on de Bruijn Sequences
Authors:
Christie S. Burris,
Francis C. Motta,
Patrick D. Shipman
Abstract:
For positive integers $k,n$, a de Bruijn sequence $B(k,n)$ is a finite sequence of elements drawn from $k$ characters whose subwords of length $n$ are exactly the $k^n$ words of length $n$ on $k$ characters. This paper introduces the unoriented de Bruijn sequence $uB(k,n)$, an analog to de Bruijn sequences, but for which the sequence is read both forwards and backwards to determine the set of subw…
▽ More
For positive integers $k,n$, a de Bruijn sequence $B(k,n)$ is a finite sequence of elements drawn from $k$ characters whose subwords of length $n$ are exactly the $k^n$ words of length $n$ on $k$ characters. This paper introduces the unoriented de Bruijn sequence $uB(k,n)$, an analog to de Bruijn sequences, but for which the sequence is read both forwards and backwards to determine the set of subwords of length $n$. We show that nontrivial unoriented de Bruijn sequences of optimal length exist if and only if $k$ is two or odd and $n$ is less than or equal to 3. Unoriented de Bruijn sequences for any $k$, $n$ may be constructed from certain Eulerian paths in Eulerizations of unoriented de Bruijn graphs.
△ Less
Submitted 30 August, 2016;
originally announced August 2016.