Skip to main content

Showing 1–3 of 3 results for author: Farnoud, F

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:1611.05537  [pdf, other

    cs.IT cs.DM q-bio.GN

    Duplication Distance to the Root for Binary Sequences

    Authors: Noga Alon, Jehoshua Bruck, Farzad Farnoud, Siddharth Jain

    Abstract: We study the tandem duplication distance between binary sequences and their roots. In other words, the quantity of interest is the number of tandem duplication operations of the form $\seq x = \seq a \seq b \seq c \to \seq y = \seq a \seq b \seq b \seq c$, where $\seq x$ and $\seq y$ are sequences and $\seq a$, $\seq b$, and $\seq c$ are their substrings, needed to generate a binary sequence of le… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: submitted to IEEE Transactions on Information Theory

  2. arXiv:1509.06029  [pdf, other

    cs.IT cs.DM cs.FL q-bio.GN

    Capacity and Expressiveness of Genomic Tandem Duplication

    Authors: Siddharth Jain, Farzad Farnoud, Jehoshua Bruck

    Abstract: The majority of the human genome consists of repeated sequences. An important type of repeated sequences common in the human genome are tandem repeats, where identical copies appear next to each other. For example, in the sequence $AGTC\underline{TGTG}C$, $TGTG$ is a tandem repeat, that may be generated from $AGTCTGC$ by a tandem duplication of length $2$. In this work, we investigate the possibil… ▽ More

    Submitted 20 September, 2015; originally announced September 2015.

    Comments: 19 pages, 3 figures, submitted to IEEE Transactions on Information Theory

  3. arXiv:1311.3932  [pdf, other

    q-bio.QM q-bio.GN

    MetaPar: Metagenomic Sequence Assembly via Iterative Reclassification

    Authors: Minji Kim, Jonathan G. Ligo, Amin Emad, Farzad Farnoud, Olgica Milenkovic, Venugopal V. Veeravalli

    Abstract: We introduce a parallel algorithmic architecture for metagenomic sequence assembly, termed MetaPar, which allows for significant reductions in assembly time and consequently enables the processing of large genomic datasets on computers with low memory usage. The gist of the approach is to iteratively perform read (re)classification based on phylogenetic marker genes and assembler outputs generated… ▽ More

    Submitted 15 November, 2013; originally announced November 2013.