Skip to main content

Showing 1–42 of 42 results for author: Gabrys, R

.
  1. arXiv:2407.07292  [pdf, other

    cs.CR

    HoneyGAN Pots: A Deep Learning Approach for Generating Honeypots

    Authors: Ryan Gabrys, Daniel Silva, Mark Bilinski

    Abstract: This paper investigates the feasibility and effectiveness of employing Generative Adversarial Networks (GANs) for the generation of decoy configurations in the field of cyber defense. The utilization of honeypots has been extensively studied in the past; however, selecting appropriate decoy configurations for a given cyber scenario (and subsequently retrieving/generating them) remain open challeng… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Presented at the 2nd International Workshop on Adaptive Cyber Defense, 2023 (arXiv:2308.09520)

    Report number: ACD/2023/112

  2. arXiv:2406.17689  [pdf, ps, other

    cs.IT cs.DS

    Robust Gray Codes Approaching the Optimal Rate

    Authors: Roni Con, Dorsa Fathollahi, Ryan Gabrys, Mary Wootters, Eitan Yaakobi

    Abstract: Robust Gray codes were introduced by (Lolck and Pagh, SODA 2024). Informally, a robust Gray code is a (binary) Gray code $\mathcal{G}$ so that, given a noisy version of the encoding $\mathcal{G}(j)$ of an integer $j$, one can recover $\hat{j}$ that is close to $j$ (with high probability over the noise). Such codes have found applications in differential privacy. In this work, we present near-opt… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2405.16370  [pdf, ps, other

    cs.IT

    Quickly-Decodable Group Testing with Fewer Tests: Price-Scarlett and Cheraghchi-Nakos's Nonadaptive Splitting with Explicit Scalars

    Authors: Hsin-Po Wang, Ryan Gabrys, Venkatesan Guruswami

    Abstract: We modify Cheraghchi-Nakos [CN20] and Price-Scarlett's [PS20] fast binary splitting approach to nonadaptive group testing. We show that, to identify a uniformly random subset of $k$ infected persons among a population of $n$, it takes only $\ln(2 - 4\varepsilon) ^{-2} k \ln n$ tests and decoding complexity $O(\varepsilon^{-2} k \ln n)$, for any small $\varepsilon > 0$, with vanishing error probabi… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 6 pages, 3 figures, ISIT 2023

  4. arXiv:2403.19061  [pdf, other

    cs.IT

    One Code Fits All: Strong stuck-at codes for versatile memory encoding

    Authors: Roni Con, Ryan Gabrys, Eitan Yaakobi

    Abstract: In this work we consider a generalization of the well-studied problem of coding for ``stuck-at'' errors, which we refer to as ``strong stuck-at'' codes. In the traditional framework of stuck-at codes, the task involves encoding a message into a one-dimensional binary vector. However, a certain number of the bits in this vector are 'frozen', meaning they are fixed at a predetermined value and canno… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  5. arXiv:2402.03987  [pdf, other

    cs.IT

    Tail-Erasure-Correcting Codes

    Authors: Boaz Moav, Ryan Gabrys, Eitan Yaakobi

    Abstract: The increasing demand for data storage has prompted the exploration of new techniques, with molecular data storage being a promising alternative. In this work, we develop coding schemes for a new storage paradigm that can be represented as a collection of two-dimensional arrays. Motivated by error patterns observed in recent prototype architectures, our study focuses on correcting erasures in the… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  6. arXiv:2401.17649  [pdf, other

    cs.IT

    Covering All Bases: The Next Inning in DNA Sequencing Efficiency

    Authors: Hadas Abraham, Rayn Gabrys, Eitan Yaakobi

    Abstract: DNA emerges as a promising medium for the exponential growth of digital data due to its density and durability. This study extends recent research by addressing the \emph{coverage depth problem} in practical scenarios, exploring optimal error-correcting code pairings with DNA storage systems to minimize coverage depth. Conducted within random access settings, the study provides theoretical analyse… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  7. arXiv:2401.15666  [pdf, other

    cs.IT

    Error-Correcting Codes for Combinatorial Composite DNA

    Authors: Omer Sabary, Inbal Preuss, Ryan Gabrys, Zohar Yakhini, Leon Anavy, Eitan Yaakobi

    Abstract: Data storage in DNA is develo** as a possible solution for archival digital data. Recently, to further increase the potential capacity of DNA-based data storage systems, the combinatorial composite DNA synthesis method was suggested. This approach extends the DNA alphabet by harnessing short DNA fragment reagents, known as shortmers. The shortmers are building blocks of the alphabet symbols, con… ▽ More

    Submitted 26 May, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  8. arXiv:2308.14558  [pdf, other

    cs.IT math.CO

    Storage codes and recoverable systems on lines and grids

    Authors: Alexander Barg, Ohad Elishco, Ryan Gabrys, Geyang Wang, Eitan Yaakobi

    Abstract: A storage code is an assignment of symbols to the vertices of a connected graph $G(V,E)$ with the property that the value of each vertex is a function of the values of its neighbors, or more generally, of a certain neighborhood of the vertex in $G$. In this work we introduce a new construction method of storage codes, enabling one to construct new codes from known ones via an interleaving procedur… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  9. arXiv:2305.05656  [pdf, other

    cs.DM cs.IT math.PR

    Cover Your Bases: How to Minimize the Sequencing Coverage in DNA Storage Systems

    Authors: Daniella Bar-Lev, Omer Sabary, Ryan Gabrys, Eitan Yaakobi

    Abstract: Although the expenses associated with DNA sequencing have been rapidly decreasing, the current cost of sequencing information stands at roughly $120/GB, which is dramatically more expensive than reading from existing archival storage solutions today. In this work, we aim to reduce not only the cost but also the latency of DNA storage by initiating the study of the DNA coverage depth problem, which… ▽ More

    Submitted 29 November, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  10. arXiv:2304.01365  [pdf, ps, other

    cs.IT

    Finding a Burst of Positives via Nonadaptive Semiquantitative Group Testing

    Authors: Yun-Han Li, Ryan Gabrys, ** Sima, Ilan Shomorony, Olgica Milenkovic

    Abstract: Motivated by testing for pathogenic diseases we consider a new nonadaptive group testing problem for which: (1) positives occur within a burst, capturing the fact that infected test subjects often come in clusters, and (2) that the test outcomes arise from semiquantitative measurements that provide coarse information about the number of positives in any tested group. Our model generalizes prior wo… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  11. arXiv:2210.11818  [pdf, ps, other

    cs.IT

    Non-binary Codes for Correcting a Burst of at Most t Deletions

    Authors: Shuche Wang, Yuanyuan Tang, ** Sima, Ryan Gabrys, Farzad Farnoud

    Abstract: The problem of correcting deletions has received significant attention, partly because of the prevalence of these errors in DNA data storage. In this paper, we study the problem of correcting a consecutive burst of at most $t$ deletions in non-binary sequences. We first propose a non-binary code correcting a burst of at most 2 deletions for $q$-ary alphabets. Afterwards, we extend this result to t… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 20 pages. The paper has been submitted to IEEE Transactions on Information Theory. Furthermore, the paper was presented in part at the ISIT2021 and Allerton2022

  12. arXiv:2208.02330  [pdf, other

    cs.IT

    Low-redundancy codes for correcting multiple short-duplication and edit errors

    Authors: Yuanyuan Tang, Shuche Wang, Hao Lou, Ryan Gabrys, Farzad Farnoud

    Abstract: Due to its higher data density, longevity, energy efficiency, and ease of generating copies, DNA is considered a promising storage technology for satisfying future needs. However, a diverse set of errors including deletions, insertions, duplications, and substitutions may arise in DNA at different stages of data storage and retrieval. The current paper constructs error-correcting codes for simulta… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 21 pages. The paper has been submitted to IEEE Transaction on Information Theory. Furthermore, the paper was presented in part at the ISIT2021 and ISIT2022

  13. Accelerating Polarization via Alphabet Extension

    Authors: Iwan Duursma, Ryan Gabrys, Venkatesan Guruswami, Ting-Chun Lin, Hsin-Po Wang

    Abstract: Polarization is an unprecedented coding technique in that it not only achieves channel capacity, but also does so at a faster speed of convergence than any other coding technique. This speed is measured by the ``scaling exponent'' and its importance is three-fold. Firstly, estimating the scaling exponent is challenging and demands a deeper understanding of the dynamics of communication channels. S… ▽ More

    Submitted 15 July, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: 22 pages, 4 figures. Accepted to RANDOM 2022. v2: 29 pages, 5 figures, 1 table; address comments from JSAIT

    MSC Class: 94B65

  14. arXiv:2204.11683  [pdf, other

    cs.IT

    Sub-4.7 Scaling Exponent of Polar Codes

    Authors: Hsin-Po Wang, Ting-Chun Lin, Alexander Vardy, Ryan Gabrys

    Abstract: Polar code visibly approaches channel capacity in practice and is thereby a constituent code of the 5G standard. Compared to low-density parity-check code, however, the performance of short-length polar code has rooms for improvement that could hinder its adoption by a wider class of applications. As part of the program that addresses the performance issue at short length, it is crucial to underst… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 15 pages, 13 figures, 1 table

    MSC Class: 94B65

  15. arXiv:2201.12671  [pdf, ps, other

    math.CO

    The Gapped $k$-Deck Problem

    Authors: Rebecca Golm, Mina Nahvi, Ryan Gabrys, Olgica Milenkovic

    Abstract: The $k$-deck problem is concerned with finding the smallest positive integer $S(k)$ such that there exist at least two strings of length $S(k)$ that share the same $k$-deck, i.e., the multiset of subsequences of length $k$. We introduce the new problem of gapped $k$-deck reconstruction: For a given gap parameter $s$, we seek the smallest positive integer $G_s(k)$ such that there exist at least two… ▽ More

    Submitted 17 May, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  16. arXiv:2201.09171  [pdf, other

    math.CO cs.IT

    Balanced and Swap-Robust Trades for Dynamical Distributed Storage

    Authors: Chao Pan, Ryan Gabrys, Xujun Liu, Charles Colbourn, Olgica Milenkovic

    Abstract: Trades, introduced by Hedayat, are two sets of blocks of elements which may be exchanged (traded) without altering the counts of certain subcollections of elements within their constituent blocks. They are of importance in applications where certain combinations of elements dynamically become prohibited from being placed in the same group of elements, since in this case one can trade the offending… ▽ More

    Submitted 13 May, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: 6 pages

  17. arXiv:2201.05440  [pdf, ps, other

    q-bio.QM stat.AP

    Tropical Group Testing

    Authors: Hsin-Po Wang, Ryan Gabrys, Alexander Vardy

    Abstract: Polymerase chain reaction (PCR) testing is the gold standard for diagnosing COVID-19. PCR amplifies the virus DNA 40 times to produce measurements of viral loads that span seven orders of magnitude. Unfortunately, the outputs of these tests are imprecise and therefore quantitative group testing methods, which rely on precise measurements, are not applicable. Motivated by the ever-increasing demand… ▽ More

    Submitted 17 January, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 25 pages, 20 figures. v2 fixes typos

    MSC Class: 05B20; 15A80 (Primary)

  18. arXiv:2112.09971  [pdf, ps, other

    cs.IT cs.DS math.CO

    Beyond Single-Deletion Correcting Codes: Substitutions and Transpositions

    Authors: Ryan Gabrys, Venkatesan Guruswami, João Ribeiro, Ke Wu

    Abstract: We consider the problem of designing low-redundancy codes in settings where one must correct deletions in conjunction with substitutions or adjacent transpositions; a combination of errors that is usually observed in DNA-based data storage. One of the most basic versions of this problem was settled more than 50 years ago by Levenshtein, or one substitution, with nearly optimal redundancy. However,… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Comments: 33 pages, 7 figures

  19. arXiv:2110.02352  [pdf, other

    cs.IT

    Reconstruction of Sets of Strings from Prefix/Suffix Compositions

    Authors: Ryan Gabrys, Srilakshmi Pattabiraman, Olgica Milenkovic

    Abstract: The problem of reconstructing strings from substring information has found many applications due to its importance in genomic data sequencing and DNA- and polymer-based data storage. One practically important and challenging paradigm requires reconstructing mixtures of strings based on the union of compositions of their prefixes and suffixes, generated by mass spectrometry devices. We describe new… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  20. arXiv:2102.04519  [pdf, ps, other

    cs.IT cs.DM

    Semiquantitative Group Testing in at Most Two Rounds

    Authors: Mahdi Cheraghchi, Ryan Gabrys, Olgica Milenkovic

    Abstract: Semiquantitative group testing (SQGT) is a pooling method in which the test outcomes represent bounded intervals for the number of defectives. Alternatively, it may be viewed as an adder channel with quantized outputs. SQGT represents a natural choice for Covid-19 group testing as it allows for a straightforward interpretation of the cycle threshold values produced by polymerase chain reactions (P… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  21. arXiv:2011.05223  [pdf, other

    q-bio.QM stat.ME

    AC-DC: Amplification Curve Diagnostics for Covid-19 Group Testing

    Authors: Ryan Gabrys, Srilakshmi Pattabiraman, Vishal Rana, João Ribeiro, Mahdi Cheraghchi, Venkatesan Guruswami, Olgica Milenkovic

    Abstract: The first part of the paper presents a review of the gold-standard testing protocol for Covid-19, real-time, reverse transcriptase PCR, and its properties and associated measurement data such as amplification curves that can guide the development of appropriate and accurate adaptive group testing protocols. The second part of the paper is concerned with examining various off-the-shelf group testin… ▽ More

    Submitted 5 June, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  22. arXiv:2010.11116  [pdf, ps, other

    cs.IT

    Reconstructing Mixtures of Coded Strings from Prefix and Suffix Compositions

    Authors: Ryan Gabrys, Srilakshmi Pattabiraman, Olgica Milenkovic

    Abstract: The problem of string reconstruction from substring information has found many applications due to its relevance in DNA- and polymer-based data storage. One practically important and challenging paradigm requires reconstructing mixtures of strings based on the union of compositions of their prefixes and suffixes, generated by mass spectrometry readouts. We describe new coding methods that allow fo… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  23. arXiv:2003.02121  [pdf, other

    cs.IT

    Coding for Polymer-Based Data Storage

    Authors: Srilakshmi Pattabiraman, Ryan Gabrys, Olgica Milenkovic

    Abstract: Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for both unique string reconstruction and correction of multiple mass errors. We consider two approaches: The first approach pertains to asymmetric errors and it is based on introducing… ▽ More

    Submitted 28 June, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.09280, arXiv:2001.04967

  24. arXiv:2001.04967  [pdf, ps, other

    cs.IT

    Mass Error-Correction Codes for Polymer-Based Data Storage

    Authors: Ryan Gabrys, Srilakshmi Pattabiraman, Olgica Milenkovic

    Abstract: We consider the problem of correcting mass readout errors in information encoded in binary polymer strings. Our work builds on results for string reconstruction problems using composition multisets [Acharya et al., 2015] and the unique string reconstruction framework proposed in [Pattabiraman et al., 2019]. Binary polymer-based data storage systems [Laure et al., 2016] operate by designing two mol… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

  25. arXiv:1910.06501  [pdf, ps, other

    cs.IT

    Optimal Codes Correcting a Single Indel / Edit for DNA-Based Data Storage

    Authors: Kui Cai, Yeow Meng Chee, Ryan Gabrys, Han Mao Kiah, Tuan Thanh Nguyen

    Abstract: An indel refers to a single insertion or deletion, while an edit refers to a single insertion, deletion or substitution. In this paper, we investigate codes that combat either a single indel or a single edit and provide linear-time algorithms that encode binary messages into these codes of length n. Over the quaternary alphabet, we provide two linear-time encoders. One corrects a single edit with… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 15 pages

  26. Transition Waste Optimization for Coded Elastic Computing

    Authors: Hoang Dau, Ryan Gabrys, Yu-Chih Huang, Chen Feng, Quang-Hung Luu, Eidah Alzahrani, Zahir Tari

    Abstract: Distributed computing, in which a resource-intensive task is divided into subtasks and distributed among different machines, plays a key role in solving large-scale problems. Coded computing is a recently emerging paradigm where redundancy for distributed computing is introduced to alleviate the impact of slow machines (stragglers) on the completion time. We investigate coded computing solutions o… ▽ More

    Submitted 14 March, 2023; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 24 pages, accepted by IEEE Transactions on Information Theory

  27. arXiv:1909.05694  [pdf, ps, other

    cs.IT

    Repeat-Free Codes

    Authors: Ohad Elishco, Ryan Gabrys, Eitan Yaakobi, Muriel Médard

    Abstract: In this paper we consider the problem of encoding data into \textit{repeat-free} sequences in which sequences are imposed to contain any $k$-tuple at most once (for predefined $k$). First, the capacity of the repeat-free constraint are calculated. Then, an efficient algorithm, which uses two bits of redundancy, is presented to encode length-$n$ sequences for $k=2+2\log (n)$. This algorithm is then… ▽ More

    Submitted 21 June, 2021; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: 21 pages

  28. arXiv:1906.12073  [pdf, ps, other

    cs.IT math.CO

    Access Balancing in Storage Systems by Labeling Partial Steiner Systems

    Authors: Yeow Meng Chee, Charles J. Colbourn, Hoang Dau, Ryan Gabrys, Alan C. H. Ling, Dylan Lusi, Olgica Milenkovic

    Abstract: Storage architectures ranging from minimum bandwidth regenerating encoded distributed storage systems to declustered-parity RAIDs can be designed using dense partial Steiner systems in order to support fast reads, writes, and recovery of failed storage units. In order to ensure good performance, popularities of the data items should be taken into account and the frequencies of accesses to the stor… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

    Comments: 16 pages

  29. arXiv:1904.09280  [pdf, other

    cs.IT

    Reconstruction and Error-Correction Codes for Polymer-Based Data Storage

    Authors: Srilakshmi Pattabiraman, Ryan Gabrys, Olgica Milenkovic

    Abstract: Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for unique string reconstruction and correction of one mass error. Our approach is based on introducing redundancy that scales logarithmically with the length of the string and allows f… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

  30. arXiv:1903.09992  [pdf, ps, other

    cs.IT math.CO

    Coded trace reconstruction

    Authors: Mahdi Cheraghchi, Ryan Gabrys, Olgica Milenkovic, João Ribeiro

    Abstract: Motivated by average-case trace reconstruction and coding for portable DNA-based storage systems, we initiate the study of \emph{coded trace reconstruction}, the design and analysis of high-rate efficiently encodable codes that can be efficiently decoded with high probability from few reads (also called \emph{traces}) corrupted by edit errors. Codes used in current portable DNA-based storage syste… ▽ More

    Submitted 9 September, 2019; v1 submitted 24 March, 2019; originally announced March 2019.

    Comments: v2 and v3: added missing references; v4: added funding acknowledgment ; v5: added references to concurrent, independent work; v6: added funding acknowledgment. 26 pages, no figures. A short version of this paper was presented at ITW 2019

  31. arXiv:1901.05559  [pdf, ps, other

    cs.IT math.CO

    Set-Codes with Small Intersections and Small Discrepancies

    Authors: R. Gabrys, H. S. Dau, C. J. Colbourn, O. Milenkovic

    Abstract: We are concerned with the problem of designing large families of subsets over a common labeled ground set that have small pairwise intersections and the property that the maximum discrepancy of the label values within each of the sets is less than or equal to one. Our results, based on transversal designs, factorizations of packings and Latin rectangles, show that by jointly constructing the sets… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

  32. arXiv:1809.04702  [pdf, other

    cs.IT

    Reconciling Similar Sets of Data

    Authors: Ryan Gabrys, Farzad Farnoud

    Abstract: In this work, we consider the problem of synchronizing two sets of data where the size of the symmetric difference between the sets is small and, in addition, the elements in the symmetric difference are related through the Hamming distance metric. Upper and lower bounds are derived on the minimum amount of information exchange. Furthermore, explicit encoding and decoding algorithms are provided f… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

  33. arXiv:1804.04548  [pdf, ps, other

    cs.IT

    Unique Reconstruction of Coded Strings from Multiset Substring Spectra

    Authors: Ryan Gabrys, Olgica Milenkovic

    Abstract: The problem of reconstructing strings from their substring spectra has a long history and in its most simple incarnation asks for determining under which conditions the spectrum uniquely determines the string. We study the problem of coded string reconstruction from multiset substring spectra, where the strings are restricted to lie in some codebook. In particular, we consider binary codebooks tha… ▽ More

    Submitted 22 April, 2019; v1 submitted 12 April, 2018; originally announced April 2018.

  34. arXiv:1712.07222  [pdf, other

    cs.IT

    Codes Correcting Two Deletions

    Authors: Ryan Gabrys, Frederic Sala

    Abstract: In this work, we investigate the problem of constructing codes capable of correcting two deletions. In particular, we construct a code that requires redundancy approximately 8 log n + O(log log n) bits of redundancy, where n is the length of the code. To the best of the author's knowledge, this represents the best known construction in that it requires the lowest number of redundant bits for a cod… ▽ More

    Submitted 30 April, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

  35. arXiv:1709.05214  [pdf, other

    cs.IT

    Mutually Uncorrelated Primers for DNA-Based Data Storage

    Authors: S. M. Hossein Tabatabaei Yazdi, Han Mao Kiah, Ryan Gabrys, Olgica Milenkovic

    Abstract: We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and for synchronization of communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems ar… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Comments: 14 pages, 3 figures, 1 Table. arXiv admin note: text overlap with arXiv:1601.08176

  36. arXiv:1701.08111  [pdf, ps, other

    cs.IT

    The Hybrid k-Deck Problem: Reconstructing Sequences from Short and Long Traces

    Authors: Ryan Gabrys, Olgica Milenkovic

    Abstract: We introduce a new variant of the $k$-deck problem, which in its traditional formulation asks for determining the smallest $k$ that allows one to reconstruct any binary sequence of length $n$ from the multiset of its $k$-length subsequences. In our version of the problem, termed the hybrid k-deck problem, one is given a certain number of special subsequences of the sequence of length $n - t$,… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

  37. arXiv:1604.03000  [pdf, other

    cs.IT

    Exact Reconstruction from Insertions in Synchronization Codes

    Authors: Frederic Sala, Ryan Gabrys, Clayton Schoeny, Lara Dolecek

    Abstract: This work studies problems in data reconstruction, an important area with numerous applications. In particular, we examine the reconstruction of binary and non-binary sequences from synchronization (insertion/deletion-correcting) codes. These sequences have been corrupted by a fixed number of symbol insertions (larger than the minimum edit distance of the code), yielding a number of distinct trace… ▽ More

    Submitted 7 March, 2017; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: 18 pages, 3 figures. Accepted to IEEE Transactions on Information Theory

  38. arXiv:1602.06820  [pdf, ps, other

    cs.IT

    Codes Correcting a Burst of Deletions or Insertions

    Authors: Clayton Schoeny, Antonia Wachter-Zeh, Ryan Gabrys, Eitan Yaakobi

    Abstract: This paper studies codes that correct bursts of deletions. Namely, a code will be called a $b$-burst-deletion-correcting code if it can correct a deletion of any $b$ consecutive bits. While the lower bound on the redundancy of such codes was shown by Levenshtein to be asymptotically $\log(n)+b-1$, the redundancy of the best code construction by Cheng et al. is $b(\log (n/b+1))$. In this paper we c… ▽ More

    Submitted 12 May, 2016; v1 submitted 22 February, 2016; originally announced February 2016.

  39. arXiv:1601.06887  [pdf, other

    cs.IT

    Balanced Permutation Codes

    Authors: Ryan Gabrys, Olgica Milenkovic

    Abstract: Motivated by charge balancing constraints for rank modulation schemes, we introduce the notion of balanced permutations and derive the capacity of balanced permutation codes. We also describe simple interleaving methods for permutation code constructions and show that they approach capacity

    Submitted 25 January, 2016; originally announced January 2016.

  40. arXiv:1601.06885  [pdf, ps, other

    cs.IT

    Codes in the Damerau Distance for DNA Storage

    Authors: Ryan Gabrys, Eitan Yaakobi, Olgica Milenkovic

    Abstract: Motivated by applications in DNA-based storage, we introduce the new problem of code design in the Damerau metric. The Damerau metric is a generalization of the Levenshtein distance which, in addition to deletions, insertions and substitution errors also accounts for adjacent transposition edits. We first provide constructions for codes that may correct either a single deletion or a single adjacen… ▽ More

    Submitted 30 April, 2018; v1 submitted 25 January, 2016; originally announced January 2016.

  41. arXiv:1506.00740  [pdf, other

    cs.IT

    Asymmetric Lee Distance Codes for DNA-Based Storage

    Authors: Ryan Gabrys, Han Mao Kiah, Olgica Milenkovic

    Abstract: We consider a new family of codes, termed asymmetric Lee distance codes, that arise in the design and implementation of DNA-based storage systems and systems with parallel string transmission protocols. The codewords are defined over a quaternary alphabet, although the results carry over to other alphabet sizes; furthermore, symbol confusability is dictated by their underlying binary representatio… ▽ More

    Submitted 14 December, 2016; v1 submitted 1 June, 2015; originally announced June 2015.

  42. arXiv:1307.7087  [pdf, ps, other

    cs.IT

    Correcting Grain-Errors in Magnetic Media

    Authors: Ryan Gabrys, Eitan Yaakobi, Lara Dolecek

    Abstract: This paper studies new bounds and constructions that are applicable to the combinatorial granular channel model previously introduced by Sharov and Roth. We derive new bounds on the maximum cardinality of a grain-error-correcting code and propose constructions of codes that correct grain-errors. We demonstrate that a permutation of the classical group codes (e.g., Constantin-Rao codes) can correct… ▽ More

    Submitted 30 April, 2018; v1 submitted 26 July, 2013; originally announced July 2013.