Skip to main content

Showing 1–1 of 1 results for author: Lanman, N A

.
  1. arXiv:2311.02333  [pdf, other

    cs.LG q-bio.GN

    Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision

    Authors: Aditya Malusare, Harish Kothandaraman, Dipesh Tamboli, Nadia A. Lanman, Vaneet Aggarwal

    Abstract: This paper presents the Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) foundation model, analyzing DNA sequences at byte-level precision with an encoder-decoder Transformer architecture. ENBED uses a sub-quadratic implementation of attention to develop an efficient model capable of sequence-to-sequence transformations, generalizing previous genomic models with encoder-only or decoder-only… ▽ More

    Submitted 13 February, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 9 pages