Skip to main content

Showing 1–1 of 1 results for author: Eguchi, R R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2004.03497  [pdf, other

    q-bio.BM cs.LG stat.ML

    ProGen: Language Modeling for Protein Generation

    Authors: Ali Madani, Bryan McCann, Nikhil Naik, Nitish Shirish Keskar, Namrata Anand, Raphael R. Eguchi, Po-Ssu Huang, Richard Socher

    Abstract: Generative modeling for protein engineering is key to solving fundamental problems in synthetic biology, medicine, and material science. We pose protein engineering as an unsupervised sequence generation problem in order to leverage the exponentially growing set of proteins that lack costly, structural annotations. We train a 1.2B-parameter language model, ProGen, on ~280M protein sequences condit… ▽ More

    Submitted 7 March, 2020; originally announced April 2020.