How to Hallucinate Functional Proteins

Costello, Zak; Martin, Hector Garcia

Quantitative Biology > Quantitative Methods

arXiv:1903.00458 (q-bio)

[Submitted on 1 Mar 2019]

Title:How to Hallucinate Functional Proteins

Authors:Zak Costello, Hector Garcia Martin

View PDF

Abstract:Here we present a novel approach to protein design and phenotypic inference using a generative model for protein sequences. BioSeqVAE, a variational autoencoder variant, can hallucinate syntactically valid protein sequences that are likely to fold and function. BioSeqVAE is trained on the entire known protein sequence space and learns to generate valid examples of protein sequences in an unsupervised manner. The model is validated by showing that its latent feature space is useful and that it accurately reconstructs sequences. Its usefulness is demonstrated with a selection of relevant downstream design tasks. This work is intended to serve as a computational first step towards a general purpose structure free protein design tool.

Subjects:	Quantitative Methods (q-bio.QM)
Cite as:	arXiv:1903.00458 [q-bio.QM]
	(or arXiv:1903.00458v1 [q-bio.QM] for this version)
	https://doi.org/10.48550/arXiv.1903.00458

Submission history

From: Zak Costello [view email]
[v1] Fri, 1 Mar 2019 18:39:00 UTC (768 KB)

Full-text links:

Access Paper:

view license

Current browse context:

q-bio.QM

< prev | next >

new | recent | 2019-03

Change to browse by:

q-bio

References & Citations

export BibTeX citation

Quantitative Biology > Quantitative Methods

Title:How to Hallucinate Functional Proteins

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Quantitative Methods

Title:How to Hallucinate Functional Proteins

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators