-
ART: A machine learning Automated Recommendation Tool for synthetic biology
Authors:
Tijana Radivojević,
Zak Costello,
Kenneth Workman,
Hector Garcia Martin
Abstract:
Biology has changed radically in the last two decades, transitioning from a descriptive science into a design science. Synthetic biology allows us to bioengineer cells to synthesize novel valuable molecules such as renewable biofuels or anticancer drugs. However, traditional synthetic biology approaches involve ad-hoc engineering practices, which lead to long development times. Here, we present th…
▽ More
Biology has changed radically in the last two decades, transitioning from a descriptive science into a design science. Synthetic biology allows us to bioengineer cells to synthesize novel valuable molecules such as renewable biofuels or anticancer drugs. However, traditional synthetic biology approaches involve ad-hoc engineering practices, which lead to long development times. Here, we present the Automated Recommendation Tool (ART), a tool that leverages machine learning and probabilistic modeling techniques to guide synthetic biology in a systematic fashion, without the need for a full mechanistic understanding of the biological system. Using sampling-based optimization, ART provides a set of recommended strains to be built in the next engineering cycle, alongside probabilistic predictions of their production levels. We demonstrate the capabilities of ART on simulated data sets, as well as experimental data from real metabolic engineering projects producing renewable biofuels, hoppy flavored beer without hops, and fatty acids. Finally, we discuss the limitations of this approach, and the practical consequences of the underlying assumptions failing.
△ Less
Submitted 28 February, 2020; v1 submitted 25 November, 2019;
originally announced November 2019.
-
How to Hallucinate Functional Proteins
Authors:
Zak Costello,
Hector Garcia Martin
Abstract:
Here we present a novel approach to protein design and phenotypic inference using a generative model for protein sequences. BioSeqVAE, a variational autoencoder variant, can hallucinate syntactically valid protein sequences that are likely to fold and function. BioSeqVAE is trained on the entire known protein sequence space and learns to generate valid examples of protein sequences in an unsupervi…
▽ More
Here we present a novel approach to protein design and phenotypic inference using a generative model for protein sequences. BioSeqVAE, a variational autoencoder variant, can hallucinate syntactically valid protein sequences that are likely to fold and function. BioSeqVAE is trained on the entire known protein sequence space and learns to generate valid examples of protein sequences in an unsupervised manner. The model is validated by showing that its latent feature space is useful and that it accurately reconstructs sequences. Its usefulness is demonstrated with a selection of relevant downstream design tasks. This work is intended to serve as a computational first step towards a general purpose structure free protein design tool.
△ Less
Submitted 1 March, 2019;
originally announced March 2019.
-
From Global Linear Computations to Local Interaction Rules
Authors:
Zak Costello,
Magnus Egerstedt
Abstract:
A network of locally interacting agents can be thought of as performing a distributed computation. But not all computations can be faithfully distributed. This paper investigates which global, linear transformations can be computed using local rules, i.e., rules which rely solely on information from adjacent nodes in a network. The main result states that a linear transformation is computable in f…
▽ More
A network of locally interacting agents can be thought of as performing a distributed computation. But not all computations can be faithfully distributed. This paper investigates which global, linear transformations can be computed using local rules, i.e., rules which rely solely on information from adjacent nodes in a network. The main result states that a linear transformation is computable in finite time using local rules if and only if the transformation has positive determinant. An optimal control problem is solved for finding the local interaction rules, and simulations are performed to elucidate how optimal solutions can be obtained.
△ Less
Submitted 22 November, 2013;
originally announced November 2013.