-
Quantitatively visualizing bipartite datasets
Authors:
Tal Einav,
Yuehaw Khoo,
Amit Singer
Abstract:
As experiments continue to increase in size and scope, a fundamental challenge of subsequent analyses is to recast the wealth of information into an intuitive and readily-interpretable form. Often, each measurement only conveys the relationship between a pair of entries, and it is difficult to integrate these local interactions across a dataset to form a cohesive global picture. The classic locali…
▽ More
As experiments continue to increase in size and scope, a fundamental challenge of subsequent analyses is to recast the wealth of information into an intuitive and readily-interpretable form. Often, each measurement only conveys the relationship between a pair of entries, and it is difficult to integrate these local interactions across a dataset to form a cohesive global picture. The classic localization problem tackles this question, transforming local measurements into a global map that reveals the underlying structure of a system. Here, we examine the more challenging bipartite localization problem, where pairwise distances are only available for bipartite data comprising two classes of entries (such as antibody-virus interactions, drug-cell potency, or user-rating profiles). We modify previous algorithms to solve bipartite localization and examine how each method behaves in the presence of noise, outliers, and partially-observed data. As a proof of concept, we apply these algorithms to antibody-virus neutralization measurements to create a basis set of antibody behaviors, formalize how potently inhibiting some viruses necessitates weakly inhibiting other viruses, and quantify how often combinations of antibodies exhibit degenerate behavior.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Using Interpretable Machine Learning to Massively Increase the Number of Antibody-Virus Interactions Across Studies
Authors:
Tal Einav,
Rong Ma
Abstract:
A central challenge in every field of biology is to use existing measurements to predict the outcomes of future experiments. In this work, we consider the wealth of antibody inhibition data against variants of the influenza virus. Due to this viru's genetic diversity and evolvability, the variants examined in one study will often have little-to-no overlap with other studies, making it difficult to…
▽ More
A central challenge in every field of biology is to use existing measurements to predict the outcomes of future experiments. In this work, we consider the wealth of antibody inhibition data against variants of the influenza virus. Due to this viru's genetic diversity and evolvability, the variants examined in one study will often have little-to-no overlap with other studies, making it difficult to discern common patterns or unify datasets for further analysis. To that end, we develop a computational framework that predicts how an antibody or serum would inhibit any variant from any other study. We use this framework to greatly expand seven influenza datasets utilizing hemagglutination inhibition, validating our method upon 200,000 existing measurements and predicting 2,000,000 new values along with their uncertainties. With these new values, we quantify the transferability between seven vaccination and infection studies in humans and ferrets, show that the serum potency is negatively correlated with breadth, and present a tool for pandemic preparedness. This data-driven approach does not require any information beyond each virus's name and measurements, and even datasets with as few as 5 viruses can be expanded, making this approach widely applicable. Future influenza studies using hemagglutination inhibition can directly utilize our curated datasets to predict newly measured antibody responses against ~80 H3N2 influenza viruses from 1968-2011, whereas immunological studies utilizing other viruses or a different assay only need a single partially-overlap** dataset to extend their work. In essence, this approach enables a shift in perspective when analyzing data from "what you see is what you get" into "what anyone sees is what everyone gets."
△ Less
Submitted 30 October, 2022; v1 submitted 10 June, 2022;
originally announced June 2022.
-
When Two are Better than One: Modeling the Mechanisms of Antibody Mixtures
Authors:
Tal Einav,
Jesse D Bloom
Abstract:
It is difficult to predict how antibodies will behave when mixed together, even after each has been independently characterized. Here, we present a statistical mechanical model for the activity of antibody mixtures that accounts for whether pairs of antibodies bind to distinct or overlap** epitopes. This model requires measuring $n$ individual antibodies and their $n(n-1)/2$ pairwise interaction…
▽ More
It is difficult to predict how antibodies will behave when mixed together, even after each has been independently characterized. Here, we present a statistical mechanical model for the activity of antibody mixtures that accounts for whether pairs of antibodies bind to distinct or overlap** epitopes. This model requires measuring $n$ individual antibodies and their $n(n-1)/2$ pairwise interactions to predict the $2^n$ potential combinations. We apply this model to epidermal growth factor receptor (EGFR) antibodies and find that the activity of antibody mixtures can be predicted without positing synergy at the molecular level. In addition, we demonstrate how the model can be used in reverse, where straightforward experiments measuring the activity of antibody mixtures can be used to infer the molecular interactions between antibodies. Lastly, we generalize this model to analyze engineered multidomain antibodies, where components of different antibodies are tethered together to form novel amalgams, and characterize how well it predicts recently designed influenza antibodies.
△ Less
Submitted 16 October, 2019;
originally announced October 2019.
-
The Energetics of Molecular Adaptation in Transcriptional Regulation
Authors:
Griffin Chure,
Manuel Razo-Mejia,
Nathan M. Belliveau,
Tal Einav,
Zofii Kaczmarek,
Stephanie L. Barnes,
Mitchell Lewis,
Rob Phillips
Abstract:
Mutation is a critical mechanism by which evolution explores the functional landscape of proteins. Despite our ability to experimentally inflict mutations at will, it remains difficult to link sequence-level perturbations to systems-level responses. Here, we present a framework centered on measuring changes in the free energy of the system to link individual mutations in an allosteric transcriptio…
▽ More
Mutation is a critical mechanism by which evolution explores the functional landscape of proteins. Despite our ability to experimentally inflict mutations at will, it remains difficult to link sequence-level perturbations to systems-level responses. Here, we present a framework centered on measuring changes in the free energy of the system to link individual mutations in an allosteric transcriptional repressor to the parameters which govern its response. We find the energetic effects of the mutations can be categorized into several classes which have characteristic curves as a function of the inducer concentration. We experimentally test these diagnostic predictions using the well-characterized LacI repressor of Escherichia coli, probing several mutations in the DNA binding and inducer binding domains. We find that the change in gene expression due to a point mutation can be captured by modifying only a subset of the model parameters that describe the respective domain of the wild-type protein. These parameters appear to be insulated, with mutations in the DNA binding domain altering only the DNA affinity and those in the inducer binding domain altering only the allosteric parameters. Changing these subsets of parameters tunes the free energy of the system in a way that is concordant with theoretical expectations. Finally, we show that the induction profiles and resulting free energies associated with pairwise double mutants can be predicted with quantitative accuracy given knowledge of the single mutants, providing an avenue for identifying and quantifying epistatic interactions.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
How the Avidity of Polymerase Binding to the -35/-10 Promoter Sites Affects Gene Expression
Authors:
Tal Einav,
Rob Phillips
Abstract:
Although the key promoter elements necessary to drive transcription in Escherichia coli have long been understood, we still cannot predict the behavior of arbitrary novel promoters, hampering our ability to characterize the myriad of sequenced regulatory architectures as well as to design novel synthetic circuits. This work builds upon a beautiful recent experiment by Urtecho et al. who measured t…
▽ More
Although the key promoter elements necessary to drive transcription in Escherichia coli have long been understood, we still cannot predict the behavior of arbitrary novel promoters, hampering our ability to characterize the myriad of sequenced regulatory architectures as well as to design novel synthetic circuits. This work builds upon a beautiful recent experiment by Urtecho et al. who measured the gene expression of over 10,000 promoters spanning all possible combinations of a small set of regulatory elements. Using this data, we demonstrate that a central claim in energy matrix models of gene expression - that each promoter element contributes independently and additively to gene expression - contradicts experimental measurements. We propose that a key missing ingredient from such models is the avidity between the -35 and -10 RNA polymerase binding sites and develop what we call a refined energy matrix model that incorporates this effect. We show that this the refined energy matrix model can characterize the full suite of gene expression data and explore several applications of this framework, namely, how multivalent binding at the -35 and -10 sites can buffer RNAP kinetics against mutations and how promoters that bind overly tightly to RNA polymerase can inhibit gene expression. The success of our approach suggests that avidity represents a key physical principle governing the interaction of RNA polymerase to its promoter.
△ Less
Submitted 3 April, 2019;
originally announced April 2019.
-
Combinatorial Control through Allostery
Authors:
Vahe Galstyan,
Luke Funk,
Tal Einav,
Rob Phillips
Abstract:
Many instances of cellular signaling and transcriptional regulation involve switch-like molecular responses to the presence or absence of input ligands. To understand how these responses come about and how they can be harnessed, we develop a statistical mechanical model to characterize the types of Boolean logic that can arise from allosteric molecules following the Monod-Wyman-Changeux (MWC) mode…
▽ More
Many instances of cellular signaling and transcriptional regulation involve switch-like molecular responses to the presence or absence of input ligands. To understand how these responses come about and how they can be harnessed, we develop a statistical mechanical model to characterize the types of Boolean logic that can arise from allosteric molecules following the Monod-Wyman-Changeux (MWC) model. Building upon previous work, we show how an allosteric molecule regulated by two inputs can elicit AND, OR, NAND and NOR responses, but is unable to realize XOR or XNOR gates. Next, we demonstrate the ability of an MWC molecule to perform ratiometric sensing - a response behavior where activity depends monotonically on the ratio of ligand concentrations. We then extend our analysis to more general schemes of combinatorial control involving either additional binding sites for the two ligands or an additional third ligand and show how these additions can cause a switch in the logic behavior of the molecule. Overall, our results demonstrate the wide variety of control schemes that biological systems can implement using simple mechanisms.
△ Less
Submitted 29 December, 2018;
originally announced December 2018.
-
Harnessing Avidity: Quantifying Entropic and Energetic Effects of Linker Length and Rigidity Required for Multivalent Binding of Antibodies to HIV-1 Spikes
Authors:
Tal Einav,
Shahrzad Yazdi,
Aaron Coey,
Pamela J. Bjorkman,
Rob Phillips
Abstract:
Due to the low density of envelope (Env) spikes on the surface of HIV-1, neutralizing IgG antibodies rarely bind bivalently using both antigen-binding arms (Fabs) to crosslink between spikes (inter-spike crosslinking), instead resorting to weaker monovalent binding that is more sensitive to Env mutations. Synthetic antibodies designed to bivalently bind a single Env trimer (intra-spike crosslinkin…
▽ More
Due to the low density of envelope (Env) spikes on the surface of HIV-1, neutralizing IgG antibodies rarely bind bivalently using both antigen-binding arms (Fabs) to crosslink between spikes (inter-spike crosslinking), instead resorting to weaker monovalent binding that is more sensitive to Env mutations. Synthetic antibodies designed to bivalently bind a single Env trimer (intra-spike crosslinking) were previously shown to exhibit increased neutralization potencies. In initial work, diFabs joined by varying lengths of rigid double-stranded DNA (dsDNA) were considered. Anticipating future experiments to improve synthetic antibodies, we investigate whether linkers with different rigidities could enhance diFab potency by modeling DNA-Fabs containing different combinations of rigid dsDNA and flexible single-stranded DNA (ssDNA) and characterizing their neutralization potential. Model predictions suggest that while a long flexible polymer may be capable of bivalent binding, it exhibits weak neutralization due to the large loss in entropic degrees of freedom when both Fabs are bound. In contrast, the strongest neutralization potencies are predicted to require a rigid linker that optimally spans the distance between two Fab binding sites on an Env trimer, and avidity can be further boosted by incorporating more Fabs into these constructs. These results inform the design of multivalent anti-HIV-1 therapeutics that utilize avidity effects to remain potent against HIV-1 in the face of the rapid mutation of Env spikes.
△ Less
Submitted 22 May, 2019; v1 submitted 1 September, 2018;
originally announced September 2018.
-
Analysis of Inducer and Operator Binding for Cyclic-AMP Receptor Protein Mutants
Authors:
Tal Einav,
Julia Duque,
Rob Phillips
Abstract:
Allosteric transcription factors undergo binding events both at their inducer binding sites as well as at distinct DNA binding domains, and it is often difficult to disentangle the structural and functional consequences of these two classes of interactions. In this work, we compare the ability of two statistical mechanical models - the Monod-Wyman-Changeux (MWC) and the Koshland-NĂ©methy-Filmer (KN…
▽ More
Allosteric transcription factors undergo binding events both at their inducer binding sites as well as at distinct DNA binding domains, and it is often difficult to disentangle the structural and functional consequences of these two classes of interactions. In this work, we compare the ability of two statistical mechanical models - the Monod-Wyman-Changeux (MWC) and the Koshland-NĂ©methy-Filmer (KNF) models of protein conformational change - to characterize the multi-step activation mechanism of the broadly acting cyclic-AMP receptor protein (CRP). We first consider the allosteric transition resulting from cyclic-AMP binding to CRP, then analyze how CRP binds to its operator, and finally investigate the ability of CRP to activate gene expression. In light of these models, we examine data from a beautiful recent experiment that created a single-chain version of the CRP homodimer, thereby enabling each subunit to be mutated separately. Using this construct, six mutants were created using all possible combinations of the wild type subunit, a D53H mutant subunit, and an S62F mutant subunit. We demonstrate that both the MWC and KNF models can explain the behavior of all six mutants using a small, self-consistent set of parameters. In comparing the results, we find that the MWC model slightly outperforms the KNF model in the quality of its fits, but more importantly the parameters inferred by the MWC model are more in line with structural knowledge of CRP. In addition, we discuss how the conceptual framework developed here for CRP enables us to not merely analyze data retrospectively, but has the predictive power to determine how combinations of mutations will interact, how double mutants will behave, and how each construct would regulate gene expression.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Tuning transcriptional regulation through signaling: A predictive theory of allosteric induction
Authors:
Manuel Razo-Mejia,
Stephanie L. Barnes,
Nathan M. Belliveau,
Griffin Chure,
Tal Einav,
Mitchell Lewis,
Rob Phillips
Abstract:
Allosteric regulation is found across all domains of life, yet we still lack simple, predictive theories that directly link the experimentally tunable parameters of a system to its input-output response. To that end, we present a general theory of allosteric transcriptional regulation using the Monod-Wyman-Changeux model. We rigorously test this model using the ubiquitous simple repression motif i…
▽ More
Allosteric regulation is found across all domains of life, yet we still lack simple, predictive theories that directly link the experimentally tunable parameters of a system to its input-output response. To that end, we present a general theory of allosteric transcriptional regulation using the Monod-Wyman-Changeux model. We rigorously test this model using the ubiquitous simple repression motif in bacteria by first predicting the behavior of strains that span a large range of repressor copy numbers and DNA binding strengths and then constructing and measuring their response. Our model not only accurately captures the induction profiles of these strains but also enables us to derive analytic expressions for key properties such as the dynamic range and $[EC_{50}]$. Finally, we derive an expression for the free energy of allosteric repressors which enables us to collapse our experimental data onto a single master curve that captures the diverse phenomenology of the induction profiles.
△ Less
Submitted 21 June, 2017; v1 submitted 23 February, 2017;
originally announced February 2017.
-
Monod-Wyman-Changeux Analysis of Ligand-Gated Ion Channel Mutants
Authors:
Tal Einav,
Rob Phillips
Abstract:
We present a framework for computing the gating properties of ligand-gated ion channel mutants using the Monod-Wyman-Changeux (MWC) model of allostery. We derive simple analytic formulas for key functional properties such as the leakiness, dynamic range, half-maximal effective concentration, and effective Hill coefficient, and explore the full spectrum of phenotypes that are accessible through mut…
▽ More
We present a framework for computing the gating properties of ligand-gated ion channel mutants using the Monod-Wyman-Changeux (MWC) model of allostery. We derive simple analytic formulas for key functional properties such as the leakiness, dynamic range, half-maximal effective concentration, and effective Hill coefficient, and explore the full spectrum of phenotypes that are accessible through mutations. Specifically, we consider mutations in the channel pore of nicotinic acetylcholine receptor (nAChR) and the ligand binding domain of a cyclic nucleotide-gated (CNG) ion channel, demonstrating how each mutation can be characterized as only affecting a subset of the biophysical parameters. In addition, we show how the unifying perspective offered by the MWC model allows us, perhaps surprisingly, to collapse the plethora of dose-response data from different classes of ion channels into a universal family of curves.
△ Less
Submitted 22 January, 2017;
originally announced January 2017.
-
Statistical Mechanics of Allosteric Enzymes
Authors:
Tal Einav,
Linas Mazutis,
Rob Phillips
Abstract:
The concept of allostery in which macromolecules switch between two different conformations is a central theme in biological processes ranging from gene regulation to cell signaling to enzymology. Allosteric enzymes pervade metabolic processes, yet a simple and unified treatment of the effects of allostery in enzymes has been lacking. In this work, we take the first step towards this goal by model…
▽ More
The concept of allostery in which macromolecules switch between two different conformations is a central theme in biological processes ranging from gene regulation to cell signaling to enzymology. Allosteric enzymes pervade metabolic processes, yet a simple and unified treatment of the effects of allostery in enzymes has been lacking. In this work, we take the first step towards this goal by modeling allosteric enzymes and their interaction with two key molecular players - allosteric regulators and competitive inhibitors. We then apply this model to characterize existing data on enzyme activity, comment on how enzyme parameters (such as substrate binding affinity) can be experimentally tuned, and make novel predictions on how to control phenomena such as substrate inhibition.
△ Less
Submitted 14 January, 2017;
originally announced January 2017.