-
The Spanning Tree Model and the Assembly Kinetics of RNA Viruses
Authors:
Inbal Mizrahi,
Robijn Bruinsma,
Joseph Rudnick
Abstract:
Single-stranded (ss) RNA viruses self-assemble spontaneously in solutions that contain the viral RNA genome molecules and viral capsid proteins. The self-assembly of empty capsids can be understood on the basis of free energy minimization. However, during the self-assembly of complete viral particles in the cytoplasm of an infected cell, the viral genome molecules must be selected from a large poo…
▽ More
Single-stranded (ss) RNA viruses self-assemble spontaneously in solutions that contain the viral RNA genome molecules and viral capsid proteins. The self-assembly of empty capsids can be understood on the basis of free energy minimization. However, during the self-assembly of complete viral particles in the cytoplasm of an infected cell, the viral genome molecules must be selected from a large pool of very similar host messenger RNA molecules and it is not known whether this also can be understood by free energy minimization. We address this question using a simple mathematical model recently proposed for the assembly of small ssRNA viruses (submitted to PLOS Biocomputation). We present a statistical physics analysis of the properties of the model finding an effect kinetic RNA selection mechanism with selection taking place during the formation of the nucleation complex. Surprisingly, kinetic selectivity is greatly enhanced by a modest level of supersaturation and by reduced protein to RNA concentration ratios. The mechanism is related to the Hopfield kinetic proofreading scenario.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
The Spanning Tree Model and the Assembly Kinetics of RNA Viruses
Authors:
Inbal Mizrahi,
Robijn Bruinsma,
Joseph Rudnick
Abstract:
Single-stranded (ss) RNA viruses self-assemble spontaneously in solutions that contain the viral RNA genome molecules and the viral capsid proteins. The self-assembly of empty capsids can be understood on the basis of free energy minimization of rather simple models. However, during the self-assembly of complete viral particles in the cytoplasm of an infected cell, the viral genome molecules must…
▽ More
Single-stranded (ss) RNA viruses self-assemble spontaneously in solutions that contain the viral RNA genome molecules and the viral capsid proteins. The self-assembly of empty capsids can be understood on the basis of free energy minimization of rather simple models. However, during the self-assembly of complete viral particles in the cytoplasm of an infected cell, the viral genome molecules must be selected from a large pool of very similar host messenger RNA molecules. It is known that the assembly process takes the form of preferential heterogeneous nucleation of capsid proteins on viral RNA molecules ("selective nucleation"). Recently, a simple mathematical model was proposed for the selective nucleation of small ssRNA viruses. In this paper we present a statistical physics analysis of the thermal equilibrium and kinetic properties of that model and show that it can account, at least qualitatively, for numerous observations of the self-assembly of small ssRNA viruses.
△ Less
Submitted 11 August, 2021;
originally announced August 2021.
-
kNet: A Deep kNN Network To Handle Label Noise
Authors:
Itzik Mizrahi,
Shai Avidan
Abstract:
Deep Neural Networks require large amounts of labeled data for their training. Collecting this data at scale inevitably causes label noise.Hence,the need to develop learning algorithms that are robust to label noise. In recent years, k Nearest Neighbors (kNN) emerged as a viable solution to this problem. Despite its success, kNN is not without its problems. Mainly, it requires a huge memory footpr…
▽ More
Deep Neural Networks require large amounts of labeled data for their training. Collecting this data at scale inevitably causes label noise.Hence,the need to develop learning algorithms that are robust to label noise. In recent years, k Nearest Neighbors (kNN) emerged as a viable solution to this problem. Despite its success, kNN is not without its problems. Mainly, it requires a huge memory footprint to store all the training samples and it needs an advanced data structure to allow for fast retrieval of the relevant examples, given a query sample. We propose a neural network, termed kNet, that learns to perform kNN. Once trained, we no longer need to store the training data, and processing a query sample is a simple matter of inference. To use kNet, we first train a preliminary network on the data set, and then train kNet on the penultimate layer of the preliminary network.We find that kNet gives a smooth approximation of kNN,and cannot handle the sharp label changes between samples that kNN can exhibit. This indicates that currently kNet is best suited to approximate kNN with a fairly large k. Experiments on two data sets show that this is the regime in which kNN works best,and can therefore be replaced by kNet.In practice, kNet consistently improve the results of all preliminary networks, in all label noise regimes, by up to 3%.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
The Spanning Tree Model for the Assembly Kinetics of RNA Viruses
Authors:
Inbal Mizrahi,
Robijn Bruinsma,
Joseph Rudnick
Abstract:
We present a simple kinetic model for the assembly of small single-stranded RNA viruses that can be used to carry out analytical packaging contests between different types of RNA molecules. The RNA selection mechanism is purely kinetic and based on small differences between the assembly energy profiles. RNA molecules that win these packaging contests are characterized by having a minimum "Maximum…
▽ More
We present a simple kinetic model for the assembly of small single-stranded RNA viruses that can be used to carry out analytical packaging contests between different types of RNA molecules. The RNA selection mechanism is purely kinetic and based on small differences between the assembly energy profiles. RNA molecules that win these packaging contests are characterized by having a minimum "Maximum Ladder Distance" and a maximum "Wrap** Number".The former is a topological invariant that measures the "branchiness" of the genome molecule while the latter measures the ability of the genome molecule to maximally associate with the capsid proteins. The model can also be used study the applicability of the theory of nucleation and growth to viral assembly, which breaks down with increasing strength of the RNA-protein interaction.
△ Less
Submitted 7 February, 2021;
originally announced February 2021.
-
A search for technosignatures from TRAPPIST-1, LHS 1140, and 10 planetary systems in the Kepler field with the Green Bank Telescope at 1.15-1.73 GHz
Authors:
Pavlo Pinchuk,
Jean-Luc Margot,
Adam H. Greenberg,
Thomas Ayalde,
Chad Bloxham,
Arjun Boddu,
Luis Gerardo Chinchilla-Garcia,
Micah Cliffe,
Sara Gallagher,
Kira Hart,
Brayden Hesford,
Inbal Mizrahi,
Ruth Pike,
Dominic Rodger,
Bade Sayki,
Una Schneck,
Aysen Tan,
Yinxue "Yolanda" Xiao,
Ryan S. Lynch
Abstract:
As part of our ongoing search for technosignatures, we collected over three terabytes of data in May 2017 with the L-band receiver (1.15-1.73 GHz) of the 100 m diameter Green Bank Telescope. These observations focused primarily on planetary systems in the Kepler field, but also included scans of the recently discovered TRAPPIST-1 and LHS 1140 systems. We present the results of our search for narro…
▽ More
As part of our ongoing search for technosignatures, we collected over three terabytes of data in May 2017 with the L-band receiver (1.15-1.73 GHz) of the 100 m diameter Green Bank Telescope. These observations focused primarily on planetary systems in the Kepler field, but also included scans of the recently discovered TRAPPIST-1 and LHS 1140 systems. We present the results of our search for narrowband signals in this data set with techniques that are generally similar to those described by Margot et al. (2018). Our improved data processing pipeline classified over $98\%$ of the $\sim$ 6 million detected signals as anthropogenic Radio Frequency Interference (RFI). Of the remaining candidates, 30 were detected outside of densely populated frequency regions attributable to RFI. These candidates were carefully examined and determined to be of terrestrial origin. We discuss the problems associated with the common practice of ignoring frequency space around candidate detections in radio technosignature detection pipelines. These problems include inaccurate estimates of figures of merit and unreliable upper limits on the prevalence of technosignatures. We present an algorithm that mitigates these problems and improves the efficiency of the search. Specifically, our new algorithm increases the number of candidate detections by a factor of more than four compared to Margot et al. (2018).
△ Less
Submitted 13 January, 2019;
originally announced January 2019.