-
Sewage Pooling Test for SARS-CoV-2
Authors:
Ritam Guha,
Anik Sengupta,
Ankan Dutta
Abstract:
CoVID-19 is currently one of the biggest threats to mankind. To date, it is the reason for infections of over 35 lakhs and the death of over 2 lakh human beings. We propose a procedure to detect CoVID-19 affected localities using a sewage mass testing and pooling mechanism which has gained ground in recent times. The proposed method named Sewage Pooling Algorithm tests wastewater samples from sewa…
▽ More
CoVID-19 is currently one of the biggest threats to mankind. To date, it is the reason for infections of over 35 lakhs and the death of over 2 lakh human beings. We propose a procedure to detect CoVID-19 affected localities using a sewage mass testing and pooling mechanism which has gained ground in recent times. The proposed method named Sewage Pooling Algorithm tests wastewater samples from sewage systems to pinpoint the regions which are affected by maximum chances of the virus spread. The algorithm also uses a priority-based backtracking procedure to perform testing in sewage links depending on the probability of infection in the sub-areas. For places with very rare CoVID cases, we present a gradient-based search method to prune those areas. The proposed method has less human intervention and increases the effective tests/million people over current in-place methods.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
An Efficient Convolutional Neural Network for Coronary Heart Disease Prediction
Authors:
Aniruddha Dutta,
Tamal Batabyal,
Meheli Basu,
Scott T. Acton
Abstract:
This study proposes an efficient neural network with convolutional layers to classify significantly class-imbalanced clinical data. The data are curated from the National Health and Nutritional Examination Survey (NHANES) with the goal of predicting the occurrence of Coronary Heart Disease (CHD). While the majority of the existing machine learning models that have been used on this class of data a…
▽ More
This study proposes an efficient neural network with convolutional layers to classify significantly class-imbalanced clinical data. The data are curated from the National Health and Nutritional Examination Survey (NHANES) with the goal of predicting the occurrence of Coronary Heart Disease (CHD). While the majority of the existing machine learning models that have been used on this class of data are vulnerable to class imbalance even after the adjustment of class-specific weights, our simple two-layer CNN exhibits resilience to the imbalance with fair harmony in class-specific performance. In order to obtain significant improvement in classification accuracy under supervised learning settings, it is a common practice to train a neural network architecture with a massive data and thereafter, test the resulting network on a comparatively smaller amount of data. However, given a highly imbalanced dataset, it is often challenging to achieve a high class 1 (true CHD prediction rate) accuracy as the testing data size increases. We adopt a two-step approach: first, we employ least absolute shrinkage and selection operator (LASSO) based feature weight assessment followed by majority-voting based identification of important features. Next, the important features are homogenized by using a fully connected layer, a crucial step before passing the output of the layer to successive convolutional stages. We also propose a training routine per epoch, akin to a simulated annealing process, to boost the classification accuracy. Despite a 35:1 (Non-CHD:CHD) ratio in the NHANES dataset, the investigation confirms that our proposed CNN architecture has the classification power of 77% to correctly classify the presence of CHD and 81.8% the absence of CHD cases on a testing data, which is 85.70% of the total dataset. ( (<1920 characters)Please check the paper for full abstract)
△ Less
Submitted 22 April, 2020; v1 submitted 1 September, 2019;
originally announced September 2019.
-
Stochastic thermodynamics and modes of operation of a ribosome: a network theoretic perspective
Authors:
Annwesha Dutta,
Gunter M Schütz,
Debashish Chowdhury
Abstract:
The ribosome is one of the largest and most complex macromolecular machines in living cells. It polymerizes a protein in a step-by-step manner as directed by the corresponding nucleotide sequence on the template messenger RNA (mRNA) and this process is referred to as `translation' of the genetic message encoded in the sequence of mRNA transcript. In each successful chemo-mechanical cycle during th…
▽ More
The ribosome is one of the largest and most complex macromolecular machines in living cells. It polymerizes a protein in a step-by-step manner as directed by the corresponding nucleotide sequence on the template messenger RNA (mRNA) and this process is referred to as `translation' of the genetic message encoded in the sequence of mRNA transcript. In each successful chemo-mechanical cycle during the (protein) elongation stage, the ribosome elongates the protein by a single subunit, called amino acid, and steps forward on the template mRNA by three nucleotides called a codon. Therefore, a ribosome is also regarded as a molecular motor for which the mRNA serves as the track, its step size is that of a codon and two molecules of GTP and one molecule of ATP hydrolyzed in that cycle serve as its fuel. What adds further complexity is the existence of competing pathways leading to distinct cycles, branched pathways in each cycle and futile consumption of fuel that leads neither to elongation of the nascent protein nor forward step** of the ribosome on its track. We investigate a model formulated in terms of the network of discrete chemo-mechanical states of a ribosome during the elongation stage of translation. The model is analyzed using a combination of stochastic thermodynamic and kinetic analysis based on a graph-theoretic approach. We derive the exact solution of the corresponding master equations. We represent the steady state in terms of the cycles of the underlying network and discuss the energy transduction processes. We identify the various possible modes of operation of a ribosome in terms of its average velocity and mean rate of GTP hydrolysis. We also compute entropy production as functions of the rates of the interstate transitions and the thermodynamic cost for accuracy of the translation process.
△ Less
Submitted 11 February, 2020; v1 submitted 19 June, 2019;
originally announced June 2019.
-
A Biologically Motivated Asymmetric Exclusion Process: interplay of congestion in RNA polymerase traffic and slippage of nascent transcript
Authors:
Soumendu Ghosh,
Annwesha Dutta,
Shubhadeep Patra,
Jun Sato,
Katsuhiro Nishinari,
Debashish Chowdhury
Abstract:
We develope a theoretical framework, based on exclusion process, that is motivated by a biological phenomenon called transcript slippage (TS). In this model a discrete lattice represents a DNA strand while each of the particles that hop on it unidirectionally, from site to site, represents a RNA polymerase (RNAP). While walking like a molecular motor along a DNA track in a step-by-step manner, a R…
▽ More
We develope a theoretical framework, based on exclusion process, that is motivated by a biological phenomenon called transcript slippage (TS). In this model a discrete lattice represents a DNA strand while each of the particles that hop on it unidirectionally, from site to site, represents a RNA polymerase (RNAP). While walking like a molecular motor along a DNA track in a step-by-step manner, a RNAP simultaneously synthesizes a RNA chain; in each forward step it elongates the nascent RNA molecule by one unit, using the DNA track also as the template. At some special "slippery" position on the DNA, which we represent as a defect on the lattice, a RNAP can lose its grip on the nascent RNA and the latter's consequent slippage results in a final product that is either longer or shorter than the corresponding DNA template. We develope an exclusion model for RNAP traffic where the kinetics of the system at the defect site captures key features of TS events. We demonstrate the interplay of the crowding of RNAPs and TS. A RNAP has to wait at the defect site for longer period in a more congested RNAP traffic, thereby increasing the likelihood of its suffering a larger number of TS events. The qualitative trends of some of our results for a simple special case of our model are consistent with experimental observations. The general theoretical framework presented here will be useful for guiding future experimental queries and for analysis of the experimental data with more detailed versions of the same model.
△ Less
Submitted 24 April, 2019; v1 submitted 27 October, 2018;
originally announced October 2018.
-
A generalized Michaelis-Menten equation in protein synthesis: effects of mis-charged cognate tRNA and mis-reading of codon
Authors:
Annwesha Dutta,
Debashish Chowdhury
Abstract:
The sequence of amino acid monomers in the primary structure of a protein is decided by the corresponding sequence of codons (triplets of nucleic acid monomers) on the template messenger RNA (mRNA). The polymerization of a protein, by incorporation of the successive amino acid monomers, is carried out by a molecular machine called ribosome. We develop a stochastic kinetic model that captures the p…
▽ More
The sequence of amino acid monomers in the primary structure of a protein is decided by the corresponding sequence of codons (triplets of nucleic acid monomers) on the template messenger RNA (mRNA). The polymerization of a protein, by incorporation of the successive amino acid monomers, is carried out by a molecular machine called ribosome. We develop a stochastic kinetic model that captures the possibilities of mis-reading of mRNA codon and prior mis-charging of a tRNA. By a combination of analytical and numerical methods we obtain the distribution of the times taken for incorporation of the successive amino acids in the growing protein in this mathematical model. The corresponding exact analytical expression for the average rate of elongation of a nascent protein is a `biologically motivated' generalization of the {\it Michaelis-Menten formula} for the average rate of enzymatic reactions. This generalized Michaelis-Menten-like formula (and the exact analytical expressions for a few other quantities) that we report here display the interplay of four different branched pathways corresponding to selection of four different types of tRNA.
△ Less
Submitted 1 June, 2017; v1 submitted 6 December, 2015;
originally announced December 2015.