-
Predicting ATP binding sites in protein sequences using Deep Learning and Natural Language Processing
Authors:
Shreyas V,
Swati Agarwal
Abstract:
Predicting ATP-Protein Binding sites in genes is of great significance in the field of Biology and Medicine. The majority of research in this field has been conducted through time- and resource-intensive 'wet experiments' in laboratories. Over the years, researchers have been investigating computational methods computational methods to accomplish the same goals, utilising the strength of advanced…
▽ More
Predicting ATP-Protein Binding sites in genes is of great significance in the field of Biology and Medicine. The majority of research in this field has been conducted through time- and resource-intensive 'wet experiments' in laboratories. Over the years, researchers have been investigating computational methods computational methods to accomplish the same goals, utilising the strength of advanced Deep Learning and NLP algorithms. In this paper, we propose to develop methods to classify ATP-Protein binding sites. We conducted various experiments mainly using PSSMs and several word embeddings as features. We used 2D CNNs and LightGBM classifiers as our chief Deep Learning Algorithms. The MP3Vec and BERT models have also been subjected to testing in our study. The outcomes of our experiments demonstrated improvement over the state-of-the-art benchmarks.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Forecasting COVID- 19 cases using Statistical Models and Ontology-based Semantic Modelling: A real time data analytics approach
Authors:
Sadhana Tiwari,
Ritesh Chandra,
Sonali Agarwal
Abstract:
SARS-COV-19 is the most prominent issue which many countries face today. The frequent changes in infections, recovered and deaths represents the dynamic nature of this pandemic. It is very crucial to predict the spreading rate of this virus for accurate decision making against fighting with the situation of getting infected through the virus, tracking and controlling the virus transmission in the…
▽ More
SARS-COV-19 is the most prominent issue which many countries face today. The frequent changes in infections, recovered and deaths represents the dynamic nature of this pandemic. It is very crucial to predict the spreading rate of this virus for accurate decision making against fighting with the situation of getting infected through the virus, tracking and controlling the virus transmission in the community. We develop a prediction model using statistical time series models such as SARIMA and FBProphet to monitor the daily active, recovered and death cases of COVID-19 accurately. Then with the help of various details across each individual patient (like height, weight, gender etc.), we designed a set of rules using Semantic Web Rule Language and some mathematical models for dealing with COVID19 infected cases on an individual basis. After combining all the models, a COVID-19 Ontology is developed and performs various queries using SPARQL query on designed Ontology which accumulate the risk factors, provide appropriate diagnosis, precautions and preventive suggestions for COVID Patients. After comparing the performance of SARIMA and FBProphet, it is observed that the SARIMA model performs better in forecasting of COVID cases. On individual basis COVID case prediction, approx. 497 individual samples have been tested and classified into five different levels of COVID classes such as Having COVID, No COVID, High Risk COVID case, Medium to High Risk case, and Control needed case.
△ Less
Submitted 31 January, 2023; v1 submitted 6 June, 2022;
originally announced June 2022.
-
How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?
Authors:
Gantavya Bhatt,
Hritik Bansal,
Rishubh Singh,
Sumeet Agarwal
Abstract:
Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs), which appear more biologically grounded in terms of synaptic connections, have generally been less successful at capturing long-range dependencies as well as the…
▽ More
Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs), which appear more biologically grounded in terms of synaptic connections, have generally been less successful at capturing long-range dependencies as well as the loci of grammatical errors in an unsupervised setting. In this paper, we seek to develop models that bridge the gap between biological plausibility and linguistic competence. We propose a new architecture, the Decay RNN, which incorporates the decaying nature of neuronal activations and models the excitatory and inhibitory connections in a population of neurons. Besides its biological inspiration, our model also shows competitive performance relative to LSTMs on subject-verb agreement, sentence grammaticality, and language modeling tasks. These results provide some pointers towards probing the nature of the inductive biases required for RNN architectures to model linguistic phenomena successfully.
△ Less
Submitted 25 May, 2020; v1 submitted 17 May, 2020;
originally announced May 2020.
-
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?
Authors:
Aditi Jha,
Sumeet Agarwal
Abstract:
Visual scene understanding often requires the processing of human-object interactions. Here we seek to explore if and how well Deep Neural Network (DNN) models capture features similar to the brain's representation of humans, objects, and their interactions. We investigate brain regions which process human-, object-, or interaction-specific information, and establish correspondences between them a…
▽ More
Visual scene understanding often requires the processing of human-object interactions. Here we seek to explore if and how well Deep Neural Network (DNN) models capture features similar to the brain's representation of humans, objects, and their interactions. We investigate brain regions which process human-, object-, or interaction-specific information, and establish correspondences between them and DNN features. Our results suggest that we can infer the selectivity of these regions to particular visual stimuli using DNN representations. We also map features from the DNN to the regions, thus linking the DNN representations to those found in specific parts of the visual cortex. In particular, our results suggest that a typical DNN representation contains encoding of compositional information for human-object interactions which goes beyond a linear combination of the encodings for the two components, thus suggesting that DNNs may be able to model this important property of biological vision.
△ Less
Submitted 6 November, 2019; v1 submitted 31 March, 2019;
originally announced April 2019.
-
Examining Representational Similarity in ConvNets and the Primate Visual Cortex
Authors:
Abhimanyu Dubey,
Jayadeva,
Sumeet Agarwal
Abstract:
We compare several ConvNets with different depth and regularization techniques with multi-unit macaque IT cortex recordings and assess the impact of the same on representational similarity with the primate visual cortex. We find that with increasing depth and validation performance, ConvNet features are closer to cortical IT representations.
We compare several ConvNets with different depth and regularization techniques with multi-unit macaque IT cortex recordings and assess the impact of the same on representational similarity with the primate visual cortex. We find that with increasing depth and validation performance, ConvNet features are closer to cortical IT representations.
△ Less
Submitted 12 September, 2016;
originally announced September 2016.
-
Fluctuations in network dynamics: SMAR1 can trigger apoptosis
Authors:
Md. Zubbair Malik,
Md. Jahoor Alam,
Romana Ishrat,
Subhash M. Agarwal,
R. K. Brojen Singh
Abstract:
SMAR1 is a sensitive signaling molecule in p53 regulatory network which can drive p53 network dynamics to three distinct states, namely, stabilized (two), damped and sustain oscillation states. In the interaction of p53 network with SMAR1, p53 network sees SMAR1 as a sub-network with its new complexes formed by SMAR1, where SMAR1 is the central node, and fluctuations in SMAR1 concentration is prop…
▽ More
SMAR1 is a sensitive signaling molecule in p53 regulatory network which can drive p53 network dynamics to three distinct states, namely, stabilized (two), damped and sustain oscillation states. In the interaction of p53 network with SMAR1, p53 network sees SMAR1 as a sub-network with its new complexes formed by SMAR1, where SMAR1 is the central node, and fluctuations in SMAR1 concentration is propagated as a stress signal throughout the network. Excess stress induced by SMAR1 can drive p53 network dynamics to amplitude death scenario which corresponds to apoptotic state. The permutation entropy calculated for normal state is minimum indicating self-organized behavior, whereas for apoptotic state, the value is maximum showing breakdown of self-organization. We also show that the regulation of SMAR1 togather with other signaling molecules p300 and HDAC1 in the p53 regulatory network can be engineered to extend the range of stress such that the system can be save from apoptosis.
△ Less
Submitted 9 October, 2015;
originally announced October 2015.
-
Molecular mechanisms, thermodynamics, and dissociation kinetics of knob-hole interactions in fibrin
Authors:
Olga Kononova,
Rustem I. Litvinov,
Artem Zhmurov,
Andrey Alekseenko,
Chai-Ho Cheng,
Silvi Agarwal,
Kenneth A. Marx,
John W. Weisel,
Valeri Barsegov
Abstract:
Polymerization of fibrin, the primary structural protein of blood clots and thrombi, occurs through binding of knobs 'A' and 'B' in the central nodule of fibrin monomer to complementary holes 'a' and 'b' in the beta- and gamma-nodules, respectively, of another monomer. We characterized the A:a and B:b knob-hole interactions under varying solution conditions using Molecular Dynamics simulations of…
▽ More
Polymerization of fibrin, the primary structural protein of blood clots and thrombi, occurs through binding of knobs 'A' and 'B' in the central nodule of fibrin monomer to complementary holes 'a' and 'b' in the beta- and gamma-nodules, respectively, of another monomer. We characterized the A:a and B:b knob-hole interactions under varying solution conditions using Molecular Dynamics simulations of the structural models of fibrin(ogen) fragment D complexed with synthetic peptides GPRP (knob 'A' mimetic) and GHRP (knob 'B' mimetic). The strength of A:a and B:b knob-hole complexes was roughly equal, decreasing with pulling force; yet, the dissociation kinetics were sensitive to variations in acidity (pH=5-7) and temperature (T=25-37 C). There were similar structural changes in holes 'a' and 'b' during forced dissociation of the knob-hole complexes: elongation of loop I, stretching of interior region, and translocation of the moveable flap. The disruption of the knob-hole interactions was not an "all-or-none" transition, as it occurred through distinct two-step or single-step pathways with or without intermediate states. The knob-hole bonds were stronger, tighter, and more brittle at pH=7 than at pH=5. The B:b knob-hole bonds were weaker, looser, and more compliant than the A:a knob-hole bonds at pH=7, but stronger, tighter, and less compliant at pH=5. Surprisingly, the knob-hole bonds were stronger, not weaker, at elevated temperature (T=37 C) compared to T=25 C due to the helix-to-coil transition in loop I, which helps stabilize the bonds. These results provide detailed qualitative and quantitative characteristics underlying the most significant non-covalent interactions involved in fibrin polymerization.
△ Less
Submitted 30 January, 2015;
originally announced January 2015.
-
Revisiting Date and Party Hubs: Novel Approaches to Role Assignment in Protein Interaction Networks
Authors:
Sumeet Agarwal,
Charlotte M. Deane,
Mason A. Porter,
Nick S. Jones
Abstract:
The idea of 'date' and 'party' hubs has been influential in the study of protein-protein interaction networks. Date hubs display low co-expression with their partners, whilst party hubs have high co-expression. It was proposed that party hubs are local coordinators whereas date hubs are global connectors. Here we show that the reported importance of date hubs to network connectivity can in fact be…
▽ More
The idea of 'date' and 'party' hubs has been influential in the study of protein-protein interaction networks. Date hubs display low co-expression with their partners, whilst party hubs have high co-expression. It was proposed that party hubs are local coordinators whereas date hubs are global connectors. Here we show that the reported importance of date hubs to network connectivity can in fact be attributed to a tiny subset of them. Crucially, these few, extremely central, hubs do not display particularly low expression correlation, undermining the idea of a link between this quantity and hub function. The date/party distinction was originally motivated by an approximately bimodal distribution of hub co-expression; we show that this feature is not always robust to methodological changes. Additionally, topological properties of hubs do not in general correlate with co-expression. Thus, we suggest that a date/party dichotomy is not meaningful and it might be more useful to conceive of roles for protein-protein interactions rather than individual proteins. We find significant correlations between interaction centrality and the functional similarity of the interacting proteins.
△ Less
Submitted 5 May, 2010; v1 submitted 2 November, 2009;
originally announced November 2009.