-
Interpretable Network Propagation with Application to Expanding the Repertoire of Human Proteins that Interact with SARS-CoV-2
Authors:
Jeffrey N. Law,
Kyle Akers,
Nure Tasnina,
Catherine M. Della Santina,
Shay Deutsch,
Meghana Kshirsagar,
Judith Klein-Seetharaman,
Mark Crovella,
Padmavathy Rajagopalan,
Simon Kasif,
T. M. Murali
Abstract:
Background: Network propagation has been widely used for nearly 20 years to predict gene functions and phenotypes. Despite the popularity of this approach, little attention has been paid to the question of provenance tracing in this context, e.g., determining how much any experimental observation in the input contributes to the score of every prediction.
Results: We design a network propagation…
▽ More
Background: Network propagation has been widely used for nearly 20 years to predict gene functions and phenotypes. Despite the popularity of this approach, little attention has been paid to the question of provenance tracing in this context, e.g., determining how much any experimental observation in the input contributes to the score of every prediction.
Results: We design a network propagation framework with two novel components and apply it to predict human proteins that directly or indirectly interact with SARS-CoV-2 proteins. First, we trace the provenance of each prediction to its experimentally validated sources, which in our case are human proteins experimentally determined to interact with viral proteins. Second, we design a technique that helps to reduce the manual adjustment of parameters by users. We find that for every top-ranking prediction, the highest contribution to its score arises from a direct neighbor in a human protein-protein interaction network. We further analyze these results to develop functional insights on SARS-CoV-2 that expand on known biology such as the connection between endoplasmic reticulum stress, HSPA5, and anti-clotting agents.
Conclusions: We examine how our provenance tracing method can be generalized to a broad class of network-based algorithms. We provide a useful resource for the SARS-CoV-2 community that implicates many previously undocumented proteins with putative functional relationships to viral infection. This resource includes potential drugs that can be opportunistically repositioned to target these proteins. We also discuss how our overall framework can be extended to other, newly-emerging viruses.
△ Less
Submitted 19 November, 2021; v1 submitted 2 June, 2020;
originally announced June 2020.
-
MT-CGCNN: Integrating Crystal Graph Convolutional Neural Network with Multitask Learning for Material Property Prediction
Authors:
Soumya Sanyal,
Janakiraman Balachandran,
Naganand Yadati,
Abhishek Kumar,
Padmini Rajagopalan,
Suchismita Sanyal,
Partha Talukdar
Abstract:
Develo** accurate, transferable and computationally inexpensive machine learning models can rapidly accelerate the discovery and development of new materials. Some of the major challenges involved in develo** such models are, (i) limited availability of materials data as compared to other fields, (ii) lack of universal descriptor of materials to predict its various properties. The limited avai…
▽ More
Develo** accurate, transferable and computationally inexpensive machine learning models can rapidly accelerate the discovery and development of new materials. Some of the major challenges involved in develo** such models are, (i) limited availability of materials data as compared to other fields, (ii) lack of universal descriptor of materials to predict its various properties. The limited availability of materials data can be addressed through transfer learning, while the generic representation was recently addressed by Xie and Grossman [1], where they developed a crystal graph convolutional neural network (CGCNN) that provides a unified representation of crystals. In this work, we develop a new model (MT-CGCNN) by integrating CGCNN with transfer learning based on multi-task (MT) learning. We demonstrate the effectiveness of MT-CGCNN by simultaneous prediction of various material properties such as Formation Energy, Band Gap and Fermi Energy for a wide range of inorganic crystals (46774 materials). MT-CGCNN is able to reduce the test error when employed on correlated properties by upto 8%. The model prediction has lower test error compared to CGCNN, even when the training data is reduced by 10%. We also demonstrate our model's better performance through prediction of end user scenario related to metal/non-metal classification. These results encourage further development of machine learning approaches which leverage multi-task learning to address the aforementioned challenges in the discovery of new materials. We make MT-CGCNN's source code available to encourage reproducible research.
△ Less
Submitted 14 November, 2018;
originally announced November 2018.
-
Application of Gist SVM in Cancer Detection
Authors:
S. Aruna,
S. P. Rajagopalan,
L. V. Nandakishore
Abstract:
In this paper, we study the application of GIST SVM in disease prediction (detection of cancer). Pattern classification problems can be effectively solved by Support vector machines. Here we propose a classifier which can differentiate patients having benign and malignant cancer cells. To improve the accuracy of classification, we propose to determine the optimal size of the training set and perfo…
▽ More
In this paper, we study the application of GIST SVM in disease prediction (detection of cancer). Pattern classification problems can be effectively solved by Support vector machines. Here we propose a classifier which can differentiate patients having benign and malignant cancer cells. To improve the accuracy of classification, we propose to determine the optimal size of the training set and perform feature selection. To find the optimal size of the training set, different sizes of training sets are experimented and the one with highest classification rate is selected. The optimal features are selected through their F-Scores.
△ Less
Submitted 6 March, 2012; v1 submitted 1 March, 2012;
originally announced March 2012.
-
The Expert System Designed to Improve Customer Satisfaction
Authors:
P. Isakki alias Devi,
S. P. Rajagopalan
Abstract:
Customer Relationship Management becomes a leading business strategy in highly competitive business environment. It aims to enhance the performance of the businesses by improving the customer satisfaction and loyalty. The objective of this paper is to improve customer satisfaction on product's colors and design with the help of the expert system developed by using Artificial Neural Networks. The e…
▽ More
Customer Relationship Management becomes a leading business strategy in highly competitive business environment. It aims to enhance the performance of the businesses by improving the customer satisfaction and loyalty. The objective of this paper is to improve customer satisfaction on product's colors and design with the help of the expert system developed by using Artificial Neural Networks. The expert system's role is to capture the knowledge of the experts and the data from the customer requirements, and then, process the collected data and form the appropriate rules for choosing product's colors and design. In order to identify the hidden pattern of the customer's needs, the Artificial Neural Networks technique has been applied to classify the colors and design based upon a list of selected information. Moreover, the expert system has the capability to make decisions in ranking the scores of the colors and design presented in the selection. In addition, the expert system has been validated with a different customer types.
△ Less
Submitted 9 December, 2011;
originally announced December 2011.