Search | arXiv e-print repository

Self-Supervised Learning Based Handwriting Verification

Authors: Mihir Chauhan, Mohammad Abuzar Shaikh, Bina Ramamurthy, Mingchen Gao, Siwei Lyu, Sargur Srihari

Abstract: We present SSL-HV: Self-Supervised Learning approaches applied to the task of Handwriting Verification. This task involves determining whether a given pair of handwritten images originate from the same or different writer distribution. We have compared the performance of multiple generative, contrastive SSL approaches against handcrafted feature extractors and supervised learning on CEDAR AND data… ▽ More We present SSL-HV: Self-Supervised Learning approaches applied to the task of Handwriting Verification. This task involves determining whether a given pair of handwritten images originate from the same or different writer distribution. We have compared the performance of multiple generative, contrastive SSL approaches against handcrafted feature extractors and supervised learning on CEDAR AND dataset. We show that ResNet based Variational Auto-Encoder (VAE) outperforms other generative approaches achieving 76.3% accuracy, while ResNet-18 fine-tuned using Variance-Invariance-Covariance Regularization (VICReg) outperforms other contrastive approaches achieving 78% accuracy. Using a pre-trained VAE and VICReg for the downstream task of writer verification we observed a relative improvement in accuracy of 6.7% and 9% over ResNet-18 supervised baseline with 10% writer labels. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 14 pages, 6 figures, 2 tables

arXiv:2201.00627 [pdf, other]

Uncertainty Detection and Reduction in Neural Decoding of EEG Signals

Authors: Tiehang Duan, Zhenyi Wang, Sheng Liu, Sargur N. Srihari, Hui Yang

Abstract: EEG decoding systems based on deep neural networks have been widely used in decision making of brain computer interfaces (BCI). Their predictions, however, can be unreliable given the significant variance and noise in EEG signals. Previous works on EEG analysis mainly focus on the exploration of noise pattern in the source signal, while the uncertainty during the decoding process is largely unexpl… ▽ More EEG decoding systems based on deep neural networks have been widely used in decision making of brain computer interfaces (BCI). Their predictions, however, can be unreliable given the significant variance and noise in EEG signals. Previous works on EEG analysis mainly focus on the exploration of noise pattern in the source signal, while the uncertainty during the decoding process is largely unexplored. Automatically detecting and reducing such decoding uncertainty is important for BCI motor imagery applications such as robotic arm control etc. In this work, we proposed an uncertainty estimation and reduction model (UNCER) to quantify and mitigate the uncertainty during the EEG decoding process. It utilized a combination of dropout oriented method and Bayesian neural network for uncertainty estimation to incorporate both the uncertainty in the input signal and the uncertainty in the model parameters. We further proposed a data augmentation based approach for uncertainty reduction. The model can be integrated into current widely used EEG neural decoders without change of architecture. We performed extensive experiments for uncertainty estimation and its reduction in both intra-subject EEG decoding and cross-subject EEG decoding on two public motor imagery datasets, where the proposed model achieves significant improvement both on the quality of estimated uncertainty and the effectiveness of uncertainty reduction. △ Less

Submitted 1 October, 2022; v1 submitted 28 December, 2021; originally announced January 2022.

arXiv:2109.04993 [pdf, other]

LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation

Authors: Mohammad Abuzar Shaikh, Zhanghexuan Ji, Dana Moukheiber, Yan Shen, Sargur Srihari, Mingchen Gao

Abstract: Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Text… ▽ More Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Textual Alignment (VTA) will be assisted by two auxiliary tasks, GAN-based image synthesis and Image Captioning. We also propose a new evaluation metric measuring the similarity between the learnt visual and textual embedding. The experimental results on two public datasets, CUB and MS-COCO, demonstrate superior visual and textual representation alignment in the joint feature embedding space △ Less

Submitted 19 October, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

Comments: 14 pages, 10 Figures, 5 Tables

arXiv:2109.01949 [pdf, other]

Improving Joint Learning of Chest X-Ray and Radiology Report by Word Region Alignment

Authors: Zhanghexuan Ji, Mohammad Abuzar Shaikh, Dana Moukheiber, Sargur Srihari, Yifan Peng, Mingchen Gao

Abstract: Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level… ▽ More Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level and the local image region-word level for visual-textual matching. Both are bidirectionally constrained on Cross-Entropy based and ranking-based Triplet Matching Losses. The region-word matching is calculated using the attention mechanism without direct supervision about their map**. The pre-trained multi-modal representation learning paves the way for downstream tasks concerning image and/or text encoding. We demonstrate the representation learning quality by cross-modality retrievals and multi-label classifications on two datasets: OpenI-IU and MIMIC-CXR △ Less

Submitted 4 September, 2021; originally announced September 2021.

Comments: 10 Pages, 1 Figure, 3 Tables, Accepted in 12th Machine Learning in Medical Imaging (MLMI 2021) workshop

arXiv:2105.03358 [pdf, other]

Soft-Attention Improves Skin Cancer Classification Performance

Authors: Soumyya Kanti Datta, Mohammad Abuzar Shaikh, Sargur N. Srihari, Mingchen Gao

Abstract: In clinical applications, neural networks must focus on and highlight the most important parts of an input image. Soft-Attention mechanism enables a neural network toachieve this goal. This paper investigates the effectiveness of Soft-Attention in deep neural architectures. The central aim of Soft-Attention is to boost the value of important features and suppress the noise-inducing features. We co… ▽ More In clinical applications, neural networks must focus on and highlight the most important parts of an input image. Soft-Attention mechanism enables a neural network toachieve this goal. This paper investigates the effectiveness of Soft-Attention in deep neural architectures. The central aim of Soft-Attention is to boost the value of important features and suppress the noise-inducing features. We compare the performance of VGG, ResNet, InceptionResNetv2 and DenseNet architectures with and without the Soft-Attention mechanism, while classifying skin lesions. The original network when coupled with Soft-Attention outperforms the baseline[16] by 4.7% while achieving a precision of 93.7% on HAM10000 dataset [25]. Additionally, Soft-Attention coupling improves the sensitivity score by 3.8% compared to baseline[31] and achieves 91.6% on ISIC-2017 dataset [2]. The code is publicly available at github. △ Less

Submitted 4 June, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

Comments: 8 pages, 9 figures, 4 tables

arXiv:2009.04532 [pdf, other]

doi 10.1109/ICFHR2020.2020.00074

Attention based Writer Independent Handwriting Verification

Authors: Mohammad Abuzar Shaikh, Tiehang Duan, Mihir Chauhan, Sargur Srihari

Abstract: The task of writer verification is to provide a likelihood score for whether the queried and known handwritten image samples belong to the same writer or not. Such a task calls for the neural network to make it's outcome interpretable, i.e. provide a view into the network's decision making process. We implement and integrate cross-attention and soft-attention mechanisms to capture the highly corre… ▽ More The task of writer verification is to provide a likelihood score for whether the queried and known handwritten image samples belong to the same writer or not. Such a task calls for the neural network to make it's outcome interpretable, i.e. provide a view into the network's decision making process. We implement and integrate cross-attention and soft-attention mechanisms to capture the highly correlated and salient points in feature space of 2D inputs. The attention maps serve as an explanation premise for the network's output likelihood score. The attention mechanism also allows the network to focus more on relevant areas of the input, thus improving the classification performance. Our proposed approach achieves a precision of 86\% for detecting intra-writer cases in CEDAR cursive "AND" dataset. Furthermore, we generate meaningful explanations for the provided decision by extracting attention maps from multiple levels of the network. △ Less

Submitted 30 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

Comments: 7 pages, 6 figures, Published in 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)

arXiv:2003.06113 [pdf, ps, other]

Ultra Efficient Transfer Learning with Meta Update for Cross Subject EEG Classification

Authors: Tiehang Duan, Mihir Chauhan, Mohammad Abuzar Shaikh, Jun Chu, Sargur Srihari

Abstract: The pattern of Electroencephalogram (EEG) signal differs significantly across different subjects, and poses challenge for EEG classifiers in terms of 1) effectively adapting a learned classifier onto a new subject, 2) retaining knowledge of known subjects after the adaptation. We propose an efficient transfer learning method, named Meta UPdate Strategy (MUPS-EEG), for continuous EEG classification… ▽ More The pattern of Electroencephalogram (EEG) signal differs significantly across different subjects, and poses challenge for EEG classifiers in terms of 1) effectively adapting a learned classifier onto a new subject, 2) retaining knowledge of known subjects after the adaptation. We propose an efficient transfer learning method, named Meta UPdate Strategy (MUPS-EEG), for continuous EEG classification across different subjects. The model learns effective representations with meta update which accelerates adaptation on new subject and mitigate forgetting of knowledge on previous subjects at the same time. The proposed mechanism originates from meta learning and works to 1) find feature representation that is broadly suitable for different subjects, 2) maximizes sensitivity of loss function for fast adaptation on new subject. The method can be applied to all deep learning oriented models. Extensive experiments on two public datasets demonstrate the effectiveness of the proposed model, outperforming current state of the arts by a large margin in terms of both adapting on new subject and retain knowledge of learned subjects. △ Less

Submitted 1 March, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

arXiv:1909.02548 [pdf, other]

Explanation based Handwriting Verification

Authors: Mihir Chauhan, Mohammad Abuzar Shaikh, Sargur N. Srihari

Abstract: Deep learning system have drawback that their output is not accompanied with ex-planation. In a domain such as forensic handwriting verification it is essential to provideexplanation to jurors. The goal of handwriting verification is to find a measure of confi-dence whether the given handwritten samples are written by the same or different writer.We propose a method to generate explanations for th… ▽ More Deep learning system have drawback that their output is not accompanied with ex-planation. In a domain such as forensic handwriting verification it is essential to provideexplanation to jurors. The goal of handwriting verification is to find a measure of confi-dence whether the given handwritten samples are written by the same or different writer.We propose a method to generate explanations for the confidence provided by convolu-tional neural network (CNN) which maps the input image to 15 annotations (features)provided by experts. Our system comprises of: (1) Feature learning network (FLN),a differentiable system, (2) Inference module for providing explanations. Furthermore,inference module provides two types of explanations: (a) Based on cosine similaritybetween categorical probabilities of each feature, (b) Based on Log-Likelihood Ratio(LLR) using directed probabilistic graphical model. We perform experiments using acombination of feature learning network (FLN) and each inference module. We evaluateour system using XAI-AND dataset, containing 13700 handwritten samples and 15 cor-responding expert examined features for each sample. The dataset is released for publicuse and the methods can be extended to provide explanations on other verification taskslike face verification and bio-medical comparison. This dataset can serve as the basis and benchmark for future research in explanation based handwriting verification. The code is available on github. △ Less

Submitted 14 August, 2019; originally announced September 2019.

Comments: Presented at BMVC 2019: Workshop on Interpretable and Explainable Machine Vision, Cardiff, UK

arXiv:1812.02621 [pdf, other]

doi 10.1109/ICFHR-2018.2018.00041

Hybrid Feature Learning for Handwriting Verification

Authors: Mohammad Abuzar Shaikh, Mihir Chauhan, Jun Chu, Sargur Srihari

Abstract: We propose an effective Hybrid Deep Learning (HDL) architecture for the task of determining the probability that a questioned handwritten word has been written by a known writer. HDL is an amalgamation of Auto-Learned Features (ALF) and Human-Engineered Features (HEF). To extract auto-learned features we use two methods: First, Two Channel Convolutional Neural Network (TC-CNN); Second, Two Channel… ▽ More We propose an effective Hybrid Deep Learning (HDL) architecture for the task of determining the probability that a questioned handwritten word has been written by a known writer. HDL is an amalgamation of Auto-Learned Features (ALF) and Human-Engineered Features (HEF). To extract auto-learned features we use two methods: First, Two Channel Convolutional Neural Network (TC-CNN); Second, Two Channel Autoencoder (TC-AE). Furthermore, human-engineered features are extracted by using two methods: First, Gradient Structural Concavity (GSC); Second, Scale Invariant Feature Transform (SIFT). Experiments are performed by complementing one of the HEF methods with one ALF method on 150000 pairs of samples of the word "AND" cropped from handwritten notes written by 1500 writers. Our results indicate that HDL architecture with AE-GSC achieves 99.7% accuracy on seen writer dataset and 92.16% accuracy on shuffled writer dataset which out performs CEDAR-FOX, as for unseen writer dataset, AE-SIFT performs comparable to this sophisticated handwriting comparison tool. △ Less

Submitted 18 November, 2018; originally announced December 2018.

Comments: Accepted and presented in International Conference on Frontiers in Handwriting Recognition (ICFHR) 2018

arXiv:1811.12500 [pdf, other]

Sequential Embedding Induced Text Clustering, a Non-parametric Bayesian Approach

Authors: Tiehang Duan, Qi Lou, Sargur N. Srihari, Xiaohui Xie

Abstract: Current state-of-the-art nonparametric Bayesian text clustering methods model documents through multinomial distribution on bags of words. Although these methods can effectively utilize the word burstiness representation of documents and achieve decent performance, they do not explore the sequential information of text and relationships among synonyms. In this paper, the documents are modeled as t… ▽ More Current state-of-the-art nonparametric Bayesian text clustering methods model documents through multinomial distribution on bags of words. Although these methods can effectively utilize the word burstiness representation of documents and achieve decent performance, they do not explore the sequential information of text and relationships among synonyms. In this paper, the documents are modeled as the joint of bags of words, sequential features and word embeddings. We proposed Sequential Embedding induced Dirichlet Process Mixture Model (SiDPMM) to effectively exploit this joint document representation in text clustering. The sequential features are extracted by the encoder-decoder component. Word embeddings produced by the continuous-bag-of-words (CBOW) model are introduced to handle synonyms. Experimental results demonstrate the benefits of our model in two major aspects: 1) improved performance across multiple diverse text datasets in terms of the normalized mutual information (NMI); 2) more accurate inference of ground truth cluster numbers with regularization effect on tiny outlier clusters. △ Less

Submitted 29 November, 2018; originally announced November 2018.

arXiv:1612.01075 [pdf, other]

Joint Visual Denoising and Classification using Deep Learning

Authors: Gang Chen, Yawei Li, Sargur N. Srihari

Abstract: Visual restoration and recognition are traditionally addressed in pipeline fashion, i.e. denoising followed by classification. Instead, observing correlations between the two tasks, for example clearer image will lead to better categorization and vice visa, we propose a joint framework for visual restoration and recognition for handwritten images, inspired by advances in deep autoencoder and multi… ▽ More Visual restoration and recognition are traditionally addressed in pipeline fashion, i.e. denoising followed by classification. Instead, observing correlations between the two tasks, for example clearer image will lead to better categorization and vice visa, we propose a joint framework for visual restoration and recognition for handwritten images, inspired by advances in deep autoencoder and multi-modality learning. Our model is a 3-pathway deep architecture with a hidden-layer representation which is shared by multi-inputs and outputs, and each branch can be composed of a multi-layer deep model. Thus, visual restoration and classification can be unified using shared representation via non-linear map**, and model parameters can be learnt via backpropagation. Using MNIST and USPS data corrupted with structured noise, the proposed framework performs at least 20\% better in classification than separate pipelines, as well as clearer recovered images. The noise model and the reproducible source code is available at {\url{https://github.com/ganggit/jointmodel}}. △ Less

Submitted 4 December, 2016; originally announced December 2016.

Comments: 5 pages, 7 figures, ICIP 2016

arXiv:1612.01072 [pdf, other]

Word Recognition with Deep Conditional Random Fields

Authors: Gang Chen, Yawei Li, Sargur N. Srihari

Abstract: Recognition of handwritten words continues to be an important problem in document analysis and recognition. Existing approaches extract hand-engineered features from word images--which can perform poorly with new data sets. Recently, deep learning has attracted great attention because of the ability to learn features from raw data. Moreover they have yielded state-of-the-art results in classificat… ▽ More Recognition of handwritten words continues to be an important problem in document analysis and recognition. Existing approaches extract hand-engineered features from word images--which can perform poorly with new data sets. Recently, deep learning has attracted great attention because of the ability to learn features from raw data. Moreover they have yielded state-of-the-art results in classification tasks including character recognition and scene recognition. On the other hand, word recognition is a sequential problem where we need to model the correlation between characters. In this paper, we propose using deep Conditional Random Fields (deep CRFs) for word recognition. Basically, we combine CRFs with deep learning, in which deep features are learned and sequences are labeled in a unified framework. We pre-train the deep structure with stacked restricted Boltzmann machines (RBMs) for feature learning and optimize the entire network with an online learning algorithm. The proposed model was evaluated on two datasets, and seen to perform significantly better than competitive baseline models. The source code is available at https://github.com/ganggit/deepCRFs. △ Less

Submitted 4 December, 2016; originally announced December 2016.

Comments: 5 pages, published in ICIP 2016. arXiv admin note: substantial text overlap with arXiv:1412.3397

arXiv:1611.02181 [pdf, other]

Using Social Dynamics to Make Individual Predictions: Variational Inference with a Stochastic Kinetic Model

Authors: Zhen Xu, Wen Dong, Sargur Srihari

Abstract: Social dynamics is concerned primarily with interactions among individuals and the resulting group behaviors, modeling the temporal evolution of social systems via the interactions of individuals within these systems. In particular, the availability of large-scale data from social networks and sensor networks offers an unprecedented opportunity to predict state-changing events at the individual le… ▽ More Social dynamics is concerned primarily with interactions among individuals and the resulting group behaviors, modeling the temporal evolution of social systems via the interactions of individuals within these systems. In particular, the availability of large-scale data from social networks and sensor networks offers an unprecedented opportunity to predict state-changing events at the individual level. Examples of such events include disease transmission, opinion transition in elections, and rumor propagation. Unlike previous research focusing on the collective effects of social systems, this study makes efficient inferences at the individual level. In order to cope with dynamic interactions among a large number of individuals, we introduce the stochastic kinetic model to capture adaptive transition probabilities and propose an efficient variational inference algorithm the complexity of which grows linearly --- rather than exponentially --- with the number of individuals. To validate this method, we have performed epidemic-dynamics experiments on wireless sensor network data collected from more than ten thousand people over three years. The proposed algorithm was used to track disease transmission and predict the probability of infection for each individual. Our results demonstrate that this method is more efficient than sampling while nonetheless achieving high accuracy. △ Less

Submitted 7 November, 2016; originally announced November 2016.

Comments: In proceedings of 29th Conference on Neural Information Processing Systems (NIPS 2016)

arXiv:1602.00096 [pdf]

Exploiting synthetic lethal vulnerabilities for cancer therapy

Authors: Sriganesh Srihari

Abstract: Synthetic lethality refers to a combination of two or more genetic events (typically affecting different genes) in which the co-occurrence of the events results in cell or organismal lethality, but the cell or organism remains viable when only one of the events occurs. Synthetic lethality has gained attention in the last few years for its value in selective killing of cancer cells: by targeting th… ▽ More Synthetic lethality refers to a combination of two or more genetic events (typically affecting different genes) in which the co-occurrence of the events results in cell or organismal lethality, but the cell or organism remains viable when only one of the events occurs. Synthetic lethality has gained attention in the last few years for its value in selective killing of cancer cells: by targeting the synthetic lethal partner of an altered gene in cancer, only the cancer cells can be killed while sparing normal cells. In a recent study, we showed that mutual exclusive combinations of genetic events in cancer hint at naturally occurring synthetic lethal combinations, and therefore by systematically mining for these combinations we can identify novel therapeutic targets for cancer. Based on this, we had identified a list of 718 genes that are mutually exclusive to six DNA-damage response genes in cancer. Here, we extend these results to identify a subset of 43 genes whose over-expression correlates with significantly poor survival in estrogen receptor-negative breast cancers, and thus provide a promising list of potential therapeutic targets and/or biomarkers. △ Less

Submitted 30 January, 2016; originally announced February 2016.

Comments: 5 Figures

MSC Class: 92C42

arXiv:1601.05117 [pdf]

doi 10.1039/C5MB00655D

Understanding the functional impact of copy number alterations in breast cancer using a network modeling approach

Authors: Sriganesh Srihari, Murugan Kalimutho, Samir Lal, Jitin Singla, Dhaval Patel, Peter T. Simpson, Kum Kum Khanna, Mark A. Ragan

Abstract: Copy number alterations (CNAs) are thought to account for 85% of the variation in gene expression observed among breast tumours. The expression of cis-associated genes is impacted by CNAs occurring at proximal loci of these genes, whereas the expression of trans-associated genes is impacted by CNAs occurring at distal loci. While a majority of these CNA-driven genes responsible for breast tumourig… ▽ More Copy number alterations (CNAs) are thought to account for 85% of the variation in gene expression observed among breast tumours. The expression of cis-associated genes is impacted by CNAs occurring at proximal loci of these genes, whereas the expression of trans-associated genes is impacted by CNAs occurring at distal loci. While a majority of these CNA-driven genes responsible for breast tumourigenesis are cis-associated, trans-associated genes are thought to further abet the development of cancer and influence disease outcomes in patients. Here we present a network-based approach that integrates copy-number and expression profiles to identify putative cis- and trans-associated genes in breast cancer pathogenesis. We validate these cis- and trans-associated genes by employing them to subtype a large cohort of breast tumours obtained from the METABRIC consortium, and demonstrate that these genes accurately reconstruct the ten subtypes of breast cancer. We observe that individual breast cancer subtypes are driven by distinct sets of cis- and trans-associated genes. Among the cis-associated genes, we recover several known drivers of breast cancer (e.g. CCND1, ERRB2, MDM2 and ZNF703) and some novel putative drivers (e.g. BRF2 and SF3B3). siRNA-mediated knockdown of BRF2 across a panel of breast cancer cell lines showed significant reduction specifically in cell proliferation in HER2+ lines, thereby indicating that BRF2 could be a context-dependent oncogene and potentially targetable in these lines. Among the trans-associated genes, we identify modules of immune-response (CD2, CD19, CD38 and CD79B), mitotic/cell-cycle kinases (e.g. AURKB, MELK, PLK1 and TTK), and DNA-damage response genes (e.g. RFC4 and FEN1). △ Less

Submitted 19 January, 2016; originally announced January 2016.

Comments: 23 pages, 2 tables, 7 figures

MSC Class: 92Bxx

arXiv:1510.00815 [pdf]

doi 10.1186/s13062-015-0086-1

Inferring synthetic lethal interactions from mutual exclusivity of genetic events in cancer

Authors: Sriganesh Srihari, Jitin Singla, Limsoon Wong, Mark A. Ragan

Abstract: Background: Synthetic lethality (SL) refers to the genetic interaction between two or more genes where only their co-alteration (e.g. by mutations, amplifications or deletions) results in cell death. In recent years, SL has emerged as an attractive therapeutic strategy against cancer: by targeting the SL partners of altered genes in cancer cells, these cells can be selectively killed while sparing… ▽ More Background: Synthetic lethality (SL) refers to the genetic interaction between two or more genes where only their co-alteration (e.g. by mutations, amplifications or deletions) results in cell death. In recent years, SL has emerged as an attractive therapeutic strategy against cancer: by targeting the SL partners of altered genes in cancer cells, these cells can be selectively killed while sparing the normal cells. Consequently, a number of studies have attempted prediction of SL interactions in human, a majority by extrapolating SL interactions inferred through large-scale screens in model organisms. However, these predicted SL interactions either do not hold in human cells or do not include genes that are (frequently) altered in human cancers, and are therefore not attractive in the context of cancer therapy. Results: Here, we develop a computational approach to infer SL interactions directly from frequently altered genes in human cancers. It is based on the observation that pairs of genes that are altered in a (significantly) mutually exclusive manner in cancers are likely to constitute lethal combinations. Using genomic copy-number and gene-expression data from four cancers, breast, prostate, ovarian and uterine (total 3980 samples) from The Cancer Genome Atlas, we identify 718 genes that are frequently amplified or upregulated, and are likely to be synthetic lethal with six key DNA-damage response (DDR) genes in these cancers. By comparing with published data on gene essentiality (~16000 genes) from ten DDR-deficient cancer cell lines, we show that our identified genes are enriched among the top quartile of essential genes in these cell lines, implying that our inferred genes are highly likely to be (synthetic) lethal upon knockdown in these cell lines. △ Less

Submitted 3 October, 2015; originally announced October 2015.

Comments: 35 pages, 7 figures

MSC Class: 92Bxx

Journal ref: Biology Direct 2015, 10:57

arXiv:1506.01863 [pdf]

Challenges and open problems in computational prediction of protein complexes: the case of membrane complexes

Authors: Sriganesh Srihari

Abstract: Identifying the entire set of complexes is essential not only to understand complex formations, but also to map the high level organisation of the cell. Computational prediction of protein complexes faces several challenges including the lack of sufficient protein interactions, presence of noise in protein interaction datasets and difficulty in predicting small and sparse complexes. These challeng… ▽ More Identifying the entire set of complexes is essential not only to understand complex formations, but also to map the high level organisation of the cell. Computational prediction of protein complexes faces several challenges including the lack of sufficient protein interactions, presence of noise in protein interaction datasets and difficulty in predicting small and sparse complexes. These challenges are covered in most reviews of complex prediction methods. However, an important challenge that needs to be addressed is the prediction of membrane complexes. These are often ignored because existing protein interaction detection techniques do not detect interactions between membrane proteins. But, recently there have been several new experimental techniques including MY2H that are capable of detecting membrane protein interactions. In the light of this new data, we discuss here new challenges and the kind of open problems that need to be solved to effectively detect membrane complexes. △ Less

Submitted 5 June, 2015; originally announced June 2015.

Comments: 7 pages

MSC Class: 92B05

arXiv:1505.05328 [pdf]

doi 10.1016/j.febslet.2015.04.026

Methods for protein complex prediction and their contributions towards understanding the organization, function and dynamics of complexes

Authors: Sriganesh Srihari, Chern Han Yong, Ashwini Patil, Limsoon Wong

Abstract: Complexes of physically interacting proteins constitute fundamental functional units responsible for driving biological processes within cells. A faithful reconstruction of the entire set of complexes is therefore essential to understand the functional organization of cells. In this review, we discuss the key contributions of computational methods developed till date (approximately between 2003 an… ▽ More Complexes of physically interacting proteins constitute fundamental functional units responsible for driving biological processes within cells. A faithful reconstruction of the entire set of complexes is therefore essential to understand the functional organization of cells. In this review, we discuss the key contributions of computational methods developed till date (approximately between 2003 and 2015) for identifying complexes from the network of interacting proteins (PPI network). We evaluate in depth the performance of these methods on PPI datasets from yeast, and highlight challenges faced by these methods, in particular detection of sparse and small or sub- complexes and discerning of overlap** complexes. We describe methods for integrating diverse information including expression profiles and 3D structures of proteins with PPI networks to understand the dynamics of complex formation, for instance, of time-based assembly of complex subunits and formation of fuzzy complexes from intrinsically disordered proteins. Finally, we discuss methods for identifying dysfunctional complexes in human diseases, an application that is proving invaluable to understand disease mechanisms and to discover novel therapeutic targets. We hope this review aptly commemorates a decade of research on computational prediction of complexes and constitutes a valuable reference for further advancements in this exciting area. △ Less

Submitted 20 May, 2015; originally announced May 2015.

Comments: 1 Table

MSC Class: 68

arXiv:1505.04846 [pdf]

doi 10.1093/bib/bbu020

An evaluation of DNA-damage response and cell-cycle pathways for breast cancer classification

Authors: Atefeh Taherian Fard, Sriganesh Srihari, Mark A. Ragan

Abstract: Accurate subty** or classification of breast cancer is important for ensuring proper treatment of patients and also for understanding the molecular mechanisms driving this disease. While there have been several gene signatures proposed in the literature to classify breast tumours, these signatures show very low overlaps, different classification performance, and not much relevance to the underly… ▽ More Accurate subty** or classification of breast cancer is important for ensuring proper treatment of patients and also for understanding the molecular mechanisms driving this disease. While there have been several gene signatures proposed in the literature to classify breast tumours, these signatures show very low overlaps, different classification performance, and not much relevance to the underlying biology of these tumours. Here we evaluate DNA-damage response (DDR) and cell cycle pathways, which are critical pathways implicated in a considerable proportion of breast tumours, for their usefulness and ability in breast tumour subty**. We think that subty** breast tumours based on these two pathways could lead to vital insights into molecular mechanisms driving these tumours. Here, we performed a systematic evaluation of DDR and cell-cycle pathways for subty** of breast tumours into the five known intrinsic subtypes. Homologous Recombination (HR) pathway showed the best performance in subty** breast tumours, indicating that HR genes are strongly involved in all breast tumours. Comparisons of pathway based signatures and two standard gene signatures supported the use of known pathways for breast tumour subty**. Further, the evaluation of these standard gene signatures showed that breast tumour subty**, prognosis and survival estimation are all closely related. Finally, we constructed an all-inclusive super-signature by combining (union of) all genes and performing a stringent feature selection, and found it to be reasonably accurate and robust in classification as well as prognostic value. Adopting DDR and cell cycle pathways for breast tumour subty** achieved robust and accurate breast tumour subty**, and constructing a super-signature which contains feature selected mix of genes from these molecular pathways as well as clinical aspects is valuable in clinical practice. △ Less

Submitted 18 May, 2015; originally announced May 2015.

Comments: 28 pages, 7 figures, 6 tables

MSC Class: 92-08

arXiv:1505.00925 [pdf, ps, other]

Parameterized Algorithms for Clustering PPI Networks

Authors: Sriganesh Srihari, Hon Wai Leong

Abstract: With the advent of high-throughput wet lab technologies the amount of protein interaction data available publicly has increased substantially, in turn spurring a plethora of computational methods for in silico knowledge discovery from this data. In this paper, we focus on parameterized methods for modeling and solving complex computational problems encountered in such knowledge discovery from prot… ▽ More With the advent of high-throughput wet lab technologies the amount of protein interaction data available publicly has increased substantially, in turn spurring a plethora of computational methods for in silico knowledge discovery from this data. In this paper, we focus on parameterized methods for modeling and solving complex computational problems encountered in such knowledge discovery from protein data. Specifically, we concentrate on three relevant problems today in proteomics, namely detection of lethal proteins, functional modules and alignments from protein interaction networks. We propose novel graph theoretic models for these problems and devise practical parameterized algorithms. At a broader level, we demonstrate how these methods can be viable alternatives for the several heurestic, randomized, approximation and sub-optimal methods by arriving at parameterized yet optimal solutions for these problems. We substantiate these theoretical results by experimenting on real protein interaction data of S. cerevisiae (budding yeast) and verifying the results using gene ontology. △ Less

Submitted 5 May, 2015; originally announced May 2015.

Comments: 10 pages, 4 tables

MSC Class: 92C42

arXiv:1503.07906 [pdf, other]

Generalized K-fan Multimodal Deep Model with Shared Representations

Authors: Gang Chen, Sargur N. Srihari

Abstract: Multimodal learning with deep Boltzmann machines (DBMs) is an generative approach to fuse multimodal inputs, and can learn the shared representation via Contrastive Divergence (CD) for classification and information retrieval tasks. However, it is a 2-fan DBM model, and cannot effectively handle multiple prediction tasks. Moreover, this model cannot recover the hidden representations well by sampl… ▽ More Multimodal learning with deep Boltzmann machines (DBMs) is an generative approach to fuse multimodal inputs, and can learn the shared representation via Contrastive Divergence (CD) for classification and information retrieval tasks. However, it is a 2-fan DBM model, and cannot effectively handle multiple prediction tasks. Moreover, this model cannot recover the hidden representations well by sampling from the conditional distribution when more than one modalities are missing. In this paper, we propose a K-fan deep structure model, which can handle the multi-input and muti-output learning problems effectively. In particular, the deep structure has K-branch for different inputs where each branch can be composed of a multi-layer deep model, and a shared representation is learned in an discriminative manner to tackle multimodal tasks. Given the deep structure, we propose two objective functions to handle two multi-input and multi-output tasks: joint visual restoration and labeling, and the multi-view multi-calss object recognition tasks. To estimate the model parameters, we initialize the deep model parameters with CD to maximize the joint distribution, and then we use backpropagation to update the model according to specific objective function. The experimental results demonstrate that the model can effectively leverages multi-source information and predict multiple tasks well over competitive baselines. △ Less

Submitted 26 March, 2015; originally announced March 2015.

Comments: 11 pages, 5 figures

MSC Class: 68T10 ACM Class: I.2.6

arXiv:1412.3397 [pdf, other]

Sequential Labeling with online Deep Learning

Authors: Gang Chen, Ran Xu, Sargur Srihari

Abstract: Deep learning has attracted great attention recently and yielded the state of the art performance in dimension reduction and classification problems. However, it cannot effectively handle the structured output prediction, e.g. sequential labeling. In this paper, we propose a deep learning structure, which can learn discriminative features for sequential labeling problems. More specifically, we add… ▽ More Deep learning has attracted great attention recently and yielded the state of the art performance in dimension reduction and classification problems. However, it cannot effectively handle the structured output prediction, e.g. sequential labeling. In this paper, we propose a deep learning structure, which can learn discriminative features for sequential labeling problems. More specifically, we add the inter-relationship between labels in our deep learning structure, in order to incorporate the context information from the sequential data. Thus, our model is more powerful than linear Conditional Random Fields (CRFs) because the objective function learns latent non-linear features so that target labeling can be better predicted. We pretrain the deep structure with stacked restricted Boltzmann machines (RBMs) for feature learning and optimize our objective function with online learning algorithm, a mixture of perceptron training and stochastic gradient descent. We test our model on different challenge tasks, and show that our model outperforms significantly over the completive baselines. △ Less

Submitted 3 May, 2015; v1 submitted 10 December, 2014; originally announced December 2014.

Comments: 9 pages, 1 figure

MSC Class: 68T10 ACM Class: I.2.6

arXiv:1408.1177 [pdf, ps, other]

Complex-based analysis of dysregulated cellular processes in cancer

Authors: Sriganesh Srihari, Piyush B. Madhamshettiwar, Sarah Song, Chao Liu, Peter T. Simpson, Kum Kum Khanna, Mark A. Ragan

Abstract: Background: Differential expression analysis of (individual) genes is often used to study their roles in diseases. However, diseases such as cancer are a result of the combined effect of multiple genes. Gene products such as proteins seldom act in isolation, but instead constitute stable multi-protein complexes performing dedicated functions. Therefore, complexes aggregate the effect of individual… ▽ More Background: Differential expression analysis of (individual) genes is often used to study their roles in diseases. However, diseases such as cancer are a result of the combined effect of multiple genes. Gene products such as proteins seldom act in isolation, but instead constitute stable multi-protein complexes performing dedicated functions. Therefore, complexes aggregate the effect of individual genes (proteins) and can be used to gain a better understanding of cancer mechanisms. Here, we observe that complexes show considerable changes in their expression, in turn directed by the concerted action of transcription factors (TFs), across cancer conditions. We seek to gain novel insights into cancer mechanisms through a systematic analysis of complexes and their transcriptional regulation. Results: We integrated large-scale protein-interaction (PPI) and gene-expression datasets to identify complexes that exhibit significant changes in their expression across different conditions in cancer. We devised a log-linear model to relate these changes to the differential regulation of complexes by TFs. The application of our model on two case studies involving pancreatic and familial breast tumour conditions revealed: (i) complexes in core cellular processes, especially those responsible for maintaining genome stability and cell proliferation (e.g. DNA damage repair and cell cycle) show considerable changes in expression; (ii) these changes include decrease and countering increase for different sets of complexes indicative of compensatory mechanisms coming into play in tumours; and (iii) TFs work in cooperative and counteractive ways to regulate these mechanisms. Such aberrant complexes and their regulating TFs play vital roles in the initiation and progression of cancer. △ Less

Submitted 6 August, 2014; originally announced August 2014.

Comments: 22 pages, BMC Systems Biology

MSC Class: 92Bxx

arXiv:1406.3407 [pdf, other]

Restricted Boltzmann Machine for Classification with Hierarchical Correlated Prior

Authors: Gang Chen, Sargur H. Srihari

Abstract: Restricted Boltzmann machines (RBM) and its variants have become hot research topics recently, and widely applied to many classification problems, such as character recognition and document categorization. Often, classification RBM ignores the interclass relationship or prior knowledge of sharing information among classes. In this paper, we are interested in RBM with the hierarchical prior over cl… ▽ More Restricted Boltzmann machines (RBM) and its variants have become hot research topics recently, and widely applied to many classification problems, such as character recognition and document categorization. Often, classification RBM ignores the interclass relationship or prior knowledge of sharing information among classes. In this paper, we are interested in RBM with the hierarchical prior over classes. We assume parameters for nearby nodes are correlated in the hierarchical tree, and further the parameters at each node of the tree be orthogonal to those at its ancestors. We propose a hierarchical correlated RBM for classification problem, which generalizes the classification RBM with sharing information among different classes. In order to reduce the redundancy between node parameters in the hierarchy, we also introduce orthogonal restrictions to our objective function. We test our method on challenge datasets, and show promising results compared to competitive baselines. △ Less

Submitted 20 April, 2015; v1 submitted 12 June, 2014; originally announced June 2014.

Comments: 13 pages, 5 figures

MSC Class: 68T10 ACM Class: I.2.6

arXiv:1310.3528 [pdf, other]

doi 10.1109/TCBB.2013.128

Evolution and Controllability of Cancer Networks: a Boolean Perspective

Authors: Sriganesh Srihari, Venkatesh Raman, Hon Wai Leong, Mark A. Ragan

Abstract: Cancer forms a robust system and progresses as stages over time typically with increasing aggressiveness and worsening prognosis. Characterizing these stages and identifying the genes driving transitions between them is critical to understand cancer progression and to develop effective anti-cancer therapies. Here, we propose a novel model of the 'cancer system' as a Boolean state space in which a… ▽ More Cancer forms a robust system and progresses as stages over time typically with increasing aggressiveness and worsening prognosis. Characterizing these stages and identifying the genes driving transitions between them is critical to understand cancer progression and to develop effective anti-cancer therapies. Here, we propose a novel model of the 'cancer system' as a Boolean state space in which a Boolean network, built from protein interaction and gene-expression data from different stages of cancer, transits between Boolean satisfiability states by "editing" interactions and "flip**" genes. The application of our model (called BoolSpace) on three case studies - pancreatic and breast tumours in human and post spinal-cord injury in rats - reveals valuable insights into the phenomenon of cancer progression. In particular, we notice that several of the genes flipped are serine/threonine kinases which act as natural cellular switches and that different sets of genes are flipped during the initial and final stages indicating a pattern to tumour progression. We hypothesize that robustness of cancer partly stems from "passing of the baton" between genes at different stages, and therefore an effective therapy should target a "cover set" of these genes. A C/C++ implementation of BoolSpace is freely available at: http://www.bioinformatics.org.au/tools-data △ Less

Submitted 13 October, 2013; originally announced October 2013.

Comments: 13 pages, 4 figures, Genome Informatics (GIW) Singapore, December 2013. To appear in IEEE/ACM Trans. Computational Biology and Bioinformatics

MSC Class: 68; 92

arXiv:1307.3856 [pdf]

doi 10.1186/1471-2105-14-S16-S8

Identifying conserved protein complexes between species by constructing interolog networks

Authors: Phi Vu Nguyen, Sriganesh Srihari, Hon Wai Leong

Abstract: Protein complexes conserved across species indicate processes that are core to cellular machinery (e.g. cell-cycle or DNA damage-repair complexes conserved across human and yeast). While numerous computational methods have been devised to identify complexes from the protein interaction (PPI) networks of individual species, these are severely limited by noise and errors (false positives) in current… ▽ More Protein complexes conserved across species indicate processes that are core to cellular machinery (e.g. cell-cycle or DNA damage-repair complexes conserved across human and yeast). While numerous computational methods have been devised to identify complexes from the protein interaction (PPI) networks of individual species, these are severely limited by noise and errors (false positives) in currently available datasets. Our analysis using human and yeast PPI networks revealed that these methods missed several important complexes including those conserved between the two species (e.g. the MLH1-MSH2-PMS2-PCNA mismatch-repair complex). Here, we note that much of the functionalities of yeast complexes have been conserved in human complexes not only through sequence conservation of proteins but also of critical functional domains. Therefore, integrating information of domain conservation might throw further light on conservation patterns between yeast and human complexes. △ Less

Submitted 15 July, 2013; originally announced July 2013.

Comments: 42 pages, 10 Tables, 11 Figures. To appear in BMC Bioinformatics InCoB 2013 Supplement

MSC Class: 68

Journal ref: BMC Bioinformatics 14(Suppl 16):S8 2013

arXiv:1304.5565 [pdf]

Computing Pathways to Systems Biology: Key Contributions of Computational Methods in Pathway Identification

Authors: Sriganesh Srihari, Mark A. Ragan

Abstract: Understanding large molecular networks consisting of entities such as genes, proteins or RNAs that interact in complex ways to drive the cellular machinery has been an active focus of systems biology. Computational approaches have played a key role in systems biology by complementing theoretical and experimental approaches. Here we roadmap some key contributions of computational methods developed… ▽ More Understanding large molecular networks consisting of entities such as genes, proteins or RNAs that interact in complex ways to drive the cellular machinery has been an active focus of systems biology. Computational approaches have played a key role in systems biology by complementing theoretical and experimental approaches. Here we roadmap some key contributions of computational methods developed over the last decade in the reconstruction of biological pathways. We position these contributions in a 'systems biology perspective' to reemphasize their roles in unraveling cellular mechanisms and to understand 'systems biology diseases' including cancer. △ Less

Submitted 19 April, 2013; originally announced April 2013.

Comments: 18 pages, 1 figure, survey article

MSC Class: 68

arXiv:1301.0363 [pdf, ps, other]

doi 10.1504/IJBRA.2012.048962

Employing functional interactions for characterization and detection of sparse complexes from yeast PPI networks

Authors: Sriganesh Srihari, Hon Wai Leong

Abstract: Over the last few years, several computational techniques have been devised to recover protein complexes from the protein interaction (PPI) networks of organisms. These techniques model "dense" subnetworks within PPI networks as complexes. However, our comprehensive evaluations revealed that these techniques fail to reconstruct many 'gold standard' complexes that are "sparse" in the networks (only… ▽ More Over the last few years, several computational techniques have been devised to recover protein complexes from the protein interaction (PPI) networks of organisms. These techniques model "dense" subnetworks within PPI networks as complexes. However, our comprehensive evaluations revealed that these techniques fail to reconstruct many 'gold standard' complexes that are "sparse" in the networks (only 71 recovered out of 123 known yeast complexes embedded in a network of 9704 interactions among 1622 proteins). In this work, we propose a novel index called Component-Edge (CE) score to quantitatively measure the notion of "complex derivability" from PPI networks. Using this index, we theoretically categorize complexes as "sparse" or "dense" with respect to a given network. We then devise an algorithm SPARC that selectively employs functional interactions to improve the CE scores of predicted complexes, and thereby elevates many of the "sparse" complexes to "dense". This empowers existing methods to detect these "sparse" complexes. We demonstrate that our approach is effective in reconstructing significantly many complexes missed previously (104 recovered out of the 123 known complexes or ~47% improvement). △ Less

Submitted 2 January, 2013; originally announced January 2013.

Comments: 18 pages, 9 Tables, 1 Figure

MSC Class: 92-08

Journal ref: Int J Bioinform Res Appl. 2012, 8(3-4):286-304

arXiv:1211.5625 [pdf, ps, other]

doi 10.1142/S021972001230002X

A survey of computational methods for protein complex prediction from protein interaction networks

Authors: Sriganesh Srihari, Hon Wai Leong

Abstract: Complexes of physically interacting proteins are one of the fundamental functional units responsible for driving key biological mechanisms within the cell. Their identification is therefore necessary not only to understand complex formation but also the higher level organization of the cell. With the advent of high-throughput techniques in molecular biology, significant amount of physical interact… ▽ More Complexes of physically interacting proteins are one of the fundamental functional units responsible for driving key biological mechanisms within the cell. Their identification is therefore necessary not only to understand complex formation but also the higher level organization of the cell. With the advent of high-throughput techniques in molecular biology, significant amount of physical interaction data has been cataloged from organisms such as yeast, which has in turn fueled computational approaches to systematically mine complexes from the network of physical interactions among proteins (PPI network). In this survey, we review, classify and evaluate some of the key computational methods developed till date for the identification of protein complexes from PPI networks. We present two insightful taxonomies that reflect how these methods have evolved over the years towards improving automated complex prediction. We also discuss some open challenges facing accurate reconstruction of complexes, the crucial ones being presence of high proportion of errors and noise in current high-throughput datasets and some key aspects overlooked by current complex detection methods. We hope this review will not only help to condense the history of computational complex detection for easy reference, but also provide valuable insights to drive further research in this area. △ Less

Submitted 23 November, 2012; originally announced November 2012.

Comments: 27 pages, 5 figures, 4 tables

MSC Class: 92-08

Journal ref: Srihari, S., Leong, HW., J Bioinform Comput Biol 11(2): 1230002, 2013

Showing 1–29 of 29 results for author: Srihari, S