Search | arXiv e-print repository

MolTC: Towards Molecular Relational Modeling In Language Models

Authors: Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang

Abstract: Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as a promising way for efficient and effective MRL. Despite their potential, these methods… ▽ More Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as a promising way for efficient and effective MRL. Despite their potential, these methods predominantly rely on the textual data, thus not fully harnessing the wealth of structural information inherent in molecular graphs. Moreover, the absence of a unified framework exacerbates the issue of information underutilization, as it hinders the sharing of interaction mechanism learned across diverse datasets. To address these challenges, this work proposes a novel LLM-based multi-modal framework for Molecular inTeraction prediction following Chain-of-Thought (CoT) theory, termed MolTC, which effectively integrate graphical information of two molecules in pair. To train MolTC efficiently, we introduce a Multi-hierarchical CoT concept to refine its training paradigm, and conduct a comprehensive Molecular Interactive Instructions dataset for the development of biochemical LLMs involving MRL. Our experiments, conducted across various datasets involving over 4,000,000 molecular pairs, exhibit the superiority of our method over current GNN and LLM-based baselines. Code is available at https://github.com/MangoKiller/MolTC. △ Less

Submitted 10 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: ACL 2024

arXiv:2401.09517 [pdf]

Dimensional Neuroimaging Endophenotypes: Neurobiological Representations of Disease Heterogeneity Through Machine Learning

Authors: Junhao Wen, Mathilde Antoniades, Zhijian Yang, Gyujoon Hwang, Ioanna Skampardoni, Rongguang Wang, Christos Davatzikos

Abstract: Machine learning has been increasingly used to obtain individualized neuroimaging signatures for disease diagnosis, prognosis, and response to treatment in neuropsychiatric and neurodegenerative disorders. Therefore, it has contributed to a better understanding of disease heterogeneity by identifying disease subtypes that present significant differences in various brain phenotypic measures. In thi… ▽ More Machine learning has been increasingly used to obtain individualized neuroimaging signatures for disease diagnosis, prognosis, and response to treatment in neuropsychiatric and neurodegenerative disorders. Therefore, it has contributed to a better understanding of disease heterogeneity by identifying disease subtypes that present significant differences in various brain phenotypic measures. In this review, we first present a systematic literature overview of studies using machine learning and multimodal MRI to unravel disease heterogeneity in various neuropsychiatric and neurodegenerative disorders, including Alzheimer disease, schizophrenia, major depressive disorder, autism spectrum disorder, multiple sclerosis, as well as their potential in transdiagnostic settings. Subsequently, we summarize relevant machine learning methodologies and discuss an emerging paradigm which we call dimensional neuroimaging endophenotype (DNE). DNE dissects the neurobiological heterogeneity of neuropsychiatric and neurodegenerative disorders into a low dimensional yet informative, quantitative brain phenotypic representation, serving as a robust intermediate phenotype (i.e., endophenotype) largely reflecting underlying genetics and etiology. Finally, we discuss the potential clinical implications of the current findings and envision future research avenues. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2311.04837 [pdf, other]

Identifying Semantic Component for Robust Molecular Property Prediction

Authors: Zijian Li, Zunhong Xu, Ruichu Cai, Zhenhui Yang, Yuguang Yan, Zhifeng Hao, Guangyi Chen, Kun Zhang

Abstract: Although graph neural networks have achieved great success in the task of molecular property prediction in recent years, their generalization ability under out-of-distribution (OOD) settings is still under-explored. Different from existing methods that learn discriminative representations for prediction, we propose a generative model with semantic-components identifiability, named SCI. We demonstr… ▽ More Although graph neural networks have achieved great success in the task of molecular property prediction in recent years, their generalization ability under out-of-distribution (OOD) settings is still under-explored. Different from existing methods that learn discriminative representations for prediction, we propose a generative model with semantic-components identifiability, named SCI. We demonstrate that the latent variables in this generative model can be explicitly identified into semantic-relevant (SR) and semantic-irrelevant (SI) components, which contributes to better OOD generalization by involving minimal change properties of causal mechanisms. Specifically, we first formulate the data generation process from the atom level to the molecular level, where the latent space is split into SI substructures, SR substructures, and SR atom variables. Sequentially, to reduce misidentification, we restrict the minimal changes of the SR atom variables and add a semantic latent substructure regularization to mitigate the variance of the SR substructure under augmented domain changes. Under mild assumptions, we prove the block-wise identifiability of the SR substructure and the comment-wise identifiability of SR atom variables. Experimental studies achieve state-of-the-art performance and show general improvement on 21 datasets in 3 mainstream benchmarks. Moreover, the visualization results of the proposed SCI method provide insightful case studies and explanations for the prediction results. The code is available at: https://github.com/DMIRLAB-Group/SCI. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2308.09725 [pdf]

doi 10.1145/3583780.3614970

MoCLIM: Towards Accurate Cancer Subty** via Multi-Omics Contrastive Learning with Omics-Inference Modeling

Authors: Ziwei Yang, Zheng Chen, Yasuko Matsubara, Yasushi Sakurai

Abstract: Precision medicine fundamentally aims to establish causality between dysregulated biochemical mechanisms and cancer subtypes. Omics-based cancer subty** has emerged as a revolutionary approach, as different level of omics records the biochemical products of multistep processes in cancers. This paper focuses on fully exploiting the potential of multi-omics data to improve cancer subty** outcome… ▽ More Precision medicine fundamentally aims to establish causality between dysregulated biochemical mechanisms and cancer subtypes. Omics-based cancer subty** has emerged as a revolutionary approach, as different level of omics records the biochemical products of multistep processes in cancers. This paper focuses on fully exploiting the potential of multi-omics data to improve cancer subty** outcomes, and hence developed MoCLIM, a representation learning framework. MoCLIM independently extracts the informative features from distinct omics modalities. Using a unified representation informed by contrastive learning of different omics modalities, we can well-cluster the subtypes, given cancer, into a lower latent space. This contrast can be interpreted as a projection of inter-omics inference observed in biological networks. Experimental results on six cancer datasets demonstrate that our approach significantly improves data fit and subty** performance in fewer high-dimensional cancer instances. Moreover, our framework incorporates various medical evaluations as the final component, providing high interpretability in medical analysis. △ Less

Submitted 24 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: CIKM'23 Long/Full Papers

arXiv:2308.01941 [pdf]

doi 10.34133/icomputing.0055

Digital twin brain: a bridge between biological intelligence and artificial intelligence

Authors: Hui Xiong, Congying Chu, Lingzhong Fan, Ming Song, Jiaqi Zhang, Yawei Ma, Ruonan Zheng, Junyang Zhang, Zhengyi Yang, Tianzi Jiang

Abstract: In recent years, advances in neuroscience and artificial intelligence have paved the way for unprecedented opportunities for understanding the complexity of the brain and its emulation by computational systems. Cutting-edge advancements in neuroscience research have revealed the intricate relationship between brain structure and function, while the success of artificial neural networks highlights… ▽ More In recent years, advances in neuroscience and artificial intelligence have paved the way for unprecedented opportunities for understanding the complexity of the brain and its emulation by computational systems. Cutting-edge advancements in neuroscience research have revealed the intricate relationship between brain structure and function, while the success of artificial neural networks highlights the importance of network architecture. Now is the time to bring them together to better unravel how intelligence emerges from the brain's multiscale repositories. In this review, we propose the Digital Twin Brain (DTB) as a transformative platform that bridges the gap between biological and artificial intelligence. It consists of three core elements: the brain structure that is fundamental to the twinning process, bottom-layer models to generate brain functions, and its wide spectrum of applications. Crucially, brain atlases provide a vital constraint, preserving the brain's network organization within the DTB. Furthermore, we highlight open questions that invite joint efforts from interdisciplinary fields and emphasize the far-reaching implications of the DTB. The DTB can offer unprecedented insights into the emergence of intelligence and neurological disorders, which holds tremendous promise for advancing our understanding of both biological and artificial intelligence, and ultimately propelling the development of artificial general intelligence and facilitating precision mental healthcare. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Journal ref: Intell Comput. 2023;2:0055

arXiv:2307.08848 [pdf]

Microbiome-derived bile acids contribute to elevated antigenic response and bone erosion in rheumatoid arthritis

Authors: Xiuli Su, Xiaona Li, Yanqin Bian, Qing Ren, Leiguang Li, Xiaohao Wu, Hemi Luan, Bing He, Xiaojuan He, Hui Feng, Xingye Cheng, Pan-Jun Kim, Leihan Tang, Ai** Lu, Lianbo Xiao, Liang Tian, Zhu Yang, Zongwei Cai

Abstract: Rheumatoid arthritis (RA) is a chronic, disabling and incurable autoimmune disease. It has been widely recognized that gut microbial dysbiosis is an important contributor to the pathogenesis of RA, although distinct alterations in microbiota have been associated with this disease. Yet, the metabolites that mediate the impacts of the gut microbiome on RA are less well understood. Here, with microbi… ▽ More Rheumatoid arthritis (RA) is a chronic, disabling and incurable autoimmune disease. It has been widely recognized that gut microbial dysbiosis is an important contributor to the pathogenesis of RA, although distinct alterations in microbiota have been associated with this disease. Yet, the metabolites that mediate the impacts of the gut microbiome on RA are less well understood. Here, with microbial profiling and non-targeted metabolomics, we revealed profound yet diverse perturbation of the gut microbiome and metabolome in RA patients in a discovery set. In the Bacteroides-dominated RA patients, differentiation of gut microbiome resulted in distinct bile acid profiles compared to healthy subjects. Predominated Bacteroides species expressing BSH and 7a-HSDH increased, leading to elevated secondary bile acid production in this subgroup of RA patients. Reduced serum fibroblast growth factor-19 and dysregulated bile acids were evidence of impaired farnesoid X receptor-mediated signaling in the patients. This gut microbiota-bile acid axis was correlated to ACPA. The patients from the validation sets demonstrated that ACPA-positive patients have more abundant bacteria expressing BSH and 7a-HSDH but less Clostridium scindens expressing 7a-dehydroxylation enzymes, together with dysregulated microbial bile acid metabolism and more severe bone erosion than ACPA-negative ones. Mediation analyses revealed putative causal relationships between the gut microbiome, bile acids, and ACPA-positive RA, supporting a potential causal effect of Bacteroides species in increasing levels of ACPA and bone erosion mediated via disturbing bile acid metabolism. These results provide insights into the role of gut dysbiosis in RA in a manifestation-specific manner, as well as the functions of bile acids in this gut-joint axis, which may be a potential intervention target for precisely controlling RA conditions. △ Less

Submitted 14 July, 2023; originally announced July 2023.

Comments: 38 pages, 6 figures

arXiv:2307.07443 [pdf, other]

Can Large Language Models Empower Molecular Property Prediction?

Authors: Chen Qian, Huayi Tang, Zhirui Yang, Hong Liang, Yong Liu

Abstract: Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines. Conventionally, a molecule graph can be represented either as a graph-structured data or a SMILES text. Recently, the rapid development of Large Language Models (LLMs) has revolutionized the field of NLP. Although it is natural to utilize LLMs to assist in understa… ▽ More Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines. Conventionally, a molecule graph can be represented either as a graph-structured data or a SMILES text. Recently, the rapid development of Large Language Models (LLMs) has revolutionized the field of NLP. Although it is natural to utilize LLMs to assist in understanding molecules represented by SMILES, the exploration of how LLMs will impact molecular property prediction is still in its early stage. In this work, we advance towards this objective through two perspectives: zero/few-shot molecular classification, and using the new explanations generated by LLMs as representations of molecules. To be specific, we first prompt LLMs to do in-context molecular classification and evaluate their performance. After that, we employ LLMs to generate semantically enriched explanations for the original SMILES and then leverage that to fine-tune a small-scale LM model for multiple downstream tasks. The experimental results highlight the superiority of text explanations as molecular representations across multiple benchmark datasets, and confirm the immense potential of LLMs in molecular property prediction tasks. Codes are available at \url{https://github.com/ChnQ/LLM4Mol}. △ Less

Submitted 14 July, 2023; originally announced July 2023.

arXiv:2306.09629 [pdf, other]

Fusing Structural and Functional Connectivities using Disentangled VAE for Detecting MCI

Authors: Qiankun Zuo, Yanfei Zhu, Libin Lu, Zhi Yang, Yuhui Li, Ning Zhang

Abstract: Brain network analysis is a useful approach to studying human brain disorders because it can distinguish patients from healthy people by detecting abnormal connections. Due to the complementary information from multiple modal neuroimages, multimodal fusion technology has a lot of potential for improving prediction performance. However, effective fusion of multimodal medical images to achieve compl… ▽ More Brain network analysis is a useful approach to studying human brain disorders because it can distinguish patients from healthy people by detecting abnormal connections. Due to the complementary information from multiple modal neuroimages, multimodal fusion technology has a lot of potential for improving prediction performance. However, effective fusion of multimodal medical images to achieve complementarity is still a challenging problem. In this paper, a novel hierarchical structural-functional connectivity fusing (HSCF) model is proposed to construct brain structural-functional connectivity matrices and predict abnormal brain connections based on functional magnetic resonance imaging (fMRI) and diffusion tensor imaging (DTI). Specifically, the prior knowledge is incorporated into the separators for disentangling each modality of information by the graph convolutional networks (GCN). And a disentangled cosine distance loss is devised to ensure the disentanglement's effectiveness. Moreover, the hierarchical representation fusion module is designed to effectively maximize the combination of relevant and effective features between modalities, which makes the generated structural-functional connectivity more robust and discriminative in the cognitive disease analysis. Results from a wide range of tests performed on the public Alzheimer's Disease Neuroimaging Initiative (ADNI) database show that the proposed model performs better than competing approaches in terms of classification evaluation. In general, the proposed HSCF model is a promising model for generating brain structural-functional connectivities and identifying abnormal brain connections as cognitive disease progresses. △ Less

Submitted 21 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

Comments: 4 figures

arXiv:2306.04886 [pdf, other]

Multi-task Bioassay Pre-training for Protein-ligand Binding Affinity Prediction

Authors: Jiaxian Yan, Zhaofeng Ye, Ziyi Yang, Chengqiang Lu, Shengyu Zhang, Qi Liu, Jiezhong Qiu

Abstract: Protein-ligand binding affinity (PLBA) prediction is the fundamental task in drug discovery. Recently, various deep learning-based models predict binding affinity by incorporating the three-dimensional structure of protein-ligand complexes as input and achieving astounding progress. However, due to the scarcity of high-quality training data, the generalization ability of current models is still li… ▽ More Protein-ligand binding affinity (PLBA) prediction is the fundamental task in drug discovery. Recently, various deep learning-based models predict binding affinity by incorporating the three-dimensional structure of protein-ligand complexes as input and achieving astounding progress. However, due to the scarcity of high-quality training data, the generalization ability of current models is still limited. In addition, different bioassays use varying affinity measurement labels (i.e., IC50, Ki, Kd), and different experimental conditions inevitably introduce systematic noise, which poses a significant challenge to constructing high-precision affinity prediction models. To address these issues, we (1) propose Multi-task Bioassay Pre-training (MBP), a pre-training framework for structure-based PLBA prediction; (2) construct a pre-training dataset called ChEMBL-Dock with more than 300k experimentally measured affinity labels and about 2.8M docked three-dimensional structures. By introducing multi-task pre-training to treat the prediction of different affinity labels as different tasks and classifying relative rankings between samples from the same bioassay, MBP learns robust and transferrable structural knowledge from our new ChEMBL-Dock dataset with varied and noisy labels. Experiments substantiate the capability of MBP as a general framework that can improve and be tailored to mainstream structure-based PLBA prediction tasks. To the best of our knowledge, MBP is the first affinity pre-training model and shows great potential for future development. △ Less

Submitted 20 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: 21 pages, 7 figures

arXiv:2303.10533 [pdf]

A Radiomics-Incorporated Deep Ensemble Learning Model for Multi-Parametric MRI-based Glioma Segmentation

Authors: Yang Chen, Zhenyu Yang, **gtong Zhao, Justus Adamson, Yang Sheng, Fang-Fang Yin, Chunhao Wang

Abstract: We developed a deep ensemble learning model with a radiomics spatial encoding execution for improved glioma segmentation accuracy using multi-parametric MRI (mp-MRI). This model was developed using 369 glioma patients with a 4-modality mp-MRI protocol: T1, contrast-enhanced T1 (T1-Ce), T2, and FLAIR. In each modality volume, a 3D sliding kernel was implemented across the brain to capture image het… ▽ More We developed a deep ensemble learning model with a radiomics spatial encoding execution for improved glioma segmentation accuracy using multi-parametric MRI (mp-MRI). This model was developed using 369 glioma patients with a 4-modality mp-MRI protocol: T1, contrast-enhanced T1 (T1-Ce), T2, and FLAIR. In each modality volume, a 3D sliding kernel was implemented across the brain to capture image heterogeneity: fifty-six radiomic features were extracted within the kernel, resulting in a 4th order tensor. Each radiomic feature can then be encoded as a 3D image volume, namely a radiomic feature map (RFM). PCA was employed for data dimension reduction and the first 4 PCs were selected. Four deep neural networks as sub-models following the U-Net architecture were trained for the segmenting of a region-of-interest (ROI): each sub-model utilizes the mp-MRI and 1 of the 4 PCs as a 5-channel input for a 2D execution. The 4 softmax probability results given by the U-net ensemble were superimposed and binarized by Otsu method as the segmentation result. Three ensemble models were trained to segment enhancing tumor (ET), tumor core (TC), and whole tumor (WT). The adopted radiomics spatial encoding execution enriches the image heterogeneity information that leads to the successful demonstration of the proposed deep ensemble model, which offers a new tool for mp-MRI based medical image segmentation. △ Less

Submitted 18 March, 2023; originally announced March 2023.

arXiv:2301.10772 [pdf]

Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

Authors: Zhijian Yang, Junhao Wen, Ahmed Abdulkadir, Yuhan Cui, Guray Erus, Elizabeth Mamourian, Randa Melhem, Dhivya Srinivasan, Sindhuja T. Govindarajan, Jiong Chen, Mohamad Habes, Colin L. Masters, Paul Maruff, Jurgen Fripp, Luigi Ferrucci, Marilyn S. Albert, Sterling C. Johnson, John C. Morris, Pamela LaMontagne, Daniel S. Marcus, Tammie L. S. Benzinger, David A. Wolk, Li Shen, **gxuan Bao, Susan M. Resnick , et al. (3 additional authors not shown)

Abstract: Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limite… ▽ More Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subty** and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes. △ Less

Submitted 25 January, 2023; originally announced January 2023.

arXiv:2301.03424 [pdf, other]

An open unified deep graph learning framework for discovering drug leads

Authors: Yueming Yin, Haifeng Hu, Zhen Yang, Jitao Yang, Chun Ye, Jiansheng Wu, Wilson Wen Bin Goh

Abstract: Computational discovery of ideal lead compounds is a critical process for modern drug discovery. It comprises multiple stages: hit screening, molecular property prediction, and molecule optimization. Current efforts are disparate, involving the establishment of models for each stage, followed by multi-stage multi-model integration. However, this is non-ideal, as clumsy integration of incompatible… ▽ More Computational discovery of ideal lead compounds is a critical process for modern drug discovery. It comprises multiple stages: hit screening, molecular property prediction, and molecule optimization. Current efforts are disparate, involving the establishment of models for each stage, followed by multi-stage multi-model integration. However, this is non-ideal, as clumsy integration of incompatible models increases research overheads, and may even reduce success rates in drug discovery. Facilitating compatibilities requires establishing inherent model consistencies across lead discovery stages. Towards that effect, we propose an open deep graph learning (DGL) based pipeline: generative adversarial feature subspace enhancement (GAFSE), which first unifies the modeling of these stages into one learning framework. GAFSE also offers standardized modular design and streamlined interfaces for future expansions and community support. GAFSE combines adversarial/generative learning, graph attention network, graph reconstruction network, and optimizes the classification/regression loss, adversarial/generative loss, and reconstruction loss simultaneously. Convergence analysis theoretically guarantees model generalization performance. Exhaustive benchmarking demonstrates that the GAFSE pipeline achieves excellent performance across almost all lead discovery stages, while also providing valuable model interpretability. Hence, we believe this tool will enhance the efficiency and productivity of drug discovery researchers. △ Less

Submitted 20 January, 2023; v1 submitted 5 December, 2022; originally announced January 2023.

arXiv:2211.10419 [pdf, other]

A Neural Active Inference Model of Perceptual-Motor Learning

Authors: Zhizhuo Yang, Gabriel J. Diaz, Brett R. Fajen, Reynold Bailey, Alexander Ororbia

Abstract: The active inference framework (AIF) is a promising new computational framework grounded in contemporary neuroscience that can produce human-like behavior through reward-based learning. In this study, we test the ability for the AIF to capture the role of anticipation in the visual guidance of action in humans through the systematic investigation of a visual-motor task that has been well-explored… ▽ More The active inference framework (AIF) is a promising new computational framework grounded in contemporary neuroscience that can produce human-like behavior through reward-based learning. In this study, we test the ability for the AIF to capture the role of anticipation in the visual guidance of action in humans through the systematic investigation of a visual-motor task that has been well-explored -- that of intercepting a target moving over a ground plane. Previous research demonstrated that humans performing this task resorted to anticipatory changes in speed intended to compensate for semi-predictable changes in target speed later in the approach. To capture this behavior, our proposed "neural" AIF agent uses artificial neural networks to select actions on the basis of a very short term prediction of the information about the task environment that these actions would reveal along with a long-term estimate of the resulting cumulative expected free energy. Systematic variation revealed that anticipatory behavior emerged only when required by limitations on the agent's movement capabilities, and only when the agent was able to estimate accumulated free energy over sufficiently long durations into the future. In addition, we present a novel formulation of the prior function that maps a multi-dimensional world-state to a uni-dimensional distribution of free-energy. Together, these results demonstrate the use of AIF as a plausible model of anticipatory visually guided behavior in humans. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: 16 pages including references, 6 figures. Submitted to Frontiers in Computational Neuroscience

arXiv:2210.13225 [pdf, other]

Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All Networks

Authors: Zhile Yang, Shangqi Guo, Ying Fang, Jian K. Liu

Abstract: One stream of reinforcement learning research is exploring biologically plausible models and algorithms to simulate biological intelligence and fit neuromorphic hardware. Among them, reward-modulated spike-timing-dependent plasticity (R-STDP) is a recent branch with good potential in energy efficiency. However, current R-STDP methods rely on heuristic designs of local learning rules, thus requirin… ▽ More One stream of reinforcement learning research is exploring biologically plausible models and algorithms to simulate biological intelligence and fit neuromorphic hardware. Among them, reward-modulated spike-timing-dependent plasticity (R-STDP) is a recent branch with good potential in energy efficiency. However, current R-STDP methods rely on heuristic designs of local learning rules, thus requiring task-specific expert knowledge. In this paper, we consider a spiking recurrent winner-take-all network, and propose a new R-STDP method, spiking variational policy gradient (SVPG), whose local learning rules are derived from the global policy gradient and thus eliminate the need for heuristic designs. In experiments of MNIST classification and Gym InvertedPendulum, our SVPG achieves good training performance, and also presents better robustness to various kinds of noises than conventional methods. △ Less

Submitted 21 October, 2022; originally announced October 2022.

Comments: Accepted to BMVC 2022

arXiv:2210.06512 [pdf]

Quantifying U-Net Uncertainty in Multi-Parametric MRI-based Glioma Segmentation by Spherical Image Projection

Authors: Zhenyu Yang, Kyle Lafata, Eugene Vaios, Zongsheng Hu, Trey Mullikin, Fang-Fang Yin, Chunhao Wang

Abstract: The projection of planar MRI data onto a spherical surface is equivalent to a nonlinear image transformation that retains global anatomical information. By incorporating this image transformation process in our proposed spherical projection-based U-Net (SPU-Net) segmentation model design, multiple independent segmentation predictions can be obtained from a single MRI. The final segmentation is the… ▽ More The projection of planar MRI data onto a spherical surface is equivalent to a nonlinear image transformation that retains global anatomical information. By incorporating this image transformation process in our proposed spherical projection-based U-Net (SPU-Net) segmentation model design, multiple independent segmentation predictions can be obtained from a single MRI. The final segmentation is the average of all available results, and the variation can be visualized as a pixel-wise uncertainty map. An uncertainty score was introduced to evaluate and compare the performance of uncertainty measurements. The proposed SPU-Net model was implemented on the basis of 369 glioma patients with MP-MRI scans (T1, T1-Ce, T2, and FLAIR). Three SPU-Net models were trained to segment enhancing tumor (ET), tumor core (TC), and whole tumor (WT), respectively. The SPU-Net model was compared with (1) the classic U-Net model with test-time augmentation (TTA) and (2) linear scaling-based U-Net (LSU-Net) segmentation models in terms of both segmentation accuracy (Dice coefficient, sensitivity, specificity, and accuracy) and segmentation uncertainty (uncertainty map and uncertainty score). The developed SPU-Net model successfully achieved low uncertainty for correct segmentation predictions (e.g., tumor interior or healthy tissue interior) and high uncertainty for incorrect results (e.g., tumor boundaries). This model could allow the identification of missed tumor targets or segmentation errors in U-Net. Quantitatively, the SPU-Net model achieved the highest uncertainty scores for three segmentation targets (ET/TC/WT): 0.826/0.848/0.936, compared to 0.784/0.643/0.872 using the U-Net with TTA and 0.743/0.702/0.876 with the LSU-Net (scaling factor = 2). The SPU-Net also achieved statistically significantly higher Dice coefficients, underscoring the improved segmentation accuracy. △ Less

Submitted 12 August, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

Comments: 31 pages, 9 figures, 1 table

arXiv:2209.13492 [pdf, other]

Unraveling Key Elements Underlying Molecular Property Prediction: A Systematic Study

Authors: Jianyuan Deng, Zhibo Yang, Hehe Wang, Iwao Ojima, Dimitris Samaras, Fusheng Wang

Abstract: Artificial intelligence (AI) has been widely applied in drug discovery with a major task as molecular property prediction. Despite booming techniques in molecular representation learning, key elements underlying molecular property prediction remain largely unexplored, which impedes further advancements in this field. Herein, we conduct an extensive evaluation of representative models using various… ▽ More Artificial intelligence (AI) has been widely applied in drug discovery with a major task as molecular property prediction. Despite booming techniques in molecular representation learning, key elements underlying molecular property prediction remain largely unexplored, which impedes further advancements in this field. Herein, we conduct an extensive evaluation of representative models using various representations on the MoleculeNet datasets, a suite of opioids-related datasets and two additional activity datasets from the literature. To investigate the predictive power in low-data and high-data space, a series of descriptors datasets of varying sizes are also assembled to evaluate the models. In total, we have trained 62,820 models, including 50,220 models on fixed representations, 4,200 models on SMILES sequences and 8,400 models on molecular graphs. Based on extensive experimentation and rigorous comparison, we show that representation learning models exhibit limited performance in molecular property prediction in most datasets. Besides, multiple key elements underlying molecular property prediction can affect the evaluation results. Furthermore, we show that activity cliffs can significantly impact model prediction. Finally, we explore into potential causes why representation learning models can fail and show that dataset size is essential for representation learning models to excel. △ Less

Submitted 2 September, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

arXiv:2206.10801 [pdf, other]

Automated Cancer Subty** via Vector Quantization Mutual Information Maximization

Authors: Zheng Chen, Lingwei Zhu, Ziwei Yang, Takashi Matsubara

Abstract: Cancer subty** is crucial for understanding the nature of tumors and providing suitable therapy. However, existing labelling methods are medically controversial, and have driven the process of subty** away from teaching signals. Moreover, cancer genetic expression profiles are high-dimensional, scarce, and have complicated dependence, thereby posing a serious challenge to existing subty** mo… ▽ More Cancer subty** is crucial for understanding the nature of tumors and providing suitable therapy. However, existing labelling methods are medically controversial, and have driven the process of subty** away from teaching signals. Moreover, cancer genetic expression profiles are high-dimensional, scarce, and have complicated dependence, thereby posing a serious challenge to existing subty** models for outputting sensible clustering. In this study, we propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner. The proposed method adaptively learns categorical correspondence from latent representations of expression profiles to the subtypes output by the model. By maximizing the problem -- agnostic mutual information between input expression profiles and output subtypes, our method can automatically decide a suitable number of subtypes. Through experiments, we demonstrate that our proposed method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates. △ Less

Submitted 14 November, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: accepted by ECML-PKDD 2022

arXiv:2205.09548 [pdf, other]

ODBO: Bayesian Optimization with Search Space Prescreening for Directed Protein Evolution

Authors: Lixue Cheng, Ziyi Yang, Changyu Hsieh, Benben Liao, Shengyu Zhang

Abstract: Directed evolution is a versatile technique in protein engineering that mimics the process of natural selection by iteratively alternating between mutagenesis and screening in order to search for sequences that optimize a given property of interest, such as catalytic activity and binding affinity to a specified target. However, the space of possible proteins is too large to search exhaustively in… ▽ More Directed evolution is a versatile technique in protein engineering that mimics the process of natural selection by iteratively alternating between mutagenesis and screening in order to search for sequences that optimize a given property of interest, such as catalytic activity and binding affinity to a specified target. However, the space of possible proteins is too large to search exhaustively in the laboratory, and functional proteins are scarce in the vast sequence space. Machine learning (ML) approaches can accelerate directed evolution by learning to map protein sequences to functions without building a detailed model of the underlying physics, chemistry and biological pathways. Despite the great potentials held by these ML methods, they encounter severe challenges in identifying the most suitable sequences for a targeted function. These failures can be attributed to the common practice of adopting a high-dimensional feature representation for protein sequences and inefficient search methods. To address these issues, we propose an efficient, experimental design-oriented closed-loop optimization framework for protein directed evolution, termed ODBO, which employs a combination of novel low-dimensional protein encoding strategy and Bayesian optimization enhanced with search space prescreening via outlier detection. We further design an initial sample selection strategy to minimize the number of experimental samples for training ML models. We conduct and report four protein directed evolution experiments that substantiate the capability of the proposed framework for finding of the variants with properties of interest. We expect the ODBO framework to greatly reduce the experimental cost and time cost of directed evolution, and can be further generalized as a powerful tool for adaptive experimental design in a broader context. △ Less

Submitted 1 May, 2024; v1 submitted 19 May, 2022; originally announced May 2022.

Comments: 27 pages, 13 figures

arXiv:2204.09840 [pdf, other]

Multi-Tier Platform for Cognizing Massive Electroencephalogram

Authors: Zheng Chen, Lingwei Zhu, Ziwei Yang, Renyuan Zhang

Abstract: An end-to-end platform assembling multiple tiers is built for precisely cognizing brain activities. Being fed massive electroencephalogram (EEG) data, the time-frequency spectrograms are conventionally projected into the episode-wise feature matrices (seen as tier-1). A spiking neural network (SNN) based tier is designed to distill the principle information in terms of spike-streams from the rare… ▽ More An end-to-end platform assembling multiple tiers is built for precisely cognizing brain activities. Being fed massive electroencephalogram (EEG) data, the time-frequency spectrograms are conventionally projected into the episode-wise feature matrices (seen as tier-1). A spiking neural network (SNN) based tier is designed to distill the principle information in terms of spike-streams from the rare features, which maintains the temporal implication in the nature of EEGs. The proposed tier-3 transposes time- and space-domain of spike patterns from the SNN; and feeds the transposed pattern-matrices into an artificial neural network (ANN, Transformer specifically) known as tier-4, where a special spanning topology is proposed to match the two-dimensional input form. In this manner, cognition such as classification is conducted with high accuracy. For proof-of-concept, the sleep stage scoring problem is demonstrated by introducing multiple EEG datasets with the largest comprising 42,560 hours recorded from 5,793 subjects. From experiment results, our platform achieves the general cognition overall accuracy of 87% by leveraging sole EEG, which is 2% superior to the state-of-the-art. Moreover, our developed multi-tier methodology offers visible and graphical interpretations of the temporal characteristics of EEG by identifying the critical episodes, which is demanded in neurodynamics but hardly appears in conventional cognition scenarios. △ Less

Submitted 20 April, 2022; originally announced April 2022.

Comments: 7 pages, accepted by IJCAI 2022

arXiv:2204.02278 [pdf, other]

Cancer Subty** via Embedded Unsupervised Learning on Transcriptomics Data

Authors: Ziwei Yang, Lingwei Zhu, Zheng Chen, Ming Huang, Naoaki Ono, MD Altaf-Ul-Amin, Shigehiko Kanaya

Abstract: Cancer is one of the deadliest diseases worldwide. Accurate diagnosis and classification of cancer subtypes are indispensable for effective clinical treatment. Promising results on automatic cancer subty** systems have been published recently with the emergence of various deep learning methods. However, such automatic systems often overfit the data due to the high dimensionality and scarcity. In… ▽ More Cancer is one of the deadliest diseases worldwide. Accurate diagnosis and classification of cancer subtypes are indispensable for effective clinical treatment. Promising results on automatic cancer subty** systems have been published recently with the emergence of various deep learning methods. However, such automatic systems often overfit the data due to the high dimensionality and scarcity. In this paper, we propose to investigate automatic subty** from an unsupervised learning perspective by directly constructing the underlying data distribution itself, hence sufficient data can be generated to alleviate the issue of overfitting. Specifically, we bypass the strong Gaussianity assumption that typically exists but fails in the unsupervised learning subty** literature due to small-sized samples by vector quantization. Our proposed method better captures the latent space features and models the cancer subtype manifestation on a molecular basis, as demonstrated by the extensive experimental results. △ Less

Submitted 2 April, 2022; originally announced April 2022.

Comments: 4 pages, accepted for EMBC 2022

arXiv:2203.08648 [pdf, other]

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

Authors: Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang

Abstract: Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines. Methods: Here we present a neuroprosthetic system to demonstrate that principle by employing an artificial intelligence (AI) agent to translate the amputee's movement intent through a peripheral nerve interface. The AI agent is designed… ▽ More Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines. Methods: Here we present a neuroprosthetic system to demonstrate that principle by employing an artificial intelligence (AI) agent to translate the amputee's movement intent through a peripheral nerve interface. The AI agent is designed based on the recurrent neural network (RNN) and could simultaneously decode six degree-of-freedom (DOF) from multichannel nerve data in real-time. The decoder's performance is characterized in motor decoding experiments with three human amputees. Results: First, we show the AI agent enables amputees to intuitively control a prosthetic hand with individual finger and wrist movements up to 97-98% accuracy. Second, we demonstrate the AI agent's real-time performance by measuring the reaction time and information throughput in a hand gesture matching task. Third, we investigate the AI agent's long-term uses and show the decoder's robust predictive performance over a 16-month implant duration. Conclusion & significance: Our study demonstrates the potential of AI-enabled nerve technology, underling the next generation of dexterous and intuitive prosthetic hands. △ Less

Submitted 16 March, 2022; originally announced March 2022.

arXiv:2203.00628 [pdf]

A Neural Ordinary Differential Equation Model for Visualizing Deep Neural Network Behaviors in Multi-Parametric MRI based Glioma Segmentation

Authors: Zhenyu Yang, Zongsheng Hu, Hangjie Ji, Kyle Lafata, Scott Floyd, Fang-Fang Yin, Chunhao Wang

Abstract: Purpose: To develop a neural ordinary differential equation (ODE) model for visualizing deep neural network (DNN) behavior during multi-parametric MRI (mp-MRI) based glioma segmentation as a method to enhance deep learning explainability. Methods: By hypothesizing that deep feature extraction can be modeled as a spatiotemporally continuous process, we designed a novel deep learning model, neural O… ▽ More Purpose: To develop a neural ordinary differential equation (ODE) model for visualizing deep neural network (DNN) behavior during multi-parametric MRI (mp-MRI) based glioma segmentation as a method to enhance deep learning explainability. Methods: By hypothesizing that deep feature extraction can be modeled as a spatiotemporally continuous process, we designed a novel deep learning model, neural ODE, in which deep feature extraction was governed by an ODE without explicit expression. The dynamics of 1) MR images after interactions with DNN and 2) segmentation formation can be visualized after solving ODE. An accumulative contribution curve (ACC) was designed to quantitatively evaluate the utilization of each MRI by DNN towards the final segmentation results. The proposed neural ODE model was demonstrated using 369 glioma patients with a 4-modality mp-MRI protocol: T1, contrast-enhanced T1 (T1-Ce), T2, and FLAIR. Three neural ODE models were trained to segment enhancing tumor (ET), tumor core (TC), and whole tumor (WT). The key MR modalities with significant utilization by DNN were identified based on ACC analysis. Segmentation results by DNN using only the key MR modalities were compared to the ones using all 4 MR modalities. Results: All neural ODE models successfully illustrated image dynamics as expected. ACC analysis identified T1-Ce as the only key modality in ET and TC segmentations, while both FLAIR and T2 were key modalities in WT segmentation. Compared to the U-Net results using all 4 MR modalities, Dice coefficient of ET (0.784->0.775), TC (0.760->0.758), and WT (0.841->0.837) using the key modalities only had minimal differences without significance. Conclusion: The neural ODE model offers a new tool for optimizing the deep learning model inputs with enhanced explainability. The presented methodology can be generalized to other medical image-related deep learning applications. △ Less

Submitted 23 March, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: 30 pages, 7 figures, 2 tables

arXiv:2111.08008 [pdf, other]

doi 10.1093/bib/bbac050

SPLDExtraTrees: Robust machine learning approach for predicting kinase inhibitor resistance

Authors: Ziyi Yang, Zhaofeng Ye, Yijia Xiao, Changyu Hsieh, Shengyu Zhang

Abstract: Drug resistance is a major threat to the global health and a significant concern throughout the clinical treatment of diseases and drug development. The mutation in proteins that is related to drug binding is a common cause for adaptive drug resistance. Therefore, quantitative estimations of how mutations would affect the interaction between a drug and the target protein would be of vital signific… ▽ More Drug resistance is a major threat to the global health and a significant concern throughout the clinical treatment of diseases and drug development. The mutation in proteins that is related to drug binding is a common cause for adaptive drug resistance. Therefore, quantitative estimations of how mutations would affect the interaction between a drug and the target protein would be of vital significance for the drug development and the clinical practice. Computational methods that rely on molecular dynamics simulations, Rosetta protocols, as well as machine learning methods have been proven to be capable of predicting ligand affinity changes upon protein mutation. However, the severely limited sample size and heavy noise induced overfitting and generalization issues have impeded wide adoption of machine learning for studying drug resistance. In this paper, we propose a robust machine learning method, termed SPLDExtraTrees, which can accurately predict ligand binding affinity changes upon protein mutation and identify resistance-causing mutations. Especially, the proposed method ranks training data following a specific scheme that starts with easy-to-learn samples and gradually incorporates harder and diverse samples into the training, and then iterates between sample weight recalculations and model updates. In addition, we calculate additional physics-based structural features to provide the machine learning model with the valuable domain knowledge on proteins for this data-limited predictive tasks. The experiments substantiate the capability of the proposed method for predicting kinase inhibitor resistance under three scenarios, and achieves predictive accuracy comparable to that of molecular dynamics and Rosetta methods with much less computational costs. △ Less

Submitted 14 January, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

Comments: 14 pages, 5 figures

MSC Class: machine learning

arXiv:2110.11347 [pdf]

Multidimensional representations in late-life depression: convergence in neuroimaging, cognition, clinical symptomatology and genetics

Authors: Junhao Wen, Cynthia H. Y. Fu, Duygu Tosun, Yogasudha Veturi, Zhijian Yang, Ahmed Abdulkadir, Elizabeth Mamourian, Dhivya Srinivasan, **gxuan Bao, Guray Erus, Haochang Shou, Mohamad Habes, Jimit Doshi, Erdem Varol, Scott R Mackin, Aristeidis Sotiras, Yong Fan, Andrew J. Saykin, Yvette I. Sheline, Li Shen, Marylyn D. Ritchie, David A. Wolk, Marilyn Albert, Susan M. Resnick, Christos Davatzikos

Abstract: Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical sympto… ▽ More Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical symptomatology, and genetic profiles. Multimodal data from a multicentre sample (N=996) were analyzed. A semi-supervised clustering method (HYDRA) was applied to regional grey matter (GM) brain volumes to derive dimensional representations. Two dimensions were identified, which accounted for the LLD-related heterogeneity in voxel-wise GM maps, white matter (WM) fractional anisotropy (FA), neurocognitive functioning, clinical phenotype, and genetics. Dimension one (Dim1) demonstrated relatively preserved brain anatomy without WM disruptions relative to healthy controls. In contrast, dimension two (Dim2) showed widespread brain atrophy and WM integrity disruptions, along with cognitive impairment and higher depression severity. Moreover, one de novo independent genetic variant (rs13120336) was significantly associated with Dim 1 but not with Dim 2. Notably, the two dimensions demonstrated significant SNP-based heritability of 18-27% within the general population (N=12,518 in UKBB). Lastly, in a subset of individuals having longitudinal measurements, Dim2 demonstrated a more rapid longitudinal decrease in GM and brain age, and was more likely to progress to Alzheimers disease, compared to Dim1 (N=1,413 participants and 7,225 scans from ADNI, BLSA, and BIOCARD datasets). △ Less

Submitted 25 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2102.12582 [pdf]

Disentangling brain heterogeneity via semi-supervised deep-learning and MRI: dimensional representations of Alzheimer's Disease

Authors: Zhijian Yang, Ilya M. Nasrallah, Haochang Shou, Junhao Wen, Jimit Doshi, Mohamad Habes, Guray Erus, Ahmed Abdulkadir, Susan M. Resnick, David Wolk, Christos Davatzikos

Abstract: Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies;… ▽ More Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies; 2,832 participants; 8,146 scans) including cognitively normal individuals and those with cognitive impairment and dementia, Smile-GAN identified 4 neurodegenerative patterns/axes: P1, normal anatomy and highest cognitive performance; P2, mild/diffuse atrophy and more prominent executive dysfunction; P3, focal medial temporal atrophy and relatively greater memory impairment; P4, advanced neurodegeneration. Further application to longitudinal data revealed two distinct progression pathways: P1$\rightarrow$P2$\rightarrow$P4 and P1$\rightarrow$P3$\rightarrow$P4. Baseline expression of these patterns predicted the pathway and rate of future neurodegeneration. Pattern expression offered better yet complementary performance in predicting clinical progression, compared to amyloid/tau. These deep-learning derived biomarkers offer promise for precision diagnostics and targeted clinical trial recruitment. △ Less

Submitted 24 February, 2021; originally announced February 2021.

Comments: 37 pages, 11 figures

arXiv:2012.15418 [pdf]

EPIHC: Improving Enhancer-Promoter Interaction Prediction by using Hybrid features and Communicative learning

Authors: Shuai Liu, Xinran Xu, Zhihao Yang, Xiaohan Zhao, Wen Zhang

Abstract: Enhancer-promoter interactions (EPIs) regulate the expression of specific genes in cells, and EPIs are important for understanding gene regulation, cell differentiation and disease mechanisms. EPI identification through the wet experiments is costly and time-consuming, and computational methods are in demand. In this paper, we propose a deep neural network-based method EPIHC based on sequence-deri… ▽ More Enhancer-promoter interactions (EPIs) regulate the expression of specific genes in cells, and EPIs are important for understanding gene regulation, cell differentiation and disease mechanisms. EPI identification through the wet experiments is costly and time-consuming, and computational methods are in demand. In this paper, we propose a deep neural network-based method EPIHC based on sequence-derived features and genomic features for the EPI prediction. EPIHC extracts features from enhancer and promoter sequences respectively using convolutional neural networks (CNN), and then design a communicative learning module to captures the communicative information between enhancer and promoter sequences. EPIHC also take the genomic features of enhancers and promoters into account. At last, EPIHC combines sequence-derived features and genomic features to predict EPIs. The computational experiments show that EPIHC outperforms the existing state-of-the-art EPI prediction methods on the benchmark datasets and chromosome-split datasets, and the study reveal that the communicative learning module can bring explicit information about EPIs, which is ignore by CNN. Moreover, we consider two strategies to improve performances of EPIHC in the cross-cell line prediction, and experimental results show that EPIHC constructed on training cell lines exhibit improved performances for the other cell lines. △ Less

Submitted 30 December, 2020; originally announced December 2020.

Comments: 7 pages, 9 figures, 2 tables

arXiv:2007.00975 [pdf]

Molcontroller: a VMD Graphical User Interface for Manipulating Molecules

Authors: ChenChen Wu, Shengtang Liu, Shitong Zhang, Zaixing Yang

Abstract: Visual Molecular Dynamics (VMD) is one of the most widely used molecular graphics software in the community of theoretical simulations. So far, however, it still lacks a graphical user interface (GUI) for molecular manipulations when doing some modeling tasks. For instance, translation or rotation of a selected molecule(s) or part(s) of a molecule, which are currently only can be achieved using tc… ▽ More Visual Molecular Dynamics (VMD) is one of the most widely used molecular graphics software in the community of theoretical simulations. So far, however, it still lacks a graphical user interface (GUI) for molecular manipulations when doing some modeling tasks. For instance, translation or rotation of a selected molecule(s) or part(s) of a molecule, which are currently only can be achieved using tcl scripts. Here, we use tcl script develop a user-friendly GUI for VMD, named Molcontroller, which is featured by allowing users to quickly and conveniently perform various molecular manipulations. This GUI might be helpful for improving the modeling efficiency of VMD users. △ Less

Submitted 2 July, 2020; originally announced July 2020.

Comments: 7 pages, 3 figures

arXiv:2006.15255 [pdf, other]

Smile-GANs: Semi-supervised clustering via GANs for dissecting brain disease heterogeneity from medical images

Authors: Zhijian Yang, Junhao Wen, Christos Davatzikos

Abstract: Machine learning methods applied to complex biomedical data has enabled the construction of disease signatures of diagnostic/prognostic value. However, less attention has been given to understanding disease heterogeneity. Semi-supervised clustering methods can address this problem by estimating multiple transformations from a (e.g. healthy) control (CN) group to a patient (PT) group, seeking to ca… ▽ More Machine learning methods applied to complex biomedical data has enabled the construction of disease signatures of diagnostic/prognostic value. However, less attention has been given to understanding disease heterogeneity. Semi-supervised clustering methods can address this problem by estimating multiple transformations from a (e.g. healthy) control (CN) group to a patient (PT) group, seeking to capture the heterogeneity of underlying pathlogic processes. Herein, we propose a novel method, Smile-GANs (SeMi-supervIsed cLustEring via GANs), for semi-supervised clustering, and apply it to brain MRI scans. Smile-GANs first learns multiple distinct map**s by generating PT from CN, with each map** characterizing one relatively distinct pathological pattern. Moreover, a clustering model is trained interactively with map** functions to assign PT into corresponding subtype memberships. Using relaxed assumptions on PT/CN data distribution and imposing map** non-linearity, Smile-GANs captures heterogeneous differences in distribution between the CN and PT domains. We first validate Smile-GANs using simulated data, subsequently on real data, by demonstrating its potential in characterizing heterogeneity in Alzheimer's Disease (AD) and its prodromal phases. The model was first trained using baseline MRIs from the ADNI2 database and then applied to longitudinal data from ADNI1 and BLSA. Four robust subtypes with distinct neuroanatomical patterns were discovered: 1) normal brain, 2) diffuse atrophy atypical of AD, 3) focal medial temporal lobe atrophy, 4) typical-AD. Further longitudinal analyses discover two distinct progressive pathways from prodromal to full AD: i) subtypes 1 - 2 - 4, and ii) subtypes 1 - 3 - 4. Although demonstrated on an important biomedical problem, Smile-GANs is general and can find application in many biomedical and other domains. △ Less

Submitted 26 June, 2020; originally announced June 2020.

arXiv:2005.10951 [pdf, other]

A machine learning approach to using Quality-of-Life patient scores in guiding prostate radiation therapy dosing

Authors: Zhijian Yang, Daniel Olszewski, Chujun He, Giulia Pintea, Jun Lian, Tom Chou, Ronald Chen, Blerta Shtylla

Abstract: Thanks to advancements in diagnosis and treatment, prostate cancer patients have high long-term survival rates. Currently, an important goal is to preserve quality-of-life during and after treatment. The relationship between the radiation a patient receives and the subsequent side effects he experiences is complex and difficult to model or predict. Here, we use machine learning algorithms and stat… ▽ More Thanks to advancements in diagnosis and treatment, prostate cancer patients have high long-term survival rates. Currently, an important goal is to preserve quality-of-life during and after treatment. The relationship between the radiation a patient receives and the subsequent side effects he experiences is complex and difficult to model or predict. Here, we use machine learning algorithms and statistical models to explore the connection between radiation treatment and post-treatment gastro-urinary function. Since only a limited number of patient datasets are currently available, we used image flip** and curvature-based interpolation methods to generate more data in order to leverage transfer learning. Using interpolated and augmented data, we trained a convolutional autoencoder network to obtain near-optimal starting points for the weights. A convolutional neural network then analyzed the relationship between patient-reported quality-of-life and radiation. We also used analysis of variance and logistic regression to explore organ sensitivity to radiation and develop dosage thresholds for each organ region. Our findings show no connection between the bladder and quality-of-life scores. However, we found a connection between radiation applied to posterior and anterior rectal regions to changes in quality-of-life. Finally, we estimated radiation therapy dosage thresholds for each organ. Our analysis connects machine learning methods with organ sensitivity, thus providing a framework for informing cancer patient care using patient reported quality-of-life metrics. △ Less

Submitted 21 May, 2020; originally announced May 2020.

arXiv:2004.07985 [pdf, other]

doi 10.1142/S0217979220502884

Using single-cell entropy to describe the dynamics of reprogramming and differentiation of induced pluripotent stem cells

Authors: Yusong Ye, Zhuoqin Yang, **zhi Lei

Abstract: Induced pluripotent stem cells (iPSCs) provide a great model to study the process of reprogramming and differentiation of stem cells. Single-cell RNA sequencing (scRNA-seq) enables us to investigate the reprogramming process at single-cell level. Here, we introduce single-cell entropy (scEntropy) as a macroscopic variable to quantify the cellular transcriptome from scRNA-seq data during reprogramm… ▽ More Induced pluripotent stem cells (iPSCs) provide a great model to study the process of reprogramming and differentiation of stem cells. Single-cell RNA sequencing (scRNA-seq) enables us to investigate the reprogramming process at single-cell level. Here, we introduce single-cell entropy (scEntropy) as a macroscopic variable to quantify the cellular transcriptome from scRNA-seq data during reprogramming and differentiation of iPSCs. scEntropy measures the relative order parameter of genomic transcriptions at single cell level during the cell fate change process, which shows increasing during differentiation, and decreasing upon reprogramming. Moreover, based on the scEntropy dynamics, we construct a phenomenological stochastic differential equation model and the corresponding Fokker-Plank equation for cell state transitions during iPSC differentiation, which provide insights to infer cell fates changes and stem cell differentiation. This study is the first to introduce the novel concept of scEntropy to the biological process of iPSC, and suggests that the scEntropy can provide a suitable quantify to describe cell fate transition in differentiation and reprogramming of stem cells. △ Less

Submitted 16 April, 2020; originally announced April 2020.

Comments: 12 pages, 5 figures

arXiv:2004.04768 [pdf, other]

Towards Better Opioid Antagonists Using Deep Reinforcement Learning

Authors: Jianyuan Deng, Zhibo Yang, Yao Li, Dimitris Samaras, Fusheng Wang

Abstract: Naloxone, an opioid antagonist, has been widely used to save lives from opioid overdose, a leading cause for death in the opioid epidemic. However, naloxone has short brain retention ability, which limits its therapeutic efficacy. Develo** better opioid antagonists is critical in combating the opioid epidemic.Instead of exhaustively searching in a huge chemical space for better opioid antagonist… ▽ More Naloxone, an opioid antagonist, has been widely used to save lives from opioid overdose, a leading cause for death in the opioid epidemic. However, naloxone has short brain retention ability, which limits its therapeutic efficacy. Develo** better opioid antagonists is critical in combating the opioid epidemic.Instead of exhaustively searching in a huge chemical space for better opioid antagonists, we adopt reinforcement learning which allows efficient gradient-based search towards molecules with desired physicochemical and/or biological properties. Specifically, we implement a deep reinforcement learning framework to discover potential lead compounds as better opioid antagonists with enhanced brain retention ability. A customized multi-objective reward function is designed to bias the generation towards molecules with both sufficient opioid antagonistic effect and enhanced brain retention ability. Thorough evaluation demonstrates that with this framework, we are able to identify valid, novel and feasible molecules with multiple desired properties, which has high potential in drug discovery. △ Less

Submitted 26 March, 2020; originally announced April 2020.

Comments: 10 pages, 7 figures

arXiv:2003.06846 [pdf]

Propagation analysis and prediction of the COVID-19

Authors: Lixiang Li, Zihang Yang, Zhongkai Dang, Cui Meng, **gze Huang, Hao Tian Meng, Deyu Wang, Guanhua Chen, Jiaxuan Zhang, Haipeng Peng

Abstract: Based on the official data modeling, this paper studies the transmission process of the Corona Virus Disease 2019 (COVID-19). The error between the model and the official data curve is within 3%. At the same time, it realized forward prediction and backward inference of the epidemic situation, and the relevant analysis help relevant countries to make decisions. Based on the official data modeling, this paper studies the transmission process of the Corona Virus Disease 2019 (COVID-19). The error between the model and the official data curve is within 3%. At the same time, it realized forward prediction and backward inference of the epidemic situation, and the relevant analysis help relevant countries to make decisions. △ Less

Submitted 15 March, 2020; originally announced March 2020.

arXiv:2002.09283 [pdf]

doi 10.1038/s41597-022-01211-x

MODMA dataset: a Multi-modal Open Dataset for Mental-disorder Analysis

Authors: Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, **g Zhu, Xiaowei Zhang, Guo** Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, **g Yang, Lan Zhang, Xi** Hu, Yumin Li , et al. (1 additional authors not shown)

Abstract: According to the World Health Organization, the number of mental disorder patients, especially depression patients, has grown rapidly and become a leading contributor to the global burden of disease. However, the present common practice of depression diagnosis is based on interviews and clinical scales carried out by doctors, which is not only labor-consuming but also time-consuming. One important… ▽ More According to the World Health Organization, the number of mental disorder patients, especially depression patients, has grown rapidly and become a leading contributor to the global burden of disease. However, the present common practice of depression diagnosis is based on interviews and clinical scales carried out by doctors, which is not only labor-consuming but also time-consuming. One important reason is due to the lack of physiological indicators for mental disorders. With the rising of tools such as data mining and artificial intelligence, using physiological data to explore new possible physiological indicators of mental disorder and creating new applications for mental disorder diagnosis has become a new research hot topic. However, good quality physiological data for mental disorder patients are hard to acquire. We present a multi-modal open dataset for mental-disorder analysis. The dataset includes EEG and audio data from clinically depressed patients and matching normal controls. All our patients were carefully diagnosed and selected by professional psychiatrists in hospitals. The EEG dataset includes not only data collected using traditional 128-electrodes mounted elastic cap, but also a novel wearable 3-electrode EEG collector for pervasive applications. The 128-electrodes EEG signals of 53 subjects were recorded as both in resting state and under stimulation; the 3-electrode EEG signals of 55 subjects were recorded in resting state; the audio data of 52 subjects were recorded during interviewing, reading, and picture description. We encourage other researchers in the field to use it for testing their methods of mental-disorder analysis. △ Less

Submitted 4 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

Journal ref: Sci Data 9, 178 (2022)

arXiv:2002.06401 [pdf, ps, other]

doi 10.1089/cmb.2019.0413

DNA methylation heterogeneity induced by collaborations between enhancers

Authors: Yusong Ye, Zhuoqin Yang, **zhi Lei

Abstract: During mammalian embryo development, reprogramming of DNA methylation plays important roles in the erasure of parental epigenetic memory and the establishment of naïve pluripogent cells. Multiple enzymes that regulate the processes of methylation and demethylation work together to shape the pattern of genome-scale DNA methylation and guid the process of cell differentiation. Recent availability of… ▽ More During mammalian embryo development, reprogramming of DNA methylation plays important roles in the erasure of parental epigenetic memory and the establishment of naïve pluripogent cells. Multiple enzymes that regulate the processes of methylation and demethylation work together to shape the pattern of genome-scale DNA methylation and guid the process of cell differentiation. Recent availability of methylome information from single-cell whole genome bisulfite sequencing (scBS-seq) provides an opportunity to study DNA methylation dynamics in the whole genome in individual cells, which reveal the heterogeneous methylation distributions of enhancers in embryo stem cells (ESCs). In this study, we developed a computational model of enhancer methylation inheritance to study the dynamics of genome-scale DNA methylation reprogramming during exit from pluripotency. The model enables us to track genome-scale DNA methylation reprogramming at single-cell level during the embryo development process, and reproduce the DNA methylation heterogeneity reported by scBS-seq. Model simulations show that DNA methylation heterogeneity is an intrinsic property driven by cell division along the development process, and the collaboration between neighboring enhancers is required for heterogeneous methylation. Our study suggest that the mechanism of genome-scale oscillation proposed by Rulands et al. (2018) might not necessary to the DNA methylation during exit from pluripotency. △ Less

Submitted 15 February, 2020; originally announced February 2020.

Comments: 25 pages, 4 figures

Journal ref: Journal of Computational Biology, 2020

arXiv:2001.10530 [pdf]

doi 10.1111/jebm.12376

Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV

Authors: Tao Zhou, Quanhui Liu, Zimo Yang, **gyi Liao, Kexin Yang, Wei Bai, Xin Lü, Wei Zhang

Abstract: Objectives.--To estimate the basic reproduction number of the Wuhan novel coronavirus (2019-nCoV). Methods.--Based on the susceptible-exposed-infected-removed (SEIR) compartment model and the assumption that the infectious cases with symptoms occurred before January 25, 2020 are resulted from free propagation without intervention, we estimate the basic reproduction number of 2019-nCoV according to… ▽ More Objectives.--To estimate the basic reproduction number of the Wuhan novel coronavirus (2019-nCoV). Methods.--Based on the susceptible-exposed-infected-removed (SEIR) compartment model and the assumption that the infectious cases with symptoms occurred before January 25, 2020 are resulted from free propagation without intervention, we estimate the basic reproduction number of 2019-nCoV according to the reported confirmed cases and suspected cases, as well as the theoretical estimated number of infected cases by other research teams, together with some epidemiological determinants learned from the severe acute respiratory syndrome. Results The basic reproduction number falls between 2.8 to 3.3 by using the real-time reports on the number of 2019-nCoV infected cases from People's Daily in China, and falls between 3.2 and 3.9 on the basis of the predicted number of infected cases from colleagues. Conclusions.--The early transmission ability of 2019-nCoV is closed to or slightly higher than SARS. It is a controllable disease with moderate-high transmissibility. Timely and effective control measures are needed to suppress the further transmissions. Notes Added.--Using a newly reported epidemiological determinants for early 2019-nCoV, the estimated basic reproduction number is in the range [2.2,3.0]. △ Less

Submitted 31 January, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

Comments: 8 pages, 1 table and 1 figure

Journal ref: Journal of Evidence Based Medicine (2020) 1

arXiv:2001.00114 [pdf]

Expertise and Task Pressure in fNIRS-based brain Connectomes

Authors: F. Deligianni, H. Singh, H. N. Modi, S. Jahani, M. Yucel, A. Darzi, D. R. Leff, G. Z. Yang

Abstract: Acquisition of bimanual motor skills, critical in several applications ranging from robotic teleoperations to surgery, is associated with a protracted learning curve. Brain connectivity based on functional Near Infrared Spectroscopy (fNIRS) data has shown promising results in distinguishing experts from novice surgeons. However, it is less well understood how expertise-related disparity in brain c… ▽ More Acquisition of bimanual motor skills, critical in several applications ranging from robotic teleoperations to surgery, is associated with a protracted learning curve. Brain connectivity based on functional Near Infrared Spectroscopy (fNIRS) data has shown promising results in distinguishing experts from novice surgeons. However, it is less well understood how expertise-related disparity in brain connectivity is modulated by dynamic temporal demands experienced during a surgical task. In this study, we use fNIRS to examine the interplay between frontal and motor brain regions in a cohort of surgical residents of varying expertise performing a laparoscopic surgical task under temporal demand. The results demonstrate that prefrontal-motor connectivity in senior residents is more resilient to time pressure. Furthermore, certain global characteristics of brain connectomes, such as the small-world index, may be used to detect the presence of an underlying stressor. △ Less

Submitted 31 December, 2019; originally announced January 2020.

arXiv:1810.05398 [pdf, other]

The good, the bad, and the ugly: Bayesian model selection produces spurious posterior probabilities for phylogenetic trees

Authors: Ziheng Yang, Tianqi Zhu

Abstract: The Bayesian method is noted to produce spuriously high posterior probabilities for phylogenetic trees in analysis of large datasets, but the precise reasons for this over-confidence are unknown. In general, the performance of Bayesian selection of misspecified models is poorly understood, even though this is of great scientific interest since models are never true in real data analysis. Here we c… ▽ More The Bayesian method is noted to produce spuriously high posterior probabilities for phylogenetic trees in analysis of large datasets, but the precise reasons for this over-confidence are unknown. In general, the performance of Bayesian selection of misspecified models is poorly understood, even though this is of great scientific interest since models are never true in real data analysis. Here we characterize the asymptotic behavior of Bayesian model selection and show that when the competing models are equally wrong, Bayesian model selection exhibits surprising and polarized behaviors in large datasets, supporting one model with full force while rejecting the others. If one model is slightly less wrong than the other, the less wrong model will eventually win when the amount of data increases, but the method may become overconfident before it becomes reliable. We suggest that this extreme behavior may be a major factor for the spuriously high posterior probabilities for evolutionary trees. The philosophical implications of our results to the application of Bayesian model selection to evaluate opposing scientific hypotheses are yet to be explored, as are the behaviors of non-Bayesian methods in similar situations. △ Less

Submitted 12 October, 2018; originally announced October 2018.

Comments: 6 pages, plus 3 pages of SI

Journal ref: PNAS 2018

arXiv:1809.05522 [pdf, other]

doi 10.1088/1741-2552/aae18d

Deep Compressive Autoencoder for Action Potential Compression in Large-Scale Neural Recording

Authors: Tong Wu, Wenfeng Zhao, Edward Keefer, Zhi Yang

Abstract: Understanding the coordinated activity underlying brain computations requires large-scale, simultaneous recordings from distributed neuronal structures at a cellular-level resolution. One major hurdle to design high-bandwidth, high-precision, large-scale neural interfaces lies in the formidable data streams that are generated by the recorder chip and need to be online transferred to a remote compu… ▽ More Understanding the coordinated activity underlying brain computations requires large-scale, simultaneous recordings from distributed neuronal structures at a cellular-level resolution. One major hurdle to design high-bandwidth, high-precision, large-scale neural interfaces lies in the formidable data streams that are generated by the recorder chip and need to be online transferred to a remote computer. The data rates can require hundreds to thousands of I/O pads on the recorder chip and power consumption on the order of Watts for data streaming alone. We developed a deep learning-based compression model to reduce the data rate of multichannel action potentials. The proposed model is built upon a deep compressive autoencoder (CAE) with discrete latent embeddings. The encoder is equipped with residual transformations to extract representative features from spikes, which are mapped into the latent embedding space and updated via vector quantization (VQ). The decoder network reconstructs spike waveforms from the quantized latent embeddings. Experimental results show that the proposed model consistently outperforms conventional methods by achieving much higher compression ratios (20-500x) and better or comparable reconstruction accuracies. Testing results also indicate that CAE is robust against a diverse range of imperfections, such as waveform variation and spike misalignment, and has minor influence on spike sorting accuracy. Furthermore, we have estimated the hardware cost and real-time performance of CAE and shown that it could support thousands of recording channels simultaneously without excessive power/heat dissipation. The proposed model can reduce the required data transmission bandwidth in large-scale recording experiments and maintain good signal qualities. The code of this work has been made available at https://github.com/tong-wu-umn/spike-compression-autoencoder △ Less

Submitted 16 September, 2018; v1 submitted 14 September, 2018; originally announced September 2018.

Comments: 19 pages, 13 figures

arXiv:1808.04113 [pdf]

doi 10.1073/pnas.1814006115

Quantitative and functional post-translational modification proteomics reveals that TREPH1 plays a role in plant thigmomorphogenesis

Authors: Kai Wang, Zhu Yang, Dong** Qing, Feng Ren, Shichang Liu, Qingsong Zheng, Jun Liu, Wei** Zhang, Chen Dai, Madeline Wu, E. Wassim Chehab, Janet Braam, Ning Li

Abstract: Plants can sense both intracellular and extracellular mechanical forces and can respond through morphological changes. The signaling components responsible for mechanotransduction of the touch response are largely unknown. Here, we performed a high-throughput SILIA (stable isotope labeling in Arabidopsis)-based quantitative phosphoproteomics analysis to profile changes in protein phosphorylation r… ▽ More Plants can sense both intracellular and extracellular mechanical forces and can respond through morphological changes. The signaling components responsible for mechanotransduction of the touch response are largely unknown. Here, we performed a high-throughput SILIA (stable isotope labeling in Arabidopsis)-based quantitative phosphoproteomics analysis to profile changes in protein phosphorylation resulting from 40 seconds of force stimulation in Arabidopsis thaliana. Of the 24 touch-responsive phosphopeptides identified, many were derived from kinases, phosphatases, cytoskeleton proteins, membrane proteins and ion transporters. TOUCH-REGULATED PHOSPHOPROTEIN1 (TREPH1) and MAP KINASE KINASE 2 (MKK2) and/or MKK1 became rapidly phosphorylated in touch-stimulated plants. Both TREPH1 and MKK2 are required for touch-induced delayed flowering, a major component of thigmomorphogenesis. The treph1-1 and mkk2 mutants also exhibited defects in touch-inducible gene expression. A non-phosphorylatable site-specific isoform of TREPH1 (S625A) failed to restore touch-induced flowering delay of treph1-1, indicating the necessity of S625 for TREPH1 function and providing evidence consistent with the possible functional relevance of the touch-regulated TREPH1 phosphorylation. Bioinformatic analysis and biochemical subcellular fractionation of TREPH1 protein indicate that it is a soluble protein. Altogether, these findings identify new protein players in Arabidopsis thigmomorphogenesis regulation, suggesting that protein phosphorylation may play a critical role in plant force responses. △ Less

Submitted 13 August, 2018; originally announced August 2018.

arXiv:1801.03268 [pdf]

Prognostication of chronic disorders of consciousness using brain functional networks and clinical characteristics

Authors: Ming Song, Yi Yang, Jianghong He, Zhengyi Yang, Shan Yu, Qiuyou Xie, Xiaoyu Xia, Yuanyuan Dang, Qiang Zhang, Xinhuai Wu, Yue Cui, Bing Hou, Ronghao Yu, Ruxiang Xu, Tianzi Jiang

Abstract: Disorders of consciousness are a heterogeneous mixture of different diseases or injuries. Although some indicators and models have been proposed for prognostication, any single method when used alone carries a high risk of false prediction. This study aimed to develop a multidomain prognostic model that combines resting state functional MRI with three clinical characteristics to predict one year o… ▽ More Disorders of consciousness are a heterogeneous mixture of different diseases or injuries. Although some indicators and models have been proposed for prognostication, any single method when used alone carries a high risk of false prediction. This study aimed to develop a multidomain prognostic model that combines resting state functional MRI with three clinical characteristics to predict one year outcomes at the single-subject level. The model discriminated between patients who would later recover consciousness and those who would not with an accuracy of around 90% on three datasets from two medical centers. It was also able to identify the prognostic importance of different predictors, including brain functions and clinical characteristics. To our knowledge, this is the first implementation reported of a multidomain prognostic model based on resting state functional MRI and clinical characteristics in chronic disorders of consciousness. We therefore suggest that this novel prognostic model is accurate, robust, and interpretable. △ Less

Submitted 6 September, 2018; v1 submitted 10 January, 2018; originally announced January 2018.

Comments: Although some prognostic indicators and models have been proposed for disorders of consciousness, each single method when used alone carries risks of false prediction. Song et al. report that a model combining resting state functional MRI with clinical characteristics provided accurate, robust, and interpretable prognostications. 52 pages, 1 table, 7 figures

arXiv:1512.03843 [pdf, other]

Efficient Bayesian species tree inference under the multi-species coalescent

Authors: Bruce Rannala, Ziheng Yang

Abstract: A method was developed for Bayesian inference of species phylogeny using the multi-species coalescent model. To improve the mixing properties of the Markov chain Monte Carlo (MCMC) algorithm that traverses the space of species trees, we implement two efficient MCMC proposals: the first is based on the Subtree Pruning and Regrafting (SPR) algorithm and the second is based on a novel node-slider alg… ▽ More A method was developed for Bayesian inference of species phylogeny using the multi-species coalescent model. To improve the mixing properties of the Markov chain Monte Carlo (MCMC) algorithm that traverses the space of species trees, we implement two efficient MCMC proposals: the first is based on the Subtree Pruning and Regrafting (SPR) algorithm and the second is based on a novel node-slider algorithm. Like the Nearest-Neighbor Interchange (NNI) algorithm we implemented previously, both algorithms propose changes to the species tree, while simultaneously altering the gene trees at multiple genetic loci to automatically avoid conflicts with the newly-proposed species tree. The method integrates over gene trees, naturally taking account of the uncertainty of gene tree topology and branch lengths given the sequence data. A simulation study was performed to examine the statistical properties of the new method. We found that it has excellent statistical performance, inferring the correct species tree with near certainty when analyzing 10 loci. The prior on species trees has some impact, particularly for small numbers of loci. An empirical dataset (for rattlesnakes) was reanalyzed. While the 18 nuclear loci and one mitochondrial locus support largely consistent species trees under the multi-species coalescent model estimates of parameters suggest drastically different evolutionary dynamics between the nuclear and mitochondrial loci. △ Less

Submitted 11 December, 2015; originally announced December 2015.

arXiv:1503.08261 [pdf, other]

Bifurcation analysis and potential landscape of the p53-Mdm2 oscillator regulated by the co-activator PDCD5

Authors: Yuanhong Bi, Zhuoqin Yang, Chang**g Zhuge, **zhi Lei

Abstract: Dynamics of p53 is known to play important roles in the regulation of cell fate decisions in response to various stresses, and PDCD5 functions as a co-activator of p53 to modulate the p53 dynamics. In the present paper, we investigate how p53 dynamics are modulated by PDCD5 during the DNA damage response using methods of bifurcation analysis and potential landscape. Our results reveal that p53 act… ▽ More Dynamics of p53 is known to play important roles in the regulation of cell fate decisions in response to various stresses, and PDCD5 functions as a co-activator of p53 to modulate the p53 dynamics. In the present paper, we investigate how p53 dynamics are modulated by PDCD5 during the DNA damage response using methods of bifurcation analysis and potential landscape. Our results reveal that p53 activities can display rich dynamics under different PDCD5 levels, including monostability, bistability with two stable steady states, oscillations, and co-existence of a stable steady state and an oscillatory state. Physical properties of the p53 oscillations are further shown by the potential landscape, in which the potential force attracts the system state to the limit cycle attractor, and the curl flux force drives the coherent oscillation along the cyclic. We also investigate the effect of PDCD5 efficiency on inducing the p53 oscillations. We show that Hopf bifurcation is induced by increasing the PDCD5 efficiency, and the system dynamics show clear transition features in both barrier height and energy dissipation when the efficiency is close to the bifurcation point. This study provides a global picture of how PDCD5 regulates p53 dynamics via the interaction with the p53-Mdm2 oscillator and can be helpful in understanding the complicate p53 dynamics in a more complete p53 pathway. △ Less

Submitted 27 March, 2015; originally announced March 2015.

Comments: 11 pages, 8 figures

arXiv:1305.0361 [pdf, ps, other]

doi 10.1038/srep03292

Braess's Paradox in Epidemic Game: Better Condition Results in Less Payoff

Authors: Hai-Feng Zhang, Zimo Yang, Zhi-Xi Wu, Bing-Hong Wang, Tao Zhou

Abstract: Facing the threats of infectious diseases, we take various actions to protect ourselves, but few studies considered an evolving system with competing strategies. In view of that, we propose an evolutionary epidemic model coupled with human behaviors, where individuals have three strategies: vaccination, self-protection and laissez faire, and could adjust their strategies according to their neighbo… ▽ More Facing the threats of infectious diseases, we take various actions to protect ourselves, but few studies considered an evolving system with competing strategies. In view of that, we propose an evolutionary epidemic model coupled with human behaviors, where individuals have three strategies: vaccination, self-protection and laissez faire, and could adjust their strategies according to their neighbors' strategies and payoffs at the beginning of each new season of epidemic spreading. We found a counter-intuitive phenomenon analogous to the well-known \emph{Braess's Paradox}, namely a better condition may lead to worse performance. Specifically speaking, increasing the successful rate of self-protection does not necessarily reduce the epidemic size or improve the system payoff. This phenomenon is insensitive to the network topologies, and can be well explained by a mean-field approximation. Our study demonstrates an important fact that a better condition for individuals may yield a worse outcome for the society. △ Less

Submitted 2 May, 2013; originally announced May 2013.

Comments: 17 pages, 5 figures

Journal ref: Scientific Reports,3, (2013), 3292

arXiv:1201.0153 [pdf, ps, other]

doi 10.1186/1471-2105-14-87

Empirical Bayes estimation of posterior probabilities of enrichment

Authors: Zhenyu Yang, Zuo**g Li, David R. Bickel

Abstract: To interpret differentially expressed genes or other discovered features, researchers conduct hypothesis tests to determine which biological categories such as those of the Gene Ontology (GO) are enriched in the sense of having differential representation among the discovered features. We study application of better estimators of the local false discovery rate (LFDR), a probability that the biolog… ▽ More To interpret differentially expressed genes or other discovered features, researchers conduct hypothesis tests to determine which biological categories such as those of the Gene Ontology (GO) are enriched in the sense of having differential representation among the discovered features. We study application of better estimators of the local false discovery rate (LFDR), a probability that the biological category has equivalent representation among the preselected features. We identified three promising estimators of the LFDR for detecting differential representation: a semiparametric estimator (SPE), a normalized maximum likelihood estimator (NMLE), and a maximum likelihood estimator (MLE). We found that the MLE performs at least as well as the SPE for on the order of 100 of GO categories even when the ideal number of components in its underlying mixture model is unknown. However, the MLE is unreliable when the number of GO categories is small compared to the number of PMM components. Thus, if the number of categories is on the order of 10, the SPE is a more reliable LFDR estimator. The NMLE depends not only on the data but also on a specified value of the prior probability of differential representation. It is therefore an appropriate LFDR estimator only when the number of GO categories is too small for application of the other methods. For enrichment detection, we recommend estimating the LFDR by the MLE given at least a medium number (~100) of GO categories, by the SPE given a small number of GO categories (~10), and by the NMLE given a very small number (~1) of GO categories. △ Less

Submitted 30 December, 2011; originally announced January 2012.

Comments: exhaustive revision of Zhenyu Yang and David R. Bickel, "Minimum Description Length Measures of Evidence for Enrichment" (December 2010). COBRA Preprint Series. Article 76. http://biostats.bepress.com/cobra/ps/art76

Journal ref: A comparative study of five estimators of the local false discovery rate," BMC Bioinformatics 14, art. 87 (2013)

arXiv:q-bio/0402019 [pdf, ps, other]

doi 10.1016/j.physa.2004.05.031

The Spread of Infectious Disease with Household-Structure on the Complex Networks

Authors: **gzhou Liu, **shan Wu, Z. R. Yang

Abstract: In this paper we study the household-structure SIS epidemic spreading on general complex networks. The household structure gives us the way to distinguish inner and the outer infection rate. Unlike household-structure models on homogenous networks, such as regular and random networks, here we consider heterogeneous networks with arbitrary degree distribution p(k). First we introduce the epidemic… ▽ More In this paper we study the household-structure SIS epidemic spreading on general complex networks. The household structure gives us the way to distinguish inner and the outer infection rate. Unlike household-structure models on homogenous networks, such as regular and random networks, here we consider heterogeneous networks with arbitrary degree distribution p(k). First we introduce the epidemic model. Then rate equations under mean field appropriation and computer simulations are used here to analyze our model. Some unique phenomena only existing in divergent network with household structure is found, while we also get some similar conclusions that some simple geometrical quantities of networks have important impression on infection property of infectous disease. It seems that in our model even when local cure rate is greater than inner infection rate in every household, disease still can spread on scale-free network. It implies that no disease is spreading in every single household, but for the whole network, disease is spreading. Since our society network seems like this structure, maybe this conclusion remind us that during disease spreading we should pay more attention on network structure than local cure condition. △ Less

Submitted 25 February, 2004; v1 submitted 9 February, 2004; originally announced February 2004.

Comments: 12 pages, 2 figures

arXiv:cond-mat/0009159 [pdf, ps, other]

doi 10.1103/PhysRevE.62.5923

New approach to Monte Carlo calculation of buckling of supercoiled DNA loops

Authors: Zhang Yang

Abstract: The short supercoiled circular DNA molecules are shown to be glassy systems and canonical Metropolis Monte Carlo simulations of the systems tend to get stuck in local metastable energy basins. A novel Monte Carlo algorithm is developed to alleviate the problem of ``ergodicity breaking'' of the glassy systems, in which the Markov process is driven by an explicitly analytic weight factor with enha… ▽ More The short supercoiled circular DNA molecules are shown to be glassy systems and canonical Metropolis Monte Carlo simulations of the systems tend to get stuck in local metastable energy basins. A novel Monte Carlo algorithm is developed to alleviate the problem of ``ergodicity breaking'' of the glassy systems, in which the Markov process is driven by an explicitly analytic weight factor with enhanced probability in both low- and high-energy regions. To characterize the degree of puckering of the supercoiled DNA loops, a new quantity of aplanarity is introduced as the shortest principal axis of configurational ellipsoid of DNA. With the suggested Monte Carlo method, the quantitative correlation between supercoiling degree and buckling of DNA is attained. With supercoiling stress increasing, the conformational transition from a circle to mono-, diplo- or triple interwound superhelical structure will take place in a successive but decreasingly abrupt mode. △ Less

Submitted 11 September, 2000; originally announced September 2000.

Comments: 4 pages, 3 PS figures, to appear at Phys. Rev. E as a Rapid Communication

Journal ref: Phys.Rev.E62:5923-5926,2000

arXiv:physics/9911074 [pdf, ps, other]

doi 10.1016/S0006-3495(00)76745-2

Monte Carlo implementation of supercoiled double-stranded DNA

Authors: Zhang Yang, Zhou Haijun, Ouyang Zhongcan

Abstract: Metropolis Monte Carlo simulation is used to investigate the elasticity of torsionally stressed double-stranded DNA, in which twist and supercoiling are incorporated as a natural result of base-stacking interaction and backbone bending constrained by hydrogen bonds formed between DNA complementary nucleotide bases. Three evident regimes are found in extension versus torsion and/or force versus e… ▽ More Metropolis Monte Carlo simulation is used to investigate the elasticity of torsionally stressed double-stranded DNA, in which twist and supercoiling are incorporated as a natural result of base-stacking interaction and backbone bending constrained by hydrogen bonds formed between DNA complementary nucleotide bases. Three evident regimes are found in extension versus torsion and/or force versus extension plots: a low-force regime in which over- and underwound molecules behave similarly under stretching; an intermediate-force regime in which chirality appears for negatively and positively supercoiled DNA and extension of underwound molecule is insensitive to the supercoiling degree of the polymer; and a large-force regime in which plectonemic DNA is fully converted to extended DNA and supercoiled DNA behaves quite like a torsionless molecule. The striking coincidence between theoretic calculations and recent experimental measurement of torsionally stretched DNA [Strick et al., Science {\bf 271}, 1835 (1996), Biophys. J. {\bf 74}, 2016 (1998)] strongly suggests that the interplay between base-stacking interaction and permanent hydrogen-bond constraint takes an important role in understanding the novel properties of elasticity of supercoiled DNA polymer. △ Less

Submitted 29 November, 1999; originally announced November 1999.

Comments: 21 pages, 6 PS figures. To appear at Biophys. J

arXiv:cond-mat/9901321 [pdf, ps, other]

Entropic Elasticity, Cooperative Extensibility and Supercoiling Property of DNA: A Unified Viewpoint

Authors: Zhou Haijun, Zhang Yang, Ou-Yang Zhong-can

Abstract: A unified model is constructed to study the recently observed DNA entropic elasticity, cooperative extensibility, and supercoiling property. With the introduction of a new structural parameter (the folding angle $φ$), bending deformations of sugar-phosphate backbones, steric effects of nucleotide basepairs, and short-range basestacking interactions are considered. The comprehensive agreement of… ▽ More A unified model is constructed to study the recently observed DNA entropic elasticity, cooperative extensibility, and supercoiling property. With the introduction of a new structural parameter (the folding angle $φ$), bending deformations of sugar-phosphate backbones, steric effects of nucleotide basepairs, and short-range basestacking interactions are considered. The comprehensive agreement of theoretical results with experimental observations on both torsionally relaxed and negatively supercoiled DNAs strongly indicates that, basestacking interactions, although short-ranged in nature, dominate the elasticity of DNA and hence are of vital biological significance. △ Less

Submitted 3 April, 1999; v1 submitted 27 January, 1999; originally announced January 1999.

Comments: 4 pages in Latex format, with 3 EPS figures included. A typographic mistake in Eq. (7) is corrected in this version. A slightly different version of this paper will appear in PRL

Showing 1–48 of 48 results for author: Yang, Z