Search | arXiv e-print repository

ProtFAD: Introducing function-aware domains as implicit modality towards protein function perception

Authors: Mingqing Wang, Zhiwei Nie, Yonghong He, Zhixiang Ren

Abstract: Protein function prediction is currently achieved by encoding its sequence or structure, where the sequence-to-function transcendence and high-quality structural data scarcity lead to obvious performance bottlenecks. Protein domains are "building blocks" of proteins that are functionally independent, and their combinations determine the diverse biological functions. However, most existing studies… ▽ More Protein function prediction is currently achieved by encoding its sequence or structure, where the sequence-to-function transcendence and high-quality structural data scarcity lead to obvious performance bottlenecks. Protein domains are "building blocks" of proteins that are functionally independent, and their combinations determine the diverse biological functions. However, most existing studies have yet to thoroughly explore the intricate functional information contained in the protein domains. To fill this gap, we propose a synergistic integration approach for a function-aware domain representation, and a domain-joint contrastive learning strategy to distinguish different protein functions while aligning the modalities. Specifically, we associate domains with the GO terms as function priors to pre-train domain embeddings. Furthermore, we partition proteins into multiple sub-views based on continuous joint domains for contrastive training under the supervision of a novel triplet InfoNCE loss. Our approach significantly and comprehensively outperforms the state-of-the-art methods on various benchmarks, and clearly differentiates proteins carrying distinct functions compared to the competitor. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 16 pages, 6 figures, 5 tables

arXiv:2405.00577 [pdf]

Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review

Authors: Yi Hao Chan, Deepank Girish, Sukrit Gupta, **g Xia, Chockalingam Kasi, Yinan He, Conghao Wang, Jagath C. Rajapakse

Abstract: Graph neural networks (GNN) have emerged as a popular tool for modelling functional magnetic resonance imaging (fMRI) datasets. Many recent studies have reported significant improvements in disorder classification performance via more sophisticated GNN designs and highlighted salient features that could be potential biomarkers of the disorder. In this review, we provide an overview of how GNN and… ▽ More Graph neural networks (GNN) have emerged as a popular tool for modelling functional magnetic resonance imaging (fMRI) datasets. Many recent studies have reported significant improvements in disorder classification performance via more sophisticated GNN designs and highlighted salient features that could be potential biomarkers of the disorder. In this review, we provide an overview of how GNN and model explainability techniques have been applied on fMRI datasets for disorder prediction tasks, with a particular emphasis on the robustness of biomarkers produced for neurodegenerative diseases and neuropsychiatric disorders. We found that while most studies have performant models, salient features highlighted in these studies vary greatly across studies on the same disorder and little has been done to evaluate their robustness. To address these issues, we suggest establishing new standards that are based on objective evaluation metrics to determine the robustness of these potential biomarkers. We further highlight gaps in the existing literature and put together a prediction-attribution-evaluation framework that could set the foundations for future research on improving the robustness of potential biomarkers discovered via GNNs. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2401.11782 [pdf, other]

Temporal Interaction and its Role in the Evolution of Cooperation

Authors: Yujie He, Tianyu Ren, Xiao-Jun Zeng, Huawen Liang, Liukai Yu, Junjun Zheng

Abstract: This research investigates the impact of dynamic interactions with time-varying topologies on the evolution of cooperative behaviours in social dilemmas. Traditional research has focused on deterministic rules governing pairwise interactions, yet the impact of interaction frequency and synchronicity on cooperation remains underexplored. Addressing this gap, our work introduces two temporal interac… ▽ More This research investigates the impact of dynamic interactions with time-varying topologies on the evolution of cooperative behaviours in social dilemmas. Traditional research has focused on deterministic rules governing pairwise interactions, yet the impact of interaction frequency and synchronicity on cooperation remains underexplored. Addressing this gap, our work introduces two temporal interaction mechanisms to model the stochastic or periodic participation of individuals in these games, acknowledging real-life variances due to exogenous temporal factors and geographical time differences. We consider that the interaction state significantly influences both game payoff calculations and the strategy updating process, offering new insights into the emergence and sustainability of cooperation. Our results indicate that maximum game participation frequency is suboptimal under a stochastic interaction mechanism. Instead, an intermediate region of activation probability yields the highest cooperation level, especially under strong dilemma conditions. This suggests that a balance between inactivity security and interaction frequency is crucial. Furthermore, local synchronization of interactions within specific areas is shown to be beneficial, as time differences hinder the spread of cross-structures but promote the formation of dense cooperative clusters with smoother boundaries. Our findings provide an intuitive understanding of node-based temporality and probabilistic interactions, contributing to the broader discourse on resolving social dilemmas. △ Less

Submitted 5 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 10 pages, 9 figures

arXiv:2307.06344 [pdf, other]

The Whole Pathological Slide Classification via Weakly Supervised Learning

Authors: Qiehe Sun, Jiawen Li, ** Xu, Junru Cheng, Tian Guan, Yonghong He

Abstract: Due to its superior efficiency in utilizing annotations and addressing gigapixel-sized images, multiple instance learning (MIL) has shown great promise as a framework for whole slide image (WSI) classification in digital pathology diagnosis. However, existing methods tend to focus on advanced aggregators with different structures, often overlooking the intrinsic features of H\&E pathological slide… ▽ More Due to its superior efficiency in utilizing annotations and addressing gigapixel-sized images, multiple instance learning (MIL) has shown great promise as a framework for whole slide image (WSI) classification in digital pathology diagnosis. However, existing methods tend to focus on advanced aggregators with different structures, often overlooking the intrinsic features of H\&E pathological slides. To address this limitation, we introduced two pathological priors: nuclear heterogeneity of diseased cells and spatial correlation of pathological tiles. Leveraging the former, we proposed a data augmentation method that utilizes stain separation during extractor training via a contrastive learning strategy to obtain instance-level representations. We then described the spatial relationships between the tiles using an adjacency matrix. By integrating these two views, we designed a multi-instance framework for analyzing H\&E-stained tissue images based on pathological inductive bias, encompassing feature extraction, filtering, and aggregation. Extensive experiments on the Camelyon16 breast dataset and TCGA-NSCLC Lung dataset demonstrate that our proposed framework can effectively handle tasks related to cancer detection and differentiation of subtypes, outperforming state-of-the-art medical image classification methods based on MIL. The code will be released later. △ Less

Submitted 12 July, 2023; originally announced July 2023.

arXiv:2210.13561 [pdf, other]

Flowers of immortality

Authors: Thomas Fink, Yang-Hui He

Abstract: There has been a recent surge of interest in what causes aging. This has been matched by unprecedented research investment in the field from tech companies. But, despite considerable effort from a broad range of researchers, we do not have a rigorous mathematical theory of programmed aging. To address this, we recently derived a mortality equation that governs the transition matrix of an evolving… ▽ More There has been a recent surge of interest in what causes aging. This has been matched by unprecedented research investment in the field from tech companies. But, despite considerable effort from a broad range of researchers, we do not have a rigorous mathematical theory of programmed aging. To address this, we recently derived a mortality equation that governs the transition matrix of an evolving population with a given maximum age. Here, we characterize the spectrum of eigenvalues of the solution to this equation. The eigenvalues fall into two classes. The complex and negative real eigenvalues, which we call the flower, are always contained in the unit circle in the complex plane. They play a negligible role in controlling the dynamics of an aging population. The positive real eigenvalues, which we call the stem, are the only eigenvalues which can exceed the unit circle. They control the most important properties of the dynamics. In particular, the spectral radius increases with the maximum allowed age. This suggests that programmed aging confers no advantage in a constant environment. However, the spectral gap, which governs the rate of convergence to equilibrium, decreases with the maximum allowed age. This opens the door to an evolutionary advantage in a changing environment. △ Less

Submitted 24 October, 2022; originally announced October 2022.

arXiv:2209.09700 [pdf]

Unresolved excess accumulation of myelin-derived cholesterol contributes to scar formation after spinal cord injury

Authors: Bolin Zheng, Yi**g He, Qing Zhao, Xu Zhu, Shuai Yin, Huiyi Yang, Zhaojie Wang, Liming Cheng

Abstract: Background: Spinal cord injury triggers complex pathological cascades, resulting in destructive tissue damage and incomplete tissue repair. Scar formation is generally considered as a barrier for regeneration in central nervous system (CNS), while the intrinsic mechanism of scar-forming after spinal cord injury has not been completed deciphered. Methods: We assessed cholesterol hemostasis in spina… ▽ More Background: Spinal cord injury triggers complex pathological cascades, resulting in destructive tissue damage and incomplete tissue repair. Scar formation is generally considered as a barrier for regeneration in central nervous system (CNS), while the intrinsic mechanism of scar-forming after spinal cord injury has not been completed deciphered. Methods: We assessed cholesterol hemostasis in spinal cord lesions and injured peripheral nerves using confocal reflection microscopy and real-time PCR analyses. The involvement of the proteins, which were predicted to promote cholesterol efflux in spinal cord lesions, were assessed with Liver X receptor (LXR) agonist and Apolipoprotein E (APOE) deficiency. The role of reverse cholesterol transport (RCT) in cholesterol clearance was examined in APOE KO mice injured sciatic nerves and myelin-overloaded macrophages in vitro. Finally, we determined the consequence of excess cholesterol accumulation in CNS by transplantation of myelin into neonatal spinal cord lesions. Results: We found that excess cholesterol accumulates in phagocytes and is inefficiently removed in spinal cord lesions in young-adult mice. Interestingly, we observed that excessive cholesterol also accumulates in injured peripheral nerves, but is subsequently removed by RCT. Meanwhile, preventing RCT led to macrophage accumulation and fibrosis in injured peripheral nerves. Furthermore, the neonatal mouse spinal cord lesions are devoid of myelin-derived lipids, and able to heal without excess cholesterol accumulation. We found that transplantation of myelin into neonatal lesions disrupts healing with excessive cholesterol accumulation, persistent macrophage activation and fibrosis, indicating myelin-derived cholesterol plays a critical role in impaired wound healing. △ Less

Submitted 20 September, 2022; originally announced September 2022.

arXiv:2209.04084 [pdf, other]

Polarization effects on fluorescence emission of zebrafish neurons using light-sheet microscopy

Authors: Hong Ye, Xin Xu, Jixiang Wang, **g Wang, Yi He, Yu Mu, Guohua Shi

Abstract: Light-sheet fluorescence microscopy (LSFM) makes use of a thin plane of light to optically section and image transparent tissues or organisms {\it{in vivo}}, which has the advantages of fast imaging speed and low phototoxicity. In this paper, we have employed light-sheet microscopy to investigate the polarization effects on fluorescence emission of zebrafish neurons via modifying the electric osci… ▽ More Light-sheet fluorescence microscopy (LSFM) makes use of a thin plane of light to optically section and image transparent tissues or organisms {\it{in vivo}}, which has the advantages of fast imaging speed and low phototoxicity. In this paper, we have employed light-sheet microscopy to investigate the polarization effects on fluorescence emission of zebrafish neurons via modifying the electric oscillation orientation of the excitation light. The intensity of the fluorescence emission from the excited zebrafish larvae follows a cosine square function with respect to the polarization state of the excitation light and reveals a 40$\%$ higher fluorescence emission when the polarization orientation is orthogonal to the illumination and detection axes. Through registration and subtraction of fluorescence images under different polarization states, we have demonstrated that most of the enhanced fluorescence signals are from the nerve cells rather than the extracellular substance. This provides us a way to distinguish the cell boundaries and observe the organism structures with improved contrast and resolution. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2205.03447 [pdf, ps, other]

Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching

Authors: Yuan He, Jiaoyan Chen, Hang Dong, Ernesto Jiménez-Ruiz, Ali Hadian, Ian Horrocks

Abstract: Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limi… ▽ More Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limitations including limited evaluation of subsumption map**s, suboptimal reference map**s, and limited support for the evaluation of ML-based systems. To tackle these limitations, we introduce five new biomedical OM tasks involving ontologies extracted from Mondo and UMLS. Each task includes both equivalence and subsumption matching; the quality of reference map**s is ensured by human curation, ontology pruning, etc.; and a comprehensive evaluation framework is proposed to measure OM performance from various perspectives for both ML-based and non-ML-based OM systems. We report evaluation results for OM systems of different types to demonstrate the usage of these resources, all of which are publicly available as part of the new BioML track at OAEI 2022. △ Less

Submitted 22 July, 2023; v1 submitted 6 May, 2022; originally announced May 2022.

Comments: Accepted paper (Best Resource Paper Candidate) in the 21st International Semantic Web Conference (ISWC-2022); Bio-ML Dataset: https://doi.org/10.5281/zenodo.6510086

arXiv:2203.06895 [pdf, other]

doi 10.1109/TCDS.2022.3174209

Topological EEG Nonlinear Dynamics Analysis for Emotion Recognition

Authors: Yan Yan, Xuankun Wu, Chengdong Li, Yini He, Zhicheng Zhang, Huihui Li, Ang Li, Lei Wang

Abstract: Emotional recognition through exploring the electroencephalography (EEG) characteristics has been widely performed in recent studies. Nonlinear analysis and feature extraction methods for understanding the complex dynamical phenomena are associated with the EEG patterns of different emotions. The phase space reconstruction is a typical nonlinear technique to reveal the dynamics of the brain neural… ▽ More Emotional recognition through exploring the electroencephalography (EEG) characteristics has been widely performed in recent studies. Nonlinear analysis and feature extraction methods for understanding the complex dynamical phenomena are associated with the EEG patterns of different emotions. The phase space reconstruction is a typical nonlinear technique to reveal the dynamics of the brain neural system. Recently, the topological data analysis (TDA) scheme has been used to explore the properties of space, which provides a powerful tool to think over the phase space. In this work, we proposed a topological EEG nonlinear dynamics analysis approach using the phase space reconstruction (PSR) technique to convert EEG time series into phase space, and the persistent homology tool explores the topological properties of the phase space. We perform the topological analysis of EEG signals in different rhythm bands to build emotion feature vectors, which shows high distinguishing ability. We evaluate the approach with two well-known benchmark datasets, the DEAP and DREAMER datasets. The recognition results achieved accuracies of 99.37% and 99.35% in arousal and valence classification tasks with DEAP, and 99.96%, 99.93%, and 99.95% in arousal, valence, and dominance classifications tasks with DREAMER, respectively. The performances are supposed to be outperformed current state-of-art approaches in DREAMER (improved by 1% to 10% depends on temporal length), while comparable to other related works evaluated in DEAP. The proposed work is the first investigation in the emotion recognition oriented EEG topological feature analysis, which brought a novel insight into the brain neural system nonlinear dynamics analysis and feature extraction. △ Less

Submitted 14 March, 2022; originally announced March 2022.

arXiv:2112.15552 [pdf, other]

doi 10.1109/JSSC.2021.3129993

Magnetoelectric Bio-Implants Powered and Programmed by a Single Transmitter for Coordinated Multisite Stimulation

Authors: Zhanghao Yu, Joshua C. Chen, Yan He, Fatima T. Alrashdan, Benjamin W. Avants, Amanda Singer, Jacob T. Robinson, Kaiyuan Yang

Abstract: This article presents a hardware platform including stimulating implants wirelessly powered and controlled by a shared transmitter (TX) for coordinated leadless multisite stimulation. The adopted novel single-TX, multiple-implant structure can flexibly deploy stimuli, improve system efficiency, easily scale stimulating channel quantity, and relieve efforts in device synchronization. In the propose… ▽ More This article presents a hardware platform including stimulating implants wirelessly powered and controlled by a shared transmitter (TX) for coordinated leadless multisite stimulation. The adopted novel single-TX, multiple-implant structure can flexibly deploy stimuli, improve system efficiency, easily scale stimulating channel quantity, and relieve efforts in device synchronization. In the proposed system, a wireless link leveraging magnetoelectric (ME) effect is co-designed with a robust and efficient system-on-chip (SoC) to enable reliable operation and individual programming of every implant. Each implant integrates a 0.8-mm2 chip, a 6-mm2 ME film, and an energy storage capacitor within a 6.2-mm3 size. ME power transfer is capable of safely transmitting milliwatt power to devices placed several centimeters away from the TX coil, maintaining good efficiency with size constraints, and tolerating 60 degree, 1.5-cm misalignment in angular and lateral movement. The SoC robustly operates with 2-V source amplitude variations that spans a 40-mm TX-implant distance change, realizes individual addressability through physical unclonable function (PUF) IDs, and achieves 90% efficiency for 1.5-3.5-V stimulation with fully programmable stimulation parameters. △ Less

Submitted 31 December, 2021; originally announced December 2021.

Comments: This paper has been published in IEEE Journal of Solid-State Circuits, 2021

Journal ref: IEEE Journal of Solid-State Circuits, 2021

arXiv:2107.02995 [pdf, other]

doi 10.1109/TBCAS.2020.3037862

MagNI: A Magnetoelectrically Powered and Controlled Wireless Neurostimulating Implant

Authors: Zhanghao Yu, Joshua C. Chen, Fatima T. Alrashdan, Benjamin W. Avants, Yan He, Amanda Singer, Jacob T. Robinson, Kaiyuan Yang

Abstract: This paper presents the first wireless and programmable neural stimulator leveraging magnetoelectric (ME) effects for power and data transfer. Thanks to low tissue absorption, low misalignment sensitivity and high power transfer efficiency, the ME effect enables safe delivery of high power levels (a few milliwatts) at low resonant frequencies (~250 kHz) to mm-sized implants deep inside the body (3… ▽ More This paper presents the first wireless and programmable neural stimulator leveraging magnetoelectric (ME) effects for power and data transfer. Thanks to low tissue absorption, low misalignment sensitivity and high power transfer efficiency, the ME effect enables safe delivery of high power levels (a few milliwatts) at low resonant frequencies (~250 kHz) to mm-sized implants deep inside the body (30-mm depth). The presented MagNI (Magnetoelectric Neural Implant) consists of a 1.5-mm$^2$ 180-nm CMOS chip, an in-house built 4x2 mm ME film, an energy storage capacitor, and on-board electrodes on a flexible polyimide substrate with a total volume of 8.2 mm$^3$ . The chip with a power consumption of 23.7 $μ$W includes robust system control and data recovery mechanisms under source amplitude variations (1-V variation tolerance). The system delivers fully-programmable bi-phasic current-controlled stimulation with patterns covering 0.05-to-1.5-mA amplitude, 64-to-512-$μ$s pulse width, and 0-to-200Hz repetition frequency for neurostimulation. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: This work has been accepted to 2020 IEEE Transactions on Biomedical Circuits and Systems (TBioCAS)

Journal ref: IEEE Transactions on Biomedical Circuits and Systems (TBioCAS), Volume: 14, Issue: 6, Pages: 1241-1252, Dec. 2020

arXiv:2104.11644 [pdf, other]

Quantitative map** of the brain's structural connectivity using diffusion MRI tractography: a review

Authors: Fan Zhang, Alessandro Daducci, Yong He, Simona Schiavi, Caio Seguin, Robert Smith, Chun-Hung Yeh, Tengda Zhao, Lauren J. O'Donnell

Abstract: Diffusion magnetic resonance imaging (dMRI) tractography is an advanced imaging technique that enables in vivo map** of the brain's white matter connections at macro scale. Over the last two decades, the study of brain connectivity using dMRI tractography has played a prominent role in the neuroimaging research landscape. In this paper, we provide a high-level overview of how tractography is use… ▽ More Diffusion magnetic resonance imaging (dMRI) tractography is an advanced imaging technique that enables in vivo map** of the brain's white matter connections at macro scale. Over the last two decades, the study of brain connectivity using dMRI tractography has played a prominent role in the neuroimaging research landscape. In this paper, we provide a high-level overview of how tractography is used to enable quantitative analysis of the brain's structural connectivity in health and disease. We first provide a review of methodology involved in three main processing steps that are common across most approaches for quantitative analysis of tractography, including methods for tractography correction, segmentation and quantification. For each step, we aim to describe methodological choices, their popularity, and potential pros and cons. We then review studies that have used quantitative tractography approaches to study the brain's white matter, focusing on applications in neurodevelopment, aging, neurological disorders, mental disorders, and neurosurgery. We conclude that, while there have been considerable advancements in methodological technologies and breadth of applications, there nevertheless remains no consensus about the "best" methodology in quantitative analysis of tractography, and researchers should remain cautious when interpreting results in research and clinical applications. △ Less

Submitted 23 April, 2021; originally announced April 2021.

arXiv:2012.10239 [pdf]

doi 10.1063/5.0041901

Computational interference microscopy enabled by deep learning

Authors: Yuheng Jiao, Yuchen R. He, Mikhail E. Kandel, Xiaojun Liu, Wenlong Lu, Gabriel Popescu

Abstract: Quantitative phase imaging (QPI) has been widely applied in characterizing cells and tissues. Spatial light interference microscopy (SLIM) is a highly sensitive QPI method, due to its partially coherent illumination and common path interferometry geometry. However, its acquisition rate is limited because of the four-frame phase-shifting scheme. On the other hand, off-axis methods like diffraction… ▽ More Quantitative phase imaging (QPI) has been widely applied in characterizing cells and tissues. Spatial light interference microscopy (SLIM) is a highly sensitive QPI method, due to its partially coherent illumination and common path interferometry geometry. However, its acquisition rate is limited because of the four-frame phase-shifting scheme. On the other hand, off-axis methods like diffraction phase microscopy (DPM), allows for single-shot QPI. However, the laser-based DPM system is plagued by spatial noise due to speckles and multiple reflections. In a parallel development, deep learning was proven valuable in the field of bioimaging, especially due to its ability to translate one form of contrast into another. Here, we propose using deep learning to produce synthetic, SLIM-quality, high-sensitivity phase maps from DPM, single-shot images as input. We used an inverted microscope with its two ports connected to the DPM and SLIM modules, such that we have access to the two types of images on the same field of view. We constructed a deep learning model based on U-net and trained on over 1,000 pairs of DPM and SLIM images. The model learned to remove the speckles in laser DPM and overcame the background phase noise in both the test set and new data. Furthermore, we implemented the neural network inference into the live acquisition software, which now allows a DPM user to observe in real-time an extremely low-noise phase image. We demonstrated this principle of computational interference microscopy (CIM) imaging using blood smears, as they contain both erythrocytes and leukocytes, in static and dynamic conditions. △ Less

Submitted 17 December, 2020; originally announced December 2020.

arXiv:2008.02241 [pdf]

Ontology-based annotation and analysis of COVID-19 phenotypes

Authors: Yang Wang, Fengwei Zhang, Hong Yu, Xianwei Ye, Yongqun He

Abstract: The epidemic of COVID-19 has caused an unpredictable and devastated disaster to the public health in different territories around the world. Common phenotypes include fever, cough, shortness of breath, and chills. With more cases investigated, other clinical phenotypes are gradually recognized, for example, loss of smell, and loss of tastes. Compared with discharged or cured patients, severe or di… ▽ More The epidemic of COVID-19 has caused an unpredictable and devastated disaster to the public health in different territories around the world. Common phenotypes include fever, cough, shortness of breath, and chills. With more cases investigated, other clinical phenotypes are gradually recognized, for example, loss of smell, and loss of tastes. Compared with discharged or cured patients, severe or died patients often have one or more comorbidities, such as hypertension, diabetes, and cardiovascular disease. In this study, we systematically collected and analyzed COVID-19-related clinical phenotypes from 70 articles. The commonly occurring 17 phenotypes were classified into different groups based on the Human Phenotype Ontology (HPO). Based on the HP classification, we systematically analyze three nervous phenotypes (loss of smell, loss of taste, and headache) and four abdominal phenotypes (nausea, vomiting, abdominal pain, and diarrhea) identified in patients, and found that patients from Europe and USA turned to have higher nervous phenotypes and abdominal phenotypes than patients from Asia. A total of 23 comorbidities were found to commonly exist among COVID-19 patients. Patients with these comorbidities such as diabetes and kidney failure had worse outcomes compared with those without these comorbidities. △ Less

Submitted 5 August, 2020; originally announced August 2020.

arXiv:2006.00639 [pdf]

Ontology-based systematic classification and analysis of coronaviruses, hosts, and host-coronavirus interactions towards deep understanding of COVID-19

Authors: Hong Yu, Li Li, Hsin-hui Huang, Yang Wang, Yingtong Liu, Edison Ong, Anthony Huffman, Tao Zeng, **gsong Zhang, Pengpai Li, Zhi** Liu, Xiangyan Zhang, Xianwei Ye, Samuel K. Handelman, Gerry Higgins, Gilbert S. Omenn, Brian Athey, Junguk Hur, Luonan Chen, Yongqun He

Abstract: Given the existing COVID-19 pandemic worldwide, it is critical to systematically study the interactions between hosts and coronaviruses including SARS-Cov, MERS-Cov, and SARS-CoV-2 (cause of COVID-19). We first created four host-pathogen interaction (HPI)-Outcome postulates, and generated a HPI-Outcome model as the basis for understanding host-coronavirus interactions (HCI) and their relations wit… ▽ More Given the existing COVID-19 pandemic worldwide, it is critical to systematically study the interactions between hosts and coronaviruses including SARS-Cov, MERS-Cov, and SARS-CoV-2 (cause of COVID-19). We first created four host-pathogen interaction (HPI)-Outcome postulates, and generated a HPI-Outcome model as the basis for understanding host-coronavirus interactions (HCI) and their relations with the disease outcomes. We hypothesized that ontology can be used as an integrative platform to classify and analyze HCI and disease outcomes. Accordingly, we annotated and categorized different coronaviruses, hosts, and phenotypes using ontologies and identified their relations. Various COVID-19 phenotypes are hypothesized to be caused by the backend HCI mechanisms. To further identify the causal HCI-outcome relations, we collected 35 experimentally-verified HCI protein-protein interactions (PPIs), and applied literature mining to identify additional host PPIs in response to coronavirus infections. The results were formulated in a logical ontology representation for integrative HCI-outcome understanding. Using known PPIs as baits, we also developed and applied a domain-inferred prediction method to predict new PPIs and identified their pathological targets on multiple organs. Overall, our proposed ontology-based integrative framework combined with computational predictions can be used to support fundamental understanding of the intricate interactions between human patients and coronaviruses (including SARS-CoV-2) and their association with various disease outcomes. △ Less

Submitted 31 May, 2020; originally announced June 2020.

Comments: 32 pages, 1 table, 6 figures

arXiv:2003.00125 [pdf]

doi 10.1063/5.0004723

Label-free colorectal cancer screening using deep learning and spatial light interference microscopy (SLIM)

Authors: **gfang "Kelly" Zhang, Yuchen R. He, Nahil Sobh, Gabriel Popescu

Abstract: Current pathology workflow involves staining of thin tissue slices, which otherwise would be transparent, followed by manual investigation under the microscope by a trained pathologist. While the hematoxylin and eosin (H&E) stain is well-established and a cost-effective method for visualizing histology slides, its color variability across preparations and subjectivity across clinicians remain unad… ▽ More Current pathology workflow involves staining of thin tissue slices, which otherwise would be transparent, followed by manual investigation under the microscope by a trained pathologist. While the hematoxylin and eosin (H&E) stain is well-established and a cost-effective method for visualizing histology slides, its color variability across preparations and subjectivity across clinicians remain unaddressed challenges. To mitigate these challenges, recently we have demonstrated that spatial light interference microscopy (SLIM) can provide a path to intrinsic, objective markers, that are independent of preparation and human bias. Additionally, the sensitivity of SLIM to collagen fibers yields information relevant to patient outcome, which is not available in H&E. Here, we show that deep learning and SLIM can form a powerful combination for screening applications: training on 1,660 SLIM images of colon glands and validating on 144 glands, we obtained a benign vs. cancer classification accuracy of 99%. We envision that the SLIM whole slide scanner presented here paired with artificial intelligence algorithms may prove valuable as a pre-screening method, economizing the clinician's time and effort. △ Less

Submitted 28 February, 2020; originally announced March 2020.

Comments: 17 pages, 6 figures

Journal ref: APL Photonics 5, 040805 (2020)

arXiv:1912.07434 [pdf, other]

doi 10.1007/s10237-019-01155-z

Gradient-enhanced continuum models of healing in damaged soft tissues

Authors: Yiqian He, Di Zuo, Klaus Hackl, Haitian Yang, S. Jamaleddin Mousavi, Stéphane Avril

Abstract: Healing of soft biological tissue is the process of self-recovering or self-repairing the injured or damaged extracellular matrix (ECM). Healing is assumed to be stress-driven, with the objective of returning to a homeostatic stress metrics in the tissue after replacing the damaged ECM with new undamaged one. However, based on the existence of intrinsic length-scales in soft tissues, it is thought… ▽ More Healing of soft biological tissue is the process of self-recovering or self-repairing the injured or damaged extracellular matrix (ECM). Healing is assumed to be stress-driven, with the objective of returning to a homeostatic stress metrics in the tissue after replacing the damaged ECM with new undamaged one. However, based on the existence of intrinsic length-scales in soft tissues, it is thought that computational models of healing should be non-local. In the present study, we introduce for the first time two gradient-enhanced con-stitutive healing models for soft tissues including non-local variables. The first model combines a continuum damage model with a temporally homogenized growth model, where the growth direction is determined according to local principal stress directions. The second one is based on a gradient-enhanced healing model with continuously recoverable damage variable. Both models are implemented in the finite-element package Abaqus by means of a user sub-routine UEL. Three two-dimensional situations simulating the healing process of soft tissues are modeled numerically with both models, and their application for simulation of balloon angioplasty is provided by illustrating the change of damage field and geometry in the media layer throughout the healing process. △ Less

Submitted 16 December, 2019; originally announced December 2019.

Journal ref: Biomechanics and Modeling in Mechanobiology, Springer Verlag, 2019, 18 (5), pp.1443-1460

arXiv:1909.10405 [pdf, other]

doi 10.1103/PhysRevE.103.052409

Dynamics of genetic code evolution: The emergence of universality

Authors: John-Antonio Argyriadis, Yang-Hui He, Vishnu Jejjala, Djordje Minic

Abstract: We study the dynamics of genetic code evolution. The model of Vetsigian et al. [1] and Vetsigian [2] uses the mechanism of horizontal gene transfer to demonstrate convergence of the genetic code to a near universal solution. We reproduce and analyze the algorithm as a dynamical system. All the parameters used in the model are varied to assess their impact on convergence and optimality score. We sh… ▽ More We study the dynamics of genetic code evolution. The model of Vetsigian et al. [1] and Vetsigian [2] uses the mechanism of horizontal gene transfer to demonstrate convergence of the genetic code to a near universal solution. We reproduce and analyze the algorithm as a dynamical system. All the parameters used in the model are varied to assess their impact on convergence and optimality score. We show that by allowing specific parameters to vary with time, the solution exhibits attractor dynamics. Finally, we study automorphisms of the genetic code arising due to this model. We use this to examine the scaling of the solutions in order to re-examine universality and find that there is a direct link to mutation rate. △ Less

Submitted 27 November, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

Comments: 38 pages

Journal ref: Phys. Rev. E 103, 052409 (2021)

arXiv:1901.05051 [pdf, other]

doi 10.1371/journal.pone.0250227

Machine-learning a virus assembly fitness landscape

Authors: Pierre-Philippe Dechant, Yang-Hui He

Abstract: Realistic evolutionary fitness landscapes are notoriously difficult to construct. A recent cutting-edge model of virus assembly consists of a dodecahedral capsid with $12$ corresponding packaging signals in three affinity bands. This whole genome/phenotype space consisting of $3^{12}$ genomes has been explored via computationally expensive stochastic assembly models, giving a fitness landscape in… ▽ More Realistic evolutionary fitness landscapes are notoriously difficult to construct. A recent cutting-edge model of virus assembly consists of a dodecahedral capsid with $12$ corresponding packaging signals in three affinity bands. This whole genome/phenotype space consisting of $3^{12}$ genomes has been explored via computationally expensive stochastic assembly models, giving a fitness landscape in terms of the assembly efficiency. Using latest machine-learning techniques by establishing a neural network, we show that the intensive computation can be short-circuited in a matter of minutes to astounding accuracy. △ Less

Submitted 13 January, 2019; originally announced January 2019.

Comments: 13 pages, 4 figures

MSC Class: 68Txx; 97R40; 92B20; 92Bxx; 82Dxx; 82D80

Journal ref: PLoS ONE 16(5): e0250227 (2021)

arXiv:1811.07143 [pdf, other]

High Quality Prediction of Protein Q8 Secondary Structure by Diverse Neural Network Architectures

Authors: Iddo Drori, Isht Dwivedi, Pranav Shrestha, Jeffrey Wan, Yueqi Wang, Yunchu He, Anthony Mazza, Hugh Krogh-Freeman, Dimitri Leggas, Kendal Sandridge, Linyong Nan, Kaveri Thakoor, Chinmay Joshi, Sonam Goenka, Chen Keasar, Itsik Pe'er

Abstract: We tackle the problem of protein secondary structure prediction using a common task framework. This lead to the introduction of multiple ideas for neural architectures based on state of the art building blocks, used in this task for the first time. We take a principled machine learning approach, which provides genuine, unbiased performance measures, correcting longstanding errors in the applicatio… ▽ More We tackle the problem of protein secondary structure prediction using a common task framework. This lead to the introduction of multiple ideas for neural architectures based on state of the art building blocks, used in this task for the first time. We take a principled machine learning approach, which provides genuine, unbiased performance measures, correcting longstanding errors in the application domain. We focus on the Q8 resolution of secondary structure, an active area for continuously improving methods. We use an ensemble of strong predictors to achieve accuracy of 70.7% (on the CB513 test set using the CB6133filtered training set). These results are statistically indistinguishable from those of the top existing predictors. In the spirit of reproducible research we make our data, models and code available, aiming to set a gold standard for purity of training and testing sets. Such good practices lower entry barriers to this domain and facilitate reproducible, extendable research. △ Less

Submitted 17 November, 2018; originally announced November 2018.

Comments: NIPS 2018 Workshop on Machine Learning for Molecules and Materials, 10 pages

arXiv:1701.05567 [pdf]

doi 10.1038/nmeth.4405

Convolutional Neural Networks for Automated Annotation of Cellular Cryo-Electron Tomograms

Authors: Muyuan Chen, Wei Dai, Ying Sun, Darius Jonasch, Cynthia Y He, Michael F. Schmid, Wah Chiu, Steven J Ludtke

Abstract: Cellular Electron Cryotomography (CryoET) offers the ability to look inside cells and observe macromolecules frozen in action. A primary challenge for this technique is identifying and extracting the molecular components within the crowded cellular environment. We introduce a method using neural networks to dramatically reduce the time and human effort required for subcellular annotation and featu… ▽ More Cellular Electron Cryotomography (CryoET) offers the ability to look inside cells and observe macromolecules frozen in action. A primary challenge for this technique is identifying and extracting the molecular components within the crowded cellular environment. We introduce a method using neural networks to dramatically reduce the time and human effort required for subcellular annotation and feature extraction. Subsequent subtomogram classification and averaging yields in-situ structures of molecular components of interest. △ Less

Submitted 11 June, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

Comments: 21 pages, 8 figures

Journal ref: Nature Methods volume 14, 983-985 (2017)

arXiv:1611.08310 [pdf]

White matter deficits underlie the loss of consciousness level and predict recovery outcome in disorders of consciousness

Authors: Xuehai Wu, Jiaying Zhang, Zaixu Cui, Weijun Tang, Chunhong Shao, ** Hu, Jianhong Zhu, Liangfu Zhou, Yao Zhao, Lu Lu, Gang Chen, Georg Northoff, Gaolang Gong, Ying Mao, Yong He

Abstract: This study aimed to identify white matter (WM) deficits underlying the loss of consciousness in disorder of consciousness (DOC) patients using Diffusion Tensor Imaging (DTI) and to demonstrate the potential value of DTI parameters in predicting recovery outcomes of DOC patients. With 30 DOC patients (8 comatose, 8 unresponsive wakefulness syndrome/vegetative state, and 14 minimal conscious state)… ▽ More This study aimed to identify white matter (WM) deficits underlying the loss of consciousness in disorder of consciousness (DOC) patients using Diffusion Tensor Imaging (DTI) and to demonstrate the potential value of DTI parameters in predicting recovery outcomes of DOC patients. With 30 DOC patients (8 comatose, 8 unresponsive wakefulness syndrome/vegetative state, and 14 minimal conscious state) and 25 patient controls, we performed group comparison of DTI parameters across 48 core WM regions of interest (ROIs) using Analysis of Covariance. Compared with controls, DOC patients had decreased Fractional anisotropy (FA) and increased diffusivities in widespread WM area.The corresponding DTI parameters of those WM deficits in DOC patients significantly correlated with the consciousness level evaluated by Coma Recovery Scale Revised (CRS-R) and Glasgow Coma Scale (GCS). As for predicting the recovery outcomes (i.e., regaining consciousness or not, grouped by their Glasgow Outcome Scale more than 2 or not) at 3 months post scan, radial diffusivity of left superior cerebellar peduncle and FA of right sagittal stratum reached an accuracy of 87.5% and 75% respectively. Our findings showed multiple WM deficits underlying the loss of consciousness level, and demonstrated the potential value of these WM areas in predicting the recovery outcomes of DOC patients who have lost awareness of the environment and themselves. △ Less

Submitted 24 November, 2016; originally announced November 2016.

arXiv:1610.06945 [pdf]

Decreased aneurysmal subarachnoid hemorrhage incidence rate in elderly population than in middle aged population: a retrospective analysis of 8,144 cases in Mainland China

Authors: Yi Xiang J Wang, Lihong Zhang, Lin Zhao, Jian He, Xian-Jun Zeng, Heng Liu, Yun-jun Yang, Shang-Wei Ding, Zhong-Fei Xu, Yong-Min He, Lin Yang, Lan Sun, Ke-jie Mu, Bai-Song Wang, Xiao-Hong Xu, Zhong-You Ji, Jian-hua Liu, **-Zhou Fang, Rui Hou, Feng Fan, Guang Ming Peng, Sheng-Hong Ju

Abstract: Purpose: Rupture of an intracranial aneurysm is the most common cause of subarachnoid haemorrhage (SAH), which is a life-threatening acute cerebrovascular event that typically affects working-age people. This study aims to investigate the aneurysmal SAH incidence rate in elderly population than in middle aged population in China. Materials and methods: Aneurysmal SAH cases were collected retrospec… ▽ More Purpose: Rupture of an intracranial aneurysm is the most common cause of subarachnoid haemorrhage (SAH), which is a life-threatening acute cerebrovascular event that typically affects working-age people. This study aims to investigate the aneurysmal SAH incidence rate in elderly population than in middle aged population in China. Materials and methods: Aneurysmal SAH cases were collected retrospectively from the archives of 21 hospitals in Mainland China. All the cases collected were from September 2016 and backward consecutively for a period of time up to 8 years. SAH was initially diagnosed by brain computed tomography, and CT angiography (CTA) or digital subtraction angiography (DSA) was followed and SAH was confirmed to be due to cerebral aneurysm. When for cases multiple bleeding occurred, the age of the first SAH was used in this study. The toltal incidence from all hospital at each age were summed together for females and males; then adjusted by the total population number at each age for females and males. The total population data was from the 2010 population census of the People's Republic of China. Results: In total there were 8,144 cases, with 4,861 females and 3,283 males. Our analysis shows for both females and males the relative aneurysmal SAH rate started to decrease after around 65 years old. The males the relative aneurysmal SAH rate might have started to decrease after around 55 years old. Conclusion: In contrast to previous reports, our data demonstrated a decreased aneurysmal subarachnoid hemorrhage incidence rate in elderly population than in middle aged population. Our data therefore support the hypothesis that aneurysms do not grow progressively once they form but probably either rupture or stabilize and that very elderly patients are at a reduced risk of rupture compared with atients who are younger with the same-sized aneurysms. △ Less

Submitted 19 October, 2016; originally announced October 2016.

Comments: Total 16 pages, 3 figures

arXiv:1609.03372 [pdf]

Differentially Categorized Structural Connectome Hubs are Involved in Differential Microstructural Basis and Functional Implications and Contribute to Individual Identification

Authors: Xindi Wang, Qixiang Lin, Mingrui Xia, Yong He

Abstract: Human brain structural networks contain sets of centrally embedded hub regions that enable efficient information communication. However, it remains largely unknown about categories of structural brain hubs and their microstructural, functional and cognitive characteristics as well as contributions to individual identification. Here, we employed three multi-modal imaging data sets with structural M… ▽ More Human brain structural networks contain sets of centrally embedded hub regions that enable efficient information communication. However, it remains largely unknown about categories of structural brain hubs and their microstructural, functional and cognitive characteristics as well as contributions to individual identification. Here, we employed three multi-modal imaging data sets with structural MRI, diffusion MRI and resting-state functional MRI to construct individual structural brain networks, identify brain hubs based on eight commonly used graph-nodal metrics, and perform comprehensive validation analysis. We found three categories of structural hubs in the brain networks, namely, aggregated, distributed and connector hubs. Spatially, these distinct categories of hubs were primarily located in the default-mode system and additionally in the visual and limbic systems for aggregated hubs, in the frontoparietal system for distributed hubs, and in the sensorimotor and ventral attention systems for connector hubs. Importantly, these three categories of hubs exhibited various distinct characteristics, with the highest level of microstructural organization in the aggregated hubs, the largest wiring cost and topological vulnerability in the distributed hubs, and the highest functional associations and cognitive flexibility in the connector hubs, although they behaved better regarding these characteristics compared to non-hubs. Finally, all three categories of hub indices displayed high across-session spatial similarities and acted as a structural fingerprint with high predictive rates (100%, 100% and 84.2%) for individual identification. Collectively, our findings highlighted three categories of brain hubs with differential microstructural, functional and cognitive associations, which may shed light on the topological mechanisms of the human connectome. △ Less

Submitted 12 September, 2016; originally announced September 2016.

Comments: 32 text pages, 6 figures, 1 table (Supplementary Information: 20 text pages, 9 figures, 1 table)

arXiv:1511.06427 [pdf, other]

doi 10.1016/j.neuroimage.2017.08.044

Fluctuations between high- and low-modularity topology in time-resolved functional connectivity

Authors: Makoto Fukushima, Richard F. Betzel, Ye He, Marcel A. de Reus, Martijn P. van den Heuvel, Xi-Nian Zuo, Olaf Sporns

Abstract: Modularity is an important topological attribute for functional brain networks. Recent studies have reported that modularity of functional networks varies not only across individuals being related to demographics and cognitive performance, but also within individuals co-occurring with fluctuations in network properties of functional connectivity, estimated over short time intervals. However, chara… ▽ More Modularity is an important topological attribute for functional brain networks. Recent studies have reported that modularity of functional networks varies not only across individuals being related to demographics and cognitive performance, but also within individuals co-occurring with fluctuations in network properties of functional connectivity, estimated over short time intervals. However, characteristics of these time-resolved functional networks during periods of high and low modularity have remained largely unexplored. In this study we investigate spatiotemporal properties of time-resolved networks in the high and low modularity periods during rest, with a particular focus on their spatial connectivity patterns, temporal homogeneity and test-retest reliability. We show that spatial connectivity patterns of time-resolved networks in the high and low modularity periods are represented by increased and decreased dissociation of the default mode network module from task-positive network modules, respectively. We also find that the instances of time-resolved functional connectivity sampled from within the high (low) modularity period are relatively homogeneous (heterogeneous) over time, indicating that during the low modularity period the default mode network interacts with other networks in a variable manner. We confirmed that the occurrence of the high and low modularity periods varies across individuals with moderate inter-session test-retest reliability and that it is correlated with previously-reported individual differences in the modularity of functional connectivity estimated over longer timescales. Our findings illustrate how time-resolved functional networks are spatiotemporally organized during periods of high and low modularity, allowing one to trace individual differences in long-timescale modularity to the variable occurrence of network configurations at shorter timescales. △ Less

Submitted 22 August, 2017; v1 submitted 19 November, 2015; originally announced November 2015.

Comments: Reorganized the paper; to appear in NeuroImage; arXiv abstract shortened to fit within character limits

Journal ref: NeuroImage, vol. 180, pp. 406-416, 2018

arXiv:1511.06352 [pdf, other]

doi 10.1016/j.neuroimage.2015.12.001

Dynamic fluctuations coincide with periods of high and low modularity in resting-state functional brain networks

Authors: Richard F. Betzel, Makoto Fukushima, Ye He, Xi-Nian Zuo, Olaf Sporns

Abstract: We investigate the relationship of resting-state fMRI functional connectivity estimated over long periods of time with time-varying functional connectivity estimated over shorter time intervals. We show that using Pearson's correlation to estimate functional connectivity implies that the range of fluctuations of functional connections over short time scales is subject to statistical constraints im… ▽ More We investigate the relationship of resting-state fMRI functional connectivity estimated over long periods of time with time-varying functional connectivity estimated over shorter time intervals. We show that using Pearson's correlation to estimate functional connectivity implies that the range of fluctuations of functional connections over short time scales is subject to statistical constraints imposed by their connectivity strength over longer scales. We present a method for estimating time-varying functional connectivity that is designed to mitigate this issue and allows us to identify episodes where functional connections are unexpectedly strong or weak. We apply this method to data recorded from $N=80$ participants, and show that the number of unexpectedly strong/weak connections fluctuates over time, and that these variations coincide with intermittent periods of high and low modularity in time-varying functional connectivity. We also find that during periods of relative quiescence regions associated with default mode network tend to join communities with attentional, control, and primary sensory systems. In contrast, during periods where many connections are unexpectedly strong/weak, default mode regions dissociate and form distinct modules. Finally, we go on to show that, while all functional connections can at times manifest stronger (more positively correlated) or weaker (more negatively correlated) than expected, a small number of connections, mostly within the visual and somatomotor networks, do so a disproportional number of times. Our statistical approach allows the detection of functional connections that fluctuate more or less than expected based on their long-time averages and may be of use in future studies characterizing the spatio-temporal patterns of time-varying functional connectivity △ Less

Submitted 19 November, 2015; originally announced November 2015.

Comments: 47 Pages, 8 Figures, 4 Supplementary Figures

arXiv:1510.08045 [pdf, other]

Functional brain modules reconfigure at multiple scales across the human lifespan

Authors: Richard F. Betzel, Bratislav Mišić, Ye He, Jeffrey Rumschlag, Xi-Nian Zuo, Olaf Sporns

Abstract: The human brain is a complex network of interconnected brain regions organized into functional modules with distinct roles in cognition and behavior. An important question concerns the persistence and stability of these modules over the human lifespan. Here we use graph-theoretic analysis to algorithmically uncover the brain's intrinsic modular organization across multiple spatial scales ranging f… ▽ More The human brain is a complex network of interconnected brain regions organized into functional modules with distinct roles in cognition and behavior. An important question concerns the persistence and stability of these modules over the human lifespan. Here we use graph-theoretic analysis to algorithmically uncover the brain's intrinsic modular organization across multiple spatial scales ranging from small communities comprised of only a few brain regions to large communities made up of many regions. We find that at coarse scales modules become progressively more segregated, while at finer scales segregation decreases. Module composition also exhibits scale-specific and age-dependent changes. At coarse scales, the module assignments of regions normally associated with control, default mode, attention, and visual networks are highly flexible. At fine scales the most flexible regions are associated with the default mode network. Finally, we show that, with age, some regions in the default mode network, specifically retrosplenial cortex, maintain a greater proportion of functional connections to their own module, while regions associated with somatomotor and saliency/ventral attention networks distribute their links more evenly across modules. △ Less

Submitted 27 October, 2015; originally announced October 2015.

Comments: 56 pages, 7 figures, 6 supplemental figures

arXiv:1506.06795 [pdf, other]

doi 10.1016/j.neuroimage.2015.09.041

Generative models of the human connectome

Authors: Richard F. Betzel, Andrea Avena-Koenigsberger, Joaquín Goñi, Ye He, Marcel A. de Reus, Alessandra Griffa, Petra E. Vértes, Bratislav Mišić, Jean-Philippe Thiran, Patric Hagmann, Martijn van den Heuvel, Xi-Nian Zuo, Edward T. Bullmore, Olaf Sporns

Abstract: The human connectome represents a network map of the brain's wiring diagram and the pattern into which its connections are organized is thought to play an important role in cognitive function. The generative rules that shape the topology of the human connectome remain incompletely understood. Earlier work in model organisms has suggested that wiring rules based on geometric relationships (distance… ▽ More The human connectome represents a network map of the brain's wiring diagram and the pattern into which its connections are organized is thought to play an important role in cognitive function. The generative rules that shape the topology of the human connectome remain incompletely understood. Earlier work in model organisms has suggested that wiring rules based on geometric relationships (distance) can account for many but likely not all topological features. Here we systematically explore a family of generative models of the human connectome that yield synthetic networks designed according to different wiring rules combining geometric and a broad range of topological factors. We find that a combination of geometric constraints with a homophilic attachment mechanism can create synthetic networks that closely match many topological characteristics of individual human connectomes, including features that were not included in the optimization of the generative model itself. We use these models to investigate a lifespan dataset and show that, with age, the model parameters undergo progressive changes, suggesting a rebalancing of the generative factors underlying the connectome across the lifespan. △ Less

Submitted 19 September, 2015; v1 submitted 22 June, 2015; originally announced June 2015.

Comments: 38 pages, 5 figures + 19 supplemental figures, 1 table

arXiv:1404.7766 [pdf]

Genome-wide Scan of Archaic Hominin Introgressions in Eurasians Reveals Complex Admixture History

Authors: Ya Hu, Yi Wang, Qiliang Ding, Yungang He, Minxian Wang, Jiucun Wang, Shuhua Xu, Li **

Abstract: Introgressions from Neanderthals and Denisovans were detected in modern humans. Introgressions from other archaic hominins were also implicated, however, identification of which poses a great technical challenge. Here, we introduced an approach in identifying introgressions from all possible archaic hominins in Eurasian genomes, without referring to archaic hominin sequences. We focused on mutatio… ▽ More Introgressions from Neanderthals and Denisovans were detected in modern humans. Introgressions from other archaic hominins were also implicated, however, identification of which poses a great technical challenge. Here, we introduced an approach in identifying introgressions from all possible archaic hominins in Eurasian genomes, without referring to archaic hominin sequences. We focused on mutations emerged in archaic hominins after their divergence from modern humans (denoted as archaic-specific mutations), and identified introgressive segments which showed significant enrichment of archaic-specific mutations over the rest of the genome. Furthermore, boundaries of introgressions were identified using a dynamic programming approach to partition whole genome into segments which contained different levels of archaic-specific mutations. We found that detected introgressions shared more archaic-specific mutations with Altai Neanderthal than they shared with Denisovan, and 60.3% of archaic hominin introgressions were from Neanderthals. Furthermore, we detected more introgressions from two unknown archaic hominins whom diverged with modern humans approximately 859 and 3,464 thousand years ago. The latter unknown archaic hominin contributed to the genomes of the common ancestors of modern humans and Neanderthals. In total, archaic hominin introgressions comprised 2.4% of Eurasian genomes. Above results suggested a complex admixture history among hominins. The proposed approach could also facilitate admixture research across species. △ Less

Submitted 30 April, 2014; originally announced April 2014.

Comments: 42 Pages, 1 Table, 4 Figures, 1 Supplementary Table, and 10 Supplementary Figures

arXiv:1311.3355 [pdf]

HINO: a BFO-aligned ontology representing human molecular interactions and pathways

Authors: Yongqun He, Zoushuang Xiang

Abstract: Many database resources, such as Reactome, collect manually annotated reactions, interactions, and pathways from peer-reviewed publications. The interactors (e.g., a protein), interactions, and pathways in these data resources are often represented as instances in using BioPAX, a standard pathway data exchange format. However, these interactions are better represented as classes (or universals) si… ▽ More Many database resources, such as Reactome, collect manually annotated reactions, interactions, and pathways from peer-reviewed publications. The interactors (e.g., a protein), interactions, and pathways in these data resources are often represented as instances in using BioPAX, a standard pathway data exchange format. However, these interactions are better represented as classes (or universals) since they always occur given appropriate conditions. This study aims to represent various human interaction pathways and networks as classes via a formal ontology aligned with the Basic Formal Ontology (BFO). Towards this goal, the Human Interaction Network Ontology (HINO) was generated by extending the BFO-aligned Interaction Network Ontology (INO). All human pathways and associated processes and interactors listed in Reactome and represented in BioPAX were first converted to ontology classes by aligning them under INO. Related terms and associated relations and hierarchies from external ontologies (e.g., CHEBI and GO) were also retrieved and imported into HINO. HINO ontology terms were resolved in the linked ontology data server Ontobee. The RDF triples stored in the RDF triple store are queryable through a SPARQL program. Such an ontology system supports advanced pathway data integration and applications. △ Less

Submitted 13 November, 2013; originally announced November 2013.

Comments: 7 pages, 5 figures

arXiv:1310.3897 [pdf]

doi 10.1371/journal.pone.0105691

Y Chromosomes of 40% Chinese Are Descendants of Three Neolithic Super-grandfathers

Authors: Shi Yan, Chuan-Chao Wang, Hong-Xiang Zheng, Wei Wang, Zhen-Dong Qin, Lan-Hai Wei, Yi Wang, Xue-Dong Pan, Wen-Qing Fu, Yun-Gang He, Li-Jun Xiong, Wen-Fei **, Shi-Lin Li, Yu An, Hui Li, Li **

Abstract: Demographic change of human populations is one of the central questions for delving into the past of human beings. To identify major population expansions related to male lineages, we sequenced 78 East Asian Y chromosomes at 3.9 Mbp of the non-recombining region (NRY), discovered >4,000 new SNPs, and identified many new clades. The relative divergence dates can be estimated much more precisely usi… ▽ More Demographic change of human populations is one of the central questions for delving into the past of human beings. To identify major population expansions related to male lineages, we sequenced 78 East Asian Y chromosomes at 3.9 Mbp of the non-recombining region (NRY), discovered >4,000 new SNPs, and identified many new clades. The relative divergence dates can be estimated much more precisely using molecular clock. We found that all the Paleolithic divergences were binary; however, three strong star-like Neolithic expansions at ~6 kya (thousand years ago) (assuming a constant substitution rate of 1e-9/bp/year) indicates that ~40% of modern Chinese are patrilineal descendants of only three super-grandfathers at that time. This observation suggests that the main patrilineal expansion in China occurred in the Neolithic Era and might be related to the development of agriculture. △ Less

Submitted 14 October, 2013; originally announced October 2013.

Comments: 29 pages of article text including 1 article figure, 9 pages of SI text, and 2 SI figures. 5 SI tables are in a separate ancillary file

Journal ref: Plos ONE 9(8): e105691 (2014)

arXiv:1304.5031 [pdf]

Tumor can originate from not only rare cancer stem cells

Authors: Min Hu, Yu-Fei He

Abstract: Tumors are believed to consist of a heterogeneous population of tumor cells originating from rare cancer stem cells (CSCs). However, emerging evidences show that tumor may also originate from non-CSCs. Here, we give evidences supporting that the number of tumorigenic tumor cells is higher than the number of CSCs and tumor can also derive from non-CSCs. First, we applied an idealized mathematical m… ▽ More Tumors are believed to consist of a heterogeneous population of tumor cells originating from rare cancer stem cells (CSCs). However, emerging evidences show that tumor may also originate from non-CSCs. Here, we give evidences supporting that the number of tumorigenic tumor cells is higher than the number of CSCs and tumor can also derive from non-CSCs. First, we applied an idealized mathematical model and theoretically calculated that non-CSCs could initiate tumor if their proliferation potential was adequate. Next, we demonstrated by experimental studies that 17.7%, 38.6% and 5.2% of tumor cells in murine B16 solid melanoma, H22 hepatoma and Lewis lung carcinoma, respectively, were potentially tumorigenic. We propose that the rare CSCs, if exist, are not the only origination of a tumor. △ Less

Submitted 18 April, 2013; originally announced April 2013.

Comments: This work was finished 4 years ago and is still valuable nowadays

arXiv:1105.4425 [pdf]

Functional State Dependence of Picosecond Protein Dynamics

Authors: J. Y. Chen, D. K. George, Yunfen He, J. R. Knab, A. G. Markelz

Abstract: We examine temperature dependent picosecond dynamics as a function of structure and function for lysozyme and cytochrome c using temperature dependent terahertz permittivity measurements. A double Arrhenius temperature dependence with activation energies E1 ~ 0.1 kJ/mol and E2 ~10 kJ/mol fits the native state response. The higher activation energy is consistent with the so-called protein dynamical… ▽ More We examine temperature dependent picosecond dynamics as a function of structure and function for lysozyme and cytochrome c using temperature dependent terahertz permittivity measurements. A double Arrhenius temperature dependence with activation energies E1 ~ 0.1 kJ/mol and E2 ~10 kJ/mol fits the native state response. The higher activation energy is consistent with the so-called protein dynamical transition associated with beta relaxations at the solvent-protein interface. The lower activation energy is consistent with correlated structural motions. When the structure is removed by denaturing the lower activation energy process is no longer present. Additionally the lower activation energy process is diminished with ligand binding, but not for changes in internal oxidation state. We suggest that the lower energy activation process is associated with collective structural motions that are no longer accessible with denaturing or binding. △ Less

Submitted 23 May, 2011; originally announced May 2011.

Comments: 4 main pages, 3 figures, 1 table, 2 supplemental tables

arXiv:1006.1314 [pdf]

doi 10.1016/j.bpj.2010.12.3731

Evidence Of Protein Collective Motions On The Picosecond Time Scale

Authors: Yunfen He, **g-Yin Chen, Joseph R. Knab, Wenjun Zheng, Andrea G. Markelz

Abstract: We investigate the presence of structural collective motions on a picosecond time scale for the heme protein, cytochrome c, as a function of oxidation and hydration, using terahertz (THz) time-domain spectroscopy and molecular dynamics simulations. The THz response dramatically increases with oxidation, with the largest increase for lowest hydrations and highest frequencies. For both oxidation sta… ▽ More We investigate the presence of structural collective motions on a picosecond time scale for the heme protein, cytochrome c, as a function of oxidation and hydration, using terahertz (THz) time-domain spectroscopy and molecular dynamics simulations. The THz response dramatically increases with oxidation, with the largest increase for lowest hydrations and highest frequencies. For both oxidation states the THz response rapidly increases with hydration saturating above ~25% (g H2O/g protein). Quasi-harmonic vibrational modes and dipole-dipole correlation functions are calculated from molecular dynamics trajectories. The collective mode density of states alone reproduces the measured hydration dependence providing strong evidence of the existence of these motions. The large oxidation dependence is reproduced only by the dipole-dipole correlation function, indicating the contrast arises from diffusive motions consistent with structural changes occurring in the vicinity of a buried internal water molecule. △ Less

Submitted 7 June, 2010; originally announced June 2010.

Showing 1–34 of 34 results for author: He, Y