-
A diffusion MRI tractography atlas for concurrent white matter map** across Eastern and Western populations
Authors:
Yijie Li,
Wei Zhang,
Ye Wu,
Li Yin,
Ce Zhu,
Yuqian Chen,
Suheyla Cetin-Karayumak,
Kang Ik K Cho,
Leo R. Zekelman,
Jarrett Rushmore,
Yogesh Rathi,
Nikos Makris,
Lauren J. O'Donnell,
Fan Zhang
Abstract:
The study of brain differences across Eastern and Western populations provides vital insights for understanding potential cultural and genetic influences on cognition and mental health. Diffusion MRI (dMRI) tractography is an important tool in assessing white matter (WM) connectivity and brain tissue microstructure across different populations. However, a comprehensive investigation into WM fiber…
▽ More
The study of brain differences across Eastern and Western populations provides vital insights for understanding potential cultural and genetic influences on cognition and mental health. Diffusion MRI (dMRI) tractography is an important tool in assessing white matter (WM) connectivity and brain tissue microstructure across different populations. However, a comprehensive investigation into WM fiber tracts between Eastern and Western populations is challenged due to the lack of a cross-population WM atlas and the large site-specific variability of dMRI data. This study presents a dMRI tractography atlas, namely the East-West WM Atlas, for concurrent WM map** between Eastern and Western populations and creates a large, harmonized dMRI dataset (n=306) based on the Human Connectome Project and the Chinese Human Connectome Project. The curated WM atlas, as well as subject-specific data including the harmonized dMRI data, the whole brain tractography data, and parcellated WM fiber tracts and their diffusion measures, are publicly released. This resource is a valuable addition to facilitating the exploration of brain commonalities and differences across diverse cultural backgrounds.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease
Authors:
Lan Wang,
Ruiling He,
Lili Zhao,
Jia Wang,
Zhengzi Geng,
Tao Ren,
Guo Zhang,
Peng Zhang,
Kaiqiang Tang,
Chaofei Gao,
Fei Chen,
Liting Zhang,
Yonghe Zhou,
Xin Li,
Fanbin He,
Hui Huan,
Wenjuan Wang,
Yunxiao Liang,
Juan Tang,
Fang Ai,
Tingyu Wang,
Liyun Zheng,
Zhongwei Zhao,
Jiansong Ji,
Wei Liu
, et al. (22 additional authors not shown)
Abstract:
Objective: Bleeding from gastroesophageal varices (GEV) is a medical emergency associated with high mortality. We aim to construct an artificial intelligence-based model of two-dimensional shear wave elastography (2D-SWE) of the liver and spleen to precisely assess the risk of GEV and high-risk gastroesophageal varices (HRV).
Design: A prospective multicenter study was conducted in patients with…
▽ More
Objective: Bleeding from gastroesophageal varices (GEV) is a medical emergency associated with high mortality. We aim to construct an artificial intelligence-based model of two-dimensional shear wave elastography (2D-SWE) of the liver and spleen to precisely assess the risk of GEV and high-risk gastroesophageal varices (HRV).
Design: A prospective multicenter study was conducted in patients with compensated advanced chronic liver disease. 305 patients were enrolled from 12 hospitals, and finally 265 patients were included, with 1136 liver stiffness measurement (LSM) images and 1042 spleen stiffness measurement (SSM) images generated by 2D-SWE. We leveraged deep learning methods to uncover associations between image features and patient risk, and thus conducted models to predict GEV and HRV.
Results: A multi-modality Deep Learning Risk Prediction model (DLRP) was constructed to assess GEV and HRV, based on LSM and SSM images, and clinical information. Validation analysis revealed that the AUCs of DLRP were 0.91 for GEV (95% CI 0.90 to 0.93, p < 0.05) and 0.88 for HRV (95% CI 0.86 to 0.89, p < 0.01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM. Moreover, DLPR was better than the model using individual parameters, including LSM and SSM images. In HRV prediction, the 2D-SWE images of SSM outperform LSM (p < 0.01).
Conclusion: DLRP shows excellent performance in predicting GEV and HRV over canonical risk indicators LSM and SSM. Additionally, the 2D-SWE images of SSM provided more information for better accuracy in predicting HRV than the LSM.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
Estimation of genome size using k-mer frequencies from corrected long reads
Authors:
Hengchao Wang,
Bo Liu,
Yan Zhang,
Fan Jiang,
Yuwei Ren,
Lijuan Yin,
Hangwei Liu,
Sen Wang,
Wei Fan
Abstract:
The third-generation long reads sequencing technologies, such as PacBio and Nanopore, have great advantages over second-generation Illumina sequencing in de novo assembly studies. However, due to the inherent low base accuracy, third-generation sequencing data cannot be used for k-mer counting and estimating genomic profile based on k-mer frequencies. Thus, in current genome projects, second-gener…
▽ More
The third-generation long reads sequencing technologies, such as PacBio and Nanopore, have great advantages over second-generation Illumina sequencing in de novo assembly studies. However, due to the inherent low base accuracy, third-generation sequencing data cannot be used for k-mer counting and estimating genomic profile based on k-mer frequencies. Thus, in current genome projects, second-generation data is also necessary for accurately determining genome size and other genomic characteristics. We show that corrected third-generation data can be used to count k-mer frequencies and estimate genome size reliably, in replacement of using second-generation data. Therefore, future genome projects can depend on only one sequencing technology to finish both assembly and k-mer analysis, which will largely decrease sequencing cost in both time and money. Moreover, we present a fast light-weight tool kmerfreq and use it to perform all the k-mer counting tasks in this work. We have demonstrated that corrected third-generation sequencing data can be used to estimate genome size and developed a new open-source C/C++ k-mer counting tool, kmerfreq, which is freely available at https://github.com/fanagislab/kmerfreq.
△ Less
Submitted 26 March, 2020;
originally announced March 2020.
-
A framework to decipher the genetic architecture of combinations of complex diseases: applications in cardiovascular medicine
Authors:
Liangying Yin,
Carlos Kwan-long Chau,
Yu-** Lin,
Pak-Chung Sham,
Hon-Cheong So
Abstract:
Genome-wide association studies(GWAS) have proven to be highly useful in revealing the genetic basis of complex diseases. At present, most GWAS are studies of a particular single disease diagnosis against controls. However, in practice, an individual is often affected by more than one condition/disorder. For example, patients with coronary artery disease(CAD) are often comorbid with diabetes melli…
▽ More
Genome-wide association studies(GWAS) have proven to be highly useful in revealing the genetic basis of complex diseases. At present, most GWAS are studies of a particular single disease diagnosis against controls. However, in practice, an individual is often affected by more than one condition/disorder. For example, patients with coronary artery disease(CAD) are often comorbid with diabetes mellitus(DM). Along a similar line, it is often clinically meaningful to study patients with one disease but without a comorbidity. For example, obese DM may have different pathophysiology from non-obese DM.
Here we developed a statistical framework to uncover susceptibility variants for comorbid disorders (or a disorder without comorbidity), using GWAS summary statistics only. In essence, we mimicked a case-control GWAS in which the cases are affected with comorbidities or a disease without a relevant comorbid condition (in either case, we may consider the cases as those affected by a specific subtype of disease, as characterized by the presence or absence of comorbid conditions). We extended our methodology to deal with continuous traits with clinically meaningful categories (e.g. lipids). In addition, we illustrated how the analytic framework may be extended to more than two traits. We verified the feasibility and validity of our method by applying it to simulated scenarios and four cardiometabolic (CM) traits. We also analyzed the genes, pathways, cell-types/tissues involved in CM disease subtypes. LD-score regression analysis revealed some subtypes may indeed be biologically distinct with low genetic correlations. Further Mendelian randomization analysis found differential causal effects of different subtypes to relevant complications. We believe the findings are of both scientific and clinical value, and the proposed method may open a new avenue to analyzing GWAS data.
△ Less
Submitted 29 December, 2020; v1 submitted 18 March, 2020;
originally announced March 2020.
-
Analysis of genetic differences between psychiatric disorders: Exploring pathways and cell-types/tissues involved and ability to differentiate the disorders by polygenic scores
Authors:
Shitao Rao,
Liangying Yin,
Yong Xiang,
Hon-Cheong So
Abstract:
Although displaying genetic correlations, psychiatric disorders are clinically defined as categorical entities as they each have distinguishing clinical features and may involve different treatments. Identifying differential genetic variations between these disorders may reveal how the disorders differ biologically and help to guide more personalized treatment.
Here we presented a comprehensive…
▽ More
Although displaying genetic correlations, psychiatric disorders are clinically defined as categorical entities as they each have distinguishing clinical features and may involve different treatments. Identifying differential genetic variations between these disorders may reveal how the disorders differ biologically and help to guide more personalized treatment.
Here we presented a comprehensive analysis to identify genetic markers differentially associated with various psychiatric disorders/traits based on GWAS summary statistics, covering 18 psychiatric traits/disorders and 26 comparisons. We also conducted comprehensive analysis to unravel the genes, pathways and SNP functional categories involved, and the cell types and tissues implicated. We also assessed how well one could distinguish between psychiatric disorders by polygenic risk scores (PRS).
SNP-based heritabilities (h2SNP) were significantly larger than zero for most comparisons. Based on current GWAS data, PRS have mostly modest power to distinguish between psychiatric disorders. For example, we estimated that AUC for distinguishing schizophrenia from major depressive disorder (MDD), bipolar disorder (BPD) from MDD and schizophrenia from BPD were 0.694, 0.602 and 0.618 respectively, while the maximum AUC (based on h2SNP) were 0.763, 0.749 and 0.726 respectively. We also uncovered differences in each pair of studied traits in terms of their differences in genetic correlation with comorbid traits. For example, clinically-defined MDD appeared to more strongly genetically correlated with other psychiatric disorders and heart disease, when compared to non-clinically-defined depression in UK Biobank.
Our findings highlight genetic differences between psychiatric disorders and the mechanisms involved. PRS may aid differential diagnosis of selected psychiatric disorders in the future with larger GWAS samples.
△ Less
Submitted 20 May, 2021; v1 submitted 16 March, 2020;
originally announced March 2020.
-
Towards Kinetic Modeling of Global Metabolic Networks with Incomplete Experimental Input on Kinetic Parameters
Authors:
P. Ao,
L. W. Lee,
Me Lidstrom,
L. Yin,
X. M. Zhu
Abstract:
This is the first report, to our knowledge, on a systematic method for constructing a large scale kinetic metabolic model with incomplete information on kinetic parametersr, and its initial application to the modeling of central metabolism of Methylobacterium extorquens AM1, a methylotrophic and environmental important bacterium, with all necessary constraints. Through a systematic and consisten…
▽ More
This is the first report, to our knowledge, on a systematic method for constructing a large scale kinetic metabolic model with incomplete information on kinetic parametersr, and its initial application to the modeling of central metabolism of Methylobacterium extorquens AM1, a methylotrophic and environmental important bacterium, with all necessary constraints. Through a systematic and consistent procedure of finding a set of parameters in the physiological range we overcome an outstanding difficulty in large scale kinetic modeling: the requirement for a massive number of enzymatic reaction parameters. We are able to construct the kinetic model based on general biological considerations and incomplete experimental kinetic parameters. The success of our approach with incompletely input information is guaranteed by two known principles in biology, the robustness of the system and the cooperation among its various parts. (Will be pleased to be informed on other methodologies dealing with same type of problems: [email protected])
△ Less
Submitted 1 August, 2008;
originally announced August 2008.
-
A Generic Rate Equation for modeling Enzymatic Reactions under Living Conditions
Authors:
L. W. Lee,
L. Yin,
X. M. Zhu,
P. Ao
Abstract:
Based on our experience in kinetic modeling of coupled multiple metabolic pathways we propose a generic rate equation for the dynamical modeling of metabolic kinetics. Its symmetric form makes the kinetic parameters (or functions) easy to relate to values in database and to use in computation. In addition, such form is workable to arbitrary number of substrates and products with different stoich…
▽ More
Based on our experience in kinetic modeling of coupled multiple metabolic pathways we propose a generic rate equation for the dynamical modeling of metabolic kinetics. Its symmetric form makes the kinetic parameters (or functions) easy to relate to values in database and to use in computation. In addition, such form is workable to arbitrary number of substrates and products with different stoichiometry. We explicitly show how to obtain such rate equation exactly for various binding mechanisms. Hence the proposed rate equation is formally rigorous. Various features of such a generic rate equation are discussed. For irreversible reactions, the product inhibition which directly arise from enzymatic reaction is eliminated in a natural way. We also discuss how to include the effects of modifiers and cooperativity.
△ Less
Submitted 17 December, 2007; v1 submitted 11 September, 2007;
originally announced September 2007.
-
Efficiency, Robustness and Stochasticity of Gene Regulatory Networks in Systems Biology: lambda Switch as a Working Example
Authors:
X. Zhu,
L. Yin,
L. Hood,
D. Galas,
P. Ao
Abstract:
Phage lambda is one of the most studied biological models in modern molecular biology. Over the past 50 years quantitative experimental knowledge on this biological model has been accumulated at all levels: physics, chemistry, genomics, proteomics, functions, and more. All its components have been known to a great detail. The theoretical task has been to integrate its components to make the orga…
▽ More
Phage lambda is one of the most studied biological models in modern molecular biology. Over the past 50 years quantitative experimental knowledge on this biological model has been accumulated at all levels: physics, chemistry, genomics, proteomics, functions, and more. All its components have been known to a great detail. The theoretical task has been to integrate its components to make the organism working quantitatively in a harmonic manner. This would test our biological understanding and would lay a solid fundamental for further explorations and applications, an obvious goal of systems biology. One of the outstanding challenges in doing so has been the so-called stability puzzle of lambda switch: the biologically observed robustness and its difficult mathematical reconstruction based on known experimental values. In this chapter we review the recent theoretical and experimental efforts on tackling this problem. An emphasis is put on the minimum quantitative modeling where a successful numerical agreement between experiments and modeling has been achieved. A novel method tentatively named stochastic dynamical structure analysis emerged from such study is also discussed within a broad modeling perspective.
△ Less
Submitted 8 February, 2006; v1 submitted 2 December, 2005;
originally announced December 2005.
-
Robustness, Stability and Efficiency of Phage lambda Gene Regulatory Network: Dynamical Structure Analysis
Authors:
X. -M. Zhu,
L. Yin,
L. Hood,
P. Ao
Abstract:
Based on our physical and biological studies we have recently developed a mathematical framework for the analysis of nonlnear dynamics. We call this framework the dynamical structure analysis. It has four dynamical elements: potential landscape, transverse matrix, descendant matrix, and stochastic drive. In particular, the importance and the existence of the potential landscape is emphasized.…
▽ More
Based on our physical and biological studies we have recently developed a mathematical framework for the analysis of nonlnear dynamics. We call this framework the dynamical structure analysis. It has four dynamical elements: potential landscape, transverse matrix, descendant matrix, and stochastic drive. In particular, the importance and the existence of the potential landscape is emphasized.
The dynamical structure analysis is illustrated in detail by the study of stability, robustness, and efficiency of the simplest gene regulatory network of phage lambda.
△ Less
Submitted 14 March, 2004;
originally announced March 2004.
-
Calculating Biological Behaviors of Epigenetic States in Phage lambda Life Cycle
Authors:
X. -M. Zhu,
L. Yin,
L. Hood,
P. Ao
Abstract:
Gene regulatory network of lambda phage is one the best studied model systems in molecular biology. More 50 years of experimental study has provided a tremendous amount of data at all levels: physics, chemistry, DNA, protein, and function. However, its stability and robustness for both wild type and mutants has been a notorious theoretical/mathematical problem. In this paper we report our succes…
▽ More
Gene regulatory network of lambda phage is one the best studied model systems in molecular biology. More 50 years of experimental study has provided a tremendous amount of data at all levels: physics, chemistry, DNA, protein, and function. However, its stability and robustness for both wild type and mutants has been a notorious theoretical/mathematical problem. In this paper we report our successful calculation on the properties of this gene regulatory network. We believe it is of its first kind. Our success is of course built upon numerous previous theoretical attempts, but following 3 features make our modeling uniqu:
1) A new modeling method particular suitable for stability and robustness study;
2) Paying a close attention to the well-known difference of in vivo and in vitro;
3) Allowing more important role for noise and stochastic effect to play.
The last two points have been discussed by two of us (Ao and Yin, cond-mat/0307747), which we believe would be enough to make some of previous theoretical attempts successful, too. We hope the present work would stimulate a further interest in the emerging field of gene regulatory network.
△ Less
Submitted 3 March, 2004;
originally announced March 2004.
-
Towards the Understanding of Stability Puzzles in Phage lambda
Authors:
P. Ao,
L. Yin
Abstract:
We discuss two aspects, the in vivo and in vitro difference and the modeling of noise, of integrative modeling of network dynamics in biology, using phage lambda as an example. We believe those two aspects have not been seriously considered, and the including of them may be enough to solve the outstanding stability and robustness puzzle of in gene regulatory network dynamics.
We discuss two aspects, the in vivo and in vitro difference and the modeling of noise, of integrative modeling of network dynamics in biology, using phage lambda as an example. We believe those two aspects have not been seriously considered, and the including of them may be enough to solve the outstanding stability and robustness puzzle of in gene regulatory network dynamics.
△ Less
Submitted 30 July, 2003;
originally announced July 2003.