Search | arXiv e-print repository

Generative Plant Growth Simulation from Sequence-Informed Environmental Conditions

Authors: Mohamed Debbagh, Yixue Liu, Zhouzhou Zheng, Xintong Jiang, Shangpeng Sun, Mark Lefsrud

Abstract: A plant growth simulation can be characterized as a reconstructed visual representation of a plant or plant system. The phenotypic characteristics and plant structures are controlled by the scene environment and other contextual attributes. Considering the temporal dependencies and compounding effects of various factors on growth trajectories, we formulate a probabilistic approach to the simulatio… ▽ More A plant growth simulation can be characterized as a reconstructed visual representation of a plant or plant system. The phenotypic characteristics and plant structures are controlled by the scene environment and other contextual attributes. Considering the temporal dependencies and compounding effects of various factors on growth trajectories, we formulate a probabilistic approach to the simulation task by solving a frame synthesis and pattern recognition problem. We introduce a sequence-informed plant growth simulation framework (SI-PGS) that employs a conditional generative model to implicitly learn a distribution of possible plant representations within a dynamic scene from a fusion of low dimensional temporal sensor and context data. Methods such as controlled latent sampling and recurrent output connections are used to improve coherence in the plant structures between frames of predictions. In this work, we demonstrate that SI-PGS is able to capture temporal dependencies and continuously generate realistic frames of plant growth. △ Less

Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

arXiv:2404.04299 [pdf, other]

GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console

Authors: Anindita Nath, Savannah Mwesigwa, Yulin Dai, Xiaoqian Jiang, Zhongming Zhao

Abstract: Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrie… ▽ More Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrieval, and visualization of customized domain-specific genetic information, and integrates functionalities to generate protein interaction networks, enrich gene sets, and search scientific literature from PubMed, Google Scholar, and arXiv, making it a comprehensive tool for biomedical research. In its pilot phase, GENEVIC is assessed using a curated database that ranks genetic variants associated with Alzheimer's disease, schizophrenia, and cognition, based on their effect weights from the Polygenic Score Catalog, thus enabling researchers to prioritize genetic variants in complex diseases. GENEVIC's operation is user-friendly, accessible without any specialized training, secured by Azure OpenAI's HIPAA-compliant infrastructure, and evaluated for its efficacy through real-time query testing. As a prototype, GENEVIC is set to advance genetic research, enabling informed biomedical decisions. Availability and implementation: GENEVIC is publicly accessible at https://genevic-anath2024.streamlit.app. The underlying code is open-source and available via GitHub at https://github.com/anath2110/GENEVIC.git. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2308.03278 [pdf]

Key Gene Mining in Transcriptional Regulation for Specific Biological Processes with Small Sample Sizes Using Multi-network pipeline Transformer

Authors: Kerui Huang, Jianhong Tian, Lei Sun, Li Zeng, Peng Xie, Aihua Deng, ** Mo, Zhibo Zhou, Ming Jiang, Yun Wang, Xiaocheng Jiang

Abstract: Gene mining is an important topic in the field of life sciences, but traditional machine learning methods cannot consider the regulatory relationships between genes. Deep learning methods perform poorly in small sample sizes. This study proposed a deep learning method, called TransGeneSelector, that can mine critical regulatory genes involved in certain life processes using a small-sample transcri… ▽ More Gene mining is an important topic in the field of life sciences, but traditional machine learning methods cannot consider the regulatory relationships between genes. Deep learning methods perform poorly in small sample sizes. This study proposed a deep learning method, called TransGeneSelector, that can mine critical regulatory genes involved in certain life processes using a small-sample transcriptome dataset. The method combines a WGAN-GP data augmentation network, a sample filtering network, and a Transformer classifier network, which successfully classified the state (germinating or dry seeds) of Arabidopsis thaliana seed in a dataset of 79 samples, showing performance comparable to that of Random Forests. Further, through the use of SHapley Additive exPlanations method, TransGeneSelector successfully mined genes involved in seed germination. Through the construction of gene regulatory networks and the enrichment analysis of KEGG, as well as RT-qPCR quantitative analysis, it was confirmed that these genes are at a more upstream regulatory level than those Random Forests mined, and the top 11 genes that were uniquely mined by TransGeneSelector were found to be related to the KAI2 signaling pathway, which is of great regulatory importance for germination-related genes. This study provides a practical tool for life science researchers to mine key genes from transcriptome data. △ Less

Submitted 6 August, 2023; originally announced August 2023.

Comments: 34 pages,6 figures

arXiv:2304.10946 [pdf, other]

CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models

Authors: Tianhao Li, Sandesh Shetty, Advaith Kamath, Ajay Jaiswal, Xianqian Jiang, Ying Ding, Ye** Kim

Abstract: Large pre-trained language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology, has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structure… ▽ More Large pre-trained language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology, has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Our proposed few-shot learning approach uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrated that the LLM-based prediction model achieved significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with $\sim$ 124M parameters), was even comparable to the larger fine-tuned GPT-3 model (with $\sim$ 175B parameters). Our research is the first to tackle drug pair synergy prediction in rare tissues with limited data. We are also the first to utilize an LLM-based prediction model for biological reaction prediction tasks. △ Less

Submitted 17 April, 2023; originally announced April 2023.

arXiv:2302.01117 [pdf, other]

PASSerRank: Prediction of Allosteric Sites with Learning to Rank

Authors: Hao Tian, Sian Xiao, Xi Jiang, Peng Tao

Abstract: Allostery plays a crucial role in regulating protein activity, making it a highly sought-after target in drug development. One of the major challenges in allosteric drug research is the identification of allosteric sites. In recent years, many computational models have been developed for accurate allosteric site prediction. Most of these models focus on designing a general rule that can be applied… ▽ More Allostery plays a crucial role in regulating protein activity, making it a highly sought-after target in drug development. One of the major challenges in allosteric drug research is the identification of allosteric sites. In recent years, many computational models have been developed for accurate allosteric site prediction. Most of these models focus on designing a general rule that can be applied to pockets of proteins from various families. In this study, we present a new approach using the concept of Learning to Rank (LTR). The LTR model ranks pockets based on their relevance to allosteric sites, i.e., how well a pocket meets the characteristics of known allosteric sites. The model outperforms other common machine learning models with higher F1 score and Matthews correlation coefficient. After the training and validation on two datasets, the Allosteric Database (ASD) and CASBench, the LTR model was able to rank an allosteric pocket in the top 3 positions for 83.6% and 80.5% of test proteins, respectively. The trained model is available on the PASSer platform (https://passer.smu.edu) to aid in drug discovery research. △ Less

Submitted 28 April, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

arXiv:2210.00395 [pdf, other]

Federated Generalized Linear Mixed Models for Collaborative Genome-wide Association Studies

Authors: Wentao Li, Han Chen, Xiaoqian Jiang, Arif Harmanci

Abstract: As the sequencing costs are decreasing, there is great incentive to perform large scale association studies to increase power of detecting new variants. Federated association testing among different institutions is a viable solution for increasing sample sizes by sharing the intermediate testing statistics that are aggregated by a central server. There are, however, standing challenges to performi… ▽ More As the sequencing costs are decreasing, there is great incentive to perform large scale association studies to increase power of detecting new variants. Federated association testing among different institutions is a viable solution for increasing sample sizes by sharing the intermediate testing statistics that are aggregated by a central server. There are, however, standing challenges to performing federated association testing. Association tests are known to be confounded by numerous factors such as population stratification, which can be especially important in multiancestral studies and in admixed populations among different sites. Furthermore, disease etiology should be considered via flexible models to avoid biases in the significance of the genetic effect. A rising challenge for performing large scale association studies is the privacy of participants and related ethical concerns of stigmatization and marginalization. Here, we present dMEGA, a flexible and efficient method for performing federated generalized linear mixed model based association testing among multiple sites while underlying genotype and phenotype data are not explicitly shared. dMEGA first utilizes a reference projection to estimate population-based covariates without sharing genotype dataset among sites. Next, dMEGA uses Laplacian approximation for the parameter likelihoods and decomposes parameter estimation into efficient local-gradient updates among sites. We use simulated and real datasets to demonstrate the accuracy and efficiency of dMEGA. Overall, dMEGA's formulation is flexible to integrate fixed and random effects in a federated setting. △ Less

Submitted 1 October, 2022; originally announced October 2022.

arXiv:2204.13040 [pdf, other]

LAST: Latent Space Assisted Adaptive Sampling for Protein Trajectories

Authors: Hao Tian, Xi Jiang, Sian Xiao, Hunter La Force, Eric C. Larson, Peng Tao

Abstract: Molecular dynamics (MD) simulation is widely used to study protein conformations and dynamics. However, conventional simulation suffers from being trapped in some local energy minima that are hard to escape. Thus, most computational time is spent sampling in the already visited regions. This leads to an inefficient sampling process and further hinders the exploration of protein movements in afford… ▽ More Molecular dynamics (MD) simulation is widely used to study protein conformations and dynamics. However, conventional simulation suffers from being trapped in some local energy minima that are hard to escape. Thus, most computational time is spent sampling in the already visited regions. This leads to an inefficient sampling process and further hinders the exploration of protein movements in affordable simulation time. The advancement of deep learning provides new opportunities for protein sampling. Variational autoencoders are a class of deep learning models to learn a low-dimensional representation (referred to as the latent space) that can capture the key features of the input data. Based on this characteristic, we proposed a new adaptive sampling method, latent space assisted adaptive sampling for protein trajectories (LAST), to accelerate the exploration of protein conformational space. This method comprises cycles of (i) variational autoencoders training, (ii) seed structure selection on the latent space and (iii) conformational sampling through additional MD simulations. The proposed approach is validated through the sampling of four structures of two protein systems: two metastable states of E. Coli adenosine kinase (ADK) and two native states of Vivid (VVD). In all four conformations, seed structures were shown to lie on the boundary of conformation distributions. Moreover, large conformational changes were observed in a shorter simulation time when compared with conventional MD (cMD) simulations in both systems. In metastable ADK simulations, LAST explored two transition paths toward two stable states while cMD became trapped in an energy basin. In VVD light state simulations, LAST was three times faster than cMD simulation with a similar conformational space. △ Less

Submitted 27 April, 2022; originally announced April 2022.

arXiv:2107.06773 [pdf]

Relational graph convolutional networks for predicting blood-brain barrier penetration of drug molecules

Authors: Yan Ding, Xiaoqian Jiang, Ye** Kim

Abstract: Evaluating the blood-brain barrier (BBB) permeability of drug molecules is a critical step in brain drug development. Traditional methods for the evaluation require complicated in vitro or in vivo testing. Alternatively, in silico predictions based on machine learning have proved to be a cost-efficient way to complement the in vitro and in vivo methods. However, the performance of the established… ▽ More Evaluating the blood-brain barrier (BBB) permeability of drug molecules is a critical step in brain drug development. Traditional methods for the evaluation require complicated in vitro or in vivo testing. Alternatively, in silico predictions based on machine learning have proved to be a cost-efficient way to complement the in vitro and in vivo methods. However, the performance of the established models has been limited by their incapability of dealing with the interactions between drugs and proteins, which play an important role in the mechanism behind the BBB penetrating behaviors. To address this limitation, we employed the relational graph convolutional network (RGCN) to handle the drug-protein interactions as well as the properties of each individual drug. The RGCN model achieved an overall accuracy of 0.872, an AUROC of 0.919 and an AUPRC of 0.838 for the testing dataset with the drug-protein interactions and the Mordred descriptors as the input. Introducing drug-drug similarity to connect structurally similar drugs in the data graph further improved the testing results, giving an overall accuracy of 0.876, an AUROC of 0.926 and an AUPRC of 0.865. In particular, the RGCN model was found to greatly outperform the LightGBM base model when evaluated with the drugs whose BBB penetration was dependent on drug-protein interactions. Our model is expected to provide high-confidence predictions of BBB permeability for drug prioritization in the experimental screening of BBB-penetrating drugs. △ Less

Submitted 6 April, 2022; v1 submitted 4 July, 2021; originally announced July 2021.

arXiv:2104.13957 [pdf, other]

A Bayesian Modified Ising Model for Identifying Spatially Variable Genes from Spatial Transcriptomics Data

Authors: Xi Jiang, Qiwei Li, Guanghua Xiao

Abstract: A recent technology breakthrough in spatial molecular profiling has enabled the comprehensive molecular characterizations of single cells while preserving spatial information. It provides new opportunities to delineate how cells from different origins form tissues with distinctive structures and functions. One immediate question in spatial molecular profiling data analysis is to identify genes who… ▽ More A recent technology breakthrough in spatial molecular profiling has enabled the comprehensive molecular characterizations of single cells while preserving spatial information. It provides new opportunities to delineate how cells from different origins form tissues with distinctive structures and functions. One immediate question in spatial molecular profiling data analysis is to identify genes whose expressions exhibit spatially correlated patterns, called spatially variable genes. Most current methods to identify spatially variable genes are built upon the geostatistical model with Gaussian process to capture the spatial patterns, which rely on ad hoc kernels that could limit the models' ability to identify complex spatial patterns. In order to overcome this challenge and capture more types of spatial patterns, we introduce a Bayesian approach to identify spatially variable genes via a modified Ising model. The key idea is to use the energy interaction parameter of the Ising model to characterize spatial expression patterns. We use auxiliary variable Markov chain Monte Carlo algorithms to sample from the posterior distribution with an intractable normalizing constant in the model. Simulation studies using both simulated and synthetic data showed that the energy-based modeling approach led to higher accuracy in detecting spatially variable genes than those kernel-based methods. When applied to two real spatial transcriptomics datasets, the proposed method discovered novel spatial patterns that shed light on the biological mechanisms. In summary, the proposed method presents a new perspective for analyzing spatial transcriptomics data. △ Less

Submitted 5 October, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

Comments: Version 3

arXiv:2009.10931 [pdf]

doi 10.1038/s41598-021-02353-5

Drug repurposing for COVID-19 using graph neural network and harmonizing multiple evidence

Authors: Kanglin Hsieh, Yinyin Wang, Luyao Chen, Zhongming Zhao, Sean Savitz, Xiaoqian Jiang, **g Tang, Ye** Kim

Abstract: Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neura… ▽ More Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neural networks, and in-vitro/population-based validations. We first collected all the available drugs (n= 3,635) involved in COVID-19 patient treatment through CTDbase. We built a SARS-CoV-2 knowledge graph based on the interactions among virus baits, host genes, pathways, drugs, and phenotypes. A deep graph neural network approach was used to derive the candidate representation based on the biological interactions. We prioritized the candidate drugs using clinical trial history, and then validated them with their genetic profiles, in vitro experimental efficacy, and electronic health records. We highlight the top 22 drugs including Azithromycin, Atorvastatin, Aspirin, Acetaminophen, and Albuterol. We further pinpointed drug combinations that may synergistically target COVID-19. In summary, we demonstrated that the integration of extensive interactions, deep neural networks, and rigorous validation can facilitate the rapid identification of candidate drugs for COVID-19 treatment. This is a post-peer-review, pre-copyedit version of an article published in Scientific Reports The final authenticated version is available online at: https://www.nature.com/articles/s41598-021-02353-5 △ Less

Submitted 1 February, 2022; v1 submitted 23 September, 2020; originally announced September 2020.

Comments: 13 pages

Journal ref: Sci Rep 11, 23179 (2021)

arXiv:2008.05909 [pdf]

Population stratification enables modeling effects of reopening policies on mortality and hospitalization rates

Authors: Tongtong Huang, Yan Chu, Shayan Shams, Ye** Kim, Genevera Allen, Ananth V Annapragada, Devika Subramanian, Ioannis Kakadiaris, Assaf Gottlieb, Xiaoqian Jiang

Abstract: Objective: We study the influence of local reopening policies on the composition of the infectious population and their impact on future hospitalization and mortality rates. Materials and Methods: We collected datasets of daily reported hospitalization and cumulative morality of COVID 19 in Houston, Texas, from May 1, 2020 until June 29, 2020. These datasets are from multiple sources (USA FACTS, S… ▽ More Objective: We study the influence of local reopening policies on the composition of the infectious population and their impact on future hospitalization and mortality rates. Materials and Methods: We collected datasets of daily reported hospitalization and cumulative morality of COVID 19 in Houston, Texas, from May 1, 2020 until June 29, 2020. These datasets are from multiple sources (USA FACTS, Southeast Texas Regional Advisory Council COVID 19 report, TMC daily news, and New York Times county level mortality reporting). Our model, risk stratified SIR HCD uses separate variables to model the dynamics of local contact (e.g., work from home) and high contact (e.g., work on site) subpopulations while sharing parameters to control their respective $R_0(t)$ over time. Results: We evaluated our models forecasting performance in Harris County, TX (the most populated county in the Greater Houston area) during the Phase I and Phase II reopening. Not only did our model outperform other competing models, it also supports counterfactual analysis to simulate the impact of future policies in a local setting, which is unique among existing approaches. Discussion: Local mortality and hospitalization are significantly impacted by quarantine and reopening policies. No existing model has directly accounted for the effect of these policies on local trends in infections, hospitalizations, and deaths in an explicit and explainable manner. Our work is an attempt to close this important technical gap to support decision making. Conclusion: Despite several limitations, we think it is a timely effort to rethink about how to best model the dynamics of pandemics under the influence of reopening policies. △ Less

Submitted 10 August, 2020; originally announced August 2020.

arXiv:1912.06686 [pdf, other]

doi 10.1038/s41386-021-01020-7

Systematic Misestimation of Machine Learning Performance in Neuroimaging Studies of Depression

Authors: Claas Flint, Micah Cearns, Nils Opel, Ronny Redlich, David M. A. Mehler, Daniel Emden, Nils R. Winter, Ramona Leenings, Simon B. Eickhoff, Tilo Kircher, Axel Krug, Igor Nenadic, Volker Arolt, Scott Clark, Bernhard T. Baune, Xiaoyi Jiang, Udo Dannlowski, Tim Hahn

Abstract: We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied ques… ▽ More We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied questions in the field, namely the classification of patients suffering from major depressive disorder (MDD) and healthy control (HC) based on neuroimaging data. Drawing upon structural magnetic resonance imaging (MRI) data from a balanced sample of $N = 1,868$ MDD patients and HC from our recent international Predictive Analytics Competition (PAC), we first trained and tested a classification model on the full dataset which yielded an accuracy of $61\,\%$. Next, we mimicked the process by which researchers would draw samples of various sizes ($N = 4$ to $N = 150$) from the population and showed a strong risk of misestimation. Specifically, for small sample sizes ($N = 20$), we observe accuracies of up to $95\,\%$. For medium sample sizes ($N = 100$) accuracies up to $75\,\%$ were found. Importantly, further investigation showed that sufficiently large test sets effectively protect against performance misestimation whereas larger datasets per se do not. While these results question the validity of a substantial part of the current literature, we outline the relatively low-cost remedy of larger test sets, which is readily available in most cases. △ Less

Submitted 3 May, 2021; v1 submitted 13 December, 2019; originally announced December 2019.

Journal ref: Neuropsychopharmacology 46 (2021) 1510-1517

arXiv:1911.10617 [pdf, other]

doi 10.1038/s41386-020-0666-3

Biological sex classification with structural MRI data shows increased misclassification in transgender women

Authors: Claas Flint, Katharina Förster, Sophie A. Koser, Carsten Konrad, Pienie Zwitserlood, Klaus Berger, Marco Hermesdorf, Tilo Kircher, Igor Nenadic, Axel Krug, Bernhard T. Baune, Katharina Dohm, Ronny Redlich, Nils Opel, Volker Arolt, Tim Hahn, Xiaoyi Jiang, Udo Dannlowski, Dominik Grotegerd

Abstract: Transgender individuals (TIs) show brain structural alterations that differ from their biological sex as well as their perceived gender. To substantiate evidence that the brain structure of TIs differs from male and female, we use a combined multivariate and univariate approach. Gray matter segments resulting from voxel-based morphometry preprocessing of $N = 1753$ cisgender (CG) healthy participa… ▽ More Transgender individuals (TIs) show brain structural alterations that differ from their biological sex as well as their perceived gender. To substantiate evidence that the brain structure of TIs differs from male and female, we use a combined multivariate and univariate approach. Gray matter segments resulting from voxel-based morphometry preprocessing of $N = 1753$ cisgender (CG) healthy participants were used to train ($N=1402$) and validate (20 % hold-out; $N = 351$) a support-vector machine classifying the biological sex. As a second validation, we classified $N = 1104$ patients with depression. A third validation was performed using the matched CG sample of the transgender women (TWs) application-sample. Subsequently, the classifier was applied to $N = 26$ TWs. Finally, we compared brain volumes of CG-men, women and TW-pre/post treatment (cross-sex hormone treatment) in a univariate analysis controlling for sexual orientation, age and total brain volume. The application of our biological sex classifier to the transgender sample resulted in a significantly lower true positive rate (TPR) (TPR-male = 56.0 %). The TPR did not differ between CG-individuals with (TPR-male = 86.9 %) and without depression (TPR-male = 88.5 %). The univariate analysis of the transgender application-sample revealed that TW-pre/post treatment show brain structural differences from CG-women and CG-men in the putamen and insula, as well as the whole-brain analysis. Our results support the hypothesis that brain structure in TW differs from brain structure of their biological sex (male) as well as their perceived gender (female). This finding substantiates evidence that TIs show specific brain structural alterations leading to a different pattern of brain structure than CG-individuals. △ Less

Submitted 22 April, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

Comments: Content adapted to the publication at Neuropsychopharmacology

Journal ref: Neuropsychopharmacology 45 (2020) 1758-1765

arXiv:1905.07818 [pdf]

Magnetic resonance imaging of mean cell size in human breast tumors

Authors: Junzhong Xu, Xiaoyu Jiang, Hua Li, Lori R. Arlinghaus, Eliot T. McKinley, Sean P. Devan, Benjamin M. Hardy, Hakmook Kang, Anuradha B. Chakravarthy, John C. Gore

Abstract: Purpose: Cell size is a fundamental characteristic of all tissues, and changes in cell size in cancer reflect tumor status and response to treatments, such as apoptosis and cell cycle arrest. Unfortunately, cell size can only be obtained by pathologic evaluation of the tumor in the current standard of care. Previous imaging approaches can be implemented on only animal MRI scanners or require relat… ▽ More Purpose: Cell size is a fundamental characteristic of all tissues, and changes in cell size in cancer reflect tumor status and response to treatments, such as apoptosis and cell cycle arrest. Unfortunately, cell size can only be obtained by pathologic evaluation of the tumor in the current standard of care. Previous imaging approaches can be implemented on only animal MRI scanners or require relatively long acquisition times that are undesirable for clinical imaging. There is a need to develop cell size imaging for clinics. Experimental Design: We propose a new method, IMPULSED (Imaging Microstructural Parameters Using Limited Spectrally Edited Diffusion) that can characterize mean cell sizes in solid tumors. We report the use of combined sequences with different gradient waveforms on human MRI and analytical equations that link DWI signals of real gradient waveforms and specific microstructural parameters such as cell size. We also describe comprehensive validations using computer simulations, cell experiments in vitro, and animal experiments in vivo and finally demonstrate applications in pre-operative breast cancer patients. Results: With fast acquisitions (~ 7 mins), IMPULSED can provide high-resolution (1.3 mm in-plane) map** of mean cell size of human tumors in vivo on currently-available 3T MRI scanners. All validations suggest IMPULSED provide accurate and reliable measurements of mean cell size. Conclusion: The proposed IMPULSED method can assess cell size variations in the tumor of breast cancer patients, which may have the potential to assess early response to neoadjuvant therapy. △ Less

Submitted 19 May, 2019; originally announced May 2019.

arXiv:1905.05861 [pdf]

From Brain Imaging to Graph Analysis: a study on ADNI's patient cohort

Authors: Rui Zhang, Luca Giancardo, Danilo A. Pena, Ye** Kim, Hanghang Tong, Xiaoqian Jiang

Abstract: In this paper, we studied the association between the change of structural brain volumes to the potential development of Alzheimer's disease (AD). Using a simple abstraction technique, we converted regional cortical and subcortical volume differences over two time points for each study subject into a graph. We then obtained substructures of interest using a graph decomposition algorithm in order t… ▽ More In this paper, we studied the association between the change of structural brain volumes to the potential development of Alzheimer's disease (AD). Using a simple abstraction technique, we converted regional cortical and subcortical volume differences over two time points for each study subject into a graph. We then obtained substructures of interest using a graph decomposition algorithm in order to extract pivotal nodes via multi-view feature selection. Intensive experiments using robust classification frameworks were conducted to evaluate the performance of using the brain substructures obtained under different thresholds. The results indicated that compact substructures acquired by examining the differences between patient groups were sufficient to discriminate between AD and healthy controls with an area under the receiver operating curve of 0.72. △ Less

Submitted 14 May, 2019; originally announced May 2019.

arXiv:1905.05827 [pdf]

Discriminative Sleep Patterns of Alzheimer's Disease via Tensor Factorization

Authors: Ye** Kim, Xiaoqian Jiang, Luyao Chen, Xiao** Li, Licong Cui

Abstract: Sleep change is commonly reported in Alzheimer's disease (AD) patients and their brain wave studies show decrease in dreaming and non-dreaming stages. Although sleep disturbance is generally considered as a consequence of AD, it might also be a risk factor of AD as new biological evidence shows. Leveraging National Sleep Research Resource (NSRR), we built a unique cohort of 83 cases and 331 contro… ▽ More Sleep change is commonly reported in Alzheimer's disease (AD) patients and their brain wave studies show decrease in dreaming and non-dreaming stages. Although sleep disturbance is generally considered as a consequence of AD, it might also be a risk factor of AD as new biological evidence shows. Leveraging National Sleep Research Resource (NSRR), we built a unique cohort of 83 cases and 331 controls with clinical variables and EEG signals. Supervised tensor factorization method was applied for this temporal dataset to extract discriminative sleep patterns. Among the 30 patterns extracted, we identified 5 significant patterns (4 patterns for AD likely and 1 pattern for normal ones) and their visual patterns provide interesting linkage to sleep with repeated wakefulness, insomnia, epileptic seizure, and etc. This study is preliminary but findings are interesting, which is a first step to provide quantifiable evidences to measure sleep as a risk factor of AD. △ Less

Submitted 14 May, 2019; originally announced May 2019.

Report number: PMC7153114

arXiv:1905.03978 [pdf]

Tumor Microenvironment-based Gene Signatures Divides Novel Immune and Stromal Subgroup Classification of Lung Adenocarcinoma

Authors: Zihang Zeng, Jiali Li, Nannan Zhang, Xue** Jiang, Yan** Gao, Liexi Xu, Xingyu Liu, Jiarui Chen, Yuke Gao, Linzhi Han, Jiangbo Ren, Yan Gong, Conghua Xie

Abstract: Tumor microenvironment has complex effects on tumorigenesis and metastasis. However, there is still a lack of comprehensive understanding of the relationship among molecular and cellular characteristics in tumor microenvironment, clinical prognosis and immunotherpy response. In this study, the immune and stromal (non-immune) signatures of tumor microenvironment were integrated to identify novel su… ▽ More Tumor microenvironment has complex effects on tumorigenesis and metastasis. However, there is still a lack of comprehensive understanding of the relationship among molecular and cellular characteristics in tumor microenvironment, clinical prognosis and immunotherpy response. In this study, the immune and stromal (non-immune) signatures of tumor microenvironment were integrated to identify novel subgroups of lung adenocarcinoma by eigendecomposition and extraction algorithms of bioinformatics and machine learning, such as non-negative matrix factorization and multitask learning. Tumors were classified into 4 groups according to the activation of immunity and stroma by novel signatures. The 4 groups had different mutation landscape, molecular, cellular characteristics and prognosis, which have been validation in 6 independent data sets containing 1551 patients. High-immune and low-stromal activation group links to high immunocyte infiltration, high immunocompetence, low fibroblasts, endothelial cells, collagen, laminin, tumor mutation burden, and better overall survival. We developed a novel model based on tumor microenvironment by integrating immune and stromal activation, namely PMBT (prognostic model based on tumor microenvironment). The PMBT showed the value to predict overall survival and immunotherapy responses. △ Less

Submitted 10 May, 2019; originally announced May 2019.

arXiv:1811.02757 [pdf, other]

Early Prediction of Acute Kidney Injury in Critical Care Setting Using Clinical Notes

Authors: Yikuan Li, Liang Yao, Chengsheng Mao, Anand Srivastava, Xiaoqian Jiang, Yuan Luo

Abstract: Acute kidney injury (AKI) in critically ill patients is associated with significant morbidity and mortality. Development of novel methods to identify patients with AKI earlier will allow for testing of novel strategies to prevent or reduce the complications of AKI. We developed data-driven prediction models to estimate the risk of new AKI onset. We generated models from clinical notes within the f… ▽ More Acute kidney injury (AKI) in critically ill patients is associated with significant morbidity and mortality. Development of novel methods to identify patients with AKI earlier will allow for testing of novel strategies to prevent or reduce the complications of AKI. We developed data-driven prediction models to estimate the risk of new AKI onset. We generated models from clinical notes within the first 24 hours following intensive care unit (ICU) admission extracted from Medical Information Mart for Intensive Care III (MIMIC-III). From the clinical notes, we generated clinically meaningful word and concept representations and embeddings, respectively. Five supervised learning classifiers and knowledge-guided deep learning architecture were used to construct prediction models. The best configuration yielded a competitive AUC of 0.779. Our work suggests that natural language processing of clinical notes can be applied to assist clinicians in identifying the risk of incident AKI onset in critically ill patients upon admission to the ICU. △ Less

Submitted 9 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.

Comments: 4 pages, 3 figures, accepted by BIBM 2018

arXiv:physics/0211086 [pdf, ps, other]

Method for Observing Intravascular BongHan Duct

Authors: Xiaowen Jiang, Hee-kyeong Kim, Hak-soo Shin, Byong-chon Lee, Chunho Choi, Kyung-soon Soh, Byeung-soo Cheun, Ku-youn Baik, Kwang-sup Soh

Abstract: A method for observing intra blood vessel ducts which are threadlike bundle of tubules which form a part of the BongHan duct system. By injecting 10% dextrose solution at a vena femoralis one makes the intravascular BongHan duct thicker and stronger to be easily detectable after incision of vessels. The duct is semi-transparent, soft and elastic, and composed of smaller tubules whose diameters a… ▽ More A method for observing intra blood vessel ducts which are threadlike bundle of tubules which form a part of the BongHan duct system. By injecting 10% dextrose solution at a vena femoralis one makes the intravascular BongHan duct thicker and stronger to be easily detectable after incision of vessels. The duct is semi-transparent, soft and elastic, and composed of smaller tubules whose diameters are of 10$μ$m order, which is in agreement with BongHan theory. △ Less

Submitted 19 November, 2002; v1 submitted 19 November, 2002; originally announced November 2002.

arXiv:physics/0211085 [pdf, ps, other]

Threadlike bundle of tubules running inside blood vessels: New anatomical structure

Authors: Xiaowen Jiang, Hee-kyeong Kim, Hak-soo Shin, Byong-chon Lee, Chunho Choi, Kyung-soon Soh, Byeung-soo Cheun, Ku-youn Baik, Kwang-sup Soh

Abstract: According to current anatomy, the arteries and veins do not have threadlike structures running inside the vessels. Despite such prevailing knowledge here we report on observation of a novel structure inside the blood vessels of rats and rabbits, which is a semi-transparent elastic bundle of tubules whose diameters are of 10$μ$m order. This is a rediscovery of the Bong Han ducts1,2 which have not… ▽ More According to current anatomy, the arteries and veins do not have threadlike structures running inside the vessels. Despite such prevailing knowledge here we report on observation of a novel structure inside the blood vessels of rats and rabbits, which is a semi-transparent elastic bundle of tubules whose diameters are of 10$μ$m order. This is a rediscovery of the Bong Han ducts1,2 which have not been confirmed because the observing method was not known. We found a new procedure of observing the intra blood vessel ducts (IBVD) which are too thin, fragile, and semi-transparent to be detected in ordinary surgical operation. The method we contrived is to let blood be coagulated around the IBVD so that they become thick and strong by intravenous injection of 10 per cent dextrose solution at the vena femoralis. A piece of thickened IBVD sample is treated with urokinase to remove blood clots and the thin thread of IBVD is embedded inside of a string of fibrin △ Less

Submitted 19 November, 2002; v1 submitted 19 November, 2002; originally announced November 2002.

Showing 1–20 of 20 results for author: Jiang, X