-
DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing
Authors:
Yingzhou Lu,
Yaojun Hu,
Chenhao Li
Abstract:
Bringing a novel drug from the original idea to market typically requires more than ten years and billions of dollars. To alleviate the heavy burden, a natural idea is to reuse the approved drug to treat new diseases. The process is also known as drug repurposing or drug repositioning. Machine learning methods exhibited huge potential in automating drug repurposing. However, it still encounter som…
▽ More
Bringing a novel drug from the original idea to market typically requires more than ten years and billions of dollars. To alleviate the heavy burden, a natural idea is to reuse the approved drug to treat new diseases. The process is also known as drug repurposing or drug repositioning. Machine learning methods exhibited huge potential in automating drug repurposing. However, it still encounter some challenges, such as lack of labels and multimodal feature representation. To address these issues, we design DrugCLIP, a cutting-edge contrastive learning method, to learn drug and disease's interaction without negative labels. Additionally, we have curated a drug repurposing dataset based on real-world clinical trial records. Thorough empirical studies are conducted to validate the effectiveness of the proposed DrugCLIP method.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Individual brain parcellation: Review of methods, validations and applications
Authors:
Chengyi Li,
Shan Yu,
Yue Cui
Abstract:
Individual brains vary greatly in morphology, connectivity and organization. The applicability of group-level parcellations is limited by the rapid development of precision medicine today because they do not take into account the variation of parcels at the individual level. Accurate map** of brain functional regions at the individual level is pivotal for a comprehensive understanding of the var…
▽ More
Individual brains vary greatly in morphology, connectivity and organization. The applicability of group-level parcellations is limited by the rapid development of precision medicine today because they do not take into account the variation of parcels at the individual level. Accurate map** of brain functional regions at the individual level is pivotal for a comprehensive understanding of the variations in brain function and behaviors, early and precise identification of brain abnormalities, as well as personalized treatments for neuropsychiatric disorders. With the development of neuroimaging and machine learning techniques, studies on individual brain parcellation are booming. In this paper, we offer an overview of recent advances in the methodologies of individual brain parcellation, including optimization- and learning-based methods. Comprehensive evaluation metrics to validate individual brain map** have been introduced. We also review the studies of how individual brain map** promotes neuroscience research and clinical medicine. Finally, we summarize the major challenges and important future directions of individualized brain parcellation. Collectively, we intend to offer a thorough overview of individual brain parcellation methods, validations, and applications, along with highlighting the current challenges that call for an urgent demand for integrated platforms that integrate datasets, methods, and validations.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
D-CoRP: Differentiable Connectivity Refinement for Functional Brain Networks
Authors:
Haoyu Hu,
Hongrun Zhang,
Chao Li
Abstract:
Brain network is an important tool for understanding the brain, offering insights for scientific research and clinical diagnosis. Existing models for brain networks typically primarily focus on brain regions or overlook the complexity of brain connectivities. MRI-derived brain network data is commonly susceptible to connectivity noise, underscoring the necessity of incorporating connectivities int…
▽ More
Brain network is an important tool for understanding the brain, offering insights for scientific research and clinical diagnosis. Existing models for brain networks typically primarily focus on brain regions or overlook the complexity of brain connectivities. MRI-derived brain network data is commonly susceptible to connectivity noise, underscoring the necessity of incorporating connectivities into the modeling of brain networks. To address this gap, we introduce a differentiable module for refining brain connectivity. We develop the multivariate optimization based on information bottleneck theory to address the complexity of the brain network and filter noisy or redundant connections. Also, our method functions as a flexible plugin that is adaptable to most graph neural networks. Our extensive experimental results show that the proposed method can significantly improve the performance of various baseline models and outperform other state-of-the-art methods, indicating the effectiveness and generalizability of the proposed method in refining brain network connectivity. The code will be released for public availability.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics
Authors:
Xiaofei Wang,
Xingxu Huang,
Stephen J. Price,
Chao Li
Abstract:
The recent advancement of spatial transcriptomics (ST) allows to characterize spatial gene expression within tissue for discovery research. However, current ST platforms suffer from low resolution, hindering in-depth understanding of spatial gene expression. Super-resolution approaches promise to enhance ST maps by integrating histology images with gene expressions of profiled tissue spots. Howeve…
▽ More
The recent advancement of spatial transcriptomics (ST) allows to characterize spatial gene expression within tissue for discovery research. However, current ST platforms suffer from low resolution, hindering in-depth understanding of spatial gene expression. Super-resolution approaches promise to enhance ST maps by integrating histology images with gene expressions of profiled tissue spots. However, current super-resolution methods are limited by restoration uncertainty and mode collapse. Although diffusion models have shown promise in capturing complex interactions between multi-modal conditions, it remains a challenge to integrate histology images and gene expression for super-resolved ST maps. This paper proposes a cross-modal conditional diffusion model for super-resolving ST maps with the guidance of histology images. Specifically, we design a multi-modal disentangling network with cross-modal adaptive modulation to utilize complementary information from histology images and spatial gene expression. Moreover, we propose a dynamic cross-attention modelling strategy to extract hierarchical cell-to-tissue information from histology images. Lastly, we propose a co-expression-based gene-correlation graph network to model the co-expression relationship of multiple genes. Experiments show that our method outperforms other state-of-the-art methods in ST super-resolution on three public datasets.
△ Less
Submitted 27 May, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Utilizing Computer Vision for Continuous Monitoring of Vaccine Side Effects in Experimental Mice
Authors:
Chuang Li,
Shuai Shao,
Willian Mikason,
Rubing Lin,
Yantong Liu
Abstract:
The demand for improved efficiency and accuracy in vaccine safety assessments is increasing. Here, we explore the application of computer vision technologies to automate the monitoring of experimental mice for potential side effects after vaccine administration. Traditional observation methods are labor-intensive and lack the capability for continuous monitoring. By deploying a computer vision sys…
▽ More
The demand for improved efficiency and accuracy in vaccine safety assessments is increasing. Here, we explore the application of computer vision technologies to automate the monitoring of experimental mice for potential side effects after vaccine administration. Traditional observation methods are labor-intensive and lack the capability for continuous monitoring. By deploying a computer vision system, our research aims to improve the efficiency and accuracy of vaccine safety assessments. The methodology involves training machine learning models on annotated video data of mice behaviors pre- and post-vaccination. Preliminary results indicate that computer vision effectively identify subtle changes, signaling possible side effects. Therefore, our approach has the potential to significantly enhance the monitoring process in vaccine trials in animals, providing a practical solution to the limitations of human observation.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Molecular Generative Adversarial Network with Multi-Property Optimization
Authors:
Huidong Tang,
Chen Li,
Sayaka Kamei,
Yoshihiro Yamanishi,
Yasuhiko Morimoto
Abstract:
Deep generative models, such as generative adversarial networks (GANs), have been employed for $de~novo$ molecular generation in drug discovery. Most prior studies have utilized reinforcement learning (RL) algorithms, particularly Monte Carlo tree search (MCTS), to handle the discrete nature of molecular representations in GANs. However, due to the inherent instability in training GANs and RL mode…
▽ More
Deep generative models, such as generative adversarial networks (GANs), have been employed for $de~novo$ molecular generation in drug discovery. Most prior studies have utilized reinforcement learning (RL) algorithms, particularly Monte Carlo tree search (MCTS), to handle the discrete nature of molecular representations in GANs. However, due to the inherent instability in training GANs and RL models, along with the high computational cost associated with MCTS sampling, MCTS RL-based GANs struggle to scale to large chemical databases. To tackle these challenges, this study introduces a novel GAN based on actor-critic RL with instant and global rewards, called InstGAN, to generate molecules at the token-level with multi-property optimization. Furthermore, maximized information entropy is leveraged to alleviate the mode collapse. The experimental results demonstrate that InstGAN outperforms other baselines, achieves comparable performance to state-of-the-art models, and efficiently generates molecules with multi-property optimization. The source code will be released upon acceptance of the paper.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Enabling self-identification in intelligent agent: insights from computational psychoanalysis
Authors:
Lingyu Li,
Chunbo Li
Abstract:
Building upon prior framework of computational Lacanian psychoanalysis with the theory of active inference, this paper aims to further explore the concept of self-identification and its potential applications. Beginning with two classic paradigms in psychology, mirror self-recognition and rubber hand illusion, we suggest that imaginary identification is characterized by an integrated body schema w…
▽ More
Building upon prior framework of computational Lacanian psychoanalysis with the theory of active inference, this paper aims to further explore the concept of self-identification and its potential applications. Beginning with two classic paradigms in psychology, mirror self-recognition and rubber hand illusion, we suggest that imaginary identification is characterized by an integrated body schema with minimal free energy. Next, we briefly survey three dimensions of symbolic identification (sociological, psychoanalytic, and linguistical) and corresponding active inference accounts. To provide intuition, we respectively employ a convolutional neural network (CNN) and a multi-layer perceptron (MLP) supervised by ChatGPT to showcase optimization of free energy during motor skill and language mastery underlying identification formation. We then introduce Lacan's Graph II of desire, unifying imaginary and symbolic identification, and propose an illustrative model called FreeAgent. In concluding remarks, we discuss some key issues in the potential of computational Lacanian psychoanalysis to advance mental health and artificial intelligence, including digital twin mind, large language models as avatars of the Lacanian Other, and the feasibility of human-level artificial general intelligence with self-awareness in the context of post-structuralism.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Recent progress in the physical principles of dynamic ground self-righting
Authors:
Chen Li
Abstract:
Animals and robots must self-right on the ground after overturning. Biology research described various strategies and motor patterns in many species. Robotics research devised many strategies. However, we do not well understand how the physical principles of how the need to generate mechanical energy to overcome the potential energy barrier governs behavioral strategies and 3-D body rotations give…
▽ More
Animals and robots must self-right on the ground after overturning. Biology research described various strategies and motor patterns in many species. Robotics research devised many strategies. However, we do not well understand how the physical principles of how the need to generate mechanical energy to overcome the potential energy barrier governs behavioral strategies and 3-D body rotations given the morphology. Here I review progress on this which I led studying cockroaches self-righting on level, flat, solid, low-friction ground, by integrating biology experiments, robotic modeling, and physics modeling.
△ Less
Submitted 10 June, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Evaluating Cognitive and Neuropsychological Assessments -- A Comprehensive Review
Authors:
Chuang Li,
Rubing Lin,
Yantong Liu,
Yichen Wei
Abstract:
Cognitive impairments in older adults represent a significant public health concern, necessitating accurate diagnostic and monitoring strategies. In this study, the principal cognitive and neuropsychological evaluations employed for the diagnosis and longitudinal observation of cognitive deficits in the elderly are investigated. An analytical review of instruments including the Mini-Mental State E…
▽ More
Cognitive impairments in older adults represent a significant public health concern, necessitating accurate diagnostic and monitoring strategies. In this study, the principal cognitive and neuropsychological evaluations employed for the diagnosis and longitudinal observation of cognitive deficits in the elderly are investigated. An analytical review of instruments including the Mini-Mental State Examination (MMSE), Digit Symbol Substitution Test (DSST), Montreal Cognitive Assessment (MoCA), and Trail Making Test (TMT) is conducted. This examination encompasses an assessment of each instrument's methodology, efficacy, advantages, and limitations. The objective is to enhance comprehension of these assessments for the early identification and effective management of conditions such as dementia and mild cognitive impairment, thereby contributing to the advancement of cognitive health within the geriatric population.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
scInterpreter: Training Large Language Models to Interpret scRNA-seq Data for Cell Type Annotation
Authors:
Cong Li,
Meng Xiao,
Pengfei Wang,
Guihai Feng,
Xin Li,
Yuanchun Zhou
Abstract:
Despite the inherent limitations of existing Large Language Models in directly reading and interpreting single-cell omics data, they demonstrate significant potential and flexibility as the Foundation Model. This research focuses on how to train and adapt the Large Language Model with the capability to interpret and distinguish cell types in single-cell RNA sequencing data. Our preliminary researc…
▽ More
Despite the inherent limitations of existing Large Language Models in directly reading and interpreting single-cell omics data, they demonstrate significant potential and flexibility as the Foundation Model. This research focuses on how to train and adapt the Large Language Model with the capability to interpret and distinguish cell types in single-cell RNA sequencing data. Our preliminary research results indicate that these foundational models excel in accurately categorizing known cell types, demonstrating the potential of the Large Language Models as effective tools for uncovering new biological insights.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Immunogenic cell death triggered by pathogen ligands via host germ line-encoded receptors
Authors:
Chuang Li,
Yichen Wei,
Chao Qin,
Shifan Chen,
Xiaolong Shao
Abstract:
The strategic induction of cell death serves as a crucial immune defense mechanism for the eradication of pathogenic infections within host cells. Investigating the molecular mechanisms underlying immunogenic cell pathways has significantly enhanced our understanding of the host's immunity. This review provides a comprehensive overview of the immunogenic cell death mechanisms triggered by pathogen…
▽ More
The strategic induction of cell death serves as a crucial immune defense mechanism for the eradication of pathogenic infections within host cells. Investigating the molecular mechanisms underlying immunogenic cell pathways has significantly enhanced our understanding of the host's immunity. This review provides a comprehensive overview of the immunogenic cell death mechanisms triggered by pathogen infections, focusing on the critical role of pattern recognition receptors. In response to infections, host cells dictate a variety of cell death pathways, including apoptosis, pyroptosis, necrosis, and lysosomal cell death, which are essential for amplifying immune responses and controlling pathogen dissemination. Key components of these mechanisms are host cellular receptors that recognize pathogen-associated ligands. These receptors activate downstream signaling cascades, leading to the expression of immunoregulatory genes and the production of antimicrobial cytokines and chemokines. Particularly, the inflammasome, a multi-protein complex, plays a pivotal role in these responses by processing pro-inflammatory cytokines and inducing pyroptotic cell death. Pathogens, in turn, have evolved strategies to manipulate these cell death pathways, either by inhibiting them to facilitate their replication or by triggering them to evade host defenses. This dynamic interplay between host immune mechanisms and pathogen strategies highlights the intricate co-evolution of microbial virulence and host immunity.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Multi-Region Markovian Gaussian Process: An Efficient Method to Discover Directional Communications Across Multiple Brain Regions
Authors:
Weihan Li,
Chengrui Li,
Yule Wang,
Anqi Wu
Abstract:
Studying the complex interactions between different brain regions is crucial in neuroscience. Various statistical methods have explored the latent communication across multiple brain regions. Two main categories are the Gaussian Process (GP) and Linear Dynamical System (LDS), each with unique strengths. The GP-based approach effectively discovers latent variables with frequency bands and communica…
▽ More
Studying the complex interactions between different brain regions is crucial in neuroscience. Various statistical methods have explored the latent communication across multiple brain regions. Two main categories are the Gaussian Process (GP) and Linear Dynamical System (LDS), each with unique strengths. The GP-based approach effectively discovers latent variables with frequency bands and communication directions. Conversely, the LDS-based approach is computationally efficient but lacks powerful expressiveness in latent representation. In this study, we merge both methodologies by creating an LDS mirroring a multi-output GP, termed Multi-Region Markovian Gaussian Process (MRM-GP). Our work establishes a connection between an LDS and a multi-output GP that explicitly models frequencies and phase delays within the latent space of neural recordings. Consequently, the model achieves a linear inference cost over time points and provides an interpretable low-dimensional representation, revealing communication directions across brain regions and separating oscillatory communications into different frequency bands.
△ Less
Submitted 30 May, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
A Differentiable Partially Observable Generalized Linear Model with Forward-Backward Message Passing
Authors:
Chengrui Li,
Weihan Li,
Yule Wang,
Anqi Wu
Abstract:
The partially observable generalized linear model (POGLM) is a powerful tool for understanding neural connectivity under the assumption of existing hidden neurons. With spike trains only recorded from visible neurons, existing works use variational inference to learn POGLM meanwhile presenting the difficulty of learning this latent variable model. There are two main issues: (1) the sampled Poisson…
▽ More
The partially observable generalized linear model (POGLM) is a powerful tool for understanding neural connectivity under the assumption of existing hidden neurons. With spike trains only recorded from visible neurons, existing works use variational inference to learn POGLM meanwhile presenting the difficulty of learning this latent variable model. There are two main issues: (1) the sampled Poisson hidden spike count hinders the use of the pathwise gradient estimator in VI; and (2) the existing design of the variational model is neither expressive nor time-efficient, which further affects the performance. For (1), we propose a new differentiable POGLM, which enables the pathwise gradient estimator, better than the score function gradient estimator used in existing works. For (2), we propose the forward-backward message-passing sampling scheme for the variational model. Comprehensive experiments show that our differentiable POGLMs with our forward-backward message passing produce a better performance on one synthetic and two real-world datasets. Furthermore, our new method yields more interpretable parameters, underscoring its significance in neuroscience.
△ Less
Submitted 7 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Quantifying energy landscape of oscillatory systems: Explosion, pre-solution, and diffusion decomposition
Authors:
Shirui Bian,
Ruisong Zhou,
Wei Lin,
Chunhe Li
Abstract:
The energy landscape theory finds its both extensive and intensive application in studying stochastic dynamics of physical and biological systems. Although the weighted summation of the Gaussian approximation (WSGA) approach has been proposed for quantifying the energy landscape in multistable systems by solving the diffusion equation approximately from moment equations, we are still lacking an ac…
▽ More
The energy landscape theory finds its both extensive and intensive application in studying stochastic dynamics of physical and biological systems. Although the weighted summation of the Gaussian approximation (WSGA) approach has been proposed for quantifying the energy landscape in multistable systems by solving the diffusion equation approximately from moment equations, we are still lacking an accurate approach for quantifying the energy landscape of the periodic oscillatory systems. To address this challenge, we propose an approach, called the diffusion decomposition of the Gaussian approximation (DDGA). Using typical oscillatory systems as examples, we demonstrate the efficacy of the proposed DDGA in quantifying the energy landscape of oscillatory systems and corresponding stochastic dynamics, in comparison with existing approaches. By further applying the DDGA to a high-dimensional cell cycle network, we are able to uncover more intricate biological mechanisms in cell cycle, which cannot be discerned using previously developed approaches.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
GenoCraft: A Comprehensive, User-Friendly Web-Based Platform for High-Throughput Omics Data Analysis and Visualization
Authors:
Yingzhou Lu,
Minjie Shen,
Yue Zhao,
Chenhao Li,
Fan Meng,
Xiao Wang,
David Herrington,
Yue Wang,
Tim Fu,
Capucine Van Rechem
Abstract:
The surge in high-throughput omics data has reshaped the landscape of biological research, underlining the need for powerful, user-friendly data analysis and interpretation tools. This paper presents GenoCraft, a web-based comprehensive software solution designed to handle the entire pipeline of omics data processing. GenoCraft offers a unified platform featuring advanced bioinformatics tools, cov…
▽ More
The surge in high-throughput omics data has reshaped the landscape of biological research, underlining the need for powerful, user-friendly data analysis and interpretation tools. This paper presents GenoCraft, a web-based comprehensive software solution designed to handle the entire pipeline of omics data processing. GenoCraft offers a unified platform featuring advanced bioinformatics tools, covering all aspects of omics data analysis. It encompasses a range of functionalities, such as normalization, quality control, differential analysis, network analysis, pathway analysis, and diverse visualization techniques. This software makes state-of-the-art omics data analysis more accessible to a wider range of users. With GenoCraft, researchers and data scientists have access to an array of cutting-edge bioinformatics tools under a user-friendly interface, making it a valuable resource for managing and analyzing large-scale omics data. The API with an interactive web interface is publicly available at https://genocraft.stanford. edu/. We also release all the codes in https://github.com/futianfan/GenoCraft.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
scBiGNN: Bilevel Graph Representation Learning for Cell Type Classification from Single-cell RNA Sequencing Data
Authors:
Rui Yang,
Wenrui Dai,
Chenglin Li,
Junni Zou,
Dapeng Wu,
Hongkai Xiong
Abstract:
Single-cell RNA sequencing (scRNA-seq) technology provides high-throughput gene expression data to study the cellular heterogeneity and dynamics of complex organisms. Graph neural networks (GNNs) have been widely used for automatic cell type classification, which is a fundamental problem to solve in scRNA-seq analysis. However, existing methods do not sufficiently exploit both gene-gene and cell-c…
▽ More
Single-cell RNA sequencing (scRNA-seq) technology provides high-throughput gene expression data to study the cellular heterogeneity and dynamics of complex organisms. Graph neural networks (GNNs) have been widely used for automatic cell type classification, which is a fundamental problem to solve in scRNA-seq analysis. However, existing methods do not sufficiently exploit both gene-gene and cell-cell relationships, and thus the true potential of GNNs is not realized. In this work, we propose a bilevel graph representation learning method, named scBiGNN, to simultaneously mine the relationships at both gene and cell levels for more accurate single-cell classification. Specifically, scBiGNN comprises two GNN modules to identify cell types. A gene-level GNN is established to adaptively learn gene-gene interactions and cell representations via the self-attention mechanism, and a cell-level GNN builds on the cell-cell graph that is constructed from the cell representations generated by the gene-level GNN. To tackle the scalability issue for processing a large number of cells, scBiGNN adopts an Expectation Maximization (EM) framework in which the two modules are alternately trained via the E-step and M-step to learn from each other. Through this interaction, the gene- and cell-level structural information is integrated to gradually enhance the classification performance of both GNN modules. Experiments on benchmark datasets demonstrate that our scBiGNN outperforms a variety of existing methods for cell type classification from scRNA-seq data.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Morphological Profiling for Drug Discovery in the Era of Deep Learning
Authors:
Qiaosi Tang,
Ranjala Ratnayake,
Gustavo Seabra,
Zhe Jiang,
Ruogu Fang,
Lina Cui,
Yousong Ding,
Tamer Kahveci,
Jiang Bian,
Chenglong Li,
Hendrik Luesch,
Yanjun Li
Abstract:
Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial…
▽ More
Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial improvements in analyzing large-scale high-content images at high-throughput. These efforts have facilitated understanding of compound mechanism-of-action (MOA), drug repurposing, characterization of cell morphodynamics under perturbation, and ultimately contributing to the development of novel therapeutics. In this review, we provide a comprehensive overview of the recent advances in the field of morphological profiling. We summarize the image profiling analysis workflow, survey a broad spectrum of analysis strategies encompassing feature engineering- and deep learning-based approaches, and introduce publicly available benchmark datasets. We place a particular emphasis on the application of deep learning in this pipeline, covering cell segmentation, image representation learning, and multimodal learning. Additionally, we illuminate the application of morphological profiling in phenotypic drug discovery and highlight potential challenges and opportunities in this field.
△ Less
Submitted 15 January, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Noise-induced stochastic Nash equilibrium
Authors:
Cong Li,
Tianjiao Feng,
Xiudeng Zheng,
Sabin Lessard,
Yi Tao
Abstract:
In order to better understand the impact of environmental stochastic fluctuations on the evolution of animal behavior, we introduce the concept of a stochastic Nash equilibrium (SNE) that extends the classical concept of a Nash equilibrium (NE). Based on a stochastic stability analysis of a linear evolutionary game with temporally varying payoffs, we address the question of the existence of a SNE,…
▽ More
In order to better understand the impact of environmental stochastic fluctuations on the evolution of animal behavior, we introduce the concept of a stochastic Nash equilibrium (SNE) that extends the classical concept of a Nash equilibrium (NE). Based on a stochastic stability analysis of a linear evolutionary game with temporally varying payoffs, we address the question of the existence of a SNE, either weak when the geometric mean payoff against it is the same for all other strategies or strong when it is strictly smaller for all other strategies, and its relationship with a stochastically evolutionarily stable (SES) strategy. While a strong SNE is always SES, this is not necessarily the case for a weak SNE. We give conditions for a completely mixed weak SNE not to be SES and to coexist with at least two strong SNE. More importantly, we show that a pair of two completely mixed strong SNE can emerge as the noise level increases. This not only indicates that a noise-induced SNE may possess some properties that a NE cannot possess, such as being completely mixed and strong, but also illustrates the complexity of evolutionary game dynamics in a stochastic environment.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
One-hot Generalized Linear Model for Switching Brain State Discovery
Authors:
Chengrui Li,
Soon Ho Kim,
Chris Rodgers,
Hannah Choi,
Anqi Wu
Abstract:
Exposing meaningful and interpretable neural interactions is critical to understanding neural circuits. Inferred neural interactions from neural signals primarily reflect functional interactions. In a long experiment, subject animals may experience different stages defined by the experiment, stimuli, or behavioral states, and hence functional interactions can change over time. To model dynamically…
▽ More
Exposing meaningful and interpretable neural interactions is critical to understanding neural circuits. Inferred neural interactions from neural signals primarily reflect functional interactions. In a long experiment, subject animals may experience different stages defined by the experiment, stimuli, or behavioral states, and hence functional interactions can change over time. To model dynamically changing functional interactions, prior work employs state-switching generalized linear models with hidden Markov models (i.e., HMM-GLMs). However, we argue they lack biological plausibility, as functional interactions are shaped and confined by the underlying anatomical connectome. Here, we propose a novel prior-informed state-switching GLM. We introduce both a Gaussian prior and a one-hot prior over the GLM in each state. The priors are learnable. We will show that the learned prior should capture the state-constant interaction, shedding light on the underlying anatomical connectome and revealing more likely physical neuron interactions. The state-dependent interaction modeled by each GLM offers traceability to capture functional variations across multiple brain states. Our methods effectively recover true interaction structures in simulated data, achieve the highest predictive likelihood with real neural datasets, and render interaction structures and hidden states more interpretable when applied to real neural data.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Schizophrenia research under the framework of predictive coding: body, language, and others
Authors:
Lingyu Li,
Chunbo Li
Abstract:
Although there have been so many studies on schizophrenia under the framework of predictive coding, works focusing on treatment are very preliminary. A model-oriented, operationalist, and comprehensive understanding of schizophrenia would promote the therapy turn of further research. We summarize predictive coding models of embodiment, co-occurrence of over- and under-weighting priors, subjective…
▽ More
Although there have been so many studies on schizophrenia under the framework of predictive coding, works focusing on treatment are very preliminary. A model-oriented, operationalist, and comprehensive understanding of schizophrenia would promote the therapy turn of further research. We summarize predictive coding models of embodiment, co-occurrence of over- and under-weighting priors, subjective time processing, language production or comprehension, self-or-other inference, and social interaction. Corresponding impairments and clinical manifestations of schizophrenia are reviewed under these models at the same time. Finally, we discuss why and how to inaugurate a therapy turn of further research under the framework of predictive coding.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
An active inference model of Lacanian psychoanalysis
Authors:
Lingyu Li,
Chunbo Li
Abstract:
There has been a growing interest in exploring behavior, brain, and mind through the lens of complex systems theory. However, a unified and computational model that comprehensively encapsulates the properties of the human mind remains elusive. To address this gap, we propose a recurrent generative model drawing upon with Lacanian psychoanalysis and active inference. We conceptualize mechanism of d…
▽ More
There has been a growing interest in exploring behavior, brain, and mind through the lens of complex systems theory. However, a unified and computational model that comprehensively encapsulates the properties of the human mind remains elusive. To address this gap, we propose a recurrent generative model drawing upon with Lacanian psychoanalysis and active inference. We conceptualize mechanism of desire as partial generalized synchronization, and then apply the model to suicidal dynamics to illustrate the theoretical and practical implications of our model. This work on computational psychoanalysis reveals its potential in unraveling complex mental phenomena.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Meta predictive learning model of languages in neural circuits
Authors:
Chan Li,
Junbin Qiu,
Hai** Huang
Abstract:
Large language models based on self-attention mechanisms have achieved astonishing performances not only in natural language itself, but also in a variety of tasks of different nature. However, regarding processing language, our human brain may not operate using the same principle. Then, a debate is established on the connection between brain computation and artificial self-supervision adopted in…
▽ More
Large language models based on self-attention mechanisms have achieved astonishing performances not only in natural language itself, but also in a variety of tasks of different nature. However, regarding processing language, our human brain may not operate using the same principle. Then, a debate is established on the connection between brain computation and artificial self-supervision adopted in large language models. One of most influential hypothesis in brain computation is the predictive coding framework, which proposes to minimize the prediction error by local learning. However, the role of predictive coding and the associated credit assignment in language processing remains unknown. Here, we propose a mean-field learning model within the predictive coding framework, assuming that the synaptic weight of each connection follows a spike and slab distribution, and only the distribution, rather than specific weights, is trained. This meta predictive learning is successfully validated on classifying handwritten digits where pixels are input to the network in sequence, and moreover on the toy and real language corpus. Our model reveals that most of the connections become deterministic after learning, while the output connections have a higher level of variability. The performance of the resulting network ensemble changes continuously with data load, further improving with more training data, in analogy with the emergent behavior of large language models. Therefore, our model provides a starting point to investigate the connection among brain computation, next-token prediction and general intelligence.
△ Less
Submitted 9 October, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Models
Authors:
Yule Wang,
Zi**g Wu,
Chengrui Li,
Anqi Wu
Abstract:
In the field of behavior-related brain computation, it is necessary to align raw neural signals against the drastic domain shift among them. A foundational framework within neuroscience research posits that trial-based neural population activities rely on low-dimensional latent dynamics, thus focusing on the latter greatly facilitates the alignment procedure. Despite this field's progress, existin…
▽ More
In the field of behavior-related brain computation, it is necessary to align raw neural signals against the drastic domain shift among them. A foundational framework within neuroscience research posits that trial-based neural population activities rely on low-dimensional latent dynamics, thus focusing on the latter greatly facilitates the alignment procedure. Despite this field's progress, existing methods ignore the intrinsic spatio-temporal structure during the alignment phase. Hence, their solutions usually lead to poor quality in latent dynamics structures and overall performance. To tackle this problem, we propose an alignment method ERDiff, which leverages the expressivity of the diffusion model to preserve the spatio-temporal structure of latent dynamics. Specifically, the latent dynamics structures of the source domain are first extracted by a diffusion model. Then, under the guidance of this diffusion model, such structures are well-recovered through a maximum likelihood alignment procedure in the target domain. We first demonstrate the effectiveness of our proposed method on a synthetic dataset. Then, when applied to neural recordings from the non-human primate motor cortex, under both cross-day and inter-subject settings, our method consistently manifests its capability of preserving the spatiotemporal structure of latent dynamics and outperforms existing approaches in alignment goodness-of-fit and neural decoding performance.
△ Less
Submitted 8 March, 2024; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Causal Intervention for Measuring Confidence in Drug-Target Interaction Prediction
Authors:
Wenting Ye,
Chen Li,
Yang Xie,
Wen Zhang,
Hong-Yu Zhang,
Bowen Wang,
Debo Cheng,
Zaiwen Feng
Abstract:
Identifying and discovering drug-target interactions(DTIs) are vital steps in drug discovery and development. They play a crucial role in assisting scientists in finding new drugs and accelerating the drug development process. Recently, knowledge graph and knowledge graph embedding (KGE) models have made rapid advancements and demonstrated impressive performance in drug discovery. However, such mo…
▽ More
Identifying and discovering drug-target interactions(DTIs) are vital steps in drug discovery and development. They play a crucial role in assisting scientists in finding new drugs and accelerating the drug development process. Recently, knowledge graph and knowledge graph embedding (KGE) models have made rapid advancements and demonstrated impressive performance in drug discovery. However, such models lack authenticity and accuracy in drug target identification, leading to an increased misjudgment rate and reduced drug development efficiency. To address these issues, we focus on the problem of drug-target interactions, with knowledge map** as the core technology. Specifically, a causal intervention-based confidence measure is employed to assess the triplet score to improve the accuracy of the drug-target interaction prediction model. Experimental results demonstrate that the developed confidence measurement method based on causal intervention can significantly enhance the accuracy of DTI link prediction, particularly for high-precision models. The predicted results are more valuable in guiding the design and development of subsequent drug development experiments, thereby significantly improving the efficiency of drug development.
△ Less
Submitted 14 November, 2023; v1 submitted 31 May, 2023;
originally announced June 2023.
-
Energy landscape reveals the underlying mechanism of cancer-adipose conversion with gene network models
Authors:
Zihao Chen,
Jia Lu,
Xing-Ming Zhao,
Haiyang Yu,
Chunhe Li
Abstract:
Cancer is a systemic heterogeneous disease involving complex molecular networks. Tumor formation involves epithelial-mesenchymal transition (EMT), which promotes both metastasis and plasticity of cancer cells. Recent experiments proposed that cancer cells can be transformed into adipocytes with combination drugs. However, the underlying mechanisms for how these drugs work from molecular network pe…
▽ More
Cancer is a systemic heterogeneous disease involving complex molecular networks. Tumor formation involves epithelial-mesenchymal transition (EMT), which promotes both metastasis and plasticity of cancer cells. Recent experiments proposed that cancer cells can be transformed into adipocytes with combination drugs. However, the underlying mechanisms for how these drugs work from molecular network perspective remain elusive. To reveal the mechanism of cancer-adipose conversion (CAC), we adopt a systems biology approach by combing mathematical modeling and molecular experiments based on the underlying molecular regulatory network. We identified four types of attractors which correspond to epithelial (E), mesenchymal (M), adipose (A) and partial/intermediate EMT (P) cell states on the CAC landscape. Landscape and transition path results illustrate that the intermediate states play critical roles in cancer to adipose transition. Through a landscape control strategy, we identified two new therapeutic strategies for drug combinations to promote CAC. We further verified these predictions by molecular experiments in different cell lines. Our combined computational and experimental approach provides a powerful tool to explore molecular mechanisms for cell fate transitions in cancer networks. Our results revealed the underlying mechanism for intermediate cell states governing the CAC, and identified new potential drug combinations to induce cancer adipogenesis.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
Machine learning traction force maps of cell monolayers
Authors:
Changhao Li,
Luyi Feng,
Yang Jeong Park,
Jian Yang,
Ju Li,
Sulin Zhang
Abstract:
Cellular force transmission across a hierarchy of molecular switchers is central to mechanobiological responses. However, current cellular force microscopies suffer from low throughput and resolution. Here we introduce and train a generative adversarial network (GAN) to paint out traction force maps of cell monolayers with high fidelity to the experimental traction force microscopy (TFM). The GAN…
▽ More
Cellular force transmission across a hierarchy of molecular switchers is central to mechanobiological responses. However, current cellular force microscopies suffer from low throughput and resolution. Here we introduce and train a generative adversarial network (GAN) to paint out traction force maps of cell monolayers with high fidelity to the experimental traction force microscopy (TFM). The GAN analyzes traction force maps as an image-to-image translation problem, where its generative and discriminative neural networks are simultaneously cross-trained by hybrid experimental and numerical datasets. In addition to capturing the colony-size and substrate-stiffness dependent traction force maps, the trained GAN predicts asymmetric traction force patterns for multicellular monolayers seeding on substrates with stiffness gradient, implicating collective durotaxis. Further, the neural network can extract experimentally inaccessible, the hidden relationship between substrate stiffness and cell contractility, which underlies cellular mechanotransduction. Trained solely on datasets for epithelial cells, the GAN can be extrapolated to other contractile cell types using only a single scaling factor. The digital TFM serves as a high-throughput tool for map** out cellular forces of cell monolayers and paves the way toward data-driven discoveries in cell mechanobiology.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction
Authors:
Yang Zhang,
Zhewei Wei,
Ye Yuan,
Chongxuan Li,
Wenbing Huang
Abstract:
Predicting the binding sites of target proteins plays a fundamental role in drug discovery. Most existing deep-learning methods consider a protein as a 3D image by spatially clustering its atoms into voxels and then feed the voxelized protein into a 3D CNN for prediction. However, the CNN-based methods encounter several critical issues: 1) defective in representing irregular protein structures; 2)…
▽ More
Predicting the binding sites of target proteins plays a fundamental role in drug discovery. Most existing deep-learning methods consider a protein as a 3D image by spatially clustering its atoms into voxels and then feed the voxelized protein into a 3D CNN for prediction. However, the CNN-based methods encounter several critical issues: 1) defective in representing irregular protein structures; 2) sensitive to rotations; 3) insufficient to characterize the protein surface; 4) unaware of protein size shift. To address the above issues, this work proposes EquiPocket, an E(3)-equivariant Graph Neural Network (GNN) for binding site prediction, which comprises three modules: the first one to extract local geometric information for each surface atom, the second one to model both the chemical and spatial structure of protein and the last one to capture the geometry of the surface via equivariant message passing over the surface atoms. We further propose a dense attention output layer to alleviate the effect incurred by variable protein size. Extensive experiments on several representative benchmarks demonstrate the superiority of our framework to the state-of-the-art methods.
△ Less
Submitted 8 June, 2024; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Music Enhances Activity in the Hypothalamus, Brainstem, and Anterior Cerebellum during Script-Driven Imagery of Affective Scenes
Authors:
Chia-Wei Li,
Tzu-Han Cheng,
Chen-Gia Tsai
Abstract:
Music is frequently used to establish atmosphere and to enhance/alter emotion in dramas and films. During music listening, visual imagery is a common mechanism underlying emotion induction. The present functional magnetic resonance imaging (fMRI) study examined the neural substrates of the emotional processing of music and imagined scene. A factorial design was used with factors emotion valence (p…
▽ More
Music is frequently used to establish atmosphere and to enhance/alter emotion in dramas and films. During music listening, visual imagery is a common mechanism underlying emotion induction. The present functional magnetic resonance imaging (fMRI) study examined the neural substrates of the emotional processing of music and imagined scene. A factorial design was used with factors emotion valence (positive; negative) and music (withoutMUSIC: script-driven imagery of emotional scenes; withMUSIC: script-driven imagery of emotional scenes and simultaneously listening to affectively congruent music). The baseline condition was imagery of neutral scenes in the absence of music. Eleven females and five males participated in this fMRI study. The contrasts of positive and negative withoutMUSIC conditions minus the baseline (imagery of neutral scenes) showed no significant activation. When comparing the withMUSIC to withoutMUSIC conditions, activity in a number of emotion-related regions was observed, including the temporal pole (TP), amygdala, hippocampus, hypothalamus, anterior ventral tegmental area (VTA), locus coeruleus, and anterior cerebellum. We hypothesized that the TP may integrate music and the imagined scene to extract socioemotional significance, initiating the subcortical structures to generate subjective feelings and bodily responses. For the withMUSIC conditions, negative emotions were associated with enhanced activation in the posterior VTA compared to positive emotions. Our findings replicated and extended previous research which suggests that different subregions of the VTA are sensitive to rewarding and aversive stimuli. Taken together, this study suggests that emotional music embedded in an imagined scenario is a salient social signal that prompts preparation of approach/avoidance behaviours and emotional responses in listeners.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Reward prediction errors arising from switches between major and minor modes in music: An fMRI study
Authors:
Chen-Gia Tsai,
Yi-Fan Fu,
Chia-Wei Li
Abstract:
Evidence has accumulated that prediction error processing plays a role in the enjoyment of music listening. The present study examined listeners' neural responses to the signed reward prediction errors (RPEs) arising from switches between major and minor modes in music. We manipulated the final chord of J. S. Bach's keyboard pieces so that each major-mode passage ended with either the major (Major…
▽ More
Evidence has accumulated that prediction error processing plays a role in the enjoyment of music listening. The present study examined listeners' neural responses to the signed reward prediction errors (RPEs) arising from switches between major and minor modes in music. We manipulated the final chord of J. S. Bach's keyboard pieces so that each major-mode passage ended with either the major (Major-Major) or minor (Major-Minor) tonic chord, and each minor-mode passage ended with either the minor (Minor-Minor) or major (Minor-Major) tonic chord. In Western music, the major and minor modes have positive and negative connotations, respectively. Therefore, the outcome of the final chord in Major-Minor stimuli was associated with negative RPE, whereas that in Minor-Major was associated with positive RPE. Twenty-three musically experienced adults underwent functional magnetic resonance imaging while listening to Major-Major, Major-Minor, Minor-Minor, and Minor-Major stimuli. We found that activity in the subgenual anterior cingulate cortex (extending into the ventromedial prefrontal cortex) during the final chord for Major-Major was significantly higher than that for Major-Minor. Conversely, a frontoparietal network for Major-Minor exhibited significantly increased activity compared to Major-Major. The contrasts between Minor-Minor and Minor-Major yielded regions implicated in interoception. We discuss our results in relation to executive functions and the emotional connotations of major versus minor mode.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
Statistical mechanics of continual learning: variational principle and mean-field potential
Authors:
Chan Li,
Zhenye Huang,
Wenxuan Zou,
Hai** Huang
Abstract:
An obstacle to artificial general intelligence is set by continual learning of multiple tasks of different nature. Recently, various heuristic tricks, both from machine learning and from neuroscience angles, were proposed, but they lack a unified theory ground. Here, we focus on continual learning in single-layered and multi-layered neural networks of binary weights. A variational Bayesian learnin…
▽ More
An obstacle to artificial general intelligence is set by continual learning of multiple tasks of different nature. Recently, various heuristic tricks, both from machine learning and from neuroscience angles, were proposed, but they lack a unified theory ground. Here, we focus on continual learning in single-layered and multi-layered neural networks of binary weights. A variational Bayesian learning setting is thus proposed, where the neural networks are trained in a field-space, rather than gradient-ill-defined discrete-weight space, and furthermore, weight uncertainty is naturally incorporated, and modulates synaptic resources among tasks. From a physics perspective, we translate the variational continual learning into Franz-Parisi thermodynamic potential framework, where previous task knowledge acts as a prior and a reference as well. We thus interpret the continual learning of the binary perceptron in a teacher-student setting as a Franz-Parisi potential computation. The learning performance can then be analytically studied with mean-field order parameters, whose predictions coincide with numerical experiments using stochastic gradient descent methods. Based on the variational principle and Gaussian field approximation of internal preactivations in hidden layers, we also derive the learning algorithm considering weight uncertainty, which solves the continual learning with binary weights using multi-layered neural networks, and performs better than the currently available metaplasticity algorithm. Our proposed principled frameworks also connect to elastic weight consolidation, weight-uncertainty modulated learning, and neuroscience inspired metaplasticity, providing a theory-grounded method for the real-world multi-task learning with deep networks.
△ Less
Submitted 20 June, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification
Authors:
Changye Li,
Trevor Cohen,
Serguei Pakhomov
Abstract:
Linguistic anomalies detectable in spontaneous speech have shown promise for various clinical applications including screening for dementia and other forms of cognitive impairment. The feasibility of deploying automated tools that can classify language samples obtained from speech in large-scale clinical settings depends on the ability to capture and automatically transcribe the speech for subsequ…
▽ More
Linguistic anomalies detectable in spontaneous speech have shown promise for various clinical applications including screening for dementia and other forms of cognitive impairment. The feasibility of deploying automated tools that can classify language samples obtained from speech in large-scale clinical settings depends on the ability to capture and automatically transcribe the speech for subsequent analysis. However, the impressive performance of self-supervised learning (SSL) automatic speech recognition (ASR) models with curated speech data is not apparent with challenging speech samples from clinical settings. One of the key questions for successfully applying ASR models for clinical applications is whether imperfect transcripts they generate provide sufficient information for downstream tasks to operate at an acceptable level of accuracy. In this study, we examine the relationship between the errors produced by several deep learning ASR systems and their impact on the downstream task of dementia classification. One of our key findings is that, paradoxically, ASR systems with relatively high error rates can produce transcripts that result in better downstream classification accuracy than classification based on verbatim transcripts.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
In vivo labeling and quantitative imaging of neurons using MRI
Authors:
Shana Li,
Xiang Xu,
Canjun Li,
Ziyan Xu,
Qiong Ye,
Yan Zhang,
Chunlei Cang,
Jie Wen
Abstract:
Mammalian brain is a complex organ that contains billions of neurons. These neurons form various neural circuits that control the perception, cognition, emotion and behavior. Develo** in vivo neuronal labeling and imaging techniques is crucial for studying the structure and function of neural circuits. In vivo techniques can provide true physiological information that cannot be provided by ex vi…
▽ More
Mammalian brain is a complex organ that contains billions of neurons. These neurons form various neural circuits that control the perception, cognition, emotion and behavior. Develo** in vivo neuronal labeling and imaging techniques is crucial for studying the structure and function of neural circuits. In vivo techniques can provide true physiological information that cannot be provided by ex vivo methods. In this study, we describe a new strategy for in vivo neuronal labeling and quantification using MRI. To demonstrate the ability of this new method, we used neurotropic virus to deliver oatp1a1 gene to the target neural circuit. OATP1A1 protein is expressed on the neuronal membrane and can increase the uptake of a specific MRI contrast agent (Gd-EOB-DTPA). By using T1-weighted images for observation, labeled neurons "light up" on MRI. We further use a dynamic-contrast-enhancement based method to obtain measures that provide quantitative information of labeled neurons in vivo.
△ Less
Submitted 12 November, 2022;
originally announced November 2022.
-
Biofilms as self-sha** growing nematics
Authors:
Japinder Nijjer,
Mrityunjay Kothari,
Changhao Li,
Thomas Henzel,
Qiuting Zhang,
Jung-Shen B. Tai,
Shuang Zhou,
Sulin Zhang,
Tal Cohen,
**g Yan
Abstract:
Active nematics are the nonequilibrium analog of passive liquid crystals in which anisotropic units consume free energy to drive emergent behavior. Similar to liquid crystal (LC) molecules in displays, ordering and dynamics in active nematics are sensitive to boundary conditions; however, unlike passive liquid crystals, active nematics, such as those composed of living matter, have the potential t…
▽ More
Active nematics are the nonequilibrium analog of passive liquid crystals in which anisotropic units consume free energy to drive emergent behavior. Similar to liquid crystal (LC) molecules in displays, ordering and dynamics in active nematics are sensitive to boundary conditions; however, unlike passive liquid crystals, active nematics, such as those composed of living matter, have the potential to regulate their boundaries through self-generated stresses. Here, using bacterial biofilms confined by a hydrogel as a model system, we show how a three-dimensional, living nematic can actively shape itself and its boundary in order to regulate its internal architecture through growth-induced stresses. We show that biofilms exhibit a sharp transition in shape from domes to lenses upon changing environmental stiffness or cell-substrate friction, which is explained by a theoretical model considering the competition between confinement and interfacial forces. The growth mode defines the progression of the boundary, which in turn determines the trajectories and spatial distribution of cell lineages. We further demonstrate that the evolving boundary defines the orientational ordering of cells and the emergence of topological defects in the interior of the biofilm. Our findings reveal novel self-organization phenomena in confined active matter and provide strategies for guiding the development of programmed microbial consortia with emergent material properties.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Equivariant Energy-Guided SDE for Inverse Molecular Design
Authors:
Fan Bao,
Min Zhao,
Zhongkai Hao,
Peiyao Li,
Chongxuan Li,
Jun Zhu
Abstract:
Inverse molecular design is critical in material science and drug discovery, where the generated molecules should satisfy certain desirable properties. In this paper, we propose equivariant energy-guided stochastic differential equations (EEGSDE), a flexible framework for controllable 3D molecule generation under the guidance of an energy function in diffusion models. Formally, we show that EEGSDE…
▽ More
Inverse molecular design is critical in material science and drug discovery, where the generated molecules should satisfy certain desirable properties. In this paper, we propose equivariant energy-guided stochastic differential equations (EEGSDE), a flexible framework for controllable 3D molecule generation under the guidance of an energy function in diffusion models. Formally, we show that EEGSDE naturally exploits the geometric symmetry in 3D molecular conformation, as long as the energy function is invariant to orthogonal transformations. Empirically, under the guidance of designed energy functions, EEGSDE significantly improves the baseline on QM9, in inverse molecular design targeted to quantum properties and molecular structures. Furthermore, EEGSDE is able to generate molecules with multiple target properties by combining the corresponding energy functions linearly.
△ Less
Submitted 28 February, 2023; v1 submitted 30 September, 2022;
originally announced September 2022.
-
Quantifying the attractor landscape and transition path of distributed working memory from large-scale brain network
Authors:
Leijun Ye,
Chunhe Li
Abstract:
Many cognitive processes, including working memory, recruit multiple distributed interacting brain regions to encode information. How to understand the underlying cognition function mechanism of working memory is a challenging problem, which involves neural circuit configuration from multiple brain regions as well as stochastic transition dynamics between brain states. The energy landscape idea pr…
▽ More
Many cognitive processes, including working memory, recruit multiple distributed interacting brain regions to encode information. How to understand the underlying cognition function mechanism of working memory is a challenging problem, which involves neural circuit configuration from multiple brain regions as well as stochastic transition dynamics between brain states. The energy landscape idea provides a tool to study the global stability and stochastic transition dynamics in the distributed cognitive function system. However, how to quantify the energy landscape in a realistic large-scale brain network remains unclear. Here, based on an anatomically constrained computational model of large-scale macaque cortex, we quantified the underlying multistable attractor landscape of distributed working memory. In the absence of external stimulation, the landscape exhibits three stable attractors, a spontaneous state, and two memory states. In the attractor landscape framework, the working memory function is governed by the change of landscape topography and the switch of system state according to the task requirement. The barrier height inferred from landscape topography quantifies the global stability of memory state and robustness to non-selective random fluctuations and distractor stimuli. The kinetic transition path identified by the minimum action path approach reveals that the spontaneous state serves as an intermediate state during the switch between the two memory states, the memory stored in the cortical area with higher hierarchy is more stable, and information flow follows the direction of hierarchical structure. These results provide new insights into the underlying mechanism of distributed working memory function, and the landscape and kinetic path approach can be applied to other cognitive function-related problems in brain networks.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
Emergence of hierarchical modes from deep learning
Authors:
Chan Li,
Hai** Huang
Abstract:
Large-scale deep neural networks consume expensive training costs, but the training results in less-interpretable weight matrices constructing the networks. Here, we propose a mode decomposition learning that can interpret the weight matrices as a hierarchy of latent modes. These modes are akin to patterns in physics studies of memory networks, but the least number of modes increases only logarith…
▽ More
Large-scale deep neural networks consume expensive training costs, but the training results in less-interpretable weight matrices constructing the networks. Here, we propose a mode decomposition learning that can interpret the weight matrices as a hierarchy of latent modes. These modes are akin to patterns in physics studies of memory networks, but the least number of modes increases only logarithmically with the network width, and becomes even a constant when the width further grows. The mode decomposition learning not only saves a significant large amount of training costs, but also explains the network performance with the leading modes, displaying a striking piecewise power-law behavior. The modes specify a progressively compact latent space across the network hierarchy, making a more disentangled subspaces compared to standard training. Our mode decomposition learning is also studied in an analytic on-line learning setting, which reveals multi-stage of learning dynamics with a continuous specialization of hidden nodes. Therefore, the proposed mode decomposition learning points to a cheap and interpretable route towards the magical deep learning.
△ Less
Submitted 27 February, 2023; v1 submitted 21 August, 2022;
originally announced August 2022.
-
Simulation of snakes using vertical body bending to traverse terrain with large height variation
Authors:
Yifeng Zhang,
Qihan Xuan,
Qiyuan Fu,
Chen Li
Abstract:
Snake moves across various terrains by bending its elongated body. Recent studies discovered that snakes can use vertical bending to traverse terrain of large height variation, such as horizontally oriented cylinders, a wedge (Jurestovsky, Usher, Astley, 2021, J. Exp. Biol.), and uneven terrain (Fu & Li, 2020, Roy. Soc. Open Sci.; Fu, Astley, Li, 2022 Bioinspiration & Biomimetics). Here, to unders…
▽ More
Snake moves across various terrains by bending its elongated body. Recent studies discovered that snakes can use vertical bending to traverse terrain of large height variation, such as horizontally oriented cylinders, a wedge (Jurestovsky, Usher, Astley, 2021, J. Exp. Biol.), and uneven terrain (Fu & Li, 2020, Roy. Soc. Open Sci.; Fu, Astley, Li, 2022 Bioinspiration & Biomimetics). Here, to understand how vertical bending generates propulsion, we developed a dynamic simulation of a snake traversing a wedge (height = 0.05 body length, slope = 27 degrees) and a half cylindrical obstacle (height = 0.1 body length). By propagating down the body an internal torque profile with a maximum around the obstacle, the simulated snake moved forward as observed in the animal. Remarkably, even when frictional drag is low (snake-terrain kinetic friction coefficient of 0.20), the body must push against the wedge with a pressure 5 times that from body weight to generate sufficient forward propulsion to move forward. This indicated that snakes are highly capable of bending vertically to push against the environment to generate propulsion. Testing different controllers revealed that contact force feedback further helps generate and maintain propulsion effectively under unknown terrain perturbations.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
How is model-related uncertainty quantified and reported in different disciplines?
Authors:
Emily G. Simmonds,
Kwaku Peprah Adjei,
Christoffer Wold Andersen,
Janne Cathrin Hetle Aspheim,
Claudia Battistin,
Nicola Bulso,
Hannah Christensen,
Benjamin Cretois,
Ryan Cubero,
Ivan A. Davidovich,
Lisa Dickel,
Benjamin Dunn,
Etienne Dunn-Sigouin,
Karin Dyrstad,
Sigurd Einum,
Donata Giglio,
Haakon Gjerlow,
Amelie Godefroidt,
Ricardo Gonzalez-Gil,
Soledad Gonzalo Cogno,
Fabian Grosse,
Paul Halloran,
Mari F. Jensen,
John James Kennedy,
Peter Egge Langsaether
, et al. (18 additional authors not shown)
Abstract:
How do we know how much we know? Quantifying uncertainty associated with our modelling work is the only way we can answer how much we know about any phenomenon. With quantitative science now highly influential in the public sphere and the results from models translating into action, we must support our conclusions with sufficient rigour to produce useful, reproducible results. Incomplete considera…
▽ More
How do we know how much we know? Quantifying uncertainty associated with our modelling work is the only way we can answer how much we know about any phenomenon. With quantitative science now highly influential in the public sphere and the results from models translating into action, we must support our conclusions with sufficient rigour to produce useful, reproducible results. Incomplete consideration of model-based uncertainties can lead to false conclusions with real world impacts. Despite these potentially damaging consequences, uncertainty consideration is incomplete both within and across scientific fields. We take a unique interdisciplinary approach and conduct a systematic audit of model-related uncertainty quantification from seven scientific fields, spanning the biological, physical, and social sciences. Our results show no single field is achieving complete consideration of model uncertainties, but together we can fill the gaps. We propose opportunities to improve the quantification of uncertainty through use of a source framework for uncertainty consideration, model type specific guidelines, improved presentation, and shared best practice. We also identify shared outstanding challenges (uncertainty in input data, balancing trade-offs, error propagation, and defining how much uncertainty is required). Finally, we make nine concrete recommendations for current practice (following good practice guidelines and an uncertainty checklist, presenting uncertainty numerically, and propagating model-related uncertainty into conclusions), future research priorities (uncertainty in input data, quantifying uncertainty in complex models, and the importance of missing uncertainty in different contexts), and general research standards across the sciences (transparency about study limitations and dedicated uncertainty sections of manuscripts).
△ Less
Submitted 1 July, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Fractional SEIR Model and Data-Driven Predictions of COVID-19 Dynamics of Omicron Variant
Authors:
Min Cai,
George Em Karniadakis,
Changpin Li
Abstract:
We study the dynamic evolution of COVID-19 cased by the Omicron variant via a fractional susceptible-exposedinfected-removed (SEIR) model. Preliminary data suggest that the symptoms of Omicron infection are not prominent and the transmission is therefore more concealed, which causes a relatively slow increase in the detected cases of the new infected at the beginning of the pandemic. To characteri…
▽ More
We study the dynamic evolution of COVID-19 cased by the Omicron variant via a fractional susceptible-exposedinfected-removed (SEIR) model. Preliminary data suggest that the symptoms of Omicron infection are not prominent and the transmission is therefore more concealed, which causes a relatively slow increase in the detected cases of the new infected at the beginning of the pandemic. To characterize the specific dynamics, the Caputo-Hadamard fractional derivative is adopted to refined the classical SEIR model. Based on the reported data, we infer the fractional order, timedependent parameters, as well as unobserved dynamics of the fractional SEIR model via fractional physics-informed neural networks (fPINNs). Then, we make short-time predictions using the learned fractional SEIR model.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Brain Cortical Functional Gradients Predict Cortical Folding Patterns via Attention Mesh Convolution
Authors:
Li Yang,
Zhibin He,
Changhe Li,
Junwei Han,
Dajiang Zhu,
Tianming Liu,
Tuo Zhang
Abstract:
Since gyri and sulci, two basic anatomical building blocks of cortical folding patterns, were suggested to bear different functional roles, a precise map** from brain function to gyro-sulcal patterns can provide profound insights into both biological and artificial neural networks. However, there lacks a generic theory and effective computational model so far, due to the highly nonlinear relatio…
▽ More
Since gyri and sulci, two basic anatomical building blocks of cortical folding patterns, were suggested to bear different functional roles, a precise map** from brain function to gyro-sulcal patterns can provide profound insights into both biological and artificial neural networks. However, there lacks a generic theory and effective computational model so far, due to the highly nonlinear relation between them, huge inter-individual variabilities and a sophisticated description of brain function regions/networks distribution as mosaics, such that spatial patterning of them has not been considered. we adopted brain functional gradients derived from resting-state fMRI to embed the "gradual" change of functional connectivity patterns, and developed a novel attention mesh convolution model to predict cortical gyro-sulcal segmentation maps on individual brains. The convolution on mesh considers the spatial organization of functional gradients and folding patterns on a cortical sheet and the newly designed channel attention block enhances the interpretability of the contribution of different functional gradients to cortical folding prediction. Experiments show that the prediction performance via our model outperforms other state-of-the-art models. In addition, we found that the dominant functional gradients contribute less to folding prediction. On the activation maps of the last layer, some well-studied cortical landmarks are found on the borders of, rather than within, the highly activated regions. These results and findings suggest that a specifically designed artificial neural network can improve the precision of the map** between brain functions and cortical folding patterns, and can provide valuable insight of brain anatomy-function relation for neuroscience.
△ Less
Submitted 21 May, 2022;
originally announced May 2022.
-
Topological EEG Nonlinear Dynamics Analysis for Emotion Recognition
Authors:
Yan Yan,
Xuankun Wu,
Chengdong Li,
Yini He,
Zhicheng Zhang,
Huihui Li,
Ang Li,
Lei Wang
Abstract:
Emotional recognition through exploring the electroencephalography (EEG) characteristics has been widely performed in recent studies. Nonlinear analysis and feature extraction methods for understanding the complex dynamical phenomena are associated with the EEG patterns of different emotions. The phase space reconstruction is a typical nonlinear technique to reveal the dynamics of the brain neural…
▽ More
Emotional recognition through exploring the electroencephalography (EEG) characteristics has been widely performed in recent studies. Nonlinear analysis and feature extraction methods for understanding the complex dynamical phenomena are associated with the EEG patterns of different emotions. The phase space reconstruction is a typical nonlinear technique to reveal the dynamics of the brain neural system. Recently, the topological data analysis (TDA) scheme has been used to explore the properties of space, which provides a powerful tool to think over the phase space. In this work, we proposed a topological EEG nonlinear dynamics analysis approach using the phase space reconstruction (PSR) technique to convert EEG time series into phase space, and the persistent homology tool explores the topological properties of the phase space. We perform the topological analysis of EEG signals in different rhythm bands to build emotion feature vectors, which shows high distinguishing ability. We evaluate the approach with two well-known benchmark datasets, the DEAP and DREAMER datasets. The recognition results achieved accuracies of 99.37% and 99.35% in arousal and valence classification tasks with DEAP, and 99.96%, 99.93%, and 99.95% in arousal, valence, and dominance classifications tasks with DREAMER, respectively. The performances are supposed to be outperformed current state-of-art approaches in DREAMER (improved by 1% to 10% depends on temporal length), while comparable to other related works evaluated in DEAP. The proposed work is the first investigation in the emotion recognition oriented EEG topological feature analysis, which brought a novel insight into the brain neural system nonlinear dynamics analysis and feature extraction.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Application of neural-network hybrid models in estimating the infection functions of nonlinear epidemic models
Authors:
Chentong Li,
Changsheng Zhou,
Junmin Liu,
Yao Rong
Abstract:
Hybrid neural network models combine the advantages of a neural network's fitting functionality with differential equation models to reflect actual physical processes and are widely used in analyzing time-series data. Most related studies have focused on linear hybrid models, but only a few have examined nonlinear problems. In this work, we use a hybrid nonlinear epidemic neural network as the ent…
▽ More
Hybrid neural network models combine the advantages of a neural network's fitting functionality with differential equation models to reflect actual physical processes and are widely used in analyzing time-series data. Most related studies have focused on linear hybrid models, but only a few have examined nonlinear problems. In this work, we use a hybrid nonlinear epidemic neural network as the entry point to study its power in predicting the correct infection function of an epidemic model. To achieve this goal, we combine the bifurcation theory of the nonlinear differential model with the mean-squared error loss and design a novel loss function to ensure model trainability. Furthermore, we find the unique existence conditions supporting ordinary differential equations to estimate the correct infection function. Using the Runge Kutta method, we perform numerical experiments on our proposed model and verify its soundness. We also apply it to real COVID-19 data to accurately discover the change law of its infectivity.
△ Less
Submitted 5 March, 2022;
originally announced March 2022.
-
A Comprehensive Survey with Quantitative Comparison of Image Analysis Methods for Microorganism Biovolume Measurements
Authors:
Jiawei Zhang,
Chen Li,
Md Mamunur Rahaman,
Yudong Yao,
**li Ma,
**ghua Zhang,
Xin Zhao,
Tao Jiang,
Marcin Grzegorzek
Abstract:
With the acceleration of urbanization and living standards, microorganisms play increasingly important roles in industrial production, bio-technique, and food safety testing. Microorganism biovolume measurements are one of the essential parts of microbial analysis. However, traditional manual measurement methods are time-consuming and challenging to measure the characteristics precisely. With the…
▽ More
With the acceleration of urbanization and living standards, microorganisms play increasingly important roles in industrial production, bio-technique, and food safety testing. Microorganism biovolume measurements are one of the essential parts of microbial analysis. However, traditional manual measurement methods are time-consuming and challenging to measure the characteristics precisely. With the development of digital image processing techniques, the characteristics of the microbial population can be detected and quantified. The changing trend can be adjusted in time and provided a basis for the improvement. The applications of the microorganism biovolume measurement method have developed since the 1980s. More than 62 articles are reviewed in this study, and the articles are grouped by digital image segmentation methods with periods. This study has high research significance and application value, which can be referred to microbial researchers to have a comprehensive understanding of microorganism biovolume measurements using digital image analysis methods and potential applications.
△ Less
Submitted 2 May, 2022; v1 submitted 17 February, 2022;
originally announced February 2022.
-
Collaborative learning of images and geometrics for predicting isocitrate dehydrogenase status of glioma
Authors:
Yiran Wei,
Chao Li,
Xi Chen,
Carola-Bibiane Schönlieb,
Stephen J. Price
Abstract:
The isocitrate dehydrogenase (IDH) gene mutation status is an important biomarker for glioma patients. The gold standard of IDH mutation detection requires tumour tissue obtained via invasive approaches and is usually expensive. Recent advancement in radiogenomics provides a non-invasive approach for predicting IDH mutation based on MRI. Meanwhile, tumor geometrics encompass crucial information fo…
▽ More
The isocitrate dehydrogenase (IDH) gene mutation status is an important biomarker for glioma patients. The gold standard of IDH mutation detection requires tumour tissue obtained via invasive approaches and is usually expensive. Recent advancement in radiogenomics provides a non-invasive approach for predicting IDH mutation based on MRI. Meanwhile, tumor geometrics encompass crucial information for tumour phenoty**. Here we propose a collaborative learning framework that learns both tumor images and tumor geometrics using convolutional neural networks (CNN) and graph neural networks (GNN), respectively. Our results show that the proposed model outperforms the baseline model of 3D-DenseNet121. Further, the collaborative learning model achieves better performance than either the CNN or the GNN alone. The model interpretation shows that the CNN and GNN could identify common and unique regions of interest for IDH mutation prediction. In conclusion, collaborating image and geometric learners provides a novel approach for predicting genotype and characterising glioma.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Snakes combine vertical and lateral bending to traverse uneven terrain
Authors:
Qiyuan Fu,
Henry C. Astely,
Chen Li
Abstract:
Terrestrial locomotion requires generating appropriate ground reaction forces which depend on substrate geometry and physical properties. The richness of positions and orientations of terrain features in the 3-D world gives limbless animals like snakes that can bend their body versatility to generate forces from different contact areas for propulsion. Despite many previous studies of how snakes us…
▽ More
Terrestrial locomotion requires generating appropriate ground reaction forces which depend on substrate geometry and physical properties. The richness of positions and orientations of terrain features in the 3-D world gives limbless animals like snakes that can bend their body versatility to generate forces from different contact areas for propulsion. Despite many previous studies of how snakes use lateral body bending for propulsion on relatively flat surfaces with lateral contact points, little is known about whether and how much snakes use vertical body bending in combination with lateral bending in 3-D terrain. This lack had contributed to snake robots being inferior to animals in stability, efficiency, and versatility when traversing complex 3-D environments. Here, to begin to elucidate this, we studied how the generalist corn snake traversed an uneven arena of blocks of random height variation 5 times its body height. The animal traversed the uneven terrain with perfect stability by propagating 3-D bending down its body with little transverse motion (11° slip angle). Although the animal preferred moving through valleys with higher neighboring blocks, it did not prefer lateral bending. Among body-terrain contact regions that potentially provide propulsion, 52% were formed by vertical body bending and 48% by lateral bending. The combination of vertical and lateral bending may dramatically expand the sources of propulsive forces available to limbless locomotors by utilizing various asperities available in 3-D terrain. Direct measurements of contact forces are necessary to further understand how snakes coordinate 3-D bending along the entire body via sensory feedback to propel through 3-D terrain. These studies will open a path to new propulsive mechanisms for snake robots, potentially increasing the performance and versatility in 3-D terrain.
△ Less
Submitted 15 August, 2022; v1 submitted 15 December, 2021;
originally announced December 2021.
-
A terrain treadmill to study animal locomotion through large obstacles
Authors:
Ratan Othayoth,
Blake Strebel,
Yuanfeng Han,
Evains Francois,
Chen Li
Abstract:
A major challenge to understanding locomotion in complex 3-D terrain with large obstacles is to create tools for controlled, systematic lab experiments. Existing terrain arenas only allow observations at small spatiotemporal scales (~10 body length, ~10 stride cycles). Here, we create a terrain treadmill to enable high-resolution observations of animal locomotion through large obstacles over large…
▽ More
A major challenge to understanding locomotion in complex 3-D terrain with large obstacles is to create tools for controlled, systematic lab experiments. Existing terrain arenas only allow observations at small spatiotemporal scales (~10 body length, ~10 stride cycles). Here, we create a terrain treadmill to enable high-resolution observations of animal locomotion through large obstacles over large spatiotemporal scales. An animal moves through modular obstacles on an inner sphere, while a rigidly-attached, concentric, transparent outer sphere rotated with the opposite velocity via closed-loop feedback to keep the animal on top. During sustained locomotion, a discoid cockroach moved through pillar obstacles for 25 minutes ($\approx$2500 strides) over 67 m ($\approx$1500 body lengths), and was contained within a radius of 4 cm (0.9 body length) for 83% of the duration, even at speeds of up to 10 body length/s. The treadmill enabled observation of diverse locomotor behaviors and quantification of animal-obstacle interaction.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
The PAT model of population dynamics
Authors:
Z. C. Feng,
Y. Charles Li
Abstract:
We introduce a population-age-time (PAT) model which describes the temporal evolution of the population distribution in age. The surprising result is that the qualitative nature of the population distribution dynamics is robust with respect to the birth rate and death rate distributions in age, and initial conditions. When the number of children born per woman is 2, the population distribution app…
▽ More
We introduce a population-age-time (PAT) model which describes the temporal evolution of the population distribution in age. The surprising result is that the qualitative nature of the population distribution dynamics is robust with respect to the birth rate and death rate distributions in age, and initial conditions. When the number of children born per woman is 2, the population distribution approaches an asymptotically steady state of a kink shape; thus the total population approaches a constant. When the number of children born per woman is greater than 2, the total population increases without bound; and when the number of children born per woman is less than 2, the total population decreases to zero.
△ Less
Submitted 26 November, 2021;
originally announced November 2021.
-
Quantifying the COVID19 infection risk due to droplet/aerosol inhalation
Authors:
Rahul Bale,
Akiyoshi Iida,
Masashi Yamakawa,
ChungGang Li,
Makoto Tsubokura
Abstract:
The dose-response model has been widely used for quantifying the risk of infection of airborne diseases like COVID-19. The model has been used in the room-average analysis of infection risk and analysis using passive scalars as a proxy for aerosol transport. However, it has not been employed for risk estimation in numerical simulations of droplet dispersion. In this work, we develop a framework fo…
▽ More
The dose-response model has been widely used for quantifying the risk of infection of airborne diseases like COVID-19. The model has been used in the room-average analysis of infection risk and analysis using passive scalars as a proxy for aerosol transport. However, it has not been employed for risk estimation in numerical simulations of droplet dispersion. In this work, we develop a framework for the evaluation of the probability of infection in droplet dispersion simulations using the dose-response model. We introduce a version of the model that can incorporate the higher transmissibility of variant strains of SARS-CoV2 and the effect of vaccination in evaluating the probability of infection. Numerical simulations of droplet dispersion during speech are carried out to investigate the infection risk over space and time using the model. The advantage of droplet dispersion simulations for risk evaluation is demonstrated through the analysis of the effect of humidity on infection risk.
△ Less
Submitted 8 September, 2022; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Predicting isocitrate dehydrogenase mutation status in glioma using structural brain networks and graph neural networks
Authors:
Yiran Wei,
Yonghao Li,
Xi Chen,
Carola-Bibiane Schönlieb,
Chao Li,
Stephen J. Price
Abstract:
Glioma is a common malignant brain tumor with distinct survival among patients. The isocitrate dehydrogenase (IDH) gene mutation provides critical diagnostic and prognostic value for glioma. It is of crucial significance to non-invasively predict IDH mutation based on pre-treatment MRI. Machine learning/deep learning models show reasonable performance in predicting IDH mutation using MRI. However,…
▽ More
Glioma is a common malignant brain tumor with distinct survival among patients. The isocitrate dehydrogenase (IDH) gene mutation provides critical diagnostic and prognostic value for glioma. It is of crucial significance to non-invasively predict IDH mutation based on pre-treatment MRI. Machine learning/deep learning models show reasonable performance in predicting IDH mutation using MRI. However, most models neglect the systematic brain alterations caused by tumor invasion, where widespread infiltration along white matter tracts is a hallmark of glioma. Structural brain network provides an effective tool to characterize brain organisation, which could be captured by the graph neural networks (GNN) to more accurately predict IDH mutation.
Here we propose a method to predict IDH mutation using GNN, based on the structural brain network of patients. Specifically, we firstly construct a network template of healthy subjects, consisting of atlases of edges (white matter tracts) and nodes (cortical/subcortical brain regions) to provide regions of interest (ROIs). Next, we employ autoencoders to extract the latent multi-modal MRI features from the ROIs of edges and nodes in patients, to train a GNN architecture for predicting IDH mutation. The results show that the proposed method outperforms the baseline models using the 3D-CNN and 3D-DenseNet. In addition, model interpretation suggests its ability to identify the tracts infiltrated by tumor, corresponding to clinical prior knowledge. In conclusion, integrating brain networks with GNN offers a new avenue to study brain lesions using computational neuroscience and computer vision approaches.
△ Less
Submitted 10 September, 2021; v1 submitted 4 September, 2021;
originally announced September 2021.
-
Identifying the fragment structure of the organic compounds by deeply learning the original NMR data
Authors:
Chongcan Li,
Yong Cong,
Weihua Deng
Abstract:
We preprocess the raw NMR spectrum and extract key characteristic features by using two different methodologies, called equidistant sampling and peak sampling for subsequent substructure pattern recognition; meanwhile may provide the alternative strategy to address the imbalance issue of the NMR dataset frequently encountered in dataset collection of statistical modeling and establish two conventi…
▽ More
We preprocess the raw NMR spectrum and extract key characteristic features by using two different methodologies, called equidistant sampling and peak sampling for subsequent substructure pattern recognition; meanwhile may provide the alternative strategy to address the imbalance issue of the NMR dataset frequently encountered in dataset collection of statistical modeling and establish two conventional SVM and KNN models to assess the capability of two feature selection, respectively. Our results in this study show that the models using the selected features of peak sampling outperform the ones using the other. Then we build the Recurrent Neural Network (RNN) model trained by Data B collected from peak sampling. Furthermore, we illustrate the easier optimization of hyper parameters and the better generalization ability of the RNN deep learning model by comparison with traditional machine learning SVM and KNN models in detail.
△ Less
Submitted 25 July, 2021;
originally announced July 2021.