Search | arXiv e-print repository

Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning

Authors: Danqing Wang, Antonis Antoniades, Kha-Dinh Luong, Edwin Zhang, Mert Kosan, Jiachen Li, Ambuj Singh, William Yang Wang, Lei Li

Abstract: Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations… ▽ More Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations is hard in real-world datasets due to a lack of human-annotated ground truth, which limits their use in areas like molecular sciences. Additionally, the increasing scale of these datasets provides a challenge for random search-based methods. In this paper, we develop a novel global explanation model RLHEX for molecular property prediction. It aligns the counterfactual explanations with human-defined principles, making the explanations more interpretable and easy for experts to evaluate. RLHEX includes a VAE-based graph generator to generate global explanations and an adapter to adjust the latent representation space to human-defined principles. Optimized by Proximal Policy Optimization (PPO), the global explanations produced by RLHEX cover 4.12% more input graphs and reduce the distance between the counterfactual explanation set and the input set by 0.47% on average across three molecular datasets. RLHEX provides a flexible framework to incorporate different human-designed principles into the counterfactual explanation generation process, aligning these explanations with domain expertise. The code and data are released at https://github.com/dqwang122/RLHEX. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: Accepted by KDD 2024

arXiv:2405.06693 [pdf, other]

SurfPro: Functional Protein Design Based on Continuous Surface

Authors: Zhenqiao Song, Tinglin Huang, Lei Li, Wengong **

Abstract: How can we design proteins with desired functions? We are motivated by a chemical intuition that both geometric structure and biochemical properties are critical to a protein's function. In this paper, we propose SurfPro, a new method to generate functional proteins given a desired surface and its associated biochemical properties. SurfPro comprises a hierarchical encoder that progressively models… ▽ More How can we design proteins with desired functions? We are motivated by a chemical intuition that both geometric structure and biochemical properties are critical to a protein's function. In this paper, we propose SurfPro, a new method to generate functional proteins given a desired surface and its associated biochemical properties. SurfPro comprises a hierarchical encoder that progressively models the geometric shape and biochemical features of a protein surface, and an autoregressive decoder to produce an amino acid sequence. We evaluate SurfPro on a standard inverse folding benchmark CATH 4.2 and two functional protein design tasks: protein binder design and enzyme design. Our SurfPro consistently surpasses previous state-of-the-art inverse folding methods, achieving a recovery rate of 57.78% on CATH 4.2 and higher success rates in terms of protein-protein binding and enzyme-substrate interaction scores. △ Less

Submitted 17 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.07325 [pdf]

Assessing Engraftment Following Fecal Microbiota Transplant

Authors: Chloe Herman, Bridget M. Barker, Thais F. Bartelli, Vidhi Chandra, Rosa Krajmalnik-Brown, Mary Jewell, Le Li, Chen Liao, Florencia McAllister, Khemlal Nirmalkar, Joao B. Xavier, J. Gregory Caporaso

Abstract: Fecal Microbiota Transplant (FMT) is an FDA approved treatment for recurrent Clostridium difficile infections, and is being explored for other clinical applications, from alleviating digestive and neurological disorders, to priming the microbiome for cancer treatment, and restoring microbiomes impacted by cancer treatment. Quantifying the extent of engraftment following an FMT is important in de… ▽ More Fecal Microbiota Transplant (FMT) is an FDA approved treatment for recurrent Clostridium difficile infections, and is being explored for other clinical applications, from alleviating digestive and neurological disorders, to priming the microbiome for cancer treatment, and restoring microbiomes impacted by cancer treatment. Quantifying the extent of engraftment following an FMT is important in determining if a recipient didn't respond because the engrafted microbiome didn't produce the desired outcomes (a successful FMT, but negative treatment outcome), or the microbiome didn't engraft (an unsuccessful FMT and negative treatment outcome). The lack of a consistent methodology for quantifying FMT engraftment extent hinders the assessment of FMT success and its relation to clinical outcomes, and presents challenges for comparing FMT results and protocols across studies. Here we review 46 studies of FMT in humans and model organisms and group their approaches for assessing the extent to which an FMT engrafts into three criteria: 1) Chimeric Asymmetric Community Coalescence investigates microbiome shifts following FMT engraftment. 2) Donated Microbiome Indicator Features tracks donated microbiome features as a signal of engraftment with methods such as differential abundance testing based on the current sample collection, or tracking changes in feature abundances that have been previously identified. 3) Temporal Stability examines how resistant post-FMT recipient's microbiomes are to reverting back to their baseline microbiome. Investigated together, these criteria provide a clear assessment of microbiome engraftment. We discuss the pros and cons of each of these criteria, providing illustrative examples of their application. We also introduce key terminology and recommendations on how FMT studies can be analyzed for rigorous engraftment extent assessment. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 18 pages, 6 figures, 2 supplemental tables

arXiv:2403.10581 [pdf, other]

Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

Authors: Chen Chen, Lei Li, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

Abstract: Heart failure (HF) poses a significant public health challenge, with a rising global mortality rate. Early detection and prevention of HF could significantly reduce its impact. We introduce a novel methodology for predicting HF risk using 12-lead electrocardiograms (ECGs). We present a novel, lightweight dual-attention ECG network designed to capture complex ECG features essential for early HF ris… ▽ More Heart failure (HF) poses a significant public health challenge, with a rising global mortality rate. Early detection and prevention of HF could significantly reduce its impact. We introduce a novel methodology for predicting HF risk using 12-lead electrocardiograms (ECGs). We present a novel, lightweight dual-attention ECG network designed to capture complex ECG features essential for early HF risk prediction, despite the notable imbalance between low and high-risk groups. This network incorporates a cross-lead attention module and twelve lead-specific temporal attention modules, focusing on cross-lead interactions and each lead's local dynamics. To further alleviate model overfitting, we leverage a large language model (LLM) with a public ECG-Report dataset for pretraining on an ECG-report alignment task. The network is then fine-tuned for HF risk prediction using two specific cohorts from the UK Biobank study, focusing on patients with hypertension (UKB-HYP) and those who have had a myocardial infarction (UKB-MI).The results reveal that LLM-informed pre-training substantially enhances HF risk prediction in these cohorts. The dual-attention design not only improves interpretability but also predictive accuracy, outperforming existing competitive methods with C-index scores of 0.6349 for UKB-HYP and 0.5805 for UKB-MI. This demonstrates our method's potential in advancing HF risk assessment with clinical complex ECG data. △ Less

Submitted 22 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: Under journal revision

arXiv:2403.07664 [pdf]

Enabling self-identification in intelligent agent: insights from computational psychoanalysis

Authors: Lingyu Li, Chunbo Li

Abstract: Building upon prior framework of computational Lacanian psychoanalysis with the theory of active inference, this paper aims to further explore the concept of self-identification and its potential applications. Beginning with two classic paradigms in psychology, mirror self-recognition and rubber hand illusion, we suggest that imaginary identification is characterized by an integrated body schema w… ▽ More Building upon prior framework of computational Lacanian psychoanalysis with the theory of active inference, this paper aims to further explore the concept of self-identification and its potential applications. Beginning with two classic paradigms in psychology, mirror self-recognition and rubber hand illusion, we suggest that imaginary identification is characterized by an integrated body schema with minimal free energy. Next, we briefly survey three dimensions of symbolic identification (sociological, psychoanalytic, and linguistical) and corresponding active inference accounts. To provide intuition, we respectively employ a convolutional neural network (CNN) and a multi-layer perceptron (MLP) supervised by ChatGPT to showcase optimization of free energy during motor skill and language mastery underlying identification formation. We then introduce Lacan's Graph II of desire, unifying imaginary and symbolic identification, and propose an illustrative model called FreeAgent. In concluding remarks, we discuss some key issues in the potential of computational Lacanian psychoanalysis to advance mental health and artificial intelligence, including digital twin mind, large language models as avatars of the Lacanian Other, and the feasibility of human-level artificial general intelligence with self-awareness in the context of post-structuralism. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 18 pages, 3 figures

arXiv:2402.14315 [pdf, other]

Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling

Authors: Yuwei Yang, Siqi Ouyang, Xueyu Hu, Mingyue Zheng, Hao Zhou, Lei Li

Abstract: Structure-based drug design aims at generating high affinity ligands with prior knowledge of 3D target structures. Existing methods either use conditional generative model to learn the distribution of 3D ligands given target binding sites, or iteratively modify molecules to optimize a structure-based activity estimator. The former is highly constrained by data quantity and quality, which leaves op… ▽ More Structure-based drug design aims at generating high affinity ligands with prior knowledge of 3D target structures. Existing methods either use conditional generative model to learn the distribution of 3D ligands given target binding sites, or iteratively modify molecules to optimize a structure-based activity estimator. The former is highly constrained by data quantity and quality, which leaves optimization-based approaches more promising in practical scenario. However, existing optimization-based approaches choose to edit molecules in 2D space, and use molecular docking to estimate the activity using docking predicted 3D target-ligand complexes. The misalignment between the action space and the objective hinders the performance of these models, especially for those employ deep learning for acceleration. In this work, we propose MolEdit3D to combine 3D molecular generation with optimization frameworks. We develop a novel 3D graph editing model to generate molecules using fragments, and pre-train this model on abundant 3D ligands for learning target-independent properties. Then we employ a target-guided self-learning strategy to improve target-related properties using self-sampled molecules. MolEdit3D achieves state-of-the-art performance on majority of the evaluation metrics, and demonstrate strong capability of capturing both target-dependent and -independent properties. △ Less

Submitted 15 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

arXiv:2402.12993 [pdf, other]

An Autonomous Large Language Model Agent for Chemical Literature Data Mining

Authors: Kexin Chen, Hanqun Cao, Junyou Li, Yuyang Du, Menghao Guo, Xin Zeng, Lanqing Li, Jiezhong Qiu, Pheng Ann Heng, Guangyong Chen

Abstract: Chemical synthesis, which is crucial for advancing material synthesis and drug discovery, impacts various sectors including environmental science and healthcare. The rise of technology in chemistry has generated extensive chemical data, challenging researchers to discern patterns and refine synthesis processes. Artificial intelligence (AI) helps by analyzing data to optimize synthesis and increase… ▽ More Chemical synthesis, which is crucial for advancing material synthesis and drug discovery, impacts various sectors including environmental science and healthcare. The rise of technology in chemistry has generated extensive chemical data, challenging researchers to discern patterns and refine synthesis processes. Artificial intelligence (AI) helps by analyzing data to optimize synthesis and increase yields. However, AI faces challenges in processing literature data due to the unstructured format and diverse writing style of chemical literature. To overcome these difficulties, we introduce an end-to-end AI agent framework capable of high-fidelity extraction from extensive chemical literature. This AI agent employs large language models (LLMs) for prompt generation and iterative optimization. It functions as a chemistry assistant, automating data collection and analysis, thereby saving manpower and enhancing performance. Our framework's efficacy is evaluated using accuracy, recall, and F1 score of reaction condition data, and we compared our method with human experts in terms of content correctness and time efficiency. The proposed approach marks a significant advancement in automating chemical literature extraction and demonstrates the potential for AI to revolutionize data management and utilization in chemistry. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.04286 [pdf]

Progress and Opportunities of Foundation Models in Bioinformatics

Authors: Qing Li, Zhihang Hu, Yixuan Wang, Lei Li, Yimin Fan, Irwin King, Le Song, Yu Li

Abstract: Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical challenges in bioinformatics such as the scarcity of annotated data and the presence of data noise. FMs are particularly adept at handling large-scale, unlabeled… ▽ More Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical challenges in bioinformatics such as the scarcity of annotated data and the presence of data noise. FMs are particularly adept at handling large-scale, unlabeled data, a common scenario in biological contexts due to the time-consuming and costly nature of experimentally determining labeled data. This characteristic has allowed FMs to excel and achieve notable results in various downstream validation tasks, demonstrating their ability to represent diverse biological entities effectively. Undoubtedly, FMs have ushered in a new era in computational biology, especially in the realm of deep learning. The primary goal of this survey is to conduct a systematic investigation and summary of FMs in bioinformatics, tracing their evolution, current research status, and the methodologies employed. Central to our focus is the application of FMs to specific biological problems, aiming to guide the research community in choosing appropriate FMs for their research needs. We delve into the specifics of the problem at hand including sequence analysis, structure prediction, function annotation, and multimodal integration, comparing the structures and advancements against traditional methods. Furthermore, the review analyses challenges and limitations faced by FMs in biology, such as data noise, model explainability, and potential biases. Finally, we outline potential development paths and strategies for FMs in future biological research, setting the stage for continued innovation and application in this rapidly evolving field. This comprehensive review serves not only as an academic resource but also as a roadmap for future explorations and applications of FMs in biology. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 27 pages, 3 figures, 2 tables

MSC Class: cs.CL; 92-02 ACM Class: I.2.1

arXiv:2401.12974 [pdf, other]

SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI

Authors: Hanxue Gu, Roy Colglazier, Haoyu Dong, Jikai Zhang, Yaqian Chen, Zafer Yildiz, Yuwen Chen, Lin Li, Jichen Yang, Jay Willhite, Alex M. Meyer, Brian Guo, Yashvi Atul Shah, Emily Luo, Shipra Rajput, Sally Kuehn, Clark Bulleit, Kevin A. Wu, Jisoo Lee, Brandon Ramirez, Darui Lu, Jay M. Levin, Maciej A. Mazurowski

Abstract: Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment pla… ▽ More Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment planning. Specifically, segmenting bones in MRI would allow for more quantitative assessments of musculoskeletal conditions, while such assessments are largely absent in current radiological practice. The difficulty of bone MRI segmentation is illustrated by the fact that limited algorithms are publicly available for use, and those contained in the literature typically address a specific anatomic area. In our study, we propose a versatile, publicly available deep-learning model for bone segmentation in MRI across multiple standard MRI locations. The proposed model can operate in two modes: fully automated segmentation and prompt-based segmentation. Our contributions include (1) collecting and annotating a new MRI dataset across various MRI protocols, encompassing over 300 annotated volumes and 8485 annotated slices across diverse anatomic regions; (2) investigating several standard network architectures and strategies for automated segmentation; (3) introducing SegmentAnyBone, an innovative foundational model-based approach that extends Segment Anything Model (SAM); (4) comparative analysis of our algorithm and previous approaches; and (5) generalization analysis of our algorithm across different anatomical locations and MRI sequences, as well as an external dataset. We publicly release our model at https://github.com/mazurowski-lab/SegmentAnyBone. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 15 pages, 15 figures

arXiv:2312.16769 [pdf, other]

Estimation and Inference for High-dimensional Multi-response Growth Curve Model

Authors: Xin Zhou, Yin Xia, Lexin Li

Abstract: A growth curve model (GCM) aims to characterize how an outcome variable evolves, develops and grows as a function of time, along with other predictors. It provides a particularly useful framework to model growth trend in longitudinal data. However, the estimation and inference of GCM with a large number of response variables faces numerous challenges, and remains underdeveloped. In this article, w… ▽ More A growth curve model (GCM) aims to characterize how an outcome variable evolves, develops and grows as a function of time, along with other predictors. It provides a particularly useful framework to model growth trend in longitudinal data. However, the estimation and inference of GCM with a large number of response variables faces numerous challenges, and remains underdeveloped. In this article, we study the high-dimensional multivariate-response linear GCM, and develop the corresponding estimation and inference procedures. Our proposal is far from a straightforward extension, and involves several innovative components. Specifically, we introduce a Kronecker product structure, which allows us to effectively decompose a very large covariance matrix, and to pool the correlated samples to improve the estimation accuracy. We devise a highly non-trivial multi-step estimation approach to estimate the individual covariance components separately and effectively. We also develop rigorous statistical inference procedures to test both the global effects and the individual effects, and establish the size and power properties, as well as the proper false discovery control. We demonstrate the effectiveness of the new method through both intensive simulations, and the analysis of a longitudinal neuroimaging data for Alzheimer's disease. △ Less

Submitted 27 December, 2023; originally announced December 2023.

arXiv:2310.20309 [pdf, other]

Tensor formalism for predicting synaptic connections with ensemble modeling or optimization

Authors: Tirthabir Biswas, Tianzhi Lambus Li, James E. Fitzgerald

Abstract: Theoretical neuroscientists often try to understand how the structure of a neural network relates to its function by focusing on structural features that would either follow from optimization or occur consistently across possible implementations. Both optimization theories and ensemble modeling approaches have repeatedly proven their worth, and it would simplify theory building considerably if pre… ▽ More Theoretical neuroscientists often try to understand how the structure of a neural network relates to its function by focusing on structural features that would either follow from optimization or occur consistently across possible implementations. Both optimization theories and ensemble modeling approaches have repeatedly proven their worth, and it would simplify theory building considerably if predictions from both theory types could be derived and tested simultaneously. Here we show how tensor formalism from theoretical physics can be used to unify and solve many optimization and ensemble modeling approaches to predicting synaptic connectivity from neuronal responses. We specifically focus on analyzing the solution space of synaptic weights that allow a threshold-linear neural network to respond in a prescribed way to a limited number of input conditions. For optimization purposes, we compute the synaptic weight vector that minimizes an arbitrary quadratic loss function. For ensemble modeling, we identify synaptic weight features that occur consistently across all solutions bounded by an arbitrary ellipsoid. We derive a common solution to this suite of nonlinear problems by showing how each of them reduces to an equivalent linear problem that can be solved analytically. Although identifying the equivalent linear problem is nontrivial, our tensor formalism provides an elegant geometrical perspective that allows us to solve the problem approximately in an analytical way or exactly using numeric methods. The final algorithm is applicable to a wide range of interesting neuroscience problems, and the associated geometric insights may carry over to other scientific problems that require constrained optimization. △ Less

Submitted 18 June, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

Comments: 31 pages, 6 figures, 2 tables

arXiv:2310.14559 [pdf, other]

Branch-and-Price for Prescriptive Contagion Analytics

Authors: Alexandre Jacquillat, Michael Lingzhi Li, Martin Ramé, Kai Wang

Abstract: Predictive contagion models are ubiquitous in epidemiology, social sciences, engineering, and management. This paper formulates a prescriptive contagion analytics model where a decision-maker allocates shared resources across multiple segments of a population, each governed by continuous-time dynamics. We define four real-world problems under this umbrella: vaccine distribution, vaccination center… ▽ More Predictive contagion models are ubiquitous in epidemiology, social sciences, engineering, and management. This paper formulates a prescriptive contagion analytics model where a decision-maker allocates shared resources across multiple segments of a population, each governed by continuous-time dynamics. We define four real-world problems under this umbrella: vaccine distribution, vaccination centers deployment, content promotion, and congestion mitigation. These problems feature a large-scale mixed-integer non-convex optimization structure with constraints governed by ordinary differential equations, combining the challenges of discrete optimization, non-linear optimization, and continuous-time system dynamics. This paper develops a branch-and-price methodology for prescriptive contagion analytics based on: (i) a set partitioning reformulation; (ii) a column generation decomposition; (iii) a state-clustering algorithm for discrete-decision continuous-state dynamic programming; and (iv) a tri-partite branching scheme to circumvent non-linearities. Extensive experiments show that the algorithm scales to very large and otherwise-intractable instances, outperforming state-of-the-art benchmarks. Our methodology provides practical benefits in contagion systems; in particular, it can increase the effectiveness of a vaccination campaign by an estimated 12-70%, resulting in 7,000 to 12,000 extra saved lives over a three-month horizon mirroring the COVID-19 pandemic. We provide an open-source implementation of the methodology in an online repository to enable replication. △ Less

Submitted 23 October, 2023; originally announced October 2023.

arXiv:2310.02546 [pdf, other]

Joint Design of Protein Sequence and Structure based on Motifs

Authors: Zhenqiao Song, Yunlong Zhao, Yufei Song, Wenxian Shi, Yang Yang, Lei Li

Abstract: Designing novel proteins with desired functions is crucial in biology and chemistry. However, most existing work focus on protein sequence design, leaving protein sequence and structure co-design underexplored. In this paper, we propose GeoPro, a method to design protein backbone structure and sequence jointly. Our motivation is that protein sequence and its backbone structure constrain each other… ▽ More Designing novel proteins with desired functions is crucial in biology and chemistry. However, most existing work focus on protein sequence design, leaving protein sequence and structure co-design underexplored. In this paper, we propose GeoPro, a method to design protein backbone structure and sequence jointly. Our motivation is that protein sequence and its backbone structure constrain each other, and thus joint design of both can not only avoid nonfolding and misfolding but also produce more diverse candidates with desired functions. To this end, GeoPro is powered by an equivariant encoder for three-dimensional (3D) backbone structure and a protein sequence decoder guided by 3D geometry. Experimental results on two biologically significant metalloprotein datasets, including $β$-lactamases and myoglobins, show that our proposed GeoPro outperforms several strong baselines on most metrics. Remarkably, our method discovers novel $β$-lactamases and myoglobins which are not present in protein data bank (PDB) and UniProt. These proteins exhibit stable folding and active site environments reminiscent of those of natural proteins, demonstrating their excellent potential to be biologically functional. △ Less

Submitted 3 October, 2023; originally announced October 2023.

arXiv:2309.06772 [pdf]

Schizophrenia research under the framework of predictive coding: body, language, and others

Authors: Lingyu Li, Chunbo Li

Abstract: Although there have been so many studies on schizophrenia under the framework of predictive coding, works focusing on treatment are very preliminary. A model-oriented, operationalist, and comprehensive understanding of schizophrenia would promote the therapy turn of further research. We summarize predictive coding models of embodiment, co-occurrence of over- and under-weighting priors, subjective… ▽ More Although there have been so many studies on schizophrenia under the framework of predictive coding, works focusing on treatment are very preliminary. A model-oriented, operationalist, and comprehensive understanding of schizophrenia would promote the therapy turn of further research. We summarize predictive coding models of embodiment, co-occurrence of over- and under-weighting priors, subjective time processing, language production or comprehension, self-or-other inference, and social interaction. Corresponding impairments and clinical manifestations of schizophrenia are reviewed under these models at the same time. Finally, we discuss why and how to inaugurate a therapy turn of further research under the framework of predictive coding. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2309.06707 [pdf]

An active inference model of Lacanian psychoanalysis

Authors: Lingyu Li, Chunbo Li

Abstract: There has been a growing interest in exploring behavior, brain, and mind through the lens of complex systems theory. However, a unified and computational model that comprehensively encapsulates the properties of the human mind remains elusive. To address this gap, we propose a recurrent generative model drawing upon with Lacanian psychoanalysis and active inference. We conceptualize mechanism of d… ▽ More There has been a growing interest in exploring behavior, brain, and mind through the lens of complex systems theory. However, a unified and computational model that comprehensively encapsulates the properties of the human mind remains elusive. To address this gap, we propose a recurrent generative model drawing upon with Lacanian psychoanalysis and active inference. We conceptualize mechanism of desire as partial generalized synchronization, and then apply the model to suicidal dynamics to illustrate the theoretical and practical implications of our model. This work on computational psychoanalysis reveals its potential in unraveling complex mental phenomena. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2309.01384 [pdf]

Deep Learning Approach for Large-Scale, Real-Time Quantification of Green Fluorescent Protein-Labeled Biological Samples in Microreactors

Authors: Yuanyuan Wei, Sai Mu Dalike Abaxi, Nawaz Mehmood, Luoquan Li, Fuyang Qu, Guangyao Cheng, Dehua Hu, Yi-** Ho, Scott Wu Yuan, Ho-Pui Ho

Abstract: Absolute quantification of biological samples entails determining expression levels in precise numerical copies, offering enhanced accuracy and superior performance for rare templates. However, existing methodologies suffer from significant limitations: flow cytometers are both costly and intricate, while fluorescence imaging relying on software tools or manual counting is time-consuming and prone… ▽ More Absolute quantification of biological samples entails determining expression levels in precise numerical copies, offering enhanced accuracy and superior performance for rare templates. However, existing methodologies suffer from significant limitations: flow cytometers are both costly and intricate, while fluorescence imaging relying on software tools or manual counting is time-consuming and prone to inaccuracies. In this study, we have devised a comprehensive deep-learning-enabled pipeline that enables the automated segmentation and classification of GFP (green fluorescent protein)-labeled microreactors, facilitating real-time absolute quantification. Our findings demonstrate the efficacy of this technique in accurately predicting the sizes and occupancy status of microreactors using standard laboratory fluorescence microscopes, thereby providing precise measurements of template concentrations. Notably, our approach exhibits an analysis speed of quantifying over 2,000 microreactors (across 10 images) within remarkably 2.5 seconds, and a dynamic range spanning from 56.52 to 1569.43 copies per micron-liter. Furthermore, our Deep-dGFP algorithm showcases remarkable generalization capabilities, as it can be directly applied to various GFP-labeling scenarios, including droplet-based, microwell-based, and agarose-based biological applications. To the best of our knowledge, this represents the first successful implementation of an all-in-one image analysis algorithm in droplet digital PCR (polymerase chain reaction), microwell digital PCR, droplet single-cell sequencing, agarose digital PCR, and bacterial quantification, without necessitating any transfer learning steps, modifications, or retraining procedures. We firmly believe that our Deep-dGFP technique will be readily embraced by biomedical laboratories and holds potential for further development in related clinical applications. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: 23 pages, 6 figures, 1 table

arXiv:2308.12624 [pdf, other]

doi 10.1088/1367-2630/acf33a

Predator-prey survival pressure is sufficient to evolve swarming behaviors

Authors: Jianan Li, Liang Li, Shiyu Zhao

Abstract: The comprehension of how local interactions arise in global collective behavior is of utmost importance in both biological and physical research. Traditional agent-based models often rely on static rules that fail to capture the dynamic strategies of the biological world. Reinforcement learning has been proposed as a solution, but most previous methods adopt handcrafted reward functions that impli… ▽ More The comprehension of how local interactions arise in global collective behavior is of utmost importance in both biological and physical research. Traditional agent-based models often rely on static rules that fail to capture the dynamic strategies of the biological world. Reinforcement learning has been proposed as a solution, but most previous methods adopt handcrafted reward functions that implicitly or explicitly encourage the emergence of swarming behaviors. In this study, we propose a minimal predator-prey coevolution framework based on mixed cooperative-competitive multiagent reinforcement learning, and adopt a reward function that is solely based on the fundamental survival pressure, that is, prey receive a reward of $-1$ if caught by predators while predators receive a reward of $+1$. Surprisingly, our analysis of this approach reveals an unexpectedly rich diversity of emergent behaviors for both prey and predators, including flocking and swirling behaviors for prey, as well as dispersion tactics, confusion, and marginal predation phenomena for predators. Overall, our study provides novel insights into the collective behavior of organisms and highlights the potential applications in swarm robotics. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2307.11870 [pdf, other]

Conditional Temporal Attention Networks for Neonatal Cortical Surface Reconstruction

Authors: Qiang Ma, Liu Li, Vanessa Kyriakopoulou, Joseph Hajnal, Emma C. Robinson, Bernhard Kainz, Daniel Rueckert

Abstract: Cortical surface reconstruction plays a fundamental role in modeling the rapid brain development during the perinatal period. In this work, we propose Conditional Temporal Attention Network (CoTAN), a fast end-to-end framework for diffeomorphic neonatal cortical surface reconstruction. CoTAN predicts multi-resolution stationary velocity fields (SVF) from neonatal brain magnetic resonance images (M… ▽ More Cortical surface reconstruction plays a fundamental role in modeling the rapid brain development during the perinatal period. In this work, we propose Conditional Temporal Attention Network (CoTAN), a fast end-to-end framework for diffeomorphic neonatal cortical surface reconstruction. CoTAN predicts multi-resolution stationary velocity fields (SVF) from neonatal brain magnetic resonance images (MRI). Instead of integrating multiple SVFs, CoTAN introduces attention mechanisms to learn a conditional time-varying velocity field (CTVF) by computing the weighted sum of all SVFs at each integration step. The importance of each SVF, which is estimated by learned attention maps, is conditioned on the age of the neonates and varies with the time step of integration. The proposed CTVF defines a diffeomorphic surface deformation, which reduces mesh self-intersection errors effectively. It only requires 0.21 seconds to deform an initial template mesh to cortical white matter and pial surfaces for each brain hemisphere. CoTAN is validated on the Develo** Human Connectome Project (dHCP) dataset with 877 3D brain MR images acquired from preterm and term born neonates. Compared to state-of-the-art baselines, CoTAN achieves superior performance with only 0.12mm geometric error and 0.07% self-intersecting faces. The visualization of our attention maps illustrates that CoTAN indeed learns coarse-to-fine surface deformations automatically without intermediate supervision. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: Accepted by the 26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023

arXiv:2307.08848 [pdf]

Microbiome-derived bile acids contribute to elevated antigenic response and bone erosion in rheumatoid arthritis

Authors: Xiuli Su, Xiaona Li, Yanqin Bian, Qing Ren, Leiguang Li, Xiaohao Wu, Hemi Luan, Bing He, Xiaojuan He, Hui Feng, Xingye Cheng, Pan-Jun Kim, Leihan Tang, Ai** Lu, Lianbo Xiao, Liang Tian, Zhu Yang, Zongwei Cai

Abstract: Rheumatoid arthritis (RA) is a chronic, disabling and incurable autoimmune disease. It has been widely recognized that gut microbial dysbiosis is an important contributor to the pathogenesis of RA, although distinct alterations in microbiota have been associated with this disease. Yet, the metabolites that mediate the impacts of the gut microbiome on RA are less well understood. Here, with microbi… ▽ More Rheumatoid arthritis (RA) is a chronic, disabling and incurable autoimmune disease. It has been widely recognized that gut microbial dysbiosis is an important contributor to the pathogenesis of RA, although distinct alterations in microbiota have been associated with this disease. Yet, the metabolites that mediate the impacts of the gut microbiome on RA are less well understood. Here, with microbial profiling and non-targeted metabolomics, we revealed profound yet diverse perturbation of the gut microbiome and metabolome in RA patients in a discovery set. In the Bacteroides-dominated RA patients, differentiation of gut microbiome resulted in distinct bile acid profiles compared to healthy subjects. Predominated Bacteroides species expressing BSH and 7a-HSDH increased, leading to elevated secondary bile acid production in this subgroup of RA patients. Reduced serum fibroblast growth factor-19 and dysregulated bile acids were evidence of impaired farnesoid X receptor-mediated signaling in the patients. This gut microbiota-bile acid axis was correlated to ACPA. The patients from the validation sets demonstrated that ACPA-positive patients have more abundant bacteria expressing BSH and 7a-HSDH but less Clostridium scindens expressing 7a-dehydroxylation enzymes, together with dysregulated microbial bile acid metabolism and more severe bone erosion than ACPA-negative ones. Mediation analyses revealed putative causal relationships between the gut microbiome, bile acids, and ACPA-positive RA, supporting a potential causal effect of Bacteroides species in increasing levels of ACPA and bone erosion mediated via disturbing bile acid metabolism. These results provide insights into the role of gut dysbiosis in RA in a manifestation-specific manner, as well as the functions of bile acids in this gut-joint axis, which may be a potential intervention target for precisely controlling RA conditions. △ Less

Submitted 14 July, 2023; originally announced July 2023.

Comments: 38 pages, 6 figures

arXiv:2305.11908 [pdf, other]

Sequential Best-Arm Identification with Application to Brain-Computer Interface

Authors: Xin Zhou, Botao Hao, Jian Kang, Tor Lattimore, Lexin Li

Abstract: A brain-computer interface (BCI) is a technology that enables direct communication between the brain and an external device or computer system. It allows individuals to interact with the device using only their thoughts, and holds immense potential for a wide range of applications in medicine, rehabilitation, and human augmentation. An electroencephalogram (EEG) and event-related potential (ERP)-b… ▽ More A brain-computer interface (BCI) is a technology that enables direct communication between the brain and an external device or computer system. It allows individuals to interact with the device using only their thoughts, and holds immense potential for a wide range of applications in medicine, rehabilitation, and human augmentation. An electroencephalogram (EEG) and event-related potential (ERP)-based speller system is a type of BCI that allows users to spell words without using a physical keyboard, but instead by recording and interpreting brain signals under different stimulus presentation paradigms. Conventional non-adaptive paradigms treat each word selection independently, leading to a lengthy learning process. To improve the sampling efficiency, we cast the problem as a sequence of best-arm identification tasks in multi-armed bandits. Leveraging pre-trained large language models (LLMs), we utilize the prior knowledge learned from previous tasks to inform and facilitate subsequent tasks. To do so in a coherent way, we propose a sequential top-two Thompson sampling (STTS) algorithm under the fixed-confidence setting and the fixed-budget setting. We study the theoretical property of the proposed algorithm, and demonstrate its substantial empirical improvement through both synthetic data analysis as well as a P300 BCI speller simulator example. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.00386 [pdf, other]

Importance Weighted Expectation-Maximization for Protein Sequence Design

Authors: Zhenqiao Song, Lei Li

Abstract: Designing protein sequences with desired biological function is crucial in biology and chemistry. Recent machine learning methods use a surrogate sequence-function model to replace the expensive wet-lab validation. How can we efficiently generate diverse and novel protein sequences with high fitness? In this paper, we propose IsEM-Pro, an approach to generate protein sequences towards a given fitn… ▽ More Designing protein sequences with desired biological function is crucial in biology and chemistry. Recent machine learning methods use a surrogate sequence-function model to replace the expensive wet-lab validation. How can we efficiently generate diverse and novel protein sequences with high fitness? In this paper, we propose IsEM-Pro, an approach to generate protein sequences towards a given fitness criterion. At its core, IsEM-Pro is a latent generative model, augmented by combinatorial structure features from a separately learned Markov random fields (MRFs). We develop an Monte Carlo Expectation-Maximization method (MCEM) to learn the model. During inference, sampling from its latent space enhances diversity while its MRFs features guide the exploration in high fitness regions. Experiments on eight protein sequence design tasks show that our IsEM-Pro outperforms the previous best methods by at least 55% on average fitness score and generates more diverse and novel protein sequences. △ Less

Submitted 28 June, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

arXiv:2302.03227 [pdf, other]

Automatic Sleep Stage Classification with Cross-modal Self-supervised Features from Deep Brain Signals

Authors: Chen Gong, Yue Chen, Yanan Sui, Luming Li

Abstract: The detection of human sleep stages is widely used in the diagnosis and intervention of neurological and psychiatric diseases. Some patients with deep brain stimulator implanted could have their neural activities recorded from the deep brain. Sleep stage classification based on deep brain recording has great potential to provide more precise treatment for patients. The accuracy and generalizabilit… ▽ More The detection of human sleep stages is widely used in the diagnosis and intervention of neurological and psychiatric diseases. Some patients with deep brain stimulator implanted could have their neural activities recorded from the deep brain. Sleep stage classification based on deep brain recording has great potential to provide more precise treatment for patients. The accuracy and generalizability of existing sleep stage classifiers based on local field potentials are still limited. We proposed an applicable cross-modal transfer learning method for sleep stage classification with implanted devices. This end-to-end deep learning model contained cross-modal self-supervised feature representation, self-attention, and classification framework. We tested the model with deep brain recording data from 12 patients with Parkinson's disease. The best total accuracy reached 83.2% for sleep stage classification. Results showed speech self-supervised features catch the conversion pattern of sleep stages effectively. We provide a new method on transfer learning from acoustic signals to local field potentials. This method supports an effective solution for the insufficient scale of clinical data. This sleep stage classification model could be adapted to chronic and continuous monitor sleep for Parkinson's patients in daily life, and potentially utilized for more precise treatment in deep brain-machine interfaces, such as closed-loop deep brain stimulation. △ Less

Submitted 6 February, 2023; originally announced February 2023.

Comments: 4 pages, 5 figures, 11th International IEEE EMBS Conference on Neural Engineering (NER)

arXiv:2212.09450 [pdf, other]

doi 10.1145/3580305.3599249

Accelerating Antimicrobial Peptide Discovery with Latent Structure

Authors: Danqing Wang, Zeyu Wen, Fei Ye, Lei Li, Hao Zhou

Abstract: Antimicrobial peptides (AMPs) are promising therapeutic approaches against drug-resistant pathogens. Recently, deep generative models are used to discover new AMPs. However, previous studies mainly focus on peptide sequence attributes and do not consider crucial structure information. In this paper, we propose a latent sequence-structure model for designing AMPs (LSSAMP). LSSAMP exploits multi-sca… ▽ More Antimicrobial peptides (AMPs) are promising therapeutic approaches against drug-resistant pathogens. Recently, deep generative models are used to discover new AMPs. However, previous studies mainly focus on peptide sequence attributes and do not consider crucial structure information. In this paper, we propose a latent sequence-structure model for designing AMPs (LSSAMP). LSSAMP exploits multi-scale vector quantization in the latent space to represent secondary structures (e.g. alpha helix and beta sheet). By sampling in the latent space, LSSAMP can simultaneously generate peptides with ideal sequence attributes and secondary structures. Experimental results show that the peptides generated by LSSAMP have a high probability of antimicrobial activity. Our wet laboratory experiments verified that two of the 21 candidates exhibit strong antimicrobial activity. The code is released at https://github.com/dqwang122/LSSAMP. △ Less

Submitted 20 August, 2023; v1 submitted 28 November, 2022; originally announced December 2022.

Comments: KDD 2023

arXiv:2210.02881 [pdf, other]

Antibody Representation Learning for Drug Discovery

Authors: Lin Li, Esther Gupta, John Spaeth, Leslie Shing, Tristan Bepler, Rajmonda Sulo Caceres

Abstract: Therapeutic antibody development has become an increasingly popular approach for drug development. To date, antibody therapeutics are largely developed using large scale experimental screens of antibody libraries containing hundreds of millions of antibody sequences. The high cost and difficulty of develo** therapeutic antibodies create a pressing need for computational methods to predict antibo… ▽ More Therapeutic antibody development has become an increasingly popular approach for drug development. To date, antibody therapeutics are largely developed using large scale experimental screens of antibody libraries containing hundreds of millions of antibody sequences. The high cost and difficulty of develo** therapeutic antibodies create a pressing need for computational methods to predict antibody properties and create bespoke designs. However, the relationship between antibody sequence and activity is a complex physical process and traditional iterative design approaches rely on large scale assays and random mutagenesis. Deep learning methods have emerged as a promising way to learn antibody property predictors, but predicting antibody properties and target-specific activities depends critically on the choice of antibody representations and data linking sequences to properties is often limited. Existing works have not yet investigated the value, limitations and opportunities of these methods in application to antibody-based drug discovery. In this paper, we present results on a novel SARS-CoV-2 antibody binding dataset and an additional benchmark dataset. We compare three classes of models: conventional statistical sequence models, supervised learning on each dataset independently, and fine-tuning an antibody specific pre-trained language model. Experimental results suggest that self-supervised pretraining of feature representation consistently offers significant improvement in over previous approaches. We also investigate the impact of data size on the model performance, and discuss challenges and opportunities that the machine learning community can address to advance in silico engineering and design of therapeutic antibodies. △ Less

Submitted 5 October, 2022; originally announced October 2022.

arXiv:2209.07921 [pdf, other]

ImDrug: A Benchmark for Deep Imbalanced Learning in AI-aided Drug Discovery

Authors: Lanqing Li, Liang Zeng, Ziqi Gao, Shen Yuan, Yatao Bian, Bingzhe Wu, Hengtong Zhang, Yang Yu, Chan Lu, Zhipeng Zhou, Hongteng Xu, Jia Li, Peilin Zhao, Pheng-Ann Heng

Abstract: The last decade has witnessed a prosperous development of computational methods and dataset curation for AI-aided drug discovery (AIDD). However, real-world pharmaceutical datasets often exhibit highly imbalanced distribution, which is overlooked by the current literature but may severely compromise the fairness and generalization of machine learning applications. Motivated by this observation, we… ▽ More The last decade has witnessed a prosperous development of computational methods and dataset curation for AI-aided drug discovery (AIDD). However, real-world pharmaceutical datasets often exhibit highly imbalanced distribution, which is overlooked by the current literature but may severely compromise the fairness and generalization of machine learning applications. Motivated by this observation, we introduce ImDrug, a comprehensive benchmark with an open-source Python library which consists of 4 imbalance settings, 11 AI-ready datasets, 54 learning tasks and 16 baseline algorithms tailored for imbalanced learning. It provides an accessible and customizable testbed for problems and solutions spanning a broad spectrum of the drug discovery pipeline such as molecular modeling, drug-target interaction and retrosynthesis. We conduct extensive empirical studies with novel evaluation metrics, to demonstrate that the existing algorithms fall short of solving medicinal and pharmaceutical challenges in the data imbalance scenario. We believe that ImDrug opens up avenues for future research and development, on real-world challenges at the intersection of AIDD and deep imbalanced learning. △ Less

Submitted 17 October, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

Comments: 29 pages, 7 figures, 8 tables, a machine learning benchmark submission

arXiv:2209.07405 [pdf]

Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based Reinforcement Learning Model

Authors: Yaqin Li, Lingli Li, Yong** Xu, Yi Yu

Abstract: De novo molecular design has facilitated the exploration of large chemical space to accelerate drug discovery. Structure-based de novo method can overcome the data scarcity of active ligands by incorporating drug-target interaction into deep generative architectures. However, these strategies are bottlenecked by the small fraction of experimentally determined protein or complex structures. In addi… ▽ More De novo molecular design has facilitated the exploration of large chemical space to accelerate drug discovery. Structure-based de novo method can overcome the data scarcity of active ligands by incorporating drug-target interaction into deep generative architectures. However, these strategies are bottlenecked by the small fraction of experimentally determined protein or complex structures. In addition, the cost of molecular generation is computationally expensive due to 3D representations of both molecule and protein. Here, we demonstrate a widely used and fast protein sequence-based reinforcement learning (RL) model for drug discovery. In the generative model, one of the reward components, a binding affinity predictor, is based on 1D protein sequence and molecular SMILES. As a proof of concept, the RL model was utilized to design molecules for four targets. The generated compounds showed bioactivities by the validation of both QSAR and molecular docking with experimental 3D binding pockets. We also found that the performance of generated molecules depends on the selection of data source training for the binding predictor. Furthermore, drug design for a kinase without any experimental structure, CDK20, was studied by our model. With only 1D protein sequence as input, the generated novel compounds showed favorable binding affinity based on the AlphaFold predicted structure. △ Less

Submitted 14 August, 2022; originally announced September 2022.

arXiv:2208.11517 [pdf, other]

EpiGNN: Exploring Spatial Transmission with Graph Neural Network for Regional Epidemic Forecasting

Authors: Feng Xie, Zhong Zhang, Liang Li, Bin Zhou, Yusong Tan

Abstract: Epidemic forecasting is the key to effective control of epidemic transmission and helps the world mitigate the crisis that threatens public health. To better understand the transmission and evolution of epidemics, we propose EpiGNN, a graph neural network-based model for epidemic forecasting. Specifically, we design a transmission risk encoding module to characterize local and global spatial effec… ▽ More Epidemic forecasting is the key to effective control of epidemic transmission and helps the world mitigate the crisis that threatens public health. To better understand the transmission and evolution of epidemics, we propose EpiGNN, a graph neural network-based model for epidemic forecasting. Specifically, we design a transmission risk encoding module to characterize local and global spatial effects of regions in epidemic processes and incorporate them into the model. Meanwhile, we develop a Region-Aware Graph Learner (RAGL) that takes transmission risk, geographical dependencies, and temporal information into account to better explore spatial-temporal dependencies and makes regions aware of related regions' epidemic situations. The RAGL can also combine with external resources, such as human mobility, to further improve prediction performance. Comprehensive experiments on five real-world epidemic-related datasets (including influenza and COVID-19) demonstrate the effectiveness of our proposed method and show that EpiGNN outperforms state-of-the-art baselines by 9.48% in RMSE. △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: 16 pages, 6 figures, ECML-PKDD2022

arXiv:2205.03583 [pdf]

Scanning Electron Microscopy and Metabolite Measurement Revealed the Stress Mechanism of PS-COOH Microplastics on Rhodotorula mucilaginosa AN5

Authors: Jiahao Ma, Xiangfei Meng, Zixin Li, Lexian Li, Jiwen Xu, Guangfeng Kan

Abstract: Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa… ▽ More Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa AN5 by bacterial count, Scanning Electron Microscopy (SEM) and metabolite analysis. The results illustrates that a 50 mg/L concentration of PS-COOH could inhibit 36.15% growth of yeast cells and 10 mg/L inhibit 80.20%. Microplastics stress causes changes in the content of some oxidative stress substances, including reactive oxygen species (ROS) 42.86% , malondialdehyde (MDA) 54.06% content and the activities of antioxidant enzymes such as catalase (CAT) 36.00% , peroxidase (POD) 66.67% and superoxide dismutase (SOD) 25.40%. These results revealed the possible stress effect of microplastic pollution on marine yeast and may affect bottom layer of marine ecosystem. △ Less

Submitted 13 September, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

arXiv:2204.06714 [pdf, other]

graph-GPA 2.0: A Graphical Model for Multi-disease Analysis of GWAS Results with Integration of Functional Annotation Data

Authors: Qiaolan Deng, ** Hyun Nam, Ayse Selen Yilmaz, Won Chang, Maciej Pietrzak, Lang Li, Hang J. Kim, Dongjun Chung

Abstract: Genome-wide association studies (GWAS) have successfully identified a large number of genetic variants associated with traits and diseases. However, it still remains challenging to fully understand functional mechanisms underlying many associated variants. This is especially the case when we are interested in variants shared across multiple phenotypes. To address this challenge, we propose graph-G… ▽ More Genome-wide association studies (GWAS) have successfully identified a large number of genetic variants associated with traits and diseases. However, it still remains challenging to fully understand functional mechanisms underlying many associated variants. This is especially the case when we are interested in variants shared across multiple phenotypes. To address this challenge, we propose graph-GPA 2.0 (GGPA 2.0), a novel statistical framework to integrate GWAS datasets for multiple phenotypes and incorporate functional annotations within a unified framework. We conducted simulation studies to evaluate GGPA 2.0. The results indicate that incorporating functional annotation data using GGPA 2.0 does not only improve detection of disease-associated variants, but also allows to identify more accurate relationships among diseases. We analyzed five autoimmune diseases and five psychiatric disorders with the functional annotations derived from GenoSkyline and GenoSkyline-Plus and the prior disease graph generated by biomedical literature mining. For autoimmune diseases, GGPA 2.0 identified enrichment for blood, especially B cells and regulatory T cells across multiple diseases. Psychiatric disorders were enriched for brain, especially prefrontal cortex and inferior temporal lobe for bipolar disorder (BIP) and schizophrenia (SCZ), respectively. Finally, GGPA 2.0 successfully identified the pleiotropy between BIP and SCZ. These results demonstrate that GGPA 2.0 can be a powerful tool to identify associated variants associated with each phenotype or those shared across multiple phenotypes, while also promoting understanding of functional mechanisms underlying the associated variants. △ Less

Submitted 13 April, 2022; originally announced April 2022.

arXiv:2201.09637 [pdf, other]

DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

Authors: Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Long-Kai Huang, Tingyang Xu, Yu Rong, Lanqing Li, Jie Ren, Ding Xue, Houtim Lai, Shaoyong Xu, **g Feng, Wei Liu, ** Luo, Shuigeng Zhou, Junzhou Huang, Peilin Zhao, Yatao Bian

Abstract: AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient. In spite of its extensive use in many fields, such as ADMET prediction, virtual screening, protein folding and generative chemistry, little has been explored in terms of the out-of-distribution (OOD) learning problem with \emph{noise},… ▽ More AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient. In spite of its extensive use in many fields, such as ADMET prediction, virtual screening, protein folding and generative chemistry, little has been explored in terms of the out-of-distribution (OOD) learning problem with \emph{noise}, which is inevitable in real world AIDD applications. In this work, we present DrugOOD, a systematic OOD dataset curator and benchmark for AI-aided drug discovery, which comes with an open-source Python package that fully automates the data curation and OOD benchmarking processes. We focus on one of the most crucial problems in AIDD: drug target binding affinity prediction, which involves both macromolecule (protein target) and small-molecule (drug compound). In contrast to only providing fixed datasets, DrugOOD offers automated dataset curator with user-friendly customization scripts, rich domain annotations aligned with biochemistry knowledge, realistic noise annotations and rigorous benchmarking of state-of-the-art OOD algorithms. Since the molecular data is often modeled as irregular graphs using graph neural network (GNN) backbones, DrugOOD also serves as a valuable testbed for \emph{graph OOD learning} problems. Extensive empirical studies have shown a significant performance gap between in-distribution and out-of-distribution experiments, which highlights the need to develop better schemes that can allow for OOD generalization under noise for AIDD. △ Less

Submitted 24 January, 2022; originally announced January 2022.

Comments: 54 pages, 11 figures

arXiv:2105.13582 [pdf, other]

Cysteine post-translational modifications: ten years from chemical proteomics to bioinformatics

Authors: Yanzheng Meng, Lei Li

Abstract: As the only thiol-bearing amino acid, cysteine (Cys) residues in proteins have the reactive thiol side chain, which is susceptible to a series of post-translational modifications (PTMs). These PTMs participate in a wide range of biological activities including the alteration of enzymatic reactions, protein-protein interactions and protein stability. Here we summarize the advance of cysteine PTM id… ▽ More As the only thiol-bearing amino acid, cysteine (Cys) residues in proteins have the reactive thiol side chain, which is susceptible to a series of post-translational modifications (PTMs). These PTMs participate in a wide range of biological activities including the alteration of enzymatic reactions, protein-protein interactions and protein stability. Here we summarize the advance of cysteine PTM identification technologies and the features of the various kinds of the PTMs. We also discuss in silico approaches for the prediction of the different types of cysteine modified sites, giving directions for future study. △ Less

Submitted 28 May, 2021; originally announced May 2021.

arXiv:2103.15182 [pdf]

Verifying Design through Generative Visualization of Neural Activities

Authors: Pan Wang, Danlin Peng, Simiao Yu, Chao Wu, Peter Childs, Yike Guo, Ling Li

Abstract: Current neuroscience focused approaches for evaluating the effectiveness of a design do not use direct visualisation of mental activity. A recurrent neural network is used as the encoder to learn latent representation from electroencephalogram (EEG) signals, recorded while subjects looked at 50 categories of images. A generative adversarial network (GAN) conditioned on the EEG latent representatio… ▽ More Current neuroscience focused approaches for evaluating the effectiveness of a design do not use direct visualisation of mental activity. A recurrent neural network is used as the encoder to learn latent representation from electroencephalogram (EEG) signals, recorded while subjects looked at 50 categories of images. A generative adversarial network (GAN) conditioned on the EEG latent representation is trained for reconstructing these images. After training, the neural network is able to reconstruct images from brain activity recordings. To demonstrate the proposed method in the context of the mental association with a design, we performed a study that indicates an iconic design image could inspire the subject to create cognitive associations with branding and valued products. The proposed method could have the potential in verifying designs by visualizing the cognitive understanding of underlying brain activity. △ Less

Submitted 28 March, 2021; originally announced March 2021.

arXiv:2103.10432 [pdf, other]

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

Authors: Yutong Xie, Chence Shi, Hao Zhou, Yuwei Yang, Weinan Zhang, Yong Yu, Lei Li

Abstract: Searching for novel molecules with desired chemical properties is crucial in drug discovery. Existing work focuses on develo** neural models to generate either molecular sequences or chemical graphs. However, it remains a big challenge to find novel and diverse compounds satisfying several properties. In this paper, we propose MARS, a method for multi-objective drug molecule discovery. MARS is b… ▽ More Searching for novel molecules with desired chemical properties is crucial in drug discovery. Existing work focuses on develo** neural models to generate either molecular sequences or chemical graphs. However, it remains a big challenge to find novel and diverse compounds satisfying several properties. In this paper, we propose MARS, a method for multi-objective drug molecule discovery. MARS is based on the idea of generating the chemical candidates by iteratively editing fragments of molecular graphs. To search for high-quality candidates, it employs Markov chain Monte Carlo sampling (MCMC) on molecules with an annealing scheme and an adaptive proposal. To further improve sample efficiency, MARS uses a graph neural network (GNN) to represent and select candidate edits, where the GNN is trained on-the-fly with samples from MCMC. Experiments show that MARS achieves state-of-the-art performance in various multi-objective settings where molecular bio-activity, drug-likeness, and synthesizability are considered. Remarkably, in the most challenging setting where all four objectives are simultaneously optimized, our approach outperforms previous methods significantly in comprehensive evaluations. The code is available at https://github.com/yutxie/mars. △ Less

Submitted 18 March, 2021; originally announced March 2021.

Comments: ICLR 2021

arXiv:2102.07309 [pdf, other]

doi 10.1002/nav.22007

Where to locate COVID-19 mass vaccination facilities?

Authors: Dimitris Bertsimas, Vassilis Digalakis Jr., Alexander Jacquillat, Michael Lingzhi Li, Alessandro Previero

Abstract: The outbreak of COVID-19 led to a record-breaking race to develop a vaccine. However, the limited vaccine capacity creates another massive challenge: how to distribute vaccines to mitigate the near-end impact of the pandemic? In the United States in particular, the new Biden administration is launching mass vaccination sites across the country, raising the obvious question of where to locate these… ▽ More The outbreak of COVID-19 led to a record-breaking race to develop a vaccine. However, the limited vaccine capacity creates another massive challenge: how to distribute vaccines to mitigate the near-end impact of the pandemic? In the United States in particular, the new Biden administration is launching mass vaccination sites across the country, raising the obvious question of where to locate these clinics to maximize the effectiveness of the vaccination campaign. This paper tackles this question with a novel data-driven approach to optimize COVID-19 vaccine distribution. We first augment a state-of-the-art epidemiological model, called DELPHI, to capture the effects of vaccinations and the variability in mortality rates across age groups. We then integrate this predictive model into a prescriptive model to optimize the location of vaccination sites and subsequent vaccine allocation. The model is formulated as a bilinear, non-convex optimization model. To solve it, we propose a coordinate descent algorithm that iterates between optimizing vaccine distribution and simulating the dynamics of the pandemic. As compared to benchmarks based on demographic and epidemiological information, the proposed optimization approach increases the effectiveness of the vaccination campaign by an estimated $20\%$, saving an extra $4000$ extra lives in the United States over a three-month period. The proposed solution achieves critical fairness objectives -- by reducing the death toll of the pandemic in several states without hurting others -- and is highly robust to uncertainties and forecast errors -- by achieving similar benefits under a vast range of perturbations. △ Less

Submitted 18 July, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

arXiv:2102.05236 [pdf, other]

A General Framework for Revealing Human Mind with auto-encoding GANs

Authors: Pan Wang, Rui Zhou, Shuo Wang, Ling Li, Wenjia Bai, Jialu Fan, Chunlin Li, Peter Childs, Yike Guo

Abstract: Addressing the question of visualising human mind could help us to find regions that are associated with observed cognition and responsible for expressing the elusive mental image, leading to a better understanding of cognitive function. The traditional approach treats brain decoding as a classification problem, reading the mind through statistical analysis of brain activity. However, human though… ▽ More Addressing the question of visualising human mind could help us to find regions that are associated with observed cognition and responsible for expressing the elusive mental image, leading to a better understanding of cognitive function. The traditional approach treats brain decoding as a classification problem, reading the mind through statistical analysis of brain activity. However, human thought is rich and varied, that it is often influenced by more of a combination of object features than a specific type of category. For this reason, we propose an end-to-end brain decoding framework which translates brain activity into an image by latent space alignment. To find the correspondence from brain signal features to image features, we embedded them into two latent spaces with modality-specific encoders and then aligned the two spaces by minimising the distance between paired latent representations. The proposed framework was trained by simultaneous electroencephalogram and functional MRI data, which were recorded when the subjects were viewing or imagining a set of image stimuli. In this paper, we focused on implementing the fMRI experiment. Our experimental results demonstrated the feasibility of translating brain activity to an image. The reconstructed image matches image stimuli approximate in both shape and colour. Our framework provides a promising direction for building a direct visualisation to reveal human mind. △ Less

Submitted 9 February, 2021; originally announced February 2021.

arXiv:2101.01532 [pdf]

Bayesian data assimilation for estimating epidemic evolution: a COVID-19 study

Authors: Xian Yang, Shuo Wang, Yuting Xing, Ling Li, Richard Yi Da Xu, Karl J. Friston, Yike Guo

Abstract: The evolution of epidemiological parameters, such as instantaneous reproduction number Rt, is important for understanding the transmission dynamics of infectious diseases. Current estimates of time-varying epidemiological parameters often face problems such as lagging observations, averaging inference, and improper quantification of uncertainties. To address these problems, we propose a Bayesian d… ▽ More The evolution of epidemiological parameters, such as instantaneous reproduction number Rt, is important for understanding the transmission dynamics of infectious diseases. Current estimates of time-varying epidemiological parameters often face problems such as lagging observations, averaging inference, and improper quantification of uncertainties. To address these problems, we propose a Bayesian data assimilation framework for time-varying parameter estimation. Specifically, this framework is applied to Rt estimation, resulting in the state-of-the-art DARt system. With DARt, time misalignment caused by lagging observations is tackled by incorporating observation delays into the joint inference of infections and Rt; the drawback of averaging is overcome by instantaneously updating upon new observations and develo** a model selection mechanism that captures abrupt changes; the uncertainty is quantified and reduced by employing Bayesian smoothing. We validate the performance of DARt and demonstrate its power in revealing the transmission dynamics of COVID-19. The proposed approach provides a promising solution for accurate and timely estimating transmission dynamics from reported data. △ Less

Submitted 24 October, 2021; v1 submitted 22 December, 2020; originally announced January 2021.

Comments: Xian Yang, Shuo Wang and Yuting Xing contribute equally

arXiv:2007.13437 [pdf, other]

Energy-based View of Retrosynthesis

Authors: Ruoxi Sun, Hanjun Dai, Li Li, Steven Kearnes, Bo Dai

Abstract: Retrosynthesis -- the process of identifying a set of reactants to synthesize a target molecule -- is of vital importance to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achieved encouraging results. In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) w… ▽ More Retrosynthesis -- the process of identifying a set of reactants to synthesize a target molecule -- is of vital importance to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achieved encouraging results. In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) with different energy functions. This unified perspective provides critical insights about EBM variants through a comprehensive assessment of performance. Additionally, we present a novel dual variant within the framework that performs consistent training over Bayesian forward- and backward-prediction by constraining the agreement between the two directions. This model improves state-of-the-art performance by 9.6% for template-free approaches where the reaction type is unknown. △ Less

Submitted 8 December, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

arXiv:2007.07202 [pdf]

Evaluating Incidence and Impact Estimates of the COVID-19 Outbreak from Wuhan before Lockdown

Authors: Mai He, Li Li, Louis P. Dehner

Abstract: Background: Wuhan, China was the epicenter of COVID-19 pandemic. The goal of current study is to understand the infection transmission dynamics before intervention measures were taken. Methods: Data and key events were searched through pubmed and internet. Epidemiological data were calculated using data extracted from a variety of data sources. Results: We established a timeline showing by January… ▽ More Background: Wuhan, China was the epicenter of COVID-19 pandemic. The goal of current study is to understand the infection transmission dynamics before intervention measures were taken. Methods: Data and key events were searched through pubmed and internet. Epidemiological data were calculated using data extracted from a variety of data sources. Results: We established a timeline showing by January 1, 2020, Chinese authorities had been presented convincing evidence of human-to-human transmission; however, it was not until January 20, 2020 that this information was shared with the public. Our study estimated that there would have been 10989 total infected cases if interventions were taken on January 2, 2020, versus 239875 cases when lockdown was put in place on January 23, 2020. Conclusions: China's withholding of key information about the 2020 COVID-19 outbreak and its delayed response ultimately led to the largest public health crisis of this century and could have been avoided with earlier countermeasures. △ Less

Submitted 10 July, 2020; originally announced July 2020.

Comments: Three tables and three figures

arXiv:2006.16509 [pdf, other]

From predictions to prescriptions: A data-driven response to COVID-19

Authors: Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng

Abstract: The COVID-19 pandemic has created unprecedented challenges worldwide. Strained healthcare providers make difficult decisions on patient triage, treatment and care management on a daily basis. Policy makers have imposed social distancing measures to slow the disease, at a steep economic price. We design analytical tools to support these decisions and combat the pandemic. Specifically, we propose a… ▽ More The COVID-19 pandemic has created unprecedented challenges worldwide. Strained healthcare providers make difficult decisions on patient triage, treatment and care management on a daily basis. Policy makers have imposed social distancing measures to slow the disease, at a steep economic price. We design analytical tools to support these decisions and combat the pandemic. Specifically, we propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19, predict its mortality, forecast its evolution, and ultimately alleviate its impact. By leveraging cohort-level clinical data, patient-level hospital data, and census-level epidemiological data, we develop an integrated four-step approach, combining descriptive, predictive and prescriptive analytics. First, we aggregate hundreds of clinical studies into the most comprehensive database on COVID-19 to paint a new macroscopic picture of the disease. Second, we build personalized calculators to predict the risk of infection and mortality as a function of demographics, symptoms, comorbidities, and lab values. Third, we develop a novel epidemiological model to project the pandemic's spread and inform social distancing policies. Fourth, we propose an optimization model to re-allocate ventilators and alleviate shortages. Our results have been used at the clinical level by several hospitals to triage patients, guide care management, plan ICU capacity, and re-distribute ventilators. At the policy level, they are currently supporting safe back-to-work policies at a major institution and equitable vaccine distribution planning at a major pharmaceutical company, and have been integrated into the US Center for Disease Control's pandemic forecast. △ Less

Submitted 29 June, 2020; originally announced June 2020.

Comments: Submitted to PNAS

arXiv:2006.12177 [pdf]

doi 10.1109/MCI.2020.3019874

A Bayesian Updating Scheme for Pandemics: Estimating the Infection Dynamics of COVID-19

Authors: Shuo Wang, Xian Yang, Ling Li, Philip Nadler, Rossella Arcucci, Yuan Huang, Zhongzhao Teng, Yike Guo

Abstract: Epidemic models play a key role in understanding and responding to the emerging COVID-19 pandemic. Widely used compartmental models are static and are of limited use to evaluate intervention strategies with the emerging pandemic. Applying the technology of data assimilation, we propose a Bayesian updating approach for estimating epidemiological parameters using observable information for the purpo… ▽ More Epidemic models play a key role in understanding and responding to the emerging COVID-19 pandemic. Widely used compartmental models are static and are of limited use to evaluate intervention strategies with the emerging pandemic. Applying the technology of data assimilation, we propose a Bayesian updating approach for estimating epidemiological parameters using observable information for the purpose of assessing the impacts of different intervention strategies. We adopt a concise renewal model and propose new parameters by disentangling the reduction of instantaneous reproduction number Rt into mitigation and suppression factors for quantifying intervention impacts at a finer granularity. Then we developed a data assimilation framework for estimating these parameters including constructing an observation function and develo** a Bayesian updating scheme. A statistical analysis framework is then built to quantify the impact of intervention strategies by monitoring the evolution of these estimated parameters. By Investigating the impacts of intervention measures of European countries, the United States and Wuhan with the framework, we reveal the effects of interventions in these countries and the resurgence risk in the USA. △ Less

Submitted 6 August, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

Comments: This work is submitted to IEEE Computational Intelligence Magzine Special Issues on Computational Intelligence for Combating COVID-19

arXiv:2006.00639 [pdf]

Ontology-based systematic classification and analysis of coronaviruses, hosts, and host-coronavirus interactions towards deep understanding of COVID-19

Authors: Hong Yu, Li Li, Hsin-hui Huang, Yang Wang, Yingtong Liu, Edison Ong, Anthony Huffman, Tao Zeng, **gsong Zhang, Pengpai Li, Zhi** Liu, Xiangyan Zhang, Xianwei Ye, Samuel K. Handelman, Gerry Higgins, Gilbert S. Omenn, Brian Athey, Junguk Hur, Luonan Chen, Yongqun He

Abstract: Given the existing COVID-19 pandemic worldwide, it is critical to systematically study the interactions between hosts and coronaviruses including SARS-Cov, MERS-Cov, and SARS-CoV-2 (cause of COVID-19). We first created four host-pathogen interaction (HPI)-Outcome postulates, and generated a HPI-Outcome model as the basis for understanding host-coronavirus interactions (HCI) and their relations wit… ▽ More Given the existing COVID-19 pandemic worldwide, it is critical to systematically study the interactions between hosts and coronaviruses including SARS-Cov, MERS-Cov, and SARS-CoV-2 (cause of COVID-19). We first created four host-pathogen interaction (HPI)-Outcome postulates, and generated a HPI-Outcome model as the basis for understanding host-coronavirus interactions (HCI) and their relations with the disease outcomes. We hypothesized that ontology can be used as an integrative platform to classify and analyze HCI and disease outcomes. Accordingly, we annotated and categorized different coronaviruses, hosts, and phenotypes using ontologies and identified their relations. Various COVID-19 phenotypes are hypothesized to be caused by the backend HCI mechanisms. To further identify the causal HCI-outcome relations, we collected 35 experimentally-verified HCI protein-protein interactions (PPIs), and applied literature mining to identify additional host PPIs in response to coronavirus infections. The results were formulated in a logical ontology representation for integrative HCI-outcome understanding. Using known PPIs as baits, we also developed and applied a domain-inferred prediction method to predict new PPIs and identified their pathological targets on multiple organs. Overall, our proposed ontology-based integrative framework combined with computational predictions can be used to support fundamental understanding of the intricate interactions between human patients and coronaviruses (including SARS-CoV-2) and their association with various disease outcomes. △ Less

Submitted 31 May, 2020; originally announced June 2020.

Comments: 32 pages, 1 table, 6 figures

arXiv:2005.13112 [pdf]

Organ size increases with obesity and correlates with cancer risk

Authors: Haley Grant Yifan Zhang, Lu Li, Yan Wang, Satomi Kawamoto, Sophie Pénisson, Daniel F. Fouladi, Shahab Shayesteh, Alejandra Blanco, Saeed Ghandili, Eva Zinreich, Jefferson S. Graves, Seyoun Park, Scott Kern, Jody Hooper, Alan L. Yuille, Elliot K Fishman, Linda Chu, Cristian Tomasetti

Abstract: Obesity increases significantly cancer risk in various organs. Although this has been recognized for decades, the mechanism through which this happens has never been explained. Here, we show that the volumes of kidneys, pancreas, and liver are strongly correlated (median correlation = 0.625; P-value<10-47) with the body mass index (BMI) of an individual. We also find a significant relationship bet… ▽ More Obesity increases significantly cancer risk in various organs. Although this has been recognized for decades, the mechanism through which this happens has never been explained. Here, we show that the volumes of kidneys, pancreas, and liver are strongly correlated (median correlation = 0.625; P-value<10-47) with the body mass index (BMI) of an individual. We also find a significant relationship between the increase in organ volume and the increase in cancer risk (P-value<10-12). These results provide a mechanism explaining why obese individuals have higher cancer risk in several organs: the larger the organ volume the more cells at risk of becoming cancerous. These findings are important for a better understanding of the effects obesity has on cancer risk and, more generally, for the development of better preventive strategies to limit the mortality caused by obesity. △ Less

Submitted 26 May, 2020; originally announced May 2020.

arXiv:2005.04224 [pdf]

doi 10.1080/17538947.2020.1809723

Taking the pulse of COVID-19: A spatiotemporal perspective

Authors: Chaowei Yang, Dexuan Sha, Qian Liu, Yun Li, Hai Lan, Weihe Wendy Guan, Tao Hu, Zhenlong Li, Zhiran Zhang, John Hoot Thompson, Zifu Wang, David Wong, Shiyang Ruan, Manzhu Yu, Douglas Richardson, Luyao Zhang, Ruizhi Hou, You Zhou, Cheng Zhong, Yifei Tian, Fayez Beaini, Kyla Carte, Colin Flynn, Wei Liu, Dieter Pfoser , et al. (10 additional authors not shown)

Abstract: The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the… ▽ More The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the pandemic by the end of March. In response to this national and global emergency, the NSF Spatiotemporal Innovation Center brought together a taskforce of international researchers and assembled implemented strategies to rapidly respond to this crisis, for supporting research, saving lives, and protecting the health of global citizens. This perspective paper presents our collective view on the global health emergency and our effort in collecting, analyzing, and sharing relevant data on global policy and government responses, geospatial indicators of the outbreak and evolving forecasts; in develo** research capabilities and mitigation measures with global scientists, promoting collaborative research on outbreak dynamics, and reflecting on the dynamic responses from human societies. △ Less

Submitted 8 May, 2020; originally announced May 2020.

Comments: 27 pages, 18 figures. International Journal of Digital Earth (2020)

arXiv:2003.06846 [pdf]

Propagation analysis and prediction of the COVID-19

Authors: Lixiang Li, Zihang Yang, Zhongkai Dang, Cui Meng, **gze Huang, Hao Tian Meng, Deyu Wang, Guanhua Chen, Jiaxuan Zhang, Haipeng Peng

Abstract: Based on the official data modeling, this paper studies the transmission process of the Corona Virus Disease 2019 (COVID-19). The error between the model and the official data curve is within 3%. At the same time, it realized forward prediction and backward inference of the epidemic situation, and the relevant analysis help relevant countries to make decisions. Based on the official data modeling, this paper studies the transmission process of the Corona Virus Disease 2019 (COVID-19). The error between the model and the official data curve is within 3%. At the same time, it realized forward prediction and backward inference of the epidemic situation, and the relevant analysis help relevant countries to make decisions. △ Less

Submitted 15 March, 2020; originally announced March 2020.

arXiv:1912.01505 [pdf, ps, other]

An integrated heterogeneous Poisson model for neuron functions in hand movement during reaching and grasp

Authors: Shu-Chuan Chen, Lung-An Li, Ji** He

Abstract: To understand potential encoding mechanism of motor cortical neurons for control commands during reach-to-grasp movements, experiments to record neuronal activities from primary motor cortical regions have been conducted in many research laboratories (for example, (7), (17)). The most popular approach in neuroscience community is to fit the Analysis of Variance (ANOVA) model using the firing rates… ▽ More To understand potential encoding mechanism of motor cortical neurons for control commands during reach-to-grasp movements, experiments to record neuronal activities from primary motor cortical regions have been conducted in many research laboratories (for example, (7), (17)). The most popular approach in neuroscience community is to fit the Analysis of Variance (ANOVA) model using the firing rates of individual neurons. In addition to consider neural firing counts but also temporal intervals, (5) proposed to apply Analysis of Covariance (ANCOVA) model. Due to the nature of the data, in this paper we propose to apply an integrated method, called heterogeneous Poisson regression model, to categorize different neural activities. Three scenarios are discussed to show that the proposed heterogeneous Poisson regression model can overcome some disadvantages of the traditional Poisson regression model. △ Less

Submitted 27 November, 2019; originally announced December 2019.

arXiv:1911.09309 [pdf, other]

Decoding Spiking Mechanism with Dynamic Learning on Neuron Population

Authors: Zhijie Chen, Junchi Yan, Longyuan Li, Xiaokang Yang

Abstract: A main concern in cognitive neuroscience is to decode the overt neural spike train observations and infer latent representations under neural circuits. However, traditional methods entail strong prior on network structure and hardly meet the demand for real spike data. Here we propose a novel neural network approach called Neuron Activation Network that extracts neural information explicitly from… ▽ More A main concern in cognitive neuroscience is to decode the overt neural spike train observations and infer latent representations under neural circuits. However, traditional methods entail strong prior on network structure and hardly meet the demand for real spike data. Here we propose a novel neural network approach called Neuron Activation Network that extracts neural information explicitly from single trial neuron population spike trains. Our proposed method consists of a spatiotemporal learning procedure on sensory environment and a message passing mechanism on population graph, followed by a neuron activation process in a recursive fashion. Our model is aimed to reconstruct neuron information while inferring representations of neuron spiking states. We apply our model to retinal ganglion cells and the experimental results suggest that our model holds a more potent capability in generating neural spike sequences with high fidelity than the state-of-the-art methods, as well as being more expressive and having potential to disclose latent spiking mechanism. The source code will be released with the final paper. △ Less

Submitted 21 November, 2019; originally announced November 2019.

arXiv:1908.08095 [pdf, other]

Paired Test of Matrix Graphs and Brain Connectivity Analysis

Authors: Yuting Ye, Yin Xia, Lexin Li

Abstract: Inferring brain connectivity network and quantifying the significance of interactions between brain regions are of paramount importance in neuroscience. Although there have recently emerged some tests for graph inference based on independent samples, there is no readily available solution to test the change of brain network for paired and correlated samples. In this article, we develop a paired te… ▽ More Inferring brain connectivity network and quantifying the significance of interactions between brain regions are of paramount importance in neuroscience. Although there have recently emerged some tests for graph inference based on independent samples, there is no readily available solution to test the change of brain network for paired and correlated samples. In this article, we develop a paired test of matrix graphs to infer brain connectivity network when the groups of samples are correlated. The proposed test statistic is both bias corrected and variance corrected, and achieves a small estimation error rate. The subsequent multiple testing procedure built on this test statistic is guaranteed to asymptotically control the false discovery rate at the pre-specified level. Both the methodology and theory of the new test are considerably different from the two independent samples framework, owing to the strong correlations of measurements on the same subjects before and after the stimulus activity. We illustrate the efficacy of our proposal through simulations and an analysis of an Alzheimer's Disease Neuroimaing Initiative dataset. △ Less

Submitted 21 August, 2019; originally announced August 2019.

Comments: 30 pages, 1 figure, 5 tables

arXiv:1906.05647 [pdf, other]

Dynamic Prediction of Competing Risk Events using Landmark Sub-distribution Hazard Model with Multiple Longitudinal Biomarker

Authors: Cai Wu, Liang Li, Ruosha Li

Abstract: The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propo… ▽ More The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propose a framework for dynamic prediction of end stage renal disease using multivariate longitudinal biomarkers, accounting for the competing risk of death. The proposed framework extends the landmark survival modeling to competing risks data, and implies that a distinct sub-distribution hazard regression model is defined at each landmark time. The model parameters, prediction horizon, longitudinal history and at-risk population are allowed to vary over the landmark time. When the measurement times of biomarkers are irregularly spaced, the predictor variable may not be observed at the time of prediction. Local polynomial is used to estimate the model parameters without explicitly imputing the predictor or modeling its longitudinal trajectory. The proposed model leads to simple interpretation of the regression coefficients and closed-form calculation of the predicted CIF. The estimation and prediction can be implemented through standard statistical software with tractable computation. We conducted simulations to evaluate the performance of the estimation procedure and predictive accuracy. The methodology is illustrated with data from the African American Study of Kidney Disease and Hypertension. △ Less

Submitted 21 May, 2019; originally announced June 2019.

arXiv:1905.10023 [pdf]

Separation Effect of Early Visual Cortex V1 Under Different Crowding Conditions A TMS Study

Authors: Xieyi Liu, Junjun Zhang, Ling Li

Abstract: The visual crowding makes it difficult to identify the patterns in peripheral vision, but the neural mechanism for this phenomenon is still unclear because of different opinions. In order to study the separation effect of V1 under different crowding conditions, single-pulse transcranial magnetic stimulation is applied within the right V1. The experimental design includes two factors: TMS intensity… ▽ More The visual crowding makes it difficult to identify the patterns in peripheral vision, but the neural mechanism for this phenomenon is still unclear because of different opinions. In order to study the separation effect of V1 under different crowding conditions, single-pulse transcranial magnetic stimulation is applied within the right V1. The experimental design includes two factors: TMS intensity (10%, 65%, and 90% of the phosphene threshold) and crowding (high and low) conditions. The accuracy results show that there is a strong interaction between crowding condition and TMS condition. When the TMS stimulation intensity is lower than the phosphene threshold, more crowding will be perceived under the high crowding condition, and less crowding will be perceived under the low crowding condition. The above results conclude that the high and low crowding condition separate by TMS stimulation. The results support the assumption that the crowding is related to V1 and occurs in the visual coding phase. △ Less

Submitted 24 May, 2019; originally announced May 2019.

arXiv:1903.04512 [pdf]

Individual-Level SNP Diversity and Similarity Profiles

Authors: Zhanshan, Ma, Lianwei Li, Ya-** Zhang

Abstract: Classic concepts of genetic (gene) diversity (heterozygosity) such as Nei (1973: PNAS) and Nei and Li (1979: PNAS) nucleotide diversity were defined within the context of populations. Although variations are often measured in population context, the basic carriers of variation are individuals. Hence, measuring variations such as SNP of individual against a reference genome, which has been ignored… ▽ More Classic concepts of genetic (gene) diversity (heterozygosity) such as Nei (1973: PNAS) and Nei and Li (1979: PNAS) nucleotide diversity were defined within the context of populations. Although variations are often measured in population context, the basic carriers of variation are individuals. Hence, measuring variations such as SNP of individual against a reference genome, which has been ignored currently, is certainly of its own right. Indeed, similar practice has been a tradition in ecology, where the basic framework of diversity measure is individual community sample. We propose to use Renyi-entropy-derived Hill numbers to define SNP (single nucleotide polymorphism) diversity (including alpha-, beta-, and gamma-diversities) and similarity profiles. Hill numbers are derived from Renyi entropy, of which Shannon entropy is a special case and which have found widely applications including measuring the quantum information entanglement, wealth distribution in economics and ecological diversity. The newly proposed SNP diversity not only complements the existing genetic diversity concepts by offering individual-level metrics, but also offers building blocks for comparative genetic analysis at higher levels. The profile concept also helps to resolve a dilemma in measuring diversity: the choice from various diversity indexes, because diversity profile unifies some of the most commonly used indexes (as special cases) with different diversity orders (along the rareness-commonness spectrum of gene mutations). Finally, the profiles can be estimated with rarefaction approach, which may help to relieve some effect of insufficient sequencing coverage. △ Less

Submitted 11 March, 2019; originally announced March 2019.

Showing 1–50 of 68 results for author: Li, L