Search | arXiv e-print repository

Deciphering interventional dynamical causality from non-intervention systems

Authors: Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Kazuyuki Aihara, Luonan Chen, Wei Lin

Abstract: Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational crite… ▽ More Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational criterion, Interventional Embedding Entropy (IEE), to quantify causality. The IEE criterion theoretically and numerically enables the deciphering of IntDC solely from observational (non-interventional) time-series data, without requiring any knowledge of dynamical models or real interventions in the considered system. Demonstrations of performance showed the accuracy and robustness of IEE on benchmark simulated systems as well as real-world systems, including the neural connectomes of C. elegans, COVID-19 transmission networks in Japan, and regulatory networks surrounding key circadian genes. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2405.00753 [pdf, other]

HMAMP: Hypervolume-Driven Multi-Objective Antimicrobial Peptides Design

Authors: Li Wang, Yi** Li, Xiangzheng Fu, Xiucai Ye, Junfeng Shi, Gary G. Yen, Xiangxiang Zeng

Abstract: Antimicrobial peptides (AMPs) have exhibited unprecedented potential as biomaterials in combating multidrug-resistant bacteria. Despite the increasing adoption of artificial intelligence for novel AMP design, challenges pertaining to conflicting attributes such as activity, hemolysis, and toxicity have significantly impeded the progress of researchers. This paper introduces a paradigm shift by con… ▽ More Antimicrobial peptides (AMPs) have exhibited unprecedented potential as biomaterials in combating multidrug-resistant bacteria. Despite the increasing adoption of artificial intelligence for novel AMP design, challenges pertaining to conflicting attributes such as activity, hemolysis, and toxicity have significantly impeded the progress of researchers. This paper introduces a paradigm shift by considering multiple attributes in AMP design. Presented herein is a novel approach termed Hypervolume-driven Multi-objective Antimicrobial Peptide Design (HMAMP), which prioritizes the simultaneous optimization of multiple attributes of AMPs. By synergizing reinforcement learning and a gradient descent algorithm rooted in the hypervolume maximization concept, HMAMP effectively expands exploration space and mitigates the issue of pattern collapse. This method generates a wide array of prospective AMP candidates that strike a balance among diverse attributes. Furthermore, we pinpoint knee points along the Pareto front of these candidate AMPs. Empirical results across five benchmark models substantiate that HMAMP-designed AMPs exhibit competitive performance and heightened diversity. A detailed analysis of the helical structures and molecular dynamics simulations for ten potential candidate AMPs validates the superiority of HMAMP in the realm of multi-objective AMP design. The ability of HMAMP to systematically craft AMPs considering multiple attributes marks a pioneering milestone, establishing a universal computational framework for the multi-objective design of AMPs. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2306.05257 [pdf, other]

doi 10.1093/bib/bbad235

Comprehensive evaluation of deep and graph learning on drug-drug interactions prediction

Authors: Xuan Lin, Lichang Dai, Yafang Zhou, Zu-Guo Yu, Wen Zhang, Jian-Yu Shi, Dong-Sheng Cao, Li Zeng, Haowen Chen, Bosheng Song, Philip S. Yu, Xiangxiang Zeng

Abstract: Recent advances and achievements of artificial intelligence (AI) as well as deep and graph learning models have established their usefulness in biomedical applications, especially in drug-drug interactions (DDIs). DDIs refer to a change in the effect of one drug to the presence of another drug in the human body, which plays an essential role in drug discovery and clinical research. DDIs prediction… ▽ More Recent advances and achievements of artificial intelligence (AI) as well as deep and graph learning models have established their usefulness in biomedical applications, especially in drug-drug interactions (DDIs). DDIs refer to a change in the effect of one drug to the presence of another drug in the human body, which plays an essential role in drug discovery and clinical research. DDIs prediction through traditional clinical trials and experiments is an expensive and time-consuming process. To correctly apply the advanced AI and deep learning, the developer and user meet various challenges such as the availability and encoding of data resources, and the design of computational methods. This review summarizes chemical structure based, network based, NLP based and hybrid methods, providing an updated and accessible guide to the broad researchers and development community with different domain knowledge. We introduce widely-used molecular representation and describe the theoretical frameworks of graph neural network models for representing molecular structures. We present the advantages and disadvantages of deep and graph learning methods by performing comparative experiments. We discuss the potential technical challenges and highlight future directions of deep and graph learning models for accelerating DDIs prediction. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: Accepted by Briefings in Bioinformatics

arXiv:2304.14300 [pdf, other]

Learning Absorption Rates in Glucose-Insulin Dynamics from Meal Covariates

Authors: Ke Alexander Wang, Matthew E. Levine, Jiaxin Shi, Emily B. Fox

Abstract: Traditional models of glucose-insulin dynamics rely on heuristic parameterizations chosen to fit observations within a laboratory setting. However, these models cannot describe glucose dynamics in daily life. One source of failure is in their descriptions of glucose absorption rates after meal events. A meal's macronutritional content has nuanced effects on the absorption profile, which is difficu… ▽ More Traditional models of glucose-insulin dynamics rely on heuristic parameterizations chosen to fit observations within a laboratory setting. However, these models cannot describe glucose dynamics in daily life. One source of failure is in their descriptions of glucose absorption rates after meal events. A meal's macronutritional content has nuanced effects on the absorption profile, which is difficult to model mechanistically. In this paper, we propose to learn the effects of macronutrition content from glucose-insulin data and meal covariates. Given macronutrition information and meal times, we use a neural network to predict an individual's glucose absorption rate. We use this neural rate function as the control function in a differential equation of glucose dynamics, enabling end-to-end training. On simulated data, our approach is able to closely approximate true absorption rates, resulting in better forecast than heuristic parameterizations, despite only observing glucose, insulin, and macronutritional information. Our work readily generalizes to meal events with higher-dimensional covariates, such as images, setting the stage for glucose dynamics models that are personalized to each individual's daily life. △ Less

Submitted 27 April, 2023; originally announced April 2023.

Comments: Work presented at NeurIPS 2022 Workshop on Learning from Time Series for Health (TS4H). arXiv admin note: substantial text overlap with arXiv:2302.11939

arXiv:2211.00750 [pdf]

Automatic Quantitative Analysis of Brain Organoids via Deep Learning

Authors: **gli Shi

Abstract: Recent advances in brain organoid technology are exciting new ways, which have the potential to change the way how doctors and researchers understand and treat cerebral diseases. Despite the remarkable use of brain organoids derived from human stem cells in new drug testing, disease modeling, and scientific research, it is still heavily time-consuming work to observe and analyze the internal struc… ▽ More Recent advances in brain organoid technology are exciting new ways, which have the potential to change the way how doctors and researchers understand and treat cerebral diseases. Despite the remarkable use of brain organoids derived from human stem cells in new drug testing, disease modeling, and scientific research, it is still heavily time-consuming work to observe and analyze the internal structure, cells, and neural inside the organoid by humans, specifically no standard quantitative analysis method combined growing AI technology for brain organoid. In this paper, an automated computer-assisted analysis method is proposed for brain organoid slice channels tagged with different fluorescent. We applied the method on two channels of two group microscopy images and the experiment result shows an obvious difference between Wild Type and Mutant Type cerebral organoids. △ Less

Submitted 1 November, 2022; originally announced November 2022.

arXiv:2011.08308 [pdf, other]

Impacts of density-dependent predation, cannibalism and fishing in a stage-structured population model of the blue crab in Chesapeake Bay

Authors: Fangming Xu, Leah B. Shaw, Jun** Shi, Romuald N. Lipcius

Abstract: The blue crab (Callinectes sapidus) is a dominant ecological species of high commercial value. Spawning stock and recruitment of the Chesapeake Bay population declined by 80% in the 1990s. After severe management actions were implemented in 2008, female abundance rebounded to pre-1994 levels and stabilized. The stepwise decline in the early 1990s, followed by a consistently low level of abundance… ▽ More The blue crab (Callinectes sapidus) is a dominant ecological species of high commercial value. Spawning stock and recruitment of the Chesapeake Bay population declined by 80% in the 1990s. After severe management actions were implemented in 2008, female abundance rebounded to pre-1994 levels and stabilized. The stepwise decline in the early 1990s, followed by a consistently low level of abundance for 15 y and a jump to high abundance after 2008, suggested the existence of alternative stable states. Alternatively, high fishing pressure combined with low recruitment in 1992 could have triggered a proportional decline in the population, followed by a population increase in 2008 due to rigorous management actions that reduced fishing. We evaluated these alternatives with a stage-structured dynamic population model using ordinary differential equations. In addition, stock assessment models assume that fishing and mortality are independent of density. Hence, we also investigated the role of density-dependent predation, cannibalism and fishing in blue crab population dynamics. We conclude that for the blue crab population in Chesapeake Bay: (1) bistable positive states are not likely with biologically realistic parameter values; (2) hyperbolic (depensatory) fishing will not produce extinction at the range of population densities observed in the bay; and (3) crabs can survive a higher fishing rate under the more realistic assumption of sigmoidal (density-dependent) predation and cannibalism than under constant (density-independent) predation and cannibalism. These collectively indicate that the blue crab population in Chesapeake Bay is resilient to a range of biotic and abiotic disturbances. △ Less

Submitted 16 November, 2020; originally announced November 2020.

arXiv:2008.05332 [pdf, other]

Renal Cell Carcinoma Detection and Subty** with Minimal Point-Based Annotation in Whole-Slide Images

Authors: Zeyu Gao, Pargorn Puttapirat, Jiangbo Shi, Chen Li

Abstract: Obtaining a large amount of labeled data in medical imaging is laborious and time-consuming, especially for histopathology. However, it is much easier and cheaper to get unlabeled data from whole-slide images (WSIs). Semi-supervised learning (SSL) is an effective way to utilize unlabeled data and alleviate the need for labeled data. For this reason, we proposed a framework that employs an SSL meth… ▽ More Obtaining a large amount of labeled data in medical imaging is laborious and time-consuming, especially for histopathology. However, it is much easier and cheaper to get unlabeled data from whole-slide images (WSIs). Semi-supervised learning (SSL) is an effective way to utilize unlabeled data and alleviate the need for labeled data. For this reason, we proposed a framework that employs an SSL method to accurately detect cancerous regions with a novel annotation method called Minimal Point-Based annotation, and then utilize the predicted results with an innovative hybrid loss to train a classification model for subty**. The annotator only needs to mark a few points and label them are cancer or not in each WSI. Experiments on three significant subtypes of renal cell carcinoma (RCC) proved that the performance of the classifier trained with the Min-Point annotated dataset is comparable to a classifier trained with the segmentation annotated dataset for cancer region detection. And the subty** model outperforms a model trained with only diagnostic labels by 12% in terms of f1-score for testing WSIs. △ Less

Submitted 12 August, 2020; originally announced August 2020.

Comments: 10 pages, 5 figure, 3 tables, accepted at MICCAI 2020

arXiv:2005.12792 [pdf]

doi 10.1002/wcms.1481

The prospects of quantum computing in computational molecular biology

Authors: Carlos Outeiral, Martin Strahm, Jiye Shi, Garrett M. Morris, Simon C. Benjamin, Charlotte M. Deane

Abstract: Quantum computers can in principle solve certain problems exponentially more quickly than their classical counterparts. We have not yet reached the advent of useful quantum computation, but when we do, it will affect nearly all scientific disciplines. In this review, we examine how current quantum algorithms could revolutionize computational biology and bioinformatics. There are potential benefits… ▽ More Quantum computers can in principle solve certain problems exponentially more quickly than their classical counterparts. We have not yet reached the advent of useful quantum computation, but when we do, it will affect nearly all scientific disciplines. In this review, we examine how current quantum algorithms could revolutionize computational biology and bioinformatics. There are potential benefits across the entire field, from the ability to process vast amounts of information and run machine learning algorithms far more efficiently, to algorithms for quantum simulation that are poised to improve computational calculations in drug discovery, to quantum algorithms for optimization that may advance fields from protein structure prediction to network analysis. However, these exciting prospects are susceptible to "hype", and it is also important to recognize the caveats and challenges in this new technology. Our aim is to introduce the promise and limitations of emerging quantum computing technologies in the areas of computational molecular biology and bioinformatics. △ Less

Submitted 26 May, 2020; originally announced May 2020.

Comments: 23 pages, 3 figures

Journal ref: WIREs Computational Molecular Science, 2020

arXiv:2004.02731 [pdf]

doi 10.1109/RBME.2020.2987975

Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation and Diagnosis for COVID-19

Authors: Feng Shi, Jun Wang, Jun Shi, Ziyan Wu, Qian Wang, Zhenyu Tang, Kelei He, Yinghuan Shi, Dinggang Shen

Abstract: (This paper was submitted as an invited paper to IEEE Reviews in Biomedical Engineering on April 6, 2020.) The pandemic of coronavirus disease 2019 (COVID-19) is spreading all over the world. Medical imaging such as X-ray and computed tomography (CT) plays an essential role in the global fight against COVID-19, whereas the recently emerging artificial intelligence (AI) technologies further strengt… ▽ More (This paper was submitted as an invited paper to IEEE Reviews in Biomedical Engineering on April 6, 2020.) The pandemic of coronavirus disease 2019 (COVID-19) is spreading all over the world. Medical imaging such as X-ray and computed tomography (CT) plays an essential role in the global fight against COVID-19, whereas the recently emerging artificial intelligence (AI) technologies further strengthen the power of the imaging tools and help medical specialists. We hereby review the rapid responses in the community of medical imaging (empowered by AI) toward COVID-19. For example, AI-empowered image acquisition can significantly help automate the scanning procedure and also reshape the workflow with minimal contact to patients, providing the best protection to the imaging technicians. Also, AI can improve work efficiency by accurate delination of infections in X-ray and CT images, facilitating subsequent quantification. Moreover, the computer-aided platforms help radiologists make clinical decisions, i.e., for disease diagnosis, tracking, and prognosis. In this review paper, we thus cover the entire pipeline of medical imaging and analysis techniques involved with COVID-19, including image acquisition, segmentation, diagnosis, and follow-up. We particularly focus on the integration of AI with X-ray and CT, both of which are widely used in the frontline hospitals, in order to depict the latest progress of medical imaging and radiology fighting against COVID-19. △ Less

Submitted 7 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

Comments: Added journal submission info

Journal ref: IEEE Reviews in Biomedical Engineering (2020)

arXiv:2004.01118 [pdf, other]

doi 10.1088/1367-2630/ac29ff

Investigating the potential for a limited quantum speedup on protein lattice problems

Authors: Carlos Outeiral, Garrett M. Morris, Jiye Shi, Martin Strahm, Simon C. Benjamin, Charlotte M. Deane

Abstract: Protein folding is a central challenge in computational biology, with important applications in molecular biology, drug discovery and catalyst design. As a hard combinatorial optimisation problem, it has been studied as a potential target problem for quantum annealing. Although several experimental implementations have been discussed in the literature, the computational scaling of these approaches… ▽ More Protein folding is a central challenge in computational biology, with important applications in molecular biology, drug discovery and catalyst design. As a hard combinatorial optimisation problem, it has been studied as a potential target problem for quantum annealing. Although several experimental implementations have been discussed in the literature, the computational scaling of these approaches has not been elucidated. In this article, we present a numerical study of quantum annealing applied to a large number of small peptide folding problems, aiming to infer useful insights for near-term applications. We present two conclusions: that even naive quantum annealing, when applied to protein lattice folding, has the potential to outperform classical approaches, and that careful engineering of the Hamiltonians and schedules involved can deliver notable relative improvements for this problem. Overall, our results suggest that quantum algorithms may well offer improvements for problems in the protein folding and structure prediction realm. △ Less

Submitted 18 May, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

Comments: 45 pages, 18 figures

arXiv:2002.02803 [pdf, other]

Model of Algal Growth Depending on Nutrients and Inorganic Carbon in a Poorly Mixed Water Column

Authors: Jimin Zhang, Jun** Shi, Xiaoyuan Chang

Abstract: In this paper, we establish a reaction-diffusion-advection partial differential equation model to describe the growth of algae depending on both nutrients and inorganic carbon in a poorly mixed water column. Nutrients from the water bottom and inorganic carbon from the water surface form an asymmetric resource supply mechanism on the algal growth. The existence and stability of semi-trivial steady… ▽ More In this paper, we establish a reaction-diffusion-advection partial differential equation model to describe the growth of algae depending on both nutrients and inorganic carbon in a poorly mixed water column. Nutrients from the water bottom and inorganic carbon from the water surface form an asymmetric resource supply mechanism on the algal growth. The existence and stability of semi-trivial steady state and coexistence steady state of the model are proved, and a threshold condition for the regime shift from extinction to survival of algae is established. The influence of environmental parameters on the vertical distribution of algae is investigated in the water column. It is shown that the vertical distribution of algae can exhibit many different profiles under the joint limitation of nutrients and inorganic carbon. △ Less

Submitted 31 January, 2020; originally announced February 2020.

Comments: 27 pages, 7 figures

arXiv:2001.05158 [pdf]

OpenHI2 -- Open source histopathological image platform

Authors: Pargorn Puttapirat, Haichuan Zhang, **gyi Deng, Yuxin Dong, Jiangbo Shi, Hongyu He, Zeyu Gao, Chunbao Wang, Xiangrong Zhang, Chen Li

Abstract: Transition from conventional to digital pathology requires a new category of biomedical informatic infrastructure which could facilitate delicate pathological routine. Pathological diagnoses are sensitive to many external factors and is known to be subjective. Only systems that can meet strict requirements in pathology would be able to run along pathological routines and eventually digitized the s… ▽ More Transition from conventional to digital pathology requires a new category of biomedical informatic infrastructure which could facilitate delicate pathological routine. Pathological diagnoses are sensitive to many external factors and is known to be subjective. Only systems that can meet strict requirements in pathology would be able to run along pathological routines and eventually digitized the study area, and the developed platform should comply with existing pathological routines and international standards. Currently, there are a number of available software tools which can perform histopathological tasks including virtual slide viewing, annotating, and basic image analysis, however, none of them can serve as a digital platform for pathology. Here we describe OpenHI2, an enhanced version Open Histopathological Image platform which is capable of supporting all basic pathological tasks and file formats; ready to be deployed in medical institutions on a standard server environment or cloud computing infrastructure. In this paper, we also describe the development decisions for the platform and propose solutions to overcome technical challenges so that OpenHI2 could be used as a platform for histopathological images. Further addition can be made to the platform since each component is modularized and fully documented. OpenHI2 is free, open-source, and available at https://gitlab.com/BioAI/OpenHI. △ Less

Submitted 15 January, 2020; originally announced January 2020.

Comments: Preprint version accepted to AIPath2019 workshop at BIBM2019. 6 pages, 3 figures, 2 tables

arXiv:1911.07986 [pdf, other]

Comparison Against Task Driven Artificial Neural Networks Reveals Functional Organization of Mouse Visual Cortex

Authors: Jianghong Shi, Eric Shea-Brown, Michael A. Buice

Abstract: Partially inspired by features of computation in visual cortex, deep neural networks compute hierarchical representations of their inputs. While these networks have been highly successful in machine learning, it remains unclear to what extent they can aid our understanding of cortical function. Several groups have developed metrics that provide a quantitative comparison between representations com… ▽ More Partially inspired by features of computation in visual cortex, deep neural networks compute hierarchical representations of their inputs. While these networks have been highly successful in machine learning, it remains unclear to what extent they can aid our understanding of cortical function. Several groups have developed metrics that provide a quantitative comparison between representations computed by networks and representations measured in cortex. At the same time, neuroscience is well into an unprecedented phase of large-scale data collection, as evidenced by projects such as the Allen Brain Observatory. Despite the magnitude of these efforts, in a given experiment only a fraction of units are recorded, limiting the information available about the cortical representation. Moreover, only a finite number of stimuli can be shown to an animal over the course of a realistic experiment. These limitations raise the question of how and whether metrics that compare representations of deep networks are meaningful on these datasets. Here, we empirically quantify the capabilities and limitations of these metrics due to limited image presentations and neuron samples. We find that the comparison procedure is robust to different choices of stimuli set and the level of subsampling that one might expect in a large-scale brain survey with thousands of neurons. Using these results, we compare the representations measured in the Allen Brain Observatory in response to natural image presentations to deep neural network. We show that the visual cortical areas are relatively high order representations (in that they map to deeper layers of convolutional neural networks). Furthermore, we see evidence of a broad, more parallel organization rather than a sequential hierarchy, with the primary area VISp(V1) being lower order relative to the other areas. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: Neural Information Processing Systems (NeurIPS), 2019

arXiv:1907.01983 [pdf, other]

Model of pattern formation in marsh ecosystems with nonlocal interactions

Authors: Sofya Zaytseva, Jun** Shi, Leah B Shaw

Abstract: Smooth cordgrass Spartina alterniflora is a grass species commonly found in tidal marshes. It is an ecosystem engineer, capable of modifying the structure of its surrounding environment through various feedbacks. The scale-dependent feedback between marsh grass and sediment volume is particularly of interest. Locally, the marsh vegetation attenuates hydrodynamic energy, enhancing sediment accretio… ▽ More Smooth cordgrass Spartina alterniflora is a grass species commonly found in tidal marshes. It is an ecosystem engineer, capable of modifying the structure of its surrounding environment through various feedbacks. The scale-dependent feedback between marsh grass and sediment volume is particularly of interest. Locally, the marsh vegetation attenuates hydrodynamic energy, enhancing sediment accretion and promoting further vegetation growth. In turn, the diverted water flow promotes the formation of erosion troughs over longer distances. This scale-dependent feedback may explain the characteristic spatially varying marsh shoreline, commonly observed in nature. We propose a mathematical framework to model grass-sediment dynamics as a system of reaction-diffusion equations with an additional nonlocal term quantifying the short-range positive and long-range negative grass-sediment interactions. We use a Mexican-hat kernel function to model this scale-dependent feedback. We perform a steady state biharmonic approximation of our system and derive conditions for the emergence of spatial patterns, corresponding to a spatially varying marsh shoreline. We find that the emergence of such patterns depends on the spatial scale and strength of the scale-dependent feedback, specified by the width and amplitude of the Mexican-hat kernel function. △ Less

Submitted 3 July, 2019; originally announced July 2019.

arXiv:1901.00785 [pdf, other]

A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes

Authors: Kui Xu, Zhe Wang, Jiang** Shi, Hongsheng Li, Qiangfeng Cliff Zhang

Abstract: Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate thi… ▽ More Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate this problem as a vision-inspired 3D detection and pose estimation task. We develop a deep learning framework for amino acid determination in a 3D Cryo-EM density volume. We also design a sequence-guided Monte Carlo Tree Search (MCTS) to thread over the candidate amino acids to form the molecular structure. This framework achieves 91% coverage on our newly proposed dataset and takes only a few minutes for a typical structure with a thousand amino acids. Our method is hundreds of times faster and several times more accurate than existing automated solutions without any human intervention. △ Less

Submitted 12 February, 2019; v1 submitted 3 January, 2019; originally announced January 2019.

Comments: 8 pages, 5 figures, 4 tables

Journal ref: published on AAAI2019

arXiv:1803.03146 [pdf]

SentRNA: Improving computational RNA design by incorporating a prior of human design strategies

Authors: Jade Shi, Rhiju Das, Vijay S. Pande

Abstract: Solving the RNA inverse folding problem is a critical prerequisite to RNA design, an emerging field in bioengineering with a broad range of applications from reaction catalysis to cancer therapy. Although significant progress has been made in develo** machine-based inverse RNA folding algorithms, current approaches still have difficulty designing sequences for large or complex targets. On the ot… ▽ More Solving the RNA inverse folding problem is a critical prerequisite to RNA design, an emerging field in bioengineering with a broad range of applications from reaction catalysis to cancer therapy. Although significant progress has been made in develo** machine-based inverse RNA folding algorithms, current approaches still have difficulty designing sequences for large or complex targets. On the other hand, human players of the online RNA design game EteRNA have consistently shown superior performance in this regard, being able to readily design sequences for targets that are challenging for machine algorithms. Here we present a novel approach to the RNA design problem, SentRNA, a design agent consisting of a fully-connected neural network trained end-to-end using human-designed RNA sequences. We show that through this approach, SentRNA can solve complex targets previously unsolvable by any machine-based approach and achieve state-of-the-art performance on two separate challenging test sets. Our results demonstrate that incorporating human design strategies into a design algorithm can significantly boost machine performance and suggests a new paradigm for machine-based RNA design. △ Less

Submitted 5 March, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

Comments: 27 pages (not including Supplementary Information), 9 figures, 7 tables

arXiv:1706.01643 [pdf]

Retrosynthetic reaction prediction using neural sequence-to-sequence models

Authors: Bowen Liu, Bharath Ramsundar, Prasad Kawthekar, Jade Shi, Joseph Gomes, Quang Luu Nguyen, Stephen Ho, Jack Sloane, Paul Wender, Vijay Pande

Abstract: We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence map** problem. The end-to-end trained model has an encoder-decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation… ▽ More We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence map** problem. The end-to-end trained model has an encoder-decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation. The model is trained on 50,000 experimental reaction examples from the United States patent literature, which span 10 broad reaction types that are commonly used by medicinal chemists. We find that our model performs comparably with a rule-based expert system baseline model, and also overcomes certain limitations associated with rule-based expert systems and with any machine learning approach that contains a rule-based expert system component. Our model provides an important first step towards solving the challenging problem of computational retrosynthetic analysis. △ Less

Submitted 6 June, 2017; originally announced June 2017.

arXiv:1608.03425 [pdf]

doi 10.1093/cercor/bhx268

Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision

Authors: Haiguang Wen, Junxing Shi, Yizhen Zhang, Kun-Han Lu, Jiayue Cao, Zhongming Liu

Abstract: Convolutional neural network (CNN) driven by image recognition has been shown to be able to explain cortical responses to static pictures at ventral-stream areas. Here, we further showed that such CNN could reliably predict and decode functional magnetic resonance imaging data from humans watching natural movies, despite its lack of any mechanism to account for temporal dynamics or feedback proces… ▽ More Convolutional neural network (CNN) driven by image recognition has been shown to be able to explain cortical responses to static pictures at ventral-stream areas. Here, we further showed that such CNN could reliably predict and decode functional magnetic resonance imaging data from humans watching natural movies, despite its lack of any mechanism to account for temporal dynamics or feedback processing. Using separate data, encoding and decoding models were developed and evaluated for describing the bi-directional relationships be-tween the CNN and the brain. Through the encoding models, the CNN-predicted areas covered not only the ventral stream, but also the dorsal stream, albe-it to a lesser degree; single-voxel response was visualized as the specific pixel pattern that drove the response, revealing the distinct representation of individual cortical location; cortical activation was synthesized from natural images with high-throughput to map category representation, con-trast, and selectivity. Through the decoding models, fMRI signals were directly decoded to estimate the feature representations in both visual and semantic spaces, for direct visual reconstruction and seman-tic categorization, respectively. These results cor-roborate, generalize, and extend previous findings, and highlight the value of using deep learning, as an all-in-one model of the visual cortex, to understand and decode natural vision. △ Less

Submitted 14 November, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

Comments: 27 pages, 10 figures, 1 table

Journal ref: Cerebral Cortex. 2017 pp.1-25

arXiv:1411.4179 [pdf]

doi 10.1103/PhysRevApplied.2.064003

Neuromimetic Circuits with Synaptic Devices based on Strongly Correlated Electron Systems

Authors: Sieu D. Ha, Jian Shi, Yasmine Meroz, L. Mahadevan, Shriram Ramanathan

Abstract: Strongly correlated electron systems such as the rare-earth nickelates (RNiO3, R = rare-earth element) can exhibit synapse-like continuous long term potentiation and depression when gated with ionic liquids; exploiting the extreme sensitivity of coupled charge, spin, orbital, and lattice degrees of freedom to stoichiometry. We present experimental real-time, device-level classical conditioning and… ▽ More Strongly correlated electron systems such as the rare-earth nickelates (RNiO3, R = rare-earth element) can exhibit synapse-like continuous long term potentiation and depression when gated with ionic liquids; exploiting the extreme sensitivity of coupled charge, spin, orbital, and lattice degrees of freedom to stoichiometry. We present experimental real-time, device-level classical conditioning and unlearning using nickelate-based synaptic devices in an electronic circuit compatible with both excitatory and inhibitory neurons. We establish a physical model for the device behavior based on electric-field driven coupled ionic-electronic diffusion that can be utilized for design of more complex systems. We use the model to simulate a variety of associate and non-associative learning mechanisms, as well as a feedforward recurrent network for storing memory. Our circuit intuitively parallels biological neural architectures, and it can be readily generalized to other forms of cellular learning and extinction. The simulation of neural function with electronic device analogues may provide insight into biological processes such as decision making, learning and adaptation, while facilitating advanced parallel information processing in hardware. △ Less

Submitted 15 November, 2014; originally announced November 2014.

Journal ref: Phys. Rev. Applied 2, 064003 (2014)

arXiv:1410.2466 [pdf, other]

Quantification and visualization of variation in anatomical trees

Authors: Nina Amenta, Manasi Datar, Asger Dirksen, Marleen de Bruijne, Aasa Feragen, Xiaoyin Ge, Jesper Holst Pedersen, Marylesa Howard, Megan Owen, Jens Petersen, Jie Shi, Qiu** Xu

Abstract: This paper presents two approaches to quantifying and visualizing variation in datasets of trees. The first approach localizes subtrees in which significant population differences are found through hypothesis testing and sparse classifiers on subtree features. The second approach visualizes the global metric structure of datasets through low-distortion embedding into hyperbolic planes in the style… ▽ More This paper presents two approaches to quantifying and visualizing variation in datasets of trees. The first approach localizes subtrees in which significant population differences are found through hypothesis testing and sparse classifiers on subtree features. The second approach visualizes the global metric structure of datasets through low-distortion embedding into hyperbolic planes in the style of multidimensional scaling. A case study is made on a dataset of airway trees in relation to Chronic Obstructive Pulmonary Disease. △ Less

Submitted 9 October, 2014; originally announced October 2014.

Comments: 22 pages

MSC Class: 62H25; 62H35

arXiv:1407.8398 [pdf, ps, other]

doi 10.1214/14-AOAS715

Leveraging local identity-by-descent increases the power of case/control GWAS with related individuals

Authors: Joshua N. Sampson, Bill Wheeler, Peng Li, Jianxin Shi

Abstract: Large case/control Genome-Wide Association Studies (GWAS) often include groups of related individuals with known relationships. When testing for associations at a given locus, current methods incorporate only the familial relationships between individuals. Here, we introduce the chromosome-based Quasi Likelihood Score (cQLS) statistic that incorporates local Identity-By-Descent (IBD) to increase t… ▽ More Large case/control Genome-Wide Association Studies (GWAS) often include groups of related individuals with known relationships. When testing for associations at a given locus, current methods incorporate only the familial relationships between individuals. Here, we introduce the chromosome-based Quasi Likelihood Score (cQLS) statistic that incorporates local Identity-By-Descent (IBD) to increase the power to detect associations. In studies robust to population stratification, such as those with case/control sibling pairs, simulations show that the study power can be increased by over 50%. In our example, a GWAS examining late-onset Alzheimer's disease, the $p$-values among the most strongly associated SNPs in the APOE gene tend to decrease, with the smallest $p$-value decreasing from $1.23\times10^{-8}$ to $7.70\times 10^{-9}$. Furthermore, as a part of our simulations, we reevaluate our expectations about the use of families in GWAS. We show that, although adding only half as many unique chromosomes, genoty** affected siblings is more efficient than genoty** randomly ascertained cases. We also show that genoty** cases with a family history of disease will be less beneficial when searching for SNPs with smaller effect sizes. △ Less

Submitted 31 July, 2014; originally announced July 2014.

Comments: Published in at http://dx.doi.org/10.1214/14-AOAS715 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS715

Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 2, 974-998

arXiv:1102.2666 [pdf, ps, other]

Bistability in a differential equation model of oyster reef height and sediment accumulation

Authors: William C. Jordan-Cooley, Romuald N. Lipcius, Leah B. Shaw, Jian Shen, Jun** Shi

Abstract: Native oyster populations in Chesapeake Bay have been the focus of three decades of restoration attempts, which have generally failed to rebuild the populations and oyster reef structure. Recent restoration successes and field experiments suggest that high-relief reefs offset heavy sedimentation and promote oyster survival, disease resistance and growth, in contrast to low-relief reefs which degra… ▽ More Native oyster populations in Chesapeake Bay have been the focus of three decades of restoration attempts, which have generally failed to rebuild the populations and oyster reef structure. Recent restoration successes and field experiments suggest that high-relief reefs offset heavy sedimentation and promote oyster survival, disease resistance and growth, in contrast to low-relief reefs which degrade in just a few years. These findings suggest the existence of alternative stable states in oyster reef populations. We developed a mathematical model consisting of three differential equations that represent volumes of live oysters, dead oyster shells (= accreting reef), and sediment. Bifurcation analysis and numerical simulations demonstrated that multiple nonnegative equilibria can exist for live oyster, accreting reef and sediment volume at an ecologically reasonable range of parameter values; the initial height of oyster reefs determined which equilibrium was reached. This investigation thus provides a conceptual framework for alternative stable states in native oyster populations, and can be used as a tool to improve the likelihood of success in restoration efforts. △ Less

Submitted 13 February, 2011; originally announced February 2011.

Showing 1–22 of 22 results for author: Shi, J