-
Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation
Authors:
Ningfeng Liu,
Jie Yu,
Siyu Xiu,
Xinfang Zhao,
Siyu Lin,
Bo Qiang,
Ruqiu Zheng,
Hongwei **,
Liangren Zhang,
Zhenming Liu
Abstract:
Molecular generation, an essential method for identifying new drug structures, has been supported by advancements in machine learning and computational technology. However, challenges remain in multi-objective generation, model adaptability, and practical application in drug discovery. In this study, we developed a versatile 'plug-in' molecular generation model that incorporates multiple objective…
▽ More
Molecular generation, an essential method for identifying new drug structures, has been supported by advancements in machine learning and computational technology. However, challenges remain in multi-objective generation, model adaptability, and practical application in drug discovery. In this study, we developed a versatile 'plug-in' molecular generation model that incorporates multiple objectives related to target affinity, drug-likeness, and synthesizability, facilitating its application in various drug development contexts. We improved the Particle Swarm Optimization (PSO) in the context of drug discoveries, and identified PSO-ENP as the optimal variant for multi-objective molecular generation and optimization through comparative experiments. The model also incorporates a novel target-ligand affinity predictor, enhancing the model's utility by supporting three-dimensional information and improving synthetic feasibility. Case studies focused on generating and optimizing drug-like big marine natural products were performed, underscoring PSO-ENP's effectiveness and demonstrating its considerable potential for practical drug discovery applications.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Bridging the Gap between Chemical Reaction Pretraining and Conditional Molecule Generation with a Unified Model
Authors:
Bo Qiang,
Yiran Zhou,
Yuheng Ding,
Ningfeng Liu,
Song Song,
Liangren Zhang,
Bo Huang,
Zhenming Liu
Abstract:
Chemical reactions are the fundamental building blocks of drug design and organic chemistry research. In recent years, there has been a growing need for a large-scale deep-learning framework that can efficiently capture the basic rules of chemical reactions. In this paper, we have proposed a unified framework that addresses both the reaction representation learning and molecule generation tasks, w…
▽ More
Chemical reactions are the fundamental building blocks of drug design and organic chemistry research. In recent years, there has been a growing need for a large-scale deep-learning framework that can efficiently capture the basic rules of chemical reactions. In this paper, we have proposed a unified framework that addresses both the reaction representation learning and molecule generation tasks, which allows for a more holistic approach. Inspired by the organic chemistry mechanism, we develop a novel pretraining framework that enables us to incorporate inductive biases into the model. Our framework achieves state-of-the-art results on challenging downstream tasks. By possessing chemical knowledge, our generative framework overcome the limitations of current molecule generation models that rely on a small number of reaction templates. In the extensive experiments, our model generates synthesizable drug-like structures of high quality. Overall, our work presents a significant step toward a large-scale deep-learning framework for a variety of reaction-based applications.
△ Less
Submitted 7 March, 2024; v1 submitted 13 March, 2023;
originally announced March 2023.
-
An ensemble neural network approach to forecast Dengue outbreak based on climatic condition
Authors:
Madhurima Panja,
Tanujit Chakraborty,
Sk Shahid Nadim,
Indrajit Ghosh,
Uttam Kumar,
Nan Liu
Abstract:
Dengue fever is a virulent disease spreading over 100 tropical and subtropical countries in Africa, the Americas, and Asia. This arboviral disease affects around 400 million people globally, severely distressing the healthcare systems. The unavailability of a specific drug and ready-to-use vaccine makes the situation worse. Hence, policymakers must rely on early warning systems to control interven…
▽ More
Dengue fever is a virulent disease spreading over 100 tropical and subtropical countries in Africa, the Americas, and Asia. This arboviral disease affects around 400 million people globally, severely distressing the healthcare systems. The unavailability of a specific drug and ready-to-use vaccine makes the situation worse. Hence, policymakers must rely on early warning systems to control intervention-related decisions. Forecasts routinely provide critical information for dangerous epidemic events. However, the available forecasting models (e.g., weather-driven mechanistic, statistical time series, and machine learning models) lack a clear understanding of different components to improve prediction accuracy and often provide unstable and unreliable forecasts. This study proposes an ensemble wavelet neural network with exogenous factor(s) (XEWNet) model that can produce reliable estimates for dengue outbreak prediction for three geographical regions, namely San Juan, Iquitos, and Ahmedabad. The proposed XEWNet model is flexible and can easily incorporate exogenous climate variable(s) confirmed by statistical causality tests in its scalable framework. The proposed model is an integrated approach that uses wavelet transformation into an ensemble neural network framework that helps in generating more reliable long-term forecasts. The proposed XEWNet allows complex non-linear relationships between the dengue incidence cases and rainfall; however, mathematically interpretable, fast in execution, and easily comprehensible. The proposal's competitiveness is measured using computational experiments based on various statistical metrics and several statistical comparison tests. In comparison with statistical, machine learning, and deep learning methods, our proposed XEWNet performs better in 75% of the cases for short-term and long-term forecasting of dengue incidence.
△ Less
Submitted 19 December, 2022; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Epicasting: An Ensemble Wavelet Neural Network (EWNet) for Forecasting Epidemics
Authors:
Madhurima Panja,
Tanujit Chakraborty,
Uttam Kumar,
Nan Liu
Abstract:
Infectious diseases remain among the top contributors to human illness and death worldwide, among which many diseases produce epidemic waves of infection. The unavailability of specific drugs and ready-to-use vaccines to prevent most of these epidemics makes the situation worse. These force public health officials and policymakers to rely on early warning systems generated by reliable and accurate…
▽ More
Infectious diseases remain among the top contributors to human illness and death worldwide, among which many diseases produce epidemic waves of infection. The unavailability of specific drugs and ready-to-use vaccines to prevent most of these epidemics makes the situation worse. These force public health officials and policymakers to rely on early warning systems generated by reliable and accurate forecasts of epidemics. Accurate forecasts of epidemics can assist stakeholders in tailoring countermeasures, such as vaccination campaigns, staff scheduling, and resource allocation, to the situation at hand, which could translate to reductions in the impact of a disease. Unfortunately, most of these past epidemics exhibit nonlinear and non-stationary characteristics due to their spreading fluctuations based on seasonal-dependent variability and the nature of these epidemics. We analyse a wide variety of epidemic time series datasets using a maximal overlap discrete wavelet transform (MODWT) based autoregressive neural network and call it EWNet model. MODWT techniques effectively characterize non-stationary behavior and seasonal dependencies in the epidemic time series and improve the nonlinear forecasting scheme of the autoregressive neural network in the proposed ensemble wavelet network framework. From a nonlinear time series viewpoint, we explore the asymptotic stationarity of the proposed EWNet model to show the asymptotic behavior of the associated Markov Chain. We also theoretically investigate the effect of learning stability and the choice of hidden neurons in the proposal. From a practical perspective, we compare our proposed EWNet framework with several statistical, machine learning, and deep learning models. Experimental results show that the proposed EWNet is highly competitive compared to the state-of-the-art epidemic forecasting methods.
△ Less
Submitted 14 March, 2023; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Fast Projection onto the Capped Simplex with Applications to Sparse Regression in Bioinformatics
Authors:
Andersen Ang,
Jianzhu Ma,
Nianjun Liu,
Kun Huang,
Yijie Wang
Abstract:
We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the ex…
▽ More
We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the existing sorting-based methods proposed in the literature. We provide a theory for partial explanation and justification of the method.
We demonstrate that the proposed algorithm can produce a solution of the projection problem with high precision on large scale datasets, and the algorithm is able to significantly outperform the state-of-the-art methods in terms of runtime (about 6-8 times faster than a commercial software with respect to CPU time for input vector with 1 million variables or more).
We further illustrate the effectiveness of the proposed algorithm on solving sparse regression in a bioinformatics problem. Empirical results on the GWAS dataset (with 1,500,000 single-nucleotide polymorphisms) show that, when using the proposed method to accelerate the Projected Quasi-Newton (PQN) method, the accelerated PQN algorithm is able to handle huge-scale regression problem and it is more efficient (about 3-6 times faster) than the current state-of-the-art methods.
△ Less
Submitted 25 October, 2021; v1 submitted 16 October, 2021;
originally announced October 2021.
-
DHX36-mediated G-quadruplex unfolding is ATP-independent?
Authors:
Hai-Lei Guo,
Wei-Fei Chen,
Stephane Rety,
Na-Nv Liu,
Ze-Yu Song,
Yan-Xue Dai,
Xi-Miao Hou,
Shuo-Xing Dou,
Xu-Guang Xi
Abstract:
Chen et al. solved the crystal structure of bovine DHX36 bound to a DNA with a G-quadruplex (G4) and a single-stranded DNA segment. They believed that the mechanism they proposed may represent a general model for describing how a G4-unfolding helicase recognizes and unfolds G4 DNA. Their conclusion is interesting, however, we noticed that their linear DNA substrate (DNAMyc) that harbors a Myc-prom…
▽ More
Chen et al. solved the crystal structure of bovine DHX36 bound to a DNA with a G-quadruplex (G4) and a single-stranded DNA segment. They believed that the mechanism they proposed may represent a general model for describing how a G4-unfolding helicase recognizes and unfolds G4 DNA. Their conclusion is interesting, however, we noticed that their linear DNA substrate (DNAMyc) that harbors a Myc-promoter-derived G4-forming sequence was directly used without pre-folding. This raises the question whether the structure they obtained really reflects DHX36-mediated G4 recognition and unfolding, or just only represents a DHX36-binding-induced quasi-folded G4 structure. By a combination of polymerase extension, DMS footprinting, stopped-flow, and smFRET assays, we obtained clear evidences that do not support their ATP-independent one-base translocation structural model. We further revealed that the oscillation of FRET signal they observed should correspond to a repetitive G4 binding, but not unfolding, by DHX36.
△ Less
Submitted 22 September, 2019;
originally announced September 2019.
-
Microfluidic study of effects of flow velocity and nutrient concentration on biofilm accumulation and adhesive strength in a microchannel
Authors:
Na Liu,
Tormod Skauge,
David Landa-Marban,
Beate Hovland,
Bente Thorbjornsen,
Florin Adrain Radu,
Bartek Florczyk Vik,
Thomas Baumann,
Gunhild Bodtker
Abstract:
Biofilm accumulation in the porous media can cause plugging and change many physical properties of porous media. Targeted bioplugging may have significant applications for industrial processes. A deeper understanding of the relative influences of hydrodynamic conditions including flow velocity and nutrient concentration, on biofilm growth and detachment is necessary to plan and analyze bioplugging…
▽ More
Biofilm accumulation in the porous media can cause plugging and change many physical properties of porous media. Targeted bioplugging may have significant applications for industrial processes. A deeper understanding of the relative influences of hydrodynamic conditions including flow velocity and nutrient concentration, on biofilm growth and detachment is necessary to plan and analyze bioplugging experiments and field trials. The experimental results by means of microscopic imaging over a T-shape microchannel show that increase in fluid velocity could facilitate biofilm growth, but that above a velocity threshold, biofilm detachment and inhibition of biofilm formation due to high shear stress were observed. High nutrient concentration prompts the biofilm growth, but was accompanied by a relatively weak adhesive strength. This letter provides an overview of biofilm development in a hydrodynamic environment for better predicting and modelling the bioplugging associated with porous system in petroleum industry, hydrogeology, and water purification.
△ Less
Submitted 9 July, 2018;
originally announced July 2018.
-
Determination of Effective Synaptic Conductances Using Somatic Voltage Clamp
Authors:
Songting Li,
Nan Liu,
Xiaohui Zhang,
Douglas Zhou,
David Cai
Abstract:
The interplay between excitatory and inhibitory neurons imparts rich functions of the brain. To understand the underlying synaptic mechanisms, a fundamental approach is to study the dynamics of excitatory and inhibitory conductances of each neuron. The traditional method of determining conductance employs the synaptic current-voltage (I-V) relation obtained via voltage clamp. Using theoretical ana…
▽ More
The interplay between excitatory and inhibitory neurons imparts rich functions of the brain. To understand the underlying synaptic mechanisms, a fundamental approach is to study the dynamics of excitatory and inhibitory conductances of each neuron. The traditional method of determining conductance employs the synaptic current-voltage (I-V) relation obtained via voltage clamp. Using theoretical analysis, electrophysiological experiments, and realistic simulations, here we demonstrate that the traditional method conceptually fails to measure the conductance due to the neglect of a nonlinear interaction between the clamp current and the synaptic current. Consequently, it incurs substantial measurement error, even giving rise to unphysically negative conductance as observed in experiments. To elucidate synaptic impact on neuronal information processing, we introduce the concept of effective conductance and propose a framework to determine it accurately. Our work suggests re-examination of previous studies involving conductance measurement and provides a reliable approach to assess synaptic influence on neuronal computation.
△ Less
Submitted 13 October, 2017;
originally announced October 2017.
-
Dendritic Integration Regulation and Neuronal Arithmetic Implemented in a Proton-Coupled Neuron Transistor
Authors:
Chang** Wan,
Ning Liu,
** Feng,
Liqiang Zhu,
Yi Shi,
Qing Wan
Abstract:
Neuron is the most important building block in our brain, and information processing in individual neuron involves the transformation of input synaptic spike trains into an appropriate output spike train. Hardware implementation of neuron by individual ionic/electronic coupled device is of great importance for enhancing our understanding of the brain and solving sensory processing and complex reco…
▽ More
Neuron is the most important building block in our brain, and information processing in individual neuron involves the transformation of input synaptic spike trains into an appropriate output spike train. Hardware implementation of neuron by individual ionic/electronic coupled device is of great importance for enhancing our understanding of the brain and solving sensory processing and complex recognition tasks. Here, we provide a proof-of-principle artificial neuron with multiple presynaptic inputs and one modulatory terminal based on a proton-coupled oxide-based electric-double-layer transistor. Regulation of dendritic integration was realized by tuning the voltage applied on the modulatory terminal. Additionally, neuronal gain control (arithmetic) in the scheme of temporal-correlated coding and rate coding are also mimicked. Our results provide a new-concept approach for building brain-inspired neuromorphic systems.
△ Less
Submitted 11 June, 2015;
originally announced June 2015.