-
Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?
Authors:
Yang Tan,
Lirong Zheng,
Bozitao Zhong,
Liang Hong,
Bingxin Zhou
Abstract:
Deep learning has become a crucial tool in studying proteins. While the significance of modeling protein structure has been discussed extensively in the literature, amino acid types are typically included in the input as a default operation for many inference tasks. This study demonstrates with structure alignment task that embedding amino acid types in some cases may not help a deep learning mode…
▽ More
Deep learning has become a crucial tool in studying proteins. While the significance of modeling protein structure has been discussed extensively in the literature, amino acid types are typically included in the input as a default operation for many inference tasks. This study demonstrates with structure alignment task that embedding amino acid types in some cases may not help a deep learning model learn better representation. To this end, we propose ProtLOCA, a local geometry alignment method based solely on amino acid structure representation. The effectiveness of ProtLOCA is examined by a global structure-matching task on protein pairs with an independent test dataset based on CATH labels. Our method outperforms existing sequence- and structure-based representation learning methods by more quickly and accurately matching structurally consistent protein domains. Furthermore, in local structure pairing tasks, ProtLOCA for the first time provides a valid solution to highlight common local structures among proteins with different overall structures but the same function. This suggests a new possibility for using deep learning methods to analyze protein structure to infer function.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
ProtSolM: Protein Solubility Prediction with Multi-modal Features
Authors:
Yang Tan,
Jia Zheng,
Liang Hong,
Bingxin Zhou
Abstract:
Understanding protein solubility is essential for their functional applications. Computational methods for predicting protein solubility are crucial for reducing experimental costs and enhancing the efficiency and success rates of protein engineering. Existing methods either construct a supervised learning scheme on small-scale datasets with manually processed physicochemical properties, or blindl…
▽ More
Understanding protein solubility is essential for their functional applications. Computational methods for predicting protein solubility are crucial for reducing experimental costs and enhancing the efficiency and success rates of protein engineering. Existing methods either construct a supervised learning scheme on small-scale datasets with manually processed physicochemical properties, or blindly apply pre-trained protein language models to extract amino acid interaction information. The scale and quality of available training datasets leave significant room for improvement in terms of accuracy and generalization. To address these research gaps, we propose \sol, a novel deep learning method that combines pre-training and fine-tuning schemes for protein solubility prediction. ProtSolM integrates information from multiple dimensions, including physicochemical properties, amino acid sequences, and protein backbone structures. Our model is trained using \data, the largest solubility dataset that we have constructed. PDBSol includes over $60,000$ protein sequences and structures. We provide a comprehensive leaderboard of existing statistical learning and deep learning methods on independent datasets with computational and experimental labels. ProtSolM achieved state-of-the-art performance across various evaluation metrics, demonstrating its potential to significantly advance the accuracy of protein solubility prediction.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models
Authors:
Yang Tan,
Mingchen Li,
Bingxin Zhou,
Bozitao Zhong,
Lirong Zheng,
Pan Tan,
Ziyi Zhou,
Huiqun Yu,
Guisheng Fan,
Liang Hong
Abstract:
Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfe…
▽ More
Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfer to life science tasks is non-trivial due to the different training strategies and data forms. To address this gap, we introduce SES-Adapter, a simple, efficient, and scalable adapter method for enhancing the representation learning of PLMs. SES-Adapter incorporates PLM embeddings with structural sequence embeddings to create structure-aware representations. We show that the proposed method is compatible with different PLM architectures and across diverse tasks. Extensive evaluations are conducted on 2 types of folding structures with notable quality differences, 9 state-of-the-art baselines, and 9 benchmark datasets across distinct downstream tasks. Results show that compared to vanilla PLMs, SES-Adapter improves downstream task performance by a maximum of 11% and an average of 3%, with significantly accelerated training speed by a maximum of 1034% and an average of 362%, the convergence rate is also improved by approximately 2 times. Moreover, positive optimization is observed even with low-quality predicted structures. The source code for SES-Adapter is available at https://github.com/tyang816/SES-Adapter.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Unsupervised Discovery of Steerable Factors When Graph Deep Generative Models Are Entangled
Authors:
Shengchao Liu,
Chengpeng Wang,
Jiarui Lu,
Weili Nie,
Hanchen Wang,
Zhuoxinran Li,
Bolei Zhou,
Jian Tang
Abstract:
Deep generative models (DGMs) have been widely developed for graph data. However, much less investigation has been carried out on understanding the latent space of such pretrained graph DGMs. These understandings possess the potential to provide constructive guidelines for crucial tasks, such as graph controllable generation. Thus in this work, we are interested in studying this problem and propos…
▽ More
Deep generative models (DGMs) have been widely developed for graph data. However, much less investigation has been carried out on understanding the latent space of such pretrained graph DGMs. These understandings possess the potential to provide constructive guidelines for crucial tasks, such as graph controllable generation. Thus in this work, we are interested in studying this problem and propose GraphCG, a method for the unsupervised discovery of steerable factors in the latent space of pretrained graph DGMs. We first examine the representation space of three pretrained graph DGMs with six disentanglement metrics, and we observe that the pretrained representation space is entangled. Motivated by this observation, GraphCG learns the steerable factors via maximizing the mutual information between semantic-rich directions, where the controlled graph moving along the same direction will share the same steerable factors. We quantitatively verify that GraphCG outperforms four competitive baselines on two graph DGMs pretrained on two molecule datasets. Additionally, we qualitatively illustrate seven steerable factors learned by GraphCG on five pretrained DGMs over five graph datasets, including two for molecules and three for point clouds.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
ANN vs SNN: A case study for Neural Decoding in Implantable Brain-Machine Interfaces
Authors:
Biyan Zhou,
Pao-Sheng Vincent Sun,
Arindam Basu
Abstract:
While it is important to make implantable brain-machine interfaces (iBMI) wireless to increase patient comfort and safety, the trend of increased channel count in recent neural probes poses a challenge due to the concomitant increase in the data rate. Extracting information from raw data at the source by using edge computing is a promising solution to this problem, with integrated intention decode…
▽ More
While it is important to make implantable brain-machine interfaces (iBMI) wireless to increase patient comfort and safety, the trend of increased channel count in recent neural probes poses a challenge due to the concomitant increase in the data rate. Extracting information from raw data at the source by using edge computing is a promising solution to this problem, with integrated intention decoders providing the best compression ratio. In this work, we compare different neural networks (NN) for motor decoding in terms of accuracy and implementation cost. We further show that combining traditional signal processing techniques with machine learning ones deliver surprisingly good performance even with simple NNs. Adding a block Bidirectional Bessel filter provided maximum gains of $\approx 0.05$, $0.04$ and $0.03$ in $R^2$ for ANN\_3d, SNN\_3D and ANN models, while the gains were lower ($\approx 0.02$ or less) for LSTM and SNN\_streaming models. Increasing training data helped improve the $R^2$ of all models by $0.03-0.04$ indicating they have more capacity for future improvement. In general, LSTM and SNN\_streaming models occupy the high and low ends of the pareto curves (for accuracy vs. memory/operations) respectively while SNN\_3D and ANN\_3D occupy intermediate positions. Our work presents state of the art results for this dataset and paves the way for decoder-integrated-implants of the future.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
A Unified View on Neural Message Passing with Opinion Dynamics for Social Networks
Authors:
Outongyi Lv,
Bingxin Zhou,
**g Wang,
Xiang Xiao,
Weishu Zhao,
Lirong Zheng
Abstract:
Social networks represent a common form of interconnected data frequently depicted as graphs within the domain of deep learning-based inference. These communities inherently form dynamic systems, achieving stability through continuous internal communications and opinion exchanges among social actors along their social ties. In contrast, neural message passing in deep learning provides a clear and…
▽ More
Social networks represent a common form of interconnected data frequently depicted as graphs within the domain of deep learning-based inference. These communities inherently form dynamic systems, achieving stability through continuous internal communications and opinion exchanges among social actors along their social ties. In contrast, neural message passing in deep learning provides a clear and intuitive mathematical framework for understanding information propagation and aggregation among connected nodes in graphs. Node representations are dynamically updated by considering both the connectivity and status of neighboring nodes. This research harmonizes concepts from sociometry and neural message passing to analyze and infer the behavior of dynamic systems. Drawing inspiration from opinion dynamics in sociology, we propose ODNet, a novel message passing scheme incorporating bounded confidence, to refine the influence weight of local nodes for message propagation. We adjust the similarity cutoffs of bounded confidence and influence weights of ODNet and define opinion exchange rules that align with the characteristics of social network graphs. We show that ODNet enhances prediction performance across various graph types and alleviates oversmoothing issues. Furthermore, our approach surpasses conventional baselines in graph representation learning and proves its practical significance in analyzing real-world co-occurrence networks of metabolic genes. Remarkably, our method simplifies complex social network graphs solely by leveraging knowledge of interaction frequencies among entities within the system. It accurately identifies internal communities and the roles of genes in different metabolic pathways, including opinion leaders, bridge communicators, and isolators.
△ Less
Submitted 3 October, 2023; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Graph Denoising Diffusion for Inverse Protein Folding
Authors:
Kai Yi,
Bingxin Zhou,
Yiqing Shen,
Pietro Liò,
Yu Guang Wang
Abstract:
Inverse protein folding is challenging due to its inherent one-to-many map** characteristic, where numerous possible amino acid sequences can fold into a single, identical protein backbone. This task involves not only identifying viable sequences but also representing the sheer diversity of potential solutions. However, existing discriminative models, such as transformer-based auto-regressive mo…
▽ More
Inverse protein folding is challenging due to its inherent one-to-many map** characteristic, where numerous possible amino acid sequences can fold into a single, identical protein backbone. This task involves not only identifying viable sequences but also representing the sheer diversity of potential solutions. However, existing discriminative models, such as transformer-based auto-regressive models, struggle to encapsulate the diverse range of plausible solutions. In contrast, diffusion probabilistic models, as an emerging genre of generative approaches, offer the potential to generate a diverse set of sequence candidates for determined protein backbones. We propose a novel graph denoising diffusion model for inverse protein folding, where a given protein backbone guides the diffusion process on the corresponding amino acid residue types. The model infers the joint distribution of amino acids conditioned on the nodes' physiochemical properties and local environment. Moreover, we utilize amino acid replacement matrices for the diffusion forward process, encoding the biologically-meaningful prior knowledge of amino acids from their spatial and sequential neighbors as well as themselves, which reduces the sampling space of the generative process. Our model achieves state-of-the-art performance over a set of popular baseline methods in sequence recovery and exhibits great potential in generating diverse protein sequences for a determined protein backbone structure.
△ Less
Submitted 7 November, 2023; v1 submitted 29 June, 2023;
originally announced June 2023.
-
Multi-level Protein Representation Learning for Blind Mutational Effect Prediction
Authors:
Yang Tan,
Bingxin Zhou,
Yuanhong Jiang,
Yu Guang Wang,
Liang Hong
Abstract:
Directed evolution plays an indispensable role in protein engineering that revises existing protein sequences to attain new or enhanced functions. Accurately predicting the effects of protein variants necessitates an in-depth understanding of protein structure and function. Although large self-supervised language models have demonstrated remarkable performance in zero-shot inference using only pro…
▽ More
Directed evolution plays an indispensable role in protein engineering that revises existing protein sequences to attain new or enhanced functions. Accurately predicting the effects of protein variants necessitates an in-depth understanding of protein structure and function. Although large self-supervised language models have demonstrated remarkable performance in zero-shot inference using only protein sequences, these models inherently do not interpret the spatial characteristics of protein structures, which are crucial for comprehending protein folding stability and internal molecular interactions. This paper introduces a novel pre-training framework that cascades sequential and geometric analyzers for protein primary and tertiary structures. It guides mutational directions toward desired traits by simulating natural selection on wild-type proteins and evaluates the effects of variants based on their fitness to perform the function. We assess the proposed approach using a public database and two new databases for a variety of variant effect prediction tasks, which encompass a diverse set of proteins and assays from different taxa. The prediction results achieve state-of-the-art performance over other zero-shot learning methods for both single-site mutations and deep mutations.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Accurate and Definite Mutational Effect Prediction with Lightweight Equivariant Graph Neural Networks
Authors:
Bingxin Zhou,
Outongyi Lv,
Kai Yi,
Xinye Xiong,
Pan Tan,
Liang Hong,
Yu Guang Wang
Abstract:
Directed evolution as a widely-used engineering strategy faces obstacles in finding desired mutants from the massive size of candidate modifications. While deep learning methods learn protein contexts to establish feasible searching space, many existing models are computationally demanding and fail to predict how specific mutational tests will affect a protein's sequence or function. This research…
▽ More
Directed evolution as a widely-used engineering strategy faces obstacles in finding desired mutants from the massive size of candidate modifications. While deep learning methods learn protein contexts to establish feasible searching space, many existing models are computationally demanding and fail to predict how specific mutational tests will affect a protein's sequence or function. This research introduces a lightweight graph representation learning scheme that efficiently analyzes the microenvironment of wild-type proteins and recommends practical higher-order mutations exclusive to the user-specified protein and function of interest. Our method enables continuous improvement of the inference model by limited computational resources and a few hundred mutational training samples, resulting in accurate prediction of variant effects that exhibit near-perfect correlation with the ground truth across deep mutational scanning assays of 19 proteins. With its affordability and applicability to both computer scientists and biochemical laboratories, our solution offers a wide range of benefits that make it an ideal choice for the community.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
Graph Representation Learning for Interactive Biomolecule Systems
Authors:
Xinye Xiong,
Bingxin Zhou,
Yu Guang Wang
Abstract:
Advances in deep learning models have revolutionized the study of biomolecule systems and their mechanisms. Graph representation learning, in particular, is important for accurately capturing the geometric information of biomolecules at different levels. This paper presents a comprehensive review of the methodologies used to represent biological molecules and systems as computer-recognizable objec…
▽ More
Advances in deep learning models have revolutionized the study of biomolecule systems and their mechanisms. Graph representation learning, in particular, is important for accurately capturing the geometric information of biomolecules at different levels. This paper presents a comprehensive review of the methodologies used to represent biological molecules and systems as computer-recognizable objects, such as sequences, graphs, and surfaces. Moreover, it examines how geometric deep learning models, with an emphasis on graph-based techniques, can analyze biomolecule data to enable drug discovery, protein characterization, and biological system analysis. The study concludes with an overview of the current state of the field, highlighting the challenges that exist and the potential future research directions.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
EpiGNN: Exploring Spatial Transmission with Graph Neural Network for Regional Epidemic Forecasting
Authors:
Feng Xie,
Zhong Zhang,
Liang Li,
Bin Zhou,
Yusong Tan
Abstract:
Epidemic forecasting is the key to effective control of epidemic transmission and helps the world mitigate the crisis that threatens public health. To better understand the transmission and evolution of epidemics, we propose EpiGNN, a graph neural network-based model for epidemic forecasting. Specifically, we design a transmission risk encoding module to characterize local and global spatial effec…
▽ More
Epidemic forecasting is the key to effective control of epidemic transmission and helps the world mitigate the crisis that threatens public health. To better understand the transmission and evolution of epidemics, we propose EpiGNN, a graph neural network-based model for epidemic forecasting. Specifically, we design a transmission risk encoding module to characterize local and global spatial effects of regions in epidemic processes and incorporate them into the model. Meanwhile, we develop a Region-Aware Graph Learner (RAGL) that takes transmission risk, geographical dependencies, and temporal information into account to better explore spatial-temporal dependencies and makes regions aware of related regions' epidemic situations. The RAGL can also combine with external resources, such as human mobility, to further improve prediction performance. Comprehensive experiments on five real-world epidemic-related datasets (including influenza and COVID-19) demonstrate the effectiveness of our proposed method and show that EpiGNN outperforms state-of-the-art baselines by 9.48% in RMSE.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Inter- and Intra-Series Embeddings Fusion Network for Epidemiological Forecasting
Authors:
Feng Xie,
Zhong Zhang,
Xuechen Zhao,
Bin Zhou,
Yusong Tan
Abstract:
The accurate forecasting of infectious epidemic diseases is the key to effective control of the epidemic situation in a region. Most existing methods ignore potential dynamic dependencies between regions or the importance of temporal dependencies and inter-dependencies between regions for prediction. In this paper, we propose an Inter- and Intra-Series Embeddings Fusion Network (SEFNet) to improve…
▽ More
The accurate forecasting of infectious epidemic diseases is the key to effective control of the epidemic situation in a region. Most existing methods ignore potential dynamic dependencies between regions or the importance of temporal dependencies and inter-dependencies between regions for prediction. In this paper, we propose an Inter- and Intra-Series Embeddings Fusion Network (SEFNet) to improve epidemic prediction performance. SEFNet consists of two parallel modules, named Inter-Series Embedding Module and Intra-Series Embedding Module. In Inter-Series Embedding Module, a multi-scale unified convolution component called Region-Aware Convolution is proposed, which cooperates with self-attention to capture dynamic dependencies between time series obtained from multiple regions. The Intra-Series Embedding Module uses Long Short-Term Memory to capture temporal relationships within each time series. Subsequently, we learn the influence degree of two embeddings and fuse them with the parametric-matrix fusion method. To further improve the robustness, SEFNet also integrates a traditional autoregressive component in parallel with nonlinear neural networks. Experiments on four real-world epidemic-related datasets show SEFNet is effective and outperforms state-of-the-art baselines.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Agent-Based Campus Novel Coronavirus Infection and Control Simulation
Authors:
Pei Lv,
Quan Zhang,
Boya Xu,
Ran Feng,
Chaochao Li,
Junxiao Xue,
Bing Zhou,
Mingliang Xu
Abstract:
Corona Virus Disease 2019 (COVID-19), due to its extremely high infectivity, has been spreading rapidly around the world and bringing huge influence to socioeconomic development as well as people's daily life. Taking for example the virus transmission that may occur after college students return to school, we analyze the quantitative influence of the key factors on the virus spread, including crow…
▽ More
Corona Virus Disease 2019 (COVID-19), due to its extremely high infectivity, has been spreading rapidly around the world and bringing huge influence to socioeconomic development as well as people's daily life. Taking for example the virus transmission that may occur after college students return to school, we analyze the quantitative influence of the key factors on the virus spread, including crowd density and self-protection. One Campus Virus Infection and Control Simulation model (CVICS) of the novel coronavirus is proposed in this paper, fully considering the characteristics of repeated contact and strong mobility of crowd in the closed environment. Specifically, we build an agent-based infection model, introduce the mean field theory to calculate the probability of virus transmission, and micro-simulate the daily prevalence of infection among individuals. The experimental results show that the proposed model in this paper efficiently simulate how the virus spread in the dense crowd in frequent contact under closed environment. Furthermore, preventive and control measures such as self-protection, crowd decentralization and isolation during the epidemic can effectively delay the arrival of infection peak and reduce the prevalence, and finally lower the risk of COVID-19 transmission after the students return to school.
△ Less
Submitted 1 September, 2021; v1 submitted 22 February, 2021;
originally announced February 2021.
-
A framework for studying behavioral evolution by reconstructing ancestral repertoires
Authors:
Damián G. Hernández,
Catalina Rivera,
Jessica Cande,
Baohua Zhou,
David L. Stern,
Gordon J. Berman
Abstract:
Although extensive behavioral changes often exist between closely related animal species, our understanding of the genetic basis underlying the evolution of behavior has remained limited. Here, we propose a new framework to study behavioral evolution by computational estimation of ancestral behavioral repertoires. We measured the behaviors of individuals from six species of fruit flies using unsup…
▽ More
Although extensive behavioral changes often exist between closely related animal species, our understanding of the genetic basis underlying the evolution of behavior has remained limited. Here, we propose a new framework to study behavioral evolution by computational estimation of ancestral behavioral repertoires. We measured the behaviors of individuals from six species of fruit flies using unsupervised techniques and identified suites of stereotyped movements exhibited by each species. We then fit a Generalized Linear Mixed Model to estimate the suites of behaviors exhibited by ancestral species, as well as the intra- and inter-species behavioral covariances. We found that much of intraspecific behavioral variation is explained by differences between individuals in the status of their behavioral hidden states, what might be called their "mood." Lastly, we propose a method to identify groups of behaviors that appear to have evolved together, illustrating how sets of behaviors, rather than individual behaviors, likely evolved. Our approach provides a new framework for identifying co-evolving behaviors and may provide new opportunities to study the genetic basis of behavioral evolution.
△ Less
Submitted 19 July, 2020;
originally announced July 2020.
-
Lung ultrasound surface wave elastography for assessing instititial lung disease
Authors:
Xiaoming Zhang,
Boran Zhou,
Thomas Osborn,
Brian Bartholmai,
Sanjay Kalra
Abstract:
Lung ultrasound surface wave elastography (LUSWE) is a novel noninvasive technique for measuring superficial lung tissue stiffness.The purpose of this study was to translate LUSWE for assessing patients with interstitial lung disease (ILD) and various connective diseases including systemic sclerosis (SSc).In this study, LUSWE was used to measure the surface wave speed of lung at 100 Hz, 150 Hz and…
▽ More
Lung ultrasound surface wave elastography (LUSWE) is a novel noninvasive technique for measuring superficial lung tissue stiffness.The purpose of this study was to translate LUSWE for assessing patients with interstitial lung disease (ILD) and various connective diseases including systemic sclerosis (SSc).In this study, LUSWE was used to measure the surface wave speed of lung at 100 Hz, 150 Hz and 200 Hz through six intercostal lung spaces for 91 patients with ILD and 30 healthy control subjects. In addition, skin viscoelasticity was measured at both forearms and upper arms for patients and controls. The surface wave speeds of patients' lungs were significantly higher than those of control subjects for the six intercostal spaces and the three excitation frequencies. Patient skin elasticity and viscosity were significantly higher than those of control subjects for the four locations on the arm. In dividing ILD patients into two groups, ILD patients with SSc and ILD patients without SSc, significant differences between each patient group with the control group were found for both the lung and skin.No significant differences were found between the two patients group, although there were some differences at a few locations and at 100 Hz. LUSWE may be useful for assessing ILD and SSc and screening early stage patients.
△ Less
Submitted 9 June, 2018;
originally announced June 2018.
-
A Numerical Study of the Relationship Between Erectile Pressure and Shear Wave Speed of Corpus Cavernosa in Ultrasound Vibro-elastography
Authors:
Boran Zhou,
Landon W. Trost,
Xiaoming Zhang
Abstract:
The objective of this study was to investigate the relationship between erectile pressure (EP) and shear wave speed of the corpus cavernosa obtained via a specific ultrasound vibro-elastography (UVE) technique. This study builds upon our prior investigation, in which UVE was used to evaluate the viscoelastic properties of the corpus cavernosa in the flaccid and erect states. A two-dimensional poro…
▽ More
The objective of this study was to investigate the relationship between erectile pressure (EP) and shear wave speed of the corpus cavernosa obtained via a specific ultrasound vibro-elastography (UVE) technique. This study builds upon our prior investigation, in which UVE was used to evaluate the viscoelastic properties of the corpus cavernosa in the flaccid and erect states. A two-dimensional poroviscoelastic finite element model (FEM) was developed to simulate wave propagation in the penile tissue according to our experimental setup. Various levels of EP were applied to the corpus cavernosa, and the relationship between shear wave speed in the corpus cavernosa and EP was investigated. Results demonstrated non-linear, positive correlations between shear wave speeds in the corpus cavernosa and increasing EP at different vibration frequencies (100-200 Hz). These findings represent the first report of the impact of EP on shear wave speed and validates the use of UVE in the evaluation of men with erectile dysfunction. Further evaluations are warranted to determine the clinical utility of this instrument in the diagnosis and treatment of men with erectile dysfunction.
△ Less
Submitted 21 June, 2018; v1 submitted 1 June, 2018;
originally announced June 2018.
-
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography
Authors:
Jialiang Guo,
Bo Zhou,
Xiangrui Zeng,
Zachary Freyberg,
Min Xu
Abstract:
Electron Cryo-Tomography (ECT) enables 3D visualization of macromolecule structure inside single cells. Macromolecule classification approaches based on convolutional neural networks (CNN) were developed to separate millions of macromolecules captured from ECT systematically. However, given the fast accumulation of ECT data, it will soon become necessary to use CNN models to efficiently and accura…
▽ More
Electron Cryo-Tomography (ECT) enables 3D visualization of macromolecule structure inside single cells. Macromolecule classification approaches based on convolutional neural networks (CNN) were developed to separate millions of macromolecules captured from ECT systematically. However, given the fast accumulation of ECT data, it will soon become necessary to use CNN models to efficiently and accurately separate substantially more macromolecules at the prediction stage, which requires additional computational costs. To speed up the prediction, we compress classification models into compact neural networks with little in accuracy for deployment. Specifically, we propose to perform model compression through knowledge distillation. Firstly, a complex teacher network is trained to generate soft labels with better classification feasibility followed by training of customized student networks with simple architectures using the soft label to compress model complexity. Our tests demonstrate that our compressed models significantly reduce the number of parameters and time cost while maintaining similar classification accuracy.
△ Less
Submitted 31 January, 2018;
originally announced January 2018.
-
Feature Decomposition Based Saliency Detection in Electron Cryo-Tomograms
Authors:
Bo Zhou,
Qiang Guo,
Xiangrui Zeng,
Min Xu
Abstract:
Electron Cryo-Tomography (ECT) allows 3D visualization of subcellular structures at the submolecular resolution in close to the native state. However, due to the high degree of structural complexity and imaging limits, the automatic segmentation of cellular components from ECT images is very difficult. To complement and speed up existing segmentation methods, it is desirable to develop a generic c…
▽ More
Electron Cryo-Tomography (ECT) allows 3D visualization of subcellular structures at the submolecular resolution in close to the native state. However, due to the high degree of structural complexity and imaging limits, the automatic segmentation of cellular components from ECT images is very difficult. To complement and speed up existing segmentation methods, it is desirable to develop a generic cell component segmentation method that is 1) not specific to particular types of cellular components, 2) able to segment unknown cellular components, 3) fully unsupervised and does not rely on the availability of training data. As an important step towards this goal, in this paper, we propose a saliency detection method that computes the likelihood that a subregion in a tomogram stands out from the background. Our method consists of four steps: supervoxel over-segmentation, feature extraction, feature matrix decomposition, and computation of saliency. The method produces a distribution map that represents the regions' saliency in tomograms. Our experiments show that our method can successfully label most salient regions detected by a human observer, and able to filter out regions not containing cellular components. Therefore, our method can remove the majority of the background region, and significantly speed up the subsequent processing of segmentation and recognition of cellular components captured by ECT.
△ Less
Submitted 31 January, 2018;
originally announced January 2018.
-
Chance, long tails, and inference: a non-Gaussian, Bayesian theory of vocal learning in songbirds
Authors:
Baohua Zhou,
David Hofmann,
Itai Pinkoviezky,
Samuel J. Sober,
Ilya Nemenman
Abstract:
Traditional theories of sensorimotor learning posit that animals use sensory error signals to find the optimal motor command in the face of Gaussian sensory and motor noise. However, most such theories cannot explain common behavioral observations, for example that smaller sensory errors are more readily corrected than larger errors and that large abrupt (but not gradually introduced) errors lead…
▽ More
Traditional theories of sensorimotor learning posit that animals use sensory error signals to find the optimal motor command in the face of Gaussian sensory and motor noise. However, most such theories cannot explain common behavioral observations, for example that smaller sensory errors are more readily corrected than larger errors and that large abrupt (but not gradually introduced) errors lead to weak learning. Here we propose a new theory of sensorimotor learning that explains these observations. The theory posits that the animal learns an entire probability distribution of motor commands rather than trying to arrive at a single optimal command, and that learning arises via Bayesian inference when new sensory information becomes available. We test this theory using data from a songbird, the Bengalese finch, that is adapting the pitch (fundamental frequency) of its song following perturbations of auditory feedback using miniature headphones. We observe the distribution of the sung pitches to have long, non-Gaussian tails, which, within our theory, explains the observed dynamics of learning. Further, the theory makes surprising predictions about the dynamics of the shape of the pitch distribution, which we confirm experimentally.
△ Less
Submitted 23 July, 2017;
originally announced July 2017.
-
A Semiparametric Bayesian Model for Detecting Synchrony Among Multiple Neurons
Authors:
Babak Shahbaba,
Bo Zhou,
Shiwei Lan,
Hernando Ombao,
David Moorman,
Sam Behseta
Abstract:
We propose a scalable semiparametric Bayesian model to capture dependencies among multiple neurons by detecting their co-firing (possibly with some lag time) patterns over time. After discretizing time so there is at most one spike at each interval, the resulting sequence of 1's (spike) and 0's (silence) for each neuron is modeled using the logistic function of a continuous latent variable with a…
▽ More
We propose a scalable semiparametric Bayesian model to capture dependencies among multiple neurons by detecting their co-firing (possibly with some lag time) patterns over time. After discretizing time so there is at most one spike at each interval, the resulting sequence of 1's (spike) and 0's (silence) for each neuron is modeled using the logistic function of a continuous latent variable with a Gaussian process prior. For multiple neurons, the corresponding marginal distributions are coupled to their joint probability distribution using a parametric copula model. The advantages of our approach are as follows: the nonparametric component (i.e., the Gaussian process model) provides a flexible framework for modeling the underlying firing rates; the parametric component (i.e., the copula model) allows us to make inference regarding both contemporaneous and lagged relationships among neurons; using the copula model, we construct multivariate probabilistic models by separating the modeling of univariate marginal distributions from the modeling of dependence structure among variables; our method is easy to implement using a computationally efficient sampling algorithm that can be easily extended to high dimensional problems. Using simulated data, we show that our approach could correctly capture temporal dependencies in firing rates and identify synchronous neurons. We also apply our model to spike train data obtained from prefrontal cortical areas in rat's brain.
△ Less
Submitted 3 March, 2014; v1 submitted 25 June, 2013;
originally announced June 2013.
-
Improving sequence-based genotype calls with linkage disequilibrium and pedigree information
Authors:
Baiyu Zhou,
Alice S. Whittemore
Abstract:
Whole and targeted sequencing of human genomes is a promising, increasingly feasible tool for discovering genetic contributions to risk of complex diseases. A key step is calling an individual's genotype from the multiple aligned short read sequences of his DNA, each of which is subject to nucleotide read error. Current methods are designed to call genotypes separately at each locus from the seque…
▽ More
Whole and targeted sequencing of human genomes is a promising, increasingly feasible tool for discovering genetic contributions to risk of complex diseases. A key step is calling an individual's genotype from the multiple aligned short read sequences of his DNA, each of which is subject to nucleotide read error. Current methods are designed to call genotypes separately at each locus from the sequence data of unrelated individuals. Here we propose likelihood-based methods that improve calling accuracy by exploiting two features of sequence data. The first is the linkage disequilibrium (LD) between nearby SNPs. The second is the Mendelian pedigree information available when related individuals are sequenced. In both cases the likelihood involves the probabilities of read variant counts given genotypes, summed over the unobserved genotypes. Parameters governing the prior genotype distribution and the read error rates can be estimated either from the sequence data itself or from external reference data. We use simulations and synthetic read data based on the 1000 Genomes Project to evaluate the performance of the proposed methods. An R-program to apply the methods to small families is freely available at http://med.stanford.edu/epidemiology/PHGC/.
△ Less
Submitted 28 June, 2012;
originally announced June 2012.