-
General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design
Authors:
Yue Jian,
Curtis Wu,
Danny Reidenbach,
Aditi S. Krishnapriyan
Abstract:
Structure-Based Drug Design (SBDD) focuses on generating valid ligands that strongly and specifically bind to a designated protein pocket. Several methods use machine learning for SBDD to generate these ligands in 3D space, conditioned on the structure of a desired protein pocket. Recently, diffusion models have shown success here by modeling the underlying distributions of atomic positions and ty…
▽ More
Structure-Based Drug Design (SBDD) focuses on generating valid ligands that strongly and specifically bind to a designated protein pocket. Several methods use machine learning for SBDD to generate these ligands in 3D space, conditioned on the structure of a desired protein pocket. Recently, diffusion models have shown success here by modeling the underlying distributions of atomic positions and types. While these methods are effective in considering the structural details of the protein pocket, they often fail to explicitly consider the binding affinity. Binding affinity characterizes how tightly the ligand binds to the protein pocket, and is measured by the change in free energy associated with the binding process. It is one of the most crucial metrics for benchmarking the effectiveness of the interaction between a ligand and protein pocket. To address this, we propose BADGER: Binding Affinity Diffusion Guidance with Enhanced Refinement. BADGER is a general guidance method to steer the diffusion sampling process towards improved protein-ligand binding, allowing us to adjust the distribution of the binding affinity between ligands and proteins. Our method is enabled by using a neural network (NN) to model the energy function, which is commonly approximated by AutoDock Vina (ADV). ADV's energy function is non-differentiable, and estimates the affinity based on the interactions between a ligand and target protein receptor. By using a NN as a differentiable energy function proxy, we utilize the gradient of our learned energy function as a guidance method on top of any trained diffusion model. We show that our method improves the binding affinity of generated ligands to their protein receptors by up to 60\%, significantly surpassing previous machine learning methods. We also show that our guidance method is flexible and can be easily applied to other diffusion-based SBDD frameworks.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Launching Your VR Neuroscience Laboratory
Authors:
Ying Choon Wu,
Christopher Maymon,
Jonathon Paden,
Weichen Liu
Abstract:
The proliferation and refinement of affordable virtual reality (VR) technologies and wearable sensors have opened new frontiers in cognitive and behavioral neuroscience. This chapter offers a broad overview of VR for anyone interested in leveraging it as a research tool. In the first section, it examines the fundamental functionalities of VR and outlines important considerations that inform the de…
▽ More
The proliferation and refinement of affordable virtual reality (VR) technologies and wearable sensors have opened new frontiers in cognitive and behavioral neuroscience. This chapter offers a broad overview of VR for anyone interested in leveraging it as a research tool. In the first section, it examines the fundamental functionalities of VR and outlines important considerations that inform the development of immersive content that stimulates the senses. In the second section, the focus of the discussion shifts to the implementation of VR in the context of the neuroscience lab. Practical advice is offered on adapting commercial, off-theshelf devices to specific research purposes. Further, methods are explored for recording, synchronizing, and fusing heterogeneous forms of data obtained through the VR system or add-on sensors, as well as for labeling events and capturing game play.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Online Mental Stress Detection Using Frontal-channel EEG Recordings in a Classroom Scenario
Authors:
Chi-Yuan Chang,
Chieh Hsu,
Ying Choon Wu,
Siwen Wang,
Darin Tsui,
Tzyy-** Jung
Abstract:
Objective: To investigate the effects of different approaches to EEG preprocessing, channel montage selection, and model architecture on the performance of an online-capable stress detection algorithm in a classroom scenario. Methods: This analysis used EEG data from a longitudinal stress and fatigue study conducted among university students. Their self-reported stress ratings during each class se…
▽ More
Objective: To investigate the effects of different approaches to EEG preprocessing, channel montage selection, and model architecture on the performance of an online-capable stress detection algorithm in a classroom scenario. Methods: This analysis used EEG data from a longitudinal stress and fatigue study conducted among university students. Their self-reported stress ratings during each class session were the basis for classifying EEG recordings into either normal or elevated stress states. We used a data-processing pipeline that combined Artifact Subspace Reconstruction (ASR)and an Independent Component Analysis (ICA)-based method to achieve online artifact removal. We compared the performance of a Linear Discriminant Analysis (LDA) and a 4-layer neural network as classifiers. We opted for accuracy, balanced accuracy, and F1 score as the metrics for assessing performance. We examined the impact of varying numbers of input channels using different channel montages. Additionally, we explored different window lengths and step sizes during online evaluation. Results: Our online artifact removal method achieved performance comparable to the offline ICA method in both offline and online evaluations. A balanced accuracy of 77% and 78% in an imbalanced binary classification were observed when using the 11-frontal-channel LDA model with the proposed artifact removal method. Moreover, the model performance remained intact when changing the channel montage from 30 full-scalp channels to just 11 frontal channels. During the online evaluation, we achieved the highest balanced accuracy (78%) with a window length of 20 seconds and a step size of 1 second. Significance: This study comprehensively investigates the deployment of stress detection in real-world scenarios. The findings of this study provide insight into the development of daily mental stress monitoring.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Drug-target interaction prediction by integrating heterogeneous information with mutual attention network
Authors:
Yuanyuan Zhang,
Yingdong Wang,
Chaoyong Wu,
Lingmin Zhana,
Aoyi Wang,
Cai** Cheng,
**zhong Zhao,
Wuxia Zhang,
Jianxin Chen,
Peng Li
Abstract:
Identification of drug-target interactions is an indispensable part of drug discovery. While conventional shallow machine learning and recent deep learning methods based on chemogenomic properties of drugs and target proteins have pushed this prediction performance improvement to a new level, these methods are still difficult to adapt to novel structures. Alternatively, large-scale biological and…
▽ More
Identification of drug-target interactions is an indispensable part of drug discovery. While conventional shallow machine learning and recent deep learning methods based on chemogenomic properties of drugs and target proteins have pushed this prediction performance improvement to a new level, these methods are still difficult to adapt to novel structures. Alternatively, large-scale biological and pharmacological data provide new ways to accelerate drug-target interaction prediction. Here, we propose DrugMAN, a deep learning model for predicting drug-target interaction by integrating multiplex heterogeneous functional networks with a mutual attention network (MAN). DrugMAN uses a graph attention network-based integration algorithm to learn network-specific low-dimensional features for drugs and target proteins by integrating four drug networks and seven gene/protein networks, respectively. DrugMAN then captures interaction information between drug and target representations by a mutual attention network to improve drug-target prediction. DrugMAN achieves the best prediction performance under four different scenarios, especially in real-world scenarios. DrugMAN spotlights heterogeneous information to mine drug-target interactions and can be a powerful tool for drug discovery and drug repurposing.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
MolTC: Towards Molecular Relational Modeling In Language Models
Authors:
Junfeng Fang,
Shuai Zhang,
Chang Wu,
Zhengyi Yang,
Zhiyuan Liu,
Sihang Li,
Kun Wang,
Wenjie Du,
Xiang Wang
Abstract:
Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as a promising way for efficient and effective MRL. Despite their potential, these methods…
▽ More
Molecular Relational Learning (MRL), aiming to understand interactions between molecular pairs, plays a pivotal role in advancing biochemical research. Recently, the adoption of large language models (LLMs), known for their vast knowledge repositories and advanced logical inference capabilities, has emerged as a promising way for efficient and effective MRL. Despite their potential, these methods predominantly rely on the textual data, thus not fully harnessing the wealth of structural information inherent in molecular graphs. Moreover, the absence of a unified framework exacerbates the issue of information underutilization, as it hinders the sharing of interaction mechanism learned across diverse datasets. To address these challenges, this work proposes a novel LLM-based multi-modal framework for Molecular inTeraction prediction following Chain-of-Thought (CoT) theory, termed MolTC, which effectively integrate graphical information of two molecules in pair. To train MolTC efficiently, we introduce a Multi-hierarchical CoT concept to refine its training paradigm, and conduct a comprehensive Molecular Interactive Instructions dataset for the development of biochemical LLMs involving MRL. Our experiments, conducted across various datasets involving over 4,000,000 molecular pairs, exhibit the superiority of our method over current GNN and LLM-based baselines. Code is available at https://github.com/MangoKiller/MolTC.
△ Less
Submitted 10 June, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Enhancing CT Image synthesis from multi-modal MRI data based on a multi-task neural network framework
Authors:
Zhuoyao Xin,
Christopher Wu,
Dong Liu,
Chunming Gu,
Jia Guo,
Jun Hua
Abstract:
Image segmentation, real-value prediction, and cross-modal translation are critical challenges in medical imaging. In this study, we propose a versatile multi-task neural network framework, based on an enhanced Transformer U-Net architecture, capable of simultaneously, selectively, and adaptively addressing these medical image tasks. Validation is performed on a public repository of human brain MR…
▽ More
Image segmentation, real-value prediction, and cross-modal translation are critical challenges in medical imaging. In this study, we propose a versatile multi-task neural network framework, based on an enhanced Transformer U-Net architecture, capable of simultaneously, selectively, and adaptively addressing these medical image tasks. Validation is performed on a public repository of human brain MR and CT images. We decompose the traditional problem of synthesizing CT images into distinct subtasks, which include skull segmentation, Hounsfield unit (HU) value prediction, and image sequential reconstruction. To enhance the framework's versatility in handling multi-modal data, we expand the model with multiple image channels. Comparisons between synthesized CT images derived from T1-weighted and T2-Flair images were conducted, evaluating the model's capability to integrate multi-modal information from both morphological and pixel value perspectives.
△ Less
Submitted 17 December, 2023; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics
Authors:
Chen Zhao,
Kuan-Jui Su,
Chong Wu,
Xuewei Cao,
Qiuying Sha,
Wu Li,
Zhe Luo,
Tian Qin,
Chuan Qiu,
Lan Juan Zhao,
Anqi Liu,
Lindong Jiang,
Xiao Zhang,
Hui Shen,
Weihua Zhou,
Hong-Wen Deng
Abstract:
Background: Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. Method: In this study, we propose a novel method that leverages the information f…
▽ More
Background: Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. Method: In this study, we propose a novel method that leverages the information from WGS data and reference metabolites to impute unknown metabolites. Our approach utilizes a multi-view variational autoencoder to jointly model the burden score, polygenetic risk score (PGS), and linkage disequilibrium (LD) pruned single nucleotide polymorphisms (SNPs) for feature extraction and missing metabolomics data imputation. By learning the latent representations of both omics data, our method can effectively impute missing metabolomics values based on genomic information. Results: We evaluate the performance of our method on empirical metabolomics datasets with missing values and demonstrate its superiority compared to conventional imputation techniques. Using 35 template metabolites derived burden scores, PGS and LD-pruned SNPs, the proposed methods achieved R^2-scores > 0.01 for 71.55% of metabolites. Conclusion: The integration of WGS data in metabolomics imputation not only improves data completeness but also enhances downstream analyses, paving the way for more comprehensive and accurate investigations of metabolic pathways and disease associations. Our findings offer valuable insights into the potential benefits of utilizing WGS data for metabolomics data imputation and underscore the importance of leveraging multi-modal data integration in precision medicine research.
△ Less
Submitted 12 March, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
SARS-CoV-2 Wastewater Genomic Surveillance: Approaches, Challenges, and Opportunities
Authors:
Viorel Munteanu,
Michael Saldana,
Dumitru Ciorba,
Viorel Bostan,
Justin Maine Su,
Nadiia Kasianchuk,
Nitesh Kumar Sharma,
Sergey Knyazev,
Victor Gordeev,
Eva Aßmann,
Andrei Lobiuc,
Mihai Covasa,
Keith A. Crandall,
Wenhao O. Ouyang,
Nicholas C. Wu,
Christopher Mason,
Braden T Tierney,
Alexander G Lucaci,
Alex Zelikovsky,
Fatemeh Mohebbi,
Pavel Skums,
Cynthia Gibas,
Jessica Schlueter,
Piotr Rzymski,
Helena Solo-Gabriele
, et al. (3 additional authors not shown)
Abstract:
During the SARS-CoV-2 pandemic, wastewater-based genomic surveillance (WWGS) emerged as an efficient viral surveillance tool that takes into account asymptomatic cases and can identify known and novel mutations and offers the opportunity to assign known virus lineages based on the detected mutations profiles. WWGS can also hint towards novel or cryptic lineages, but it is difficult to clearly iden…
▽ More
During the SARS-CoV-2 pandemic, wastewater-based genomic surveillance (WWGS) emerged as an efficient viral surveillance tool that takes into account asymptomatic cases and can identify known and novel mutations and offers the opportunity to assign known virus lineages based on the detected mutations profiles. WWGS can also hint towards novel or cryptic lineages, but it is difficult to clearly identify and define novel lineages from wastewater (WW) alone. While WWGS has significant advantages in monitoring SARS-CoV-2 viral spread, technical challenges remain, including poor sequencing coverage and quality due to viral RNA degradation. As a result, the viral RNAs in wastewater have low concentrations and are often fragmented, making sequencing difficult. WWGS analysis requires advanced computational tools that are yet to be developed and benchmarked. The existing bioinformatics tools used to analyze wastewater sequencing data are often based on previously developed methods for quantifying the expression of transcripts or viral diversity. Those methods were not developed for wastewater sequencing data specifically, and are not optimized to address unique challenges associated with wastewater. While specialized tools for analysis of wastewater sequencing data have also been developed recently, it remains to be seen how they will perform given the ongoing evolution of SARS-CoV-2 and the decline in testing and patient-based genomic surveillance. Here, we discuss opportunities and challenges associated with WWGS, including sample preparation, sequencing technology, and bioinformatics methods.
△ Less
Submitted 30 January, 2024; v1 submitted 23 September, 2023;
originally announced September 2023.
-
BioThings Explorer: a query engine for a federated knowledge graph of biomedical APIs
Authors:
Jackson Callaghan,
Colleen H. Xu,
Jiwen Xin,
Marco Alvarado Cano,
Anders Riutta,
Eric Zhou,
Rohan Juneja,
Yao Yao,
Madhumita Narayan,
Kristina Hanspers,
Ayushi Agrawal,
Alexander R. Pico,
Chunlei Wu,
Andrew I. Su
Abstract:
Knowledge graphs are an increasingly common data structure for representing biomedical information. These knowledge graphs can easily represent heterogeneous types of information, and many algorithms and tools exist for querying and analyzing graphs. Biomedical knowledge graphs have been used in a variety of applications, including drug repurposing, identification of drug targets, prediction of dr…
▽ More
Knowledge graphs are an increasingly common data structure for representing biomedical information. These knowledge graphs can easily represent heterogeneous types of information, and many algorithms and tools exist for querying and analyzing graphs. Biomedical knowledge graphs have been used in a variety of applications, including drug repurposing, identification of drug targets, prediction of drug side effects, and clinical decision support. Typically, knowledge graphs are constructed by centralization and integration of data from multiple disparate sources. Here, we describe BioThings Explorer, an application that can query a virtual, federated knowledge graph derived from the aggregated information in a network of biomedical web services. BioThings Explorer leverages semantically precise annotations of the inputs and outputs for each resource, and automates the chaining of web service calls to execute multi-step graph queries. Because there is no large, centralized knowledge graph to maintain, BioThing Explorer is distributed as a lightweight application that dynamically retrieves information at query time. More information can be found at https://explorer.biothings.io, and code is available at https://github.com/biothings/biothings_explorer.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Resistance Management for Cancer: Lessons from Farmers
Authors:
Sareh Seyedi,
Valerie K. Harris,
Stefania E. Kapsetaki,
Daniel Saha,
Zachary Compton,
Rezvan Yousefi,
Alexander May,
Efe Fakir,
Amy M. Boddy,
Marco Gerlinger,
Christina Wu,
Lida Mina,
Silvie Huijben,
Dawn H. Gouge,
Luis Cisneros,
Peter C. Ellsworth,
Carlo C. Maley
Abstract:
One of the main reasons we have not been able to cure cancers is that drugs select for drug-resistant cancer cells. Pest managers face similar challenges with pesticides selecting for pesticide-resistant organisms. Lessons in pest management have led to four heuristics that can be translated to controlling cancers 1. limit use (of chemical controls or modes of action to the lowest practical levels…
▽ More
One of the main reasons we have not been able to cure cancers is that drugs select for drug-resistant cancer cells. Pest managers face similar challenges with pesticides selecting for pesticide-resistant organisms. Lessons in pest management have led to four heuristics that can be translated to controlling cancers 1. limit use (of chemical controls or modes of action to the lowest practical levels) 2. diversify use (of modes of action largely through rotations of chemical controls) 3. partition chemistry (modes of action through space and time, which in effect is a refuge management strategy) and 4. include non-chemical methods. These principles are general to all cancers and cancer drugs, and thus should be employed to improve oncology. We review the parallel difficulties in controlling the evolution of drug resistance in pests and cancer cells, and describe the results of single- and multi-drug strategies in agriculture and oncology. We dissect the methods that pest managers use to prevent the evolution of pesticide resistance, showing how integrated pest management inspired the development of adaptive therapy in oncology to stabilize tumor size, and increase progression-free survival and quality of life of patients. Finally, we demonstrate these principles in a proposal for clinical trials in colorectal cancer.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Using birth-death processes to infer tumor subpopulation structure from live-cell imaging drug screening data
Authors:
C. Wu,
E. B. Gunnarsson,
E. M. Myklebust,
A. Köhn-Luque,
D. S. Tadele,
J. M. Enserink,
A. Frigessi,
J. Foo,
K. Leder
Abstract:
Tumor heterogeneity is a complex and widely recognized trait that poses significant challenges in develo** effective cancer therapies. In particular, many tumors harbor a variety of subpopulations with distinct therapeutic response characteristics. Characterizing this heterogeneity by determining the subpopulation structure within a tumor enables more precise and successful treatment strategies.…
▽ More
Tumor heterogeneity is a complex and widely recognized trait that poses significant challenges in develo** effective cancer therapies. In particular, many tumors harbor a variety of subpopulations with distinct therapeutic response characteristics. Characterizing this heterogeneity by determining the subpopulation structure within a tumor enables more precise and successful treatment strategies. In our prior work, we developed PhenoPop, a computational framework for unravelling the drug-response subpopulation structure within a tumor from bulk high-throughput drug screening data. However, the deterministic nature of the underlying models driving PhenoPop restricts the model fit and the information it can extract from the data. As an advancement, we propose a stochastic model based on the linear birth-death process to address this limitation. Our model can formulate a dynamic variance along the horizon of the experiment so that the model uses more information from the data to provide a more robust estimation. In addition, the newly proposed model can be readily adapted to situations where the experimental data exhibits a positive time correlation. We test our model on simulated data (in silico) and experimental data (in vitro), which supports our argument about its advantages.
△ Less
Submitted 13 June, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Deep Learning Enables Reduced Gadolinium Dose for Contrast-Enhanced Blood-Brain Barrier Opening
Authors:
P. Lee,
H. Wei,
A. N. Pouliopoulos,
B. T. Forsyth,
Y. Yang,
C. Zhang,
A. F. Laine,
E. E. Konofagou,
C. Wu,
J. Guo
Abstract:
Focused ultrasound (FUS) can be used to open the blood-brain barrier (BBB), and MRI with contrast agents can detect that opening. However, repeated use of gadolinium-based contrast agents (GBCAs) presents safety concerns to patients. This study is the first to propose the idea of modeling a volume transfer constant (Ktrans) through deep learning to reduce the dosage of contrast agents. The goal of…
▽ More
Focused ultrasound (FUS) can be used to open the blood-brain barrier (BBB), and MRI with contrast agents can detect that opening. However, repeated use of gadolinium-based contrast agents (GBCAs) presents safety concerns to patients. This study is the first to propose the idea of modeling a volume transfer constant (Ktrans) through deep learning to reduce the dosage of contrast agents. The goal of the study is not only to reconstruct artificial intelligence (AI) derived Ktrans images but to also enhance the intensity with low dosage contrast agent T1 weighted MRI scans. We successfully validated this idea through a previous state-of-the-art temporal network algorithm, which focused on extracting time domain features at the voxel level. Then we used a Spatiotemporal Network (ST-Net), composed of a spatiotemporal convolutional neural network (CNN)-based deep learning architecture with the addition of a three-dimensional CNN encoder, to improve the model performance. We tested the ST-Net model on ten datasets of FUS-induced BBB-openings aquired from different sides of the mouse brain. ST-Net successfully detected and enhanced BBB-opening signals without sacrificing spatial domain information. ST-Net was shown to be a promising method of reducing the need of contrast agents for modeling BBB-opening K-trans maps from time-series Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI) scans.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
Interpretable estimation of the risk of heart failure hospitalization from a 30-second electrocardiogram
Authors:
Sergio González,
Wan-Ting Hsieh,
Davide Burba,
Trista Pei-Chun Chen,
Chun-Li Wang,
Victor Chien-Chia Wu,
Shang-Hung Chang
Abstract:
Survival modeling in healthcare relies on explainable statistical models; yet, their underlying assumptions are often simplistic and, thus, unrealistic. Machine learning models can estimate more complex relationships and lead to more accurate predictions, but are non-interpretable. This study shows it is possible to estimate hospitalization for congestive heart failure by a 30 seconds single-lead…
▽ More
Survival modeling in healthcare relies on explainable statistical models; yet, their underlying assumptions are often simplistic and, thus, unrealistic. Machine learning models can estimate more complex relationships and lead to more accurate predictions, but are non-interpretable. This study shows it is possible to estimate hospitalization for congestive heart failure by a 30 seconds single-lead electrocardiogram signal. Using a machine learning approach not only results in greater predictive power but also provides clinically meaningful interpretations. We train an eXtreme Gradient Boosting accelerated failure time model and exploit SHapley Additive exPlanations values to explain the effect of each feature on predictions. Our model achieved a concordance index of 0.828 and an area under the curve of 0.853 at one year and 0.858 at two years on a held-out test set of 6,573 patients. These results show that a rapid test based on an electrocardiogram could be crucial in targeting and treating high-risk individuals.
△ Less
Submitted 4 November, 2022; v1 submitted 1 November, 2022;
originally announced November 2022.
-
A knowledge graph representation learning approach to predict novel kinase-substrate interactions
Authors:
Sachin Gavali,
Karen Ross,
Chuming Chen,
Julie Cowart,
Cathy H. Wu
Abstract:
The human proteome contains a vast network of interacting kinases and substrates. Even though some kinases have proven to be immensely useful as therapeutic targets, a majority are still understudied. In this work, we present a novel knowledge graph representation learning approach to predict novel interaction partners for understudied kinases. Our approach uses a phosphoproteomic knowledge graph…
▽ More
The human proteome contains a vast network of interacting kinases and substrates. Even though some kinases have proven to be immensely useful as therapeutic targets, a majority are still understudied. In this work, we present a novel knowledge graph representation learning approach to predict novel interaction partners for understudied kinases. Our approach uses a phosphoproteomic knowledge graph constructed by integrating data from iPTMnet, Protein Ontology, Gene Ontology and BioKG. The representation of kinases and substrates in this knowledge graph are learned by performing directed random walks on triples coupled with a modified SkipGram or CBOW model. These representations are then used as an input to a supervised classification model to predict novel interactions for understudied kinases. We also present a post-predictive analysis of the predicted interactions and an ablation study of the phosphoproteomic knowledge graph to gain an insight into the biology of the understudied kinases.
△ Less
Submitted 9 June, 2022; v1 submitted 5 June, 2022;
originally announced June 2022.
-
A 1D-0D-3D coupled model for simulating blood flow and transport processes in breast tissue
Authors:
Marvin Fritz,
Tobias Köppl,
J. Tinsley Oden,
Andreas Wagner,
Barbara Wohlmuth,
Chengyue Wu
Abstract:
In this work, we present mixed dimensional models for simulating blood flow and transport processes in breast tissue and the vascular tree supplying it. These processes are considered, to start from the aortic inlet to the capillaries and tissue of the breast. Large variations in biophysical properties and flow conditions exist in this system necessitating the use of different flow models for diff…
▽ More
In this work, we present mixed dimensional models for simulating blood flow and transport processes in breast tissue and the vascular tree supplying it. These processes are considered, to start from the aortic inlet to the capillaries and tissue of the breast. Large variations in biophysical properties and flow conditions exist in this system necessitating the use of different flow models for different geometries and flow regimes. Large variations in biophysical properties and flow conditions exist in this system necessitating the use of different flow models for different geometries and flow regimes. In total, we consider four different model types. First, a system of 1D nonlinear hyperbolic PDEs is considered to simulate blood flow in larger arteries with highly elastic vessel walls. Second, we assign 1D linearized hyperbolic PDEs to model the smaller arteries with stiffer vessel walls. The third model type consists of ODE systems (0D models). It is used to model the arterioles and peripheral circulation. Finally, homogenized 3D porous media models are considered to simulate flow and transport in capillaries and tissue within the breast volume. Sink terms are used to account for the influence of the venous and lymphatic systems. Combining the four model types, we obtain two different 1D-0D-3D coupled models for simulating blood flow and transport processes: The first 1D-0D-3D model covers the whole path from the aorta to the breast, while the second model is a sub-model obtained by restriction to breast vasculature and tissue making possible a significant reduction in computational cost. Several numerical experiments are conducted that demonstrate realistic flow simulations compared to existing data on blood flow in human breast and vascular system.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Invariant in variants
Authors:
Cong Liu,
Chen-Wu Wu
Abstract:
The coronavirus Covid-19 mutates quickly in the pandemic, leaves people struggling to verify and improve the effectiveness of the vaccine based on biochemistry. Is there any physical invariant in the variants of such kind of pathogen that could be taken advantage to ease the tensions? To this point, extensive numerical experiments based on continuity mechanics were carried out to discover the vibr…
▽ More
The coronavirus Covid-19 mutates quickly in the pandemic, leaves people struggling to verify and improve the effectiveness of the vaccine based on biochemistry. Is there any physical invariant in the variants of such kind of pathogen that could be taken advantage to ease the tensions? To this point, extensive numerical experiments based on continuity mechanics were carried out to discover the vibration modes and the range of natural frequency of coronavirus Covid-19. Such invariant could help us in develo** some flexible technique to deactivate the coronavirus, like as resonantly breaking the viral spike by ultrasound wave. The fundamental mechanisms governing such process are demonstrated via solving the coupled equations of acoustics and dynamics and thereafter the technique strategies proposed to efficiently realize the concept.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases
Authors:
Shashi Kant Gupta,
Mengmi Zhang,
Chia-Chien Wu,
Jeremy M. Wolfe,
Gabriel Kreiman
Abstract:
Visual search is a ubiquitous and often challenging daily task, exemplified by looking for the car keys at home or a friend in a crowd. An intriguing property of some classical search tasks is an asymmetry such that finding a target A among distractors B can be easier than finding B among A. To elucidate the mechanisms responsible for asymmetry in visual search, we propose a computational model th…
▽ More
Visual search is a ubiquitous and often challenging daily task, exemplified by looking for the car keys at home or a friend in a crowd. An intriguing property of some classical search tasks is an asymmetry such that finding a target A among distractors B can be easier than finding B among A. To elucidate the mechanisms responsible for asymmetry in visual search, we propose a computational model that takes a target and a search image as inputs and produces a sequence of eye movements until the target is found. The model integrates eccentricity-dependent visual recognition with target-dependent top-down cues. We compared the model against human behavior in six paradigmatic search tasks that show asymmetry in humans. Without prior exposure to the stimuli or task-specific training, the model provides a plausible mechanism for search asymmetry. We hypothesized that the polarity of search asymmetry arises from experience with the natural environment. We tested this hypothesis by training the model on augmented versions of ImageNet where the biases of natural images were either removed or reversed. The polarity of search asymmetry disappeared or was altered depending on the training protocol. This study highlights how classical perceptual properties can emerge in neural network models, without the need for task-specific training, but rather as a consequence of the statistical properties of the developmental diet fed to the model. All source code and data are publicly available at https://github.com/kreimanlab/VisualSearchAsymmetry.
△ Less
Submitted 6 November, 2021; v1 submitted 5 June, 2021;
originally announced June 2021.
-
Verifying Design through Generative Visualization of Neural Activities
Authors:
Pan Wang,
Danlin Peng,
Simiao Yu,
Chao Wu,
Peter Childs,
Yike Guo,
Ling Li
Abstract:
Current neuroscience focused approaches for evaluating the effectiveness of a design do not use direct visualisation of mental activity. A recurrent neural network is used as the encoder to learn latent representation from electroencephalogram (EEG) signals, recorded while subjects looked at 50 categories of images. A generative adversarial network (GAN) conditioned on the EEG latent representatio…
▽ More
Current neuroscience focused approaches for evaluating the effectiveness of a design do not use direct visualisation of mental activity. A recurrent neural network is used as the encoder to learn latent representation from electroencephalogram (EEG) signals, recorded while subjects looked at 50 categories of images. A generative adversarial network (GAN) conditioned on the EEG latent representation is trained for reconstructing these images. After training, the neural network is able to reconstruct images from brain activity recordings. To demonstrate the proposed method in the context of the mental association with a design, we performed a study that indicates an iconic design image could inspire the subject to create cognitive associations with branding and valued products. The proposed method could have the potential in verifying designs by visualizing the cognitive understanding of underlying brain activity.
△ Less
Submitted 28 March, 2021;
originally announced March 2021.
-
Deep Learning-based Automated Aortic Area and Distensibility Assessment: The Multi-Ethnic Study of Atherosclerosis (MESA)
Authors:
Vivek P. Jani,
Nadjia Kachenoura,
Alban Redheuil,
Gisela Teixido-Tura,
Kevin Bouaou,
Emilie Bollache,
Elie Mousseaux,
Alain De Cesare,
Shelby Kutty,
Colin O. Wu,
David A. Bluemke,
Joao A. C. Lima,
Bharath Ambale-Venkatesh
Abstract:
This study applies convolutional neural network (CNN)-based automatic segmentation and distensibility measurement of the ascending and descending aorta from 2D phase-contrast cine magnetic resonance imaging (PC-cine MRI) within the large MESA cohort with subsequent assessment on an external cohort of thoracic aortic aneurysm (TAA) patients. 2D PC-cine MRI images of the ascending and descending aor…
▽ More
This study applies convolutional neural network (CNN)-based automatic segmentation and distensibility measurement of the ascending and descending aorta from 2D phase-contrast cine magnetic resonance imaging (PC-cine MRI) within the large MESA cohort with subsequent assessment on an external cohort of thoracic aortic aneurysm (TAA) patients. 2D PC-cine MRI images of the ascending and descending aorta at the pulmonary artery bifurcation from the MESA study were included. Train, validation, and internal test sets consisted of 1123 studies (24282 images), 374 studies (8067 images), and 375 studies (8069 images), respectively. An external test set of TAAs consisted of 37 studies (3224 images). A U-Net based CNN was constructed, and performance was evaluated utilizing dice coefficient (for segmentation) and concordance correlation coefficients (CCC) of aortic geometric parameters by comparing to manual segmentation and parameter estimation. Dice coefficients for aorta segmentation were 97.6% (CI: 97.5%-97.6%) and 93.6% (84.6%-96.7%) on the internal and external test of TAAs, respectively. CCC for comparison of manual and CNN maximum and minimum ascending aortic areas were 0.97 and 0.95, respectively, on the internal test set and 0.997 and 0.995, respectively, for the external test. CCCs for maximum and minimum descending aortic areas were 0.96 and 0. 98, respectively, on the internal test set and 0.93 and 0.93, respectively, on the external test set. We successfully developed and validated a U-Net based ascending and descending aortic segmentation and distensibility quantification model in a large multi-ethnic database and in an external cohort of TAA patients.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Dynamics of B-cell repertoires and emergence of cross-reactive responses in COVID-19 patients with different disease severity
Authors:
Zachary Montague,
Huibin Lv,
Jakub Otwinowski,
William S. DeWitt,
Giulio Isacchini,
Garrick K. Yip,
Wilson W. Ng,
Owen Tak-Yin Tsang,
Meng Yuan,
Hejun Liu,
Ian A. Wilson,
J. S. Malik Peiris,
Nicholas C. Wu,
Armita Nourmohammad,
Chris Ka Pun Mok
Abstract:
COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multipl…
▽ More
COVID-19 patients show varying severity of the disease ranging from asymptomatic to requiring intensive care. Although a number of SARS-CoV-2 specific monoclonal antibodies have been identified, we still lack an understanding of the overall landscape of B-cell receptor (BCR) repertoires in COVID-19 patients. Here, we used high-throughput sequencing of bulk and plasma B-cells collected over multiple time points during infection to characterize signatures of B-cell response to SARS-CoV-2 in 19 patients. Using principled statistical approaches, we determined differential features of BCRs associated with different disease severity. We identified 38 significantly expanded clonal lineages shared among patients as candidates for specific responses to SARS-CoV-2. Using single-cell sequencing, we verified reactivity of BCRs shared among individuals to SARS-CoV-2 epitopes. Moreover, we identified natural emergence of a BCR with cross-reactivity to SARS-CoV-1 and SARS-CoV-2 in a number of patients. Our results provide important insights for development of rational therapies and vaccines against COVID-19.
△ Less
Submitted 5 April, 2021; v1 submitted 13 July, 2020;
originally announced July 2020.
-
Molcontroller: a VMD Graphical User Interface for Manipulating Molecules
Authors:
ChenChen Wu,
Shengtang Liu,
Shitong Zhang,
Zaixing Yang
Abstract:
Visual Molecular Dynamics (VMD) is one of the most widely used molecular graphics software in the community of theoretical simulations. So far, however, it still lacks a graphical user interface (GUI) for molecular manipulations when doing some modeling tasks. For instance, translation or rotation of a selected molecule(s) or part(s) of a molecule, which are currently only can be achieved using tc…
▽ More
Visual Molecular Dynamics (VMD) is one of the most widely used molecular graphics software in the community of theoretical simulations. So far, however, it still lacks a graphical user interface (GUI) for molecular manipulations when doing some modeling tasks. For instance, translation or rotation of a selected molecule(s) or part(s) of a molecule, which are currently only can be achieved using tcl scripts. Here, we use tcl script develop a user-friendly GUI for VMD, named Molcontroller, which is featured by allowing users to quickly and conveniently perform various molecular manipulations. This GUI might be helpful for improving the modeling efficiency of VMD users.
△ Less
Submitted 2 July, 2020;
originally announced July 2020.
-
The architecture of co-culture spheroids regulates tumor invasion within a 3D extracellular matrix
Authors:
Yu Ling Huang,
Carina Shiau,
Cindy Wu,
Jeffrey E. Segall,
Mingming Wu
Abstract:
Tumor invasion, the process by which tumor cells break away from their primary tumor and gain access to vascular systems, is an important step in cancer metastasis. Most current 3D tumor invasion assays consisted of single tumor cells embedded within an extracellular matrix (ECM). These assays taught us much of what we know today on how key biophysical (e.g. ECM stiffness) and biochemical (e.g. cy…
▽ More
Tumor invasion, the process by which tumor cells break away from their primary tumor and gain access to vascular systems, is an important step in cancer metastasis. Most current 3D tumor invasion assays consisted of single tumor cells embedded within an extracellular matrix (ECM). These assays taught us much of what we know today on how key biophysical (e.g. ECM stiffness) and biochemical (e.g. cytokine gradients) parameters within the tumor microenvironment guided and regulated tumor invasion. One limitation of the single tumor cell invasion assay was that it did not account for cell-cell adhesion within the tumor. In this article, we developed a micrometer scale 3D co-culture spheroid invasion assay that was compatible with microscopic imaging. Micrometer scale co-culture spheroids (1:1 ratio of metastatic breast cancer MDA-MB-231 and non-tumorigenic epithelial MCF-10A cells) were made using an array of microwells, and then were embedded within a collagen matrix in a microfluidic platform. Real time imaging of tumor spheroid invasion revealed that the spatial distribution of the two cell types within the tumor spheroid critically regulated tumor invasion. This work linked tumor architecture with tumor invasion and highlighted the importance of the biophysical cues within the bulk of the tumor in tumor invasion.
△ Less
Submitted 9 February, 2020;
originally announced February 2020.
-
Substituting Gadolinium in Brain MRI Using DeepContrast
Authors:
Haoran Sun,
Xueqing Liu,
Xinyang Feng,
Chen Liu,
Nanyan Zhu,
Sabrina J. Gjerswold-Selleck,
Hong-Jian Wei,
Pavan S. Upadhyayula,
Angeliki Mela,
Cheng-Chia Wu,
Peter D. Canoll,
Andrew F. Laine,
J. Thomas Vaughan,
Scott A. Small,
Jia Guo
Abstract:
Cerebral blood volume (CBV) is a hemodynamic correlate of oxygen metabolism and reflects brain activity and function. High-resolution CBV maps can be generated using the steady-state gadolinium-enhanced MRI technique. Such a technique requires an intravenous injection of exogenous gadolinium based contrast agent (GBCA) and recent studies suggest that the GBCA can accumulate in the brain after freq…
▽ More
Cerebral blood volume (CBV) is a hemodynamic correlate of oxygen metabolism and reflects brain activity and function. High-resolution CBV maps can be generated using the steady-state gadolinium-enhanced MRI technique. Such a technique requires an intravenous injection of exogenous gadolinium based contrast agent (GBCA) and recent studies suggest that the GBCA can accumulate in the brain after frequent use. We hypothesize that endogenous sources of contrast might exist within the most conventional and commonly acquired structural MRI, potentially obviating the need for exogenous contrast. Here, we test this hypothesis by develo** and optimizing a deep learning algorithm, which we call DeepContrast, in mice. We find that DeepContrast performs equally well as exogenous GBCA in map** CBV of the normal brain tissue and enhancing glioblastoma. Together, these studies validate our hypothesis that a deep learning approach can potentially replace the need for GBCAs in brain MRI.
△ Less
Submitted 15 January, 2020;
originally announced January 2020.
-
Dynamic Prediction of Competing Risk Events using Landmark Sub-distribution Hazard Model with Multiple Longitudinal Biomarker
Authors:
Cai Wu,
Liang Li,
Ruosha Li
Abstract:
The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propo…
▽ More
The cause-specific cumulative incidence function (CIF) quantifies the subject-specific disease risk with competing risk outcome. With longitudinally collected biomarker data, it is of interest to dynamically update the predicted CIF by incorporating the most recent biomarker as well as the cumulating longitudinal history. Motivated by a longitudinal cohort study of chronic kidney disease, we propose a framework for dynamic prediction of end stage renal disease using multivariate longitudinal biomarkers, accounting for the competing risk of death. The proposed framework extends the landmark survival modeling to competing risks data, and implies that a distinct sub-distribution hazard regression model is defined at each landmark time. The model parameters, prediction horizon, longitudinal history and at-risk population are allowed to vary over the landmark time. When the measurement times of biomarkers are irregularly spaced, the predictor variable may not be observed at the time of prediction. Local polynomial is used to estimate the model parameters without explicitly imputing the predictor or modeling its longitudinal trajectory. The proposed model leads to simple interpretation of the regression coefficients and closed-form calculation of the predicted CIF. The estimation and prediction can be implemented through standard statistical software with tractable computation. We conducted simulations to evaluate the performance of the estimation procedure and predictive accuracy. The methodology is illustrated with data from the African American Study of Kidney Disease and Hypertension.
△ Less
Submitted 21 May, 2019;
originally announced June 2019.
-
Multi-directional dynamic model for traumatic brain injury detection
Authors:
Kaveh Laksari,
Michael Fanton,
Lyndia C. Wu,
Taylor H. Nguyen,
Mehmet Kurt,
Chiara Giordano,
Eoin Kelly,
Eoin O'Keeffe,
Eugene Wallace,
Colin Doherty,
Matthew Campbell,
Stephen Tiernan,
Gerald Grant,
Jesse Ruan,
Saeed Barbat,
David B. Camarillo
Abstract:
Traumatic brain injury (TBI) is a complex injury that is hard to predict and diagnose, with many studies focused on associating head kinematics to brain injury risk. Recently, there has been a push towards using computationally expensive finite element (FE) models of the brain to create tissue deformation metrics of brain injury. Here, we developed a 3 degree-of-freedom lumped-parameter brain mode…
▽ More
Traumatic brain injury (TBI) is a complex injury that is hard to predict and diagnose, with many studies focused on associating head kinematics to brain injury risk. Recently, there has been a push towards using computationally expensive finite element (FE) models of the brain to create tissue deformation metrics of brain injury. Here, we developed a 3 degree-of-freedom lumped-parameter brain model, built based on the measured natural frequencies of a FE brain model simulated with live human impact data, to be used to rapidly estimate peak brain strains experienced during head rotational accelerations. On our dataset, the simplified model correlates with peak principal FE strain by an R2 of 0.80. Further, coronal and axial model displacement correlated with fiber-oriented peak strain in the corpus callosum with an R2 of 0.77. Using the maximum displacement predicted by our brain model, we propose an injury criteria and compare it against a number of existing rotational and translational kinematic injury metrics on a dataset of head kinematics from 27 clinically diagnosed injuries and 887 non-injuries. We found that our proposed metric performed comparably to peak angular acceleration, linear acceleration, and angular velocity in classifying injury and non-injury events. Metrics which separated time traces into their directional components had improved deviance to those which combined components into a single time trace magnitude. Our brain model can be used in future work as a computationally efficient alternative to FE models for classifying injuries over a wide range of loading conditions.
△ Less
Submitted 2 April, 2019; v1 submitted 18 December, 2018;
originally announced December 2018.
-
The unified maximum a posteriori (MAP) framework for neuronal system identification
Authors:
Michael C. -K. Wu,
Fatma Deniz,
Ryan J. Prenger,
Jack L. Gallant
Abstract:
The functional relationship between an input and a sensory neuron's response can be described by the neuron's stimulus-response map** function. A general approach for characterizing the stimulus-response map** function is called system identification. Many different names have been used for the stimulus-response map** function: kernel or transfer function, transducer, spatiotemporal receptiv…
▽ More
The functional relationship between an input and a sensory neuron's response can be described by the neuron's stimulus-response map** function. A general approach for characterizing the stimulus-response map** function is called system identification. Many different names have been used for the stimulus-response map** function: kernel or transfer function, transducer, spatiotemporal receptive field. Many algorithms have been developed to estimate a neuron's map** function from an ensemble of stimulus-response pairs. These include the spike-triggered average, normalized reverse correlation, linearized reverse correlation, ridge regression, local spectral reverse correlation, spike-triggered covariance, artificial neural networks, maximally informative dimensions, kernel regression, boosting, and models based on leaky integrate-and-fire neurons. Because many of these system identification algorithms were developed in other disciplines, they seem very different superficially and bear little relationship with each other. Each algorithm makes different assumptions about the neuron and how the data is generated. Without a unified framework it is difficult to select the most suitable algorithm for estimating the neuron's map** function. In this review, we present a unified framework for describing these algorithms called maximum a posteriori estimation (MAP). In the MAP framework, the implicit assumptions built into any system identification algorithm are made explicit in three MAP constituents: model class, noise distributions, and priors. Understanding the interplay between these three MAP constituents will simplify the task of selecting the most appropriate algorithms for a given data set. The MAP framework can also facilitate the development of novel system identification algorithms by incorporating biophysically plausible assumptions and mechanisms into the MAP constituents.
△ Less
Submitted 6 November, 2018; v1 submitted 31 October, 2018;
originally announced November 2018.
-
Mixed Neural Network Approach for Temporal Sleep Stage Classification
Authors:
Hao Dong,
Akara Supratak,
Wei Pan,
Chao Wu,
Paul M. Matthews,
Yike Guo
Abstract:
This paper proposes a practical approach to addressing limitations posed by use of single active electrodes in applications for sleep stage classification. Electroencephalography (EEG)-based characterizations of sleep stage progression contribute the diagnosis and monitoring of the many pathologies of sleep. Several prior reports have explored ways of automating the analysis of sleep EEG and of re…
▽ More
This paper proposes a practical approach to addressing limitations posed by use of single active electrodes in applications for sleep stage classification. Electroencephalography (EEG)-based characterizations of sleep stage progression contribute the diagnosis and monitoring of the many pathologies of sleep. Several prior reports have explored ways of automating the analysis of sleep EEG and of reducing the complexity of the data needed for reliable discrimination of sleep stages in order to make it possible to perform sleep studies at lower cost in the home (rather than only in specialized clinical facilities). However, these reports have involved recordings from electrodes placed on the cranial vertex or occiput, which can be uncomfortable or difficult for subjects to position. Those that have utilized single EEG channels which contain less sleep information, have showed poor classification performance. We have taken advantage of Rectifier Neural Network for feature detection and Long Short-Term Memory (LSTM) network for sequential data learning to optimize classification performance with single electrode recordings. After exploring alternative electrode placements, we found a comfortable configuration of a single-channel EEG on the forehead and have shown that it can be integrated with additional electrodes for simultaneous recording of the electroocuolgram (EOG). Evaluation of data from 62 people (with 494 hours sleep) demonstrated better performance of our analytical algorithm for automated sleep classification than existing approaches using vertex or occipital electrode placements. Use of this recording configuration with neural network deconvolution promises to make clinically indicated home sleep studies practical.
△ Less
Submitted 3 August, 2017; v1 submitted 15 October, 2016;
originally announced October 2016.
-
SCOTTI: Efficient Reconstruction of Transmission within Outbreaks with the Structured Coalescent
Authors:
Nicola De Maio,
Chieh-Hsi Wu,
Daniel J Wilson
Abstract:
Exploiting pathogen genomes to reconstruct transmission represents a powerful tool in the fight against infectious disease. However, their interpretation rests on a number of simplifying assumptions that regularly ignore important complexities of real data, in particular within-host evolution and non-sampled patients.
Here we propose a new approach to transmission inference called SCOTTI (Struct…
▽ More
Exploiting pathogen genomes to reconstruct transmission represents a powerful tool in the fight against infectious disease. However, their interpretation rests on a number of simplifying assumptions that regularly ignore important complexities of real data, in particular within-host evolution and non-sampled patients.
Here we propose a new approach to transmission inference called SCOTTI (Structured COalescent Transmission Tree Inference). This method is based on a statistical framework that models each host as a distinct population, and transmissions between hosts as migration events. Our computationally efficient implementation of this model enables the inference of host-to-host transmission while accommodating within-host evolution and non-sampled hosts. SCOTTI is distributed as an open source package for the phylogenetic software BEAST2.
We show that SCOTTI can generally infer transmission events even in the presence of considerable within-host variation, can account for the uncertainty associated with the possible presence of non-sampled hosts, and can efficiently use data from multiple samples of the same host, although there is some reduction in accuracy when samples are collected very close to the infection time.
We illustrate the features of our approach by investigating transmission from genetic and epidemiological data in a Foot and Mouth Disease Virus (FMDV) veterinary outbreak in England and a Klebsiella pneumoniae outbreak in a Nepali neonatal unit. Transmission histories inferred with SCOTTI will be important in devising effective measures to prevent and halt transmission.
△ Less
Submitted 7 March, 2016;
originally announced March 2016.
-
Conversion of the chemical concentration of odorous mixtures into odour concentration and odour intensity: a comparison of methods
Authors:
C. Wu,
J. Liu,
P. Zhao,
M. Piringer,
G. Schauberger
Abstract:
Continuous odour measurements both of emissions as well as ambient concentrations are seldom realised, mainly because of their high costs. They are therefore often substituted by concentration measurements of odorous substances. Then a conversion of the chemical concentrations C (mg m-3) into odour concentrations COD (ouE m-3) and odour intensities OI is necessary. Four methods to convert the conc…
▽ More
Continuous odour measurements both of emissions as well as ambient concentrations are seldom realised, mainly because of their high costs. They are therefore often substituted by concentration measurements of odorous substances. Then a conversion of the chemical concentrations C (mg m-3) into odour concentrations COD (ouE m-3) and odour intensities OI is necessary. Four methods to convert the concentrations of single substances to the odour concentrations and odour intensities of an odorous mixture are investigated: (1) direct use of measured concentrations, (2) the sum of the odour activity value SOAV, (3) the sum of the odour intensities SOI, and (4) the equivalent odour concentration EOC, as a new method. The methods are evaluated with olfactometric measurements of seven substances as well as their mixtures. The results indicate that the SOI and EOC conversion methods deliver reliable values. These methods use not only the odour threshold concentration but also the slope of the Weber-Fechner law to include the sensitivity of the odour perception of the individual substances. They fulfil the criteria of an objective conversion without the need of a further calibration by additional olfactometric measurements.
△ Less
Submitted 21 December, 2015;
originally announced December 2015.
-
Identifying lineage effects when controlling for population structure improves power in bacterial association studies
Authors:
Sarah G Earle,
Chieh-Hsi Wu,
Jane Charlesworth,
Nicole Stoesser,
N Claire Gordon,
Timothy M Walker,
Chris C A Spencer,
Zamin Iqbal,
David A Clifton,
Katie L Hopkins,
Neil Woodford,
E Grace Smith,
Nazir Ismail,
Martin J Llewelyn,
Tim E Peto,
Derrick W Crook,
Gil McVean,
A Sarah Walker,
Daniel J Wilson
Abstract:
Bacteria pose unique challenges for genome-wide association studies (GWAS) because of strong structuring into distinct strains and substantial linkage disequilibrium across the genome. While methods developed for human studies can correct for strain structure, this risks considerable loss- of-power because genetic differences between strains often contribute substantial phenotypic variability. Her…
▽ More
Bacteria pose unique challenges for genome-wide association studies (GWAS) because of strong structuring into distinct strains and substantial linkage disequilibrium across the genome. While methods developed for human studies can correct for strain structure, this risks considerable loss- of-power because genetic differences between strains often contribute substantial phenotypic variability. Here we propose a new method that captures lineage-level associations even when locus-specific associations cannot be fine-mapped. We demonstrate its ability to detect genes and genetic variants underlying resistance to 17 antimicrobials in 3144 isolates from four taxonomically diverse clonal and recombining bacteria: Mycobacterium tuberculosis, Staphylococcus aureus, Escherichia coli and Klebsiella pneumoniae. Strong selection, recombination and penetrance confer high power to recover known antimicrobial resistance mechanisms, and reveal a candidate association between the outer membrane porin nmpC and cefazolin resistance in E. coli. Hence our method pinpoints locus-specific effects where possible, and boosts power by detecting lineage-level differences when fine-map** is intractable.
△ Less
Submitted 8 February, 2016; v1 submitted 23 October, 2015;
originally announced October 2015.
-
New Routes to Phylogeography
Authors:
Nicola De Maio,
Chieh-Hsi Wu,
Kathleen M O'Reilly,
Daniel Wilson
Abstract:
Phylogeographic methods aim to infer migration trends and the history of sampled lineages from genetic data. Applications of phylogeography are broad, and in the context of pathogens include the reconstruction of transmission histories and the origin and emergence of outbreaks. Phylogeographic inference based on bottom-up population genetics models is computationally expensive, and as a result fas…
▽ More
Phylogeographic methods aim to infer migration trends and the history of sampled lineages from genetic data. Applications of phylogeography are broad, and in the context of pathogens include the reconstruction of transmission histories and the origin and emergence of outbreaks. Phylogeographic inference based on bottom-up population genetics models is computationally expensive, and as a result faster alternatives based on the evolution of discrete traits have become popular. In this paper, we show that inference of migration rates and root locations based on discrete trait models is extremely unreliable and sensitive to biased sampling. To address this problem, we introduce BASTA (BAyesian STructured coalescent Approximation), a new approach implemented in BEAST2 that combines the accuracy of methods based on the structured coalescent with the computational efficiency required to handle more than just few populations. We illustrate the potentially severe implications of poor model choice for phylogeographic analyses by investigating the zoonotic transmission of Ebola virus. Whereas the structured coalescent analysis correctly infers that successive human Ebola outbreaks have been seeded by a large unsampled non-human reservoir population, the discrete trait analysis implausibly concludes that undetected human-to-human transmission has allowed the virus to persist over the past four decades. As genomics takes on an increasingly prominent role informing the control and prevention of infectious diseases, it will be vital that phylogeographic inference provides robust insights into transmission history.
△ Less
Submitted 27 March, 2015;
originally announced March 2015.
-
In vivo evaluation of wearable head impact sensors
Authors:
Lyndia C. Wu,
Vaibhav Nangia,
Kevin Bui,
Bradley Hammoor,
Mehmet Kurt,
Fidel Hernandez,
Calvin Kuo,
David B. Camarillo
Abstract:
Inertial sensors are commonly used to measure human head motion. Some sensors have been validated with dummy or cadaver experiments, but methods to evaluate sensors in vivo are lacking. Here we present an in vivo method using high speed video to evaluate teeth-mounted (mouthguard), soft tissue-mounted (skin patch), and headgear-mounted (skull cap) sensors during 6-13g sagittal soccer head impacts.…
▽ More
Inertial sensors are commonly used to measure human head motion. Some sensors have been validated with dummy or cadaver experiments, but methods to evaluate sensors in vivo are lacking. Here we present an in vivo method using high speed video to evaluate teeth-mounted (mouthguard), soft tissue-mounted (skin patch), and headgear-mounted (skull cap) sensors during 6-13g sagittal soccer head impacts. Sensor coupling to the skull is quantified by displacement from an ear-canal reference. Mouthguard displacements were within video measurement error (<1mm), while the skin patch and skull cap displaced up to 4mm and 13mm from the ear-canal reference, respectively. We used the mouthguard, which had the least displacement from skull, as the reference to assess 6-degree-of-freedom skin patch and skull cap measurements. Linear and rotational acceleration magnitudes were over-predicted by both the skin patch (with 120% NRMS error for a_mag, 290% for alpha_mag) and the skull cap (320% NRMS error for a_mag, 500% for alpha_mag). Such over-predictions were largely due to out-of-plane motion. To model sensor error, we found that in-plane acceleration peaks from the skin patch in the anterior-posterior direction could be modeled by an underdamped viscoelastic system. In summary, the mouthguard showed tighter skull coupling in vivo than the other sensors. Furthermore, the in vivo methods presented are valuable for investigating skull acceleration sensor technologies.
△ Less
Submitted 20 August, 2015; v1 submitted 13 March, 2015;
originally announced March 2015.
-
The Cure: Making a game of gene selection for breast cancer survival prediction
Authors:
Benjamin M. Good,
Salvatore Loguercio,
Obi L. Griffith,
Max Nanis,
Chunlei Wu,
Andrew I. Su
Abstract:
Motivation: Molecular signatures for predicting breast cancer prognosis could greatly improve care through personalization of treatment. Computational analyses of genome-wide expression datasets have identified such signatures, but these signatures leave much to be desired in terms of accuracy, reproducibility and biological interpretability. Methods that take advantage of structured prior knowled…
▽ More
Motivation: Molecular signatures for predicting breast cancer prognosis could greatly improve care through personalization of treatment. Computational analyses of genome-wide expression datasets have identified such signatures, but these signatures leave much to be desired in terms of accuracy, reproducibility and biological interpretability. Methods that take advantage of structured prior knowledge (e.g. protein interaction networks) show promise in hel** to define better signatures but most knowledge remains unstructured.
Crowdsourcing via scientific discovery games is an emerging methodology that has the potential to tap into human intelligence at scales and in modes previously unheard of. Here, we developed and evaluated a game called The Cure on the task of gene selection for breast cancer survival prediction. Our central hypothesis was that knowledge linking expression patterns of specific genes to breast cancer outcomes could be captured from game players. We envisioned capturing knowledge both from the players prior experience and from their ability to interpret text related to candidate genes presented to them in the context of the game.
Results: Between its launch in Sept. 2012 and Sept. 2013, The Cure attracted more than 1,000 registered players who collectively played nearly 10,000 games. Gene sets assembled through aggregation of the collected data clearly demonstrated the accumulation of relevant expert knowledge. In terms of predictive accuracy, these gene sets provided comparable performance to gene sets generated using other methods including those used in commercial tests. The Cure is available at http://genegames.org/cure/
△ Less
Submitted 14 February, 2014;
originally announced February 2014.
-
A Method for Neuronal Source Identification
Authors:
Chang Won Lee,
Agnieszka A. Szymanska,
Shun Chi Wu,
A. Lee Swindlehurst,
Zoran Nenadic
Abstract:
Multi-sensor microelectrodes for extracellular action potential recording have significantly improved the quality of in vivo recorded neuronal signals. These microelectrodes have also been instrumental in the localization of neuronal signal sources. However, existing neuron localization methods have been mostly utilized in vivo, where the true neuron location remains unknown. Therefore, these meth…
▽ More
Multi-sensor microelectrodes for extracellular action potential recording have significantly improved the quality of in vivo recorded neuronal signals. These microelectrodes have also been instrumental in the localization of neuronal signal sources. However, existing neuron localization methods have been mostly utilized in vivo, where the true neuron location remains unknown. Therefore, these methods could not be experimentally validated. This article presents experimental validation of a method capable of estimating both the location and intensity of an electrical signal source. A four-sensor microelectrode (tetrode) immersed in a saline solution was used to record stimulus patterns at multiple intensity levels generated by a stimulating electrode. The location of the tetrode was varied with respect to the stimulator. The location and intensity of the stimulator were estimated using the Multiple Signal Classification (MUSIC) algorithm, and the results were quantified by comparison to the true values. The localization results, with an accuracy and precision of ~ 10 microns, and ~ 11 microns respectively, imply that MUSIC can resolve individual neuronal sources. Similarly, source intensity estimations indicate that this approach can track changes in signal amplitude over time. Together, these results suggest that MUSIC can be used to characterize neuronal signal sources in vivo.
△ Less
Submitted 15 November, 2013;
originally announced November 2013.
-
Topological conditions of scale-free networks for cooperation to evolve
Authors:
Dong-** Yang,
Hai Lin,
Chen-Xu Wu,
Jianwei Shuai
Abstract:
Evolutionary game theory is employed to study topological conditions of scale-free networks for the evolution of cooperation. We show that Apollonian Networks (ANs) are perfect scale-free networks, on which cooperation can spread to all individuals, even though there are initially only 3 or 4 hubs occupied by cooperators and all the others by defectors. Local topological features such as degree, c…
▽ More
Evolutionary game theory is employed to study topological conditions of scale-free networks for the evolution of cooperation. We show that Apollonian Networks (ANs) are perfect scale-free networks, on which cooperation can spread to all individuals, even though there are initially only 3 or 4 hubs occupied by cooperators and all the others by defectors. Local topological features such as degree, clustering coefficient, gradient as well as topology potential are adopted to analyze the advantages of ANs in cooperation enhancement. Furthermore, a degree-skeleton underlying ANs is uncovered for understanding the cooperation diffusion. Constructing this kind degree-skeleton for random scale-free networks promotes cooperation level close to that of Barabási-Albert networks, which gives deeper insights into the origin of the latter on organization and further promotion of cooperation.
△ Less
Submitted 8 July, 2011; v1 submitted 27 June, 2011;
originally announced June 2011.
-
Optimal atomic-resolution structures of prion AGAAAAGA amyloid fibrils
Authors:
Jiapu Zhang,
Jie Sun,
Changzhi Wu
Abstract:
X-ray crystallography is a powerful tool to determine the protein 3D structure. However, it is time-consuming and expensive, and not all proteins can be successfully crystallized, particularly for membrane proteins. Although nuclear magnetic resonance (NMR) spectroscopy is indeed a very powerful tool in determining the 3D structures of membrane proteins, it is also time-consuming and costly. To th…
▽ More
X-ray crystallography is a powerful tool to determine the protein 3D structure. However, it is time-consuming and expensive, and not all proteins can be successfully crystallized, particularly for membrane proteins. Although nuclear magnetic resonance (NMR) spectroscopy is indeed a very powerful tool in determining the 3D structures of membrane proteins, it is also time-consuming and costly. To the best of the authors' knowledge, there is little structural data available on the AGAAAAGA palindrome in the hydrophobic region (113-120) of prion proteins due to the noncrystalline and insoluble nature of the amyloid fibril, although many experimental studies have shown that this region has amyloid fibril forming properties and plays an important role in prion diseases. In view of this, the present study is devoted to address this problem from computational approaches such as global energy optimization, simulated annealing, and structural bioinformatics. The optimal atomic-resolution structures of prion AGAAAAGA amyloid fibils reported in this paper have a value to the scientific community in its drive to find treatments for prion diseases.
△ Less
Submitted 11 April, 2011; v1 submitted 11 December, 2010;
originally announced December 2010.
-
A proposal for a coordinated effort for the determination of brainwide neuroanatomical connectivity in model organisms at a mesoscopic scale
Authors:
Jason W. Bohland,
Caizhi Wu,
Helen Barbas,
Hemant Bokil,
Mihail Bota,
Hans C. Breiter,
Hollis T. Cline,
John C. Doyle,
Peter J. Freed,
Ralph J. Greenspan,
Suzanne N. Haber,
Michael Hawrylycz,
Daniel G. Herrera,
Claus C. Hilgetag,
Z. Josh Huang,
Allan Jones,
Edward G. Jones,
Harvey J. Karten,
David Kleinfeld,
Rolf Kotter,
Henry A. Lester,
John M. Lin,
Brett D. Mensh,
Shawn Mikula,
Jaak Panksepp
, et al. (12 additional authors not shown)
Abstract:
In this era of complete genomes, our knowledge of neuroanatomical circuitry remains surprisingly sparse. Such knowledge is however critical both for basic and clinical research into brain function. Here we advocate for a concerted effort to fill this gap, through systematic, experimental map** of neural circuits at a mesoscopic scale of resolution suitable for comprehensive, brain-wide coverag…
▽ More
In this era of complete genomes, our knowledge of neuroanatomical circuitry remains surprisingly sparse. Such knowledge is however critical both for basic and clinical research into brain function. Here we advocate for a concerted effort to fill this gap, through systematic, experimental map** of neural circuits at a mesoscopic scale of resolution suitable for comprehensive, brain-wide coverage, using injections of tracers or viral vectors. We detail the scientific and medical rationale and briefly review existing knowledge and experimental techniques. We define a set of desiderata, including brain-wide coverage; validated and extensible experimental techniques suitable for standardization and automation; centralized, open access data repository; compatibility with existing resources, and tractability with current informatics technology. We discuss a hypothetical but tractable plan for mouse, additional efforts for the macaque, and technique development for human. We estimate that the mouse connectivity project could be completed within five years with a comparatively modest budget.
△ Less
Submitted 28 January, 2009;
originally announced January 2009.
-
Topological basis of signal integration in the transcriptional-regulatory network of the yeast, Saccharomyces cerevisiae
Authors:
Illes J. Farkas,
Chuang Wu,
Chakra Chennubhotla,
Ivet Bahar,
Zoltan N. Oltvai
Abstract:
BACKGROUND. Signal recognition and information processing is a fundamental cellular function, which in part involves comprehensive transcriptional regulatory (TR) mechanisms carried out in response to complex environmental signals in the context of the cell's own internal state. However, the network topological basis of develo** such integrated responses remains poorly understood.
RESULTS. B…
▽ More
BACKGROUND. Signal recognition and information processing is a fundamental cellular function, which in part involves comprehensive transcriptional regulatory (TR) mechanisms carried out in response to complex environmental signals in the context of the cell's own internal state. However, the network topological basis of develo** such integrated responses remains poorly understood.
RESULTS. By studying the TR network of the yeast Saccharomyces cerevisiae we show that an intermediate layer of transcription factors naturally segregates into distinct subnetworks. In these topological units transcription factors are densely interlinked in a largely hierarchical manner and respond to external signals by utilizing a fraction of these subnets.
CONCLUSIONS. As transcriptional regulation represents the "slow" component of overall information processing, the identified topology suggests a model in which successive waves of transcriptional regulation originating from distinct fractions of the TR network control robust integrated responses to complex stimuli.
△ Less
Submitted 30 October, 2006;
originally announced October 2006.
-
Computational Fluid Dynamic Approach for Biological System Modeling
Authors:
Weidong Huang,
Chundu Wu,
Bingjia Xiao,
Weidong Xia
Abstract:
Various biological system models have been proposed in systems biology, which are based on the complex biological reactions kinetic of various components. These models are not practical because we lack of kinetic information. In this paper, it is found that the enzymatic reaction and multi-order reaction rate is often controlled by the transport of the reactants in biological systems. A Computat…
▽ More
Various biological system models have been proposed in systems biology, which are based on the complex biological reactions kinetic of various components. These models are not practical because we lack of kinetic information. In this paper, it is found that the enzymatic reaction and multi-order reaction rate is often controlled by the transport of the reactants in biological systems. A Computational Fluid Dynamic (CFD) approach, which is based on transport of the components and kinetics of biological reactions, is introduced for biological system modeling. We apply this approach to a biological wastewater treatment system for the study of metabolism of organic carbon substrates and the population of microbial. The results show that CFD model coupled with reaction kinetics is more accurate and more feasible than kinetic models for biological system modeling.
△ Less
Submitted 2 August, 2005;
originally announced August 2005.