Search | arXiv e-print repository

arXiv:2406.11900 [pdf, other]

Horizon-wise Learning Paradigm Promotes Gene Splicing Identification

Authors: Qi-Jie Li, Qian Sun, Shao-Qun Zhang

Abstract: Identifying gene splicing is a core and significant task confronted in modern collaboration between artificial intelligence and bioinformatics. Past decades have witnessed great efforts on this concern, such as the bio-plausible splicing pattern AT-CG and the famous SpliceAI. In this paper, we propose a novel framework for the task of gene splicing identification, named Horizon-wise Gene Splicing… ▽ More Identifying gene splicing is a core and significant task confronted in modern collaboration between artificial intelligence and bioinformatics. Past decades have witnessed great efforts on this concern, such as the bio-plausible splicing pattern AT-CG and the famous SpliceAI. In this paper, we propose a novel framework for the task of gene splicing identification, named Horizon-wise Gene Splicing Identification (H-GSI). The proposed H-GSI follows the horizon-wise identification paradigm and comprises four components: the pre-processing procedure transforming string data into tensors, the sliding window technique handling long sequences, the SeqLab model, and the predictor. In contrast to existing studies that process gene information with a truncated fixed-length sequence, H-GSI employs a horizon-wise identification paradigm in which all positions in a sequence are predicted with only one forward computation, improving accuracy and efficiency. The experiments conducted on the real-world Human dataset show that our proposed H-GSI outperforms SpliceAI and achieves the best accuracy of 97.20\%. The source code is available from this link. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2403.12827 [pdf, ps, other]

Predicting the stability of profiling signals of small RNAs

Authors: Qiuyun Li, Manda Riehl

Abstract: Profiling is a process that finds similarities between different RNA secondary structures by extracting signals from the Boltzmann sampling. The reproducibility of profiling can be identified by the standard deviation of number of features among Boltzmann samples. We found a strong relationship between the frequency of each helix class and its standard deviation of the frequency upon repeated Bolt… ▽ More Profiling is a process that finds similarities between different RNA secondary structures by extracting signals from the Boltzmann sampling. The reproducibility of profiling can be identified by the standard deviation of number of features among Boltzmann samples. We found a strong relationship between the frequency of each helix class and its standard deviation of the frequency upon repeated Boltzmann sampling. We developed a perturbation technique to predict the stability of these featured helix classes without the need for repeated Boltzmann sampling, with accuracy between 84% and 94%, depending on the type of RNA. Our technique only requires 0.2% of the computation time compared to one profiling process. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 14 pages

MSC Class: 92B05

arXiv:2403.06940 [pdf, other]

Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction

Authors: Qing Xiao, Siyeop Yoon, Hui Ren, Matthew Tivnan, Lichao Sun, Quanzheng Li, Tianming Liu, Yu Zhang, Xiang Li

Abstract: Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe… ▽ More Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffer from temporal sparsity and incompleteness, presenting substantial challenges in modeling the disease's progression accurately. Existing methods are limited, focusing primarily on datasets without missing entries or requiring predefined assumptions about CTh progression. To overcome these obstacles, we propose a conditional score-based diffusion model specifically designed to generate CTh trajectories with the given baseline information, such as age, sex, and initial diagnosis. Our conditional diffusion model utilizes all available data during the training phase to make predictions based solely on baseline information during inference without needing prior history about CTh progression. The prediction accuracy of the proposed CTh prediction pipeline using a conditional score-based model was compared for sub-groups consisting of cognitively normal, mild cognitive impairment, and AD subjects. The Bland-Altman analysis shows our diffusion-based prediction model has a near-zero bias with narrow 95% confidential interval compared to the ground-truth CTh in 6-36 months. In addition, our conditional diffusion model has a stochastic generative nature, therefore, we demonstrated an uncertainty analysis of patient-specific CTh prediction through multiple realizations. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2402.15515 [pdf]

Feasibility of Identifying Factors Related to Alzheimer's Disease and Related Dementia in Real-World Data

Authors: Aokun Chen, Qian Li, Yu Huang, Yongqiu Li, Yu-neng Chuang, Xia Hu, Serena Guo, Yonghui Wu, Yi Guo, Jiang Bian

Abstract: A comprehensive view of factors associated with AD/ADRD will significantly aid in studies to develop new treatments for AD/ADRD and identify high-risk populations and patients for prevention efforts. In our study, we summarized the risk factors for AD/ADRD by reviewing existing meta-analyses and review articles on risk and preventive factors for AD/ADRD. In total, we extracted 477 risk factors in… ▽ More A comprehensive view of factors associated with AD/ADRD will significantly aid in studies to develop new treatments for AD/ADRD and identify high-risk populations and patients for prevention efforts. In our study, we summarized the risk factors for AD/ADRD by reviewing existing meta-analyses and review articles on risk and preventive factors for AD/ADRD. In total, we extracted 477 risk factors in 10 categories from 537 studies. We constructed an interactive knowledge map to disseminate our study results. Most of the risk factors are accessible from structured Electronic Health Records (EHRs), and clinical narratives show promise as information sources. However, evaluating genomic risk factors using RWD remains a challenge, as genetic testing for AD/ADRD is still not a common practice and is poorly documented in both structured and unstructured EHRs. Considering the constantly evolving research on AD/ADRD risk factors, literature mining via NLP methods offers a solution to automatically update our knowledge map. △ Less

Submitted 3 February, 2024; originally announced February 2024.

arXiv:2402.04286 [pdf]

Progress and Opportunities of Foundation Models in Bioinformatics

Authors: Qing Li, Zhihang Hu, Yixuan Wang, Lei Li, Yimin Fan, Irwin King, Le Song, Yu Li

Abstract: Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical challenges in bioinformatics such as the scarcity of annotated data and the presence of data noise. FMs are particularly adept at handling large-scale, unlabeled… ▽ More Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical challenges in bioinformatics such as the scarcity of annotated data and the presence of data noise. FMs are particularly adept at handling large-scale, unlabeled data, a common scenario in biological contexts due to the time-consuming and costly nature of experimentally determining labeled data. This characteristic has allowed FMs to excel and achieve notable results in various downstream validation tasks, demonstrating their ability to represent diverse biological entities effectively. Undoubtedly, FMs have ushered in a new era in computational biology, especially in the realm of deep learning. The primary goal of this survey is to conduct a systematic investigation and summary of FMs in bioinformatics, tracing their evolution, current research status, and the methodologies employed. Central to our focus is the application of FMs to specific biological problems, aiming to guide the research community in choosing appropriate FMs for their research needs. We delve into the specifics of the problem at hand including sequence analysis, structure prediction, function annotation, and multimodal integration, comparing the structures and advancements against traditional methods. Furthermore, the review analyses challenges and limitations faced by FMs in biology, such as data noise, model explainability, and potential biases. Finally, we outline potential development paths and strategies for FMs in future biological research, setting the stage for continued innovation and application in this rapidly evolving field. This comprehensive review serves not only as an academic resource but also as a roadmap for future explorations and applications of FMs in biology. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 27 pages, 3 figures, 2 tables

MSC Class: cs.CL; 92-02 ACM Class: I.2.1

arXiv:2311.11004 [pdf, other]

A Foundation Model for Cell Segmentation

Authors: Uriah Israel, Markus Marks, Rohit Dilip, Qilin Li, Morgan Schwartz, Elora Pradhan, Edward Pao, Shenyi Li, Alexander Pearson-Goulart, Pietro Perona, Georgia Gkioxari, Ross Barnowski, Yisong Yue, David Van Valen

Abstract: Cells are the fundamental unit of biological organization, and identifying them in imaging data - cell segmentation - is a critical task for various cellular imaging experiments. While deep learning methods have led to substantial progress on this problem, models that have seen wide use are specialist models that work well for specific domains. Methods that have learned the general notion of "what… ▽ More Cells are the fundamental unit of biological organization, and identifying them in imaging data - cell segmentation - is a critical task for various cellular imaging experiments. While deep learning methods have led to substantial progress on this problem, models that have seen wide use are specialist models that work well for specific domains. Methods that have learned the general notion of "what is a cell" and can identify them across different domains of cellular imaging data have proven elusive. In this work, we present CellSAM, a foundation model for cell segmentation that generalizes across diverse cellular imaging data. CellSAM builds on top of the Segment Anything Model (SAM) by develo** a prompt engineering approach to mask generation. We train an object detector, CellFinder, to automatically detect cells and prompt SAM to generate segmentations. We show that this approach allows a single model to achieve state-of-the-art performance for segmenting images of mammalian cells (in tissues and cell culture), yeast, and bacteria collected with various imaging modalities. To enable accessibility, we integrate CellSAM into DeepCell Label to further accelerate human-in-the-loop labeling strategies for cellular imaging data. A deployed version of CellSAM is available at https://label-dev.deepcell.org/. △ Less

Submitted 18 November, 2023; originally announced November 2023.

arXiv:2310.17445 [pdf, other]

Aberrant High-Order Dependencies in Schizophrenia Resting-State Functional MRI Networks

Authors: Qiang Li, Vince D. Calhoun, Adithya Ram Ballem, Shujian Yu, Jesus Malo, Armin Iraji

Abstract: The human brain has a complex, intricate functional architecture. While many studies primarily emphasize pairwise interactions, delving into high-order associations is crucial for a comprehensive understanding of how functional brain networks intricately interact beyond simple pairwise connections. Analyzing high-order statistics allows us to explore the nuanced and complex relationships across th… ▽ More The human brain has a complex, intricate functional architecture. While many studies primarily emphasize pairwise interactions, delving into high-order associations is crucial for a comprehensive understanding of how functional brain networks intricately interact beyond simple pairwise connections. Analyzing high-order statistics allows us to explore the nuanced and complex relationships across the brain, unraveling the heterogeneity and uncovering patterns of multilevel overlap on the psychosis continuum. Here, we employed high-order independent component analysis (ICA) plus multivariate information-theoretical metrics ($O$-information and $S$-information) to estimate high-order interaction to examine schizophrenia using resting-state fMRI. The results show that multiple brain regions networks may be altered in schizophrenia, such as temporal, subcortical, and higher-cognitive brain regions, and meanwhile, it also shows that revealed synergy gives more information than redundancy in diagnosing schizophrenia. All in all, we showed that high-order dependencies were altered in schizophrenia. Identification of these aberrant patterns will give us a new window to diagnose schizophrenia. △ Less

Submitted 27 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

Comments: 7 pages, 4 figures, Accepted to InfoCog@NeurIPS 2023 (https://sites.google.com/view/infocog-neurips-2023/home)

arXiv:2309.15018 [pdf, other]

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

Authors: Ruixing Liang, Xiangyu Zhang, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola Garcia, Amir Manbachi

Abstract: While significant advancements in artificial intelligence (AI) have catalyzed progress across various domains, its full potential in understanding visual perception remains underexplored. We propose an artificial neural network dubbed VISION, an acronym for "Visual Interface System for Imaging Output of Neural activity," to mimic the human brain and show how it can foster neuroscientific inquiries… ▽ More While significant advancements in artificial intelligence (AI) have catalyzed progress across various domains, its full potential in understanding visual perception remains underexplored. We propose an artificial neural network dubbed VISION, an acronym for "Visual Interface System for Imaging Output of Neural activity," to mimic the human brain and show how it can foster neuroscientific inquiries. Using visual and contextual inputs, this multimodal model predicts the brain's functional magnetic resonance imaging (fMRI) scan response to natural images. VISION successfully predicts human hemodynamic responses as fMRI voxel values to visual inputs with an accuracy exceeding state-of-the-art performance by 45%. We further probe the trained networks to reveal representational biases in different visual areas, generate experimentally testable hypotheses, and formulate an interpretable metric to associate these hypotheses with cortical functions. With both a model and evaluation metric, the cost and time burdens associated with designing and implementing functional analysis on the visual cortex could be reduced. Our work suggests that the evolution of computational models may shed light on our fundamental understanding of the visual cortex and provide a viable approach toward reliable brain-machine interfaces. △ Less

Submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.05004 [pdf, other]

Reconstructing the kinetic chemotaxis kernel using macroscopic data: well-posedness and ill-posedness

Authors: Kathrin Hellmuth, Christian Klingenberg, Qin Li, Min Tang

Abstract: Bacterial motion is guided by external stimuli (chemotaxis), and the motion described on the mesoscopic scale is uniquely determined by a parameter $K$ that models velocity change response from the bacteria. This parameter is termed chemotaxis kernel. In a practical setting, experimental data was collected to infer this kernel. In this article, a PDE-constrained optimization framework is deployed… ▽ More Bacterial motion is guided by external stimuli (chemotaxis), and the motion described on the mesoscopic scale is uniquely determined by a parameter $K$ that models velocity change response from the bacteria. This parameter is termed chemotaxis kernel. In a practical setting, experimental data was collected to infer this kernel. In this article, a PDE-constrained optimization framework is deployed to perform this reconstruction using velocity-averaged, localized data taken in the interior of the domain. The problem can be well-posed or ill-posed depending on the data preparation and the experimental setup. In particular, we propose one specific design that guarantees numerical reconstructability and local convergence. This design is adapted to the discretization of $K$ in space and decouples the reconstruction of local values of $K$ into smaller cell problems, opening up parallelization opportunities. Numerical evidences support the theoretical findings. △ Less

Submitted 16 April, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

MSC Class: 35R30; 65M32; 92C17; 49M41; 49K40

arXiv:2308.06967 [pdf]

Intestinal Microecology in Pediatric Surgery-Related Gastrointestinal Diseases Current Insights and Future Perspectives

Authors: Yingchao Li, Yuqing Wu, Suolin Li, Lin Liu, Xiaoyi Zhang, Jiaxun Lv, Qinqin Li

Abstract: Intestinal microecology is established from birth and is constantly changing until homeostasis is reached. Intestinal microecology is involved in the immune inflammatory response of the intestine and regulates the intestinal barrier function. The imbalance of intestinal microecology is closely related to the occurrence and development of digestive system diseases. In some gastrointestinal diseases… ▽ More Intestinal microecology is established from birth and is constantly changing until homeostasis is reached. Intestinal microecology is involved in the immune inflammatory response of the intestine and regulates the intestinal barrier function. The imbalance of intestinal microecology is closely related to the occurrence and development of digestive system diseases. In some gastrointestinal diseases related to pediatric surgery, intestinal microecology and its metabolites undergo a series of changes, which can provide a certain basis for the diagnosis of diseases. The continuous development of microecological agents and fecal microbiota transplantation technology has provided a new means for its clinical treatment. We review the relationship between pathogenesis, diagnosis and treatment of pediatric surgery-related gastrointestinal diseases and intestinal microecology, in order to provide new ideas and methods for clinical diagnosis, treatment and research. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2303.11994 [pdf, other]

doi 10.1109/ICASSPW59220.2023.10193346

Higher-order Organization in the Human Brain from Matrix-Based Rényi's Entropy

Authors: Qiang Li, Shujian Yu, Kristoffer H Madsen, Vince D Calhoun, Armin Iraji

Abstract: Pairwise metrics are often employed to estimate statistical dependencies between brain regions, however they do not capture higher-order information interactions. It is critical to explore higher-order interactions that go beyond paired brain areas in order to better understand information processing in the human brain. To address this problem, we applied multivariate mutual information, specifica… ▽ More Pairwise metrics are often employed to estimate statistical dependencies between brain regions, however they do not capture higher-order information interactions. It is critical to explore higher-order interactions that go beyond paired brain areas in order to better understand information processing in the human brain. To address this problem, we applied multivariate mutual information, specifically, Total Correlation and Dual Total Correlation to reveal higher-order information in the brain. In this paper, we estimate these metrics using matrix-based Rényi's entropy, which offers a direct and easily interpretable approach that is not limited by direct assumptions about probability distribution functions of multivariate time series. We applied these metrics to resting-state fMRI data in order to examine higher-order interactions in the brain. Our results showed that the higher-order information interactions captured increase gradually as the interaction order increases. Furthermore, we observed a gradual increase in the correlation between the Total Correlation and Dual Total Correlation as the interaction order increased. In addition, the significance of Dual Total Correlation values compared to Total Correlation values also indicate that the human brain exhibits synergy dominance during the resting state. △ Less

Submitted 25 April, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: 5 pages, 3 figures; Accepted to Data Science and Learning Workshop: Unraveling the Brain. A satellite workshop of ICASSP 2023

Journal ref: 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)

arXiv:2302.12563 [pdf, other]

Retrieved Sequence Augmentation for Protein Representation Learning

Authors: Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Lingpeng Kong

Abstract: Protein language models have excelled in a variety of tasks, ranging from structure prediction to protein engineering. However, proteins are highly diverse in functions and structures, and current state-of-the-art models including the latest version of AlphaFold rely on Multiple Sequence Alignments (MSA) to feed in the evolutionary knowledge. Despite their success, heavy computational overheads, a… ▽ More Protein language models have excelled in a variety of tasks, ranging from structure prediction to protein engineering. However, proteins are highly diverse in functions and structures, and current state-of-the-art models including the latest version of AlphaFold rely on Multiple Sequence Alignments (MSA) to feed in the evolutionary knowledge. Despite their success, heavy computational overheads, as well as the de novo and orphan proteins remain great challenges in protein representation learning. In this work, we show that MSAaugmented models inherently belong to retrievalaugmented methods. Motivated by this finding, we introduce Retrieved Sequence Augmentation(RSA) for protein representation learning without additional alignment or pre-processing. RSA links query protein sequences to a set of sequences with similar structures or properties in the database and combines these sequences for downstream prediction. We show that protein language models benefit from the retrieval enhancement on both structure prediction and property prediction tasks, with a 5% improvement on MSA Transformer on average while being 373 times faster. In addition, we show that our model can transfer to new protein domains better and outperforms MSA Transformer on de novo protein prediction. Our study fills a much-encountered gap in protein prediction and brings us a step closer to demystifying the domain knowledge needed to understand protein sequences. Code is available on https://github.com/HKUNLP/RSA. △ Less

Submitted 24 February, 2023; originally announced February 2023.

arXiv:2301.02607 [pdf, ps, other]

A Data-Driven Gaussian Process Filter for Electrocardiogram Denoising

Authors: Mircea Dumitru, Qiao Li, Erick Andres Perez Alday, Ali Bahrami Rad, Gari D. Clifford, Reza Sameni

Abstract: Objective: Gaussian Processes (GP)-based filters, which have been effectively used for various applications including electrocardiogram (ECG) filtering can be computationally demanding and the choice of their hyperparameters is typically ad hoc. Methods: We develop a data-driven GP filter to address both issues, using the notion of the ECG phase domain -- a time-warped representation of the ECG be… ▽ More Objective: Gaussian Processes (GP)-based filters, which have been effectively used for various applications including electrocardiogram (ECG) filtering can be computationally demanding and the choice of their hyperparameters is typically ad hoc. Methods: We develop a data-driven GP filter to address both issues, using the notion of the ECG phase domain -- a time-warped representation of the ECG beats onto a fixed number of samples and aligned R-peaks, which is assumed to follow a Gaussian distribution. Under this assumption, the computation of the sample mean and covariance matrix is simplified, enabling an efficient implementation of the GP filter in a data-driven manner, with no ad hoc hyperparameters. The proposed filter is evaluated and compared with a state-of-the-art wavelet-based filter, on the PhysioNet QT Database. The performance is evaluated by measuring the signal-to-noise ratio (SNR) improvement of the filter at SNR levels ranging from -5 to 30dB, in 5dB steps, using additive noise. For a clinical evaluation, the error between the estimated QT-intervals of the original and filtered signals is measured and compared with the benchmark filter. Results: It is shown that the proposed GP filter outperforms the benchmark filter for all the tested noise levels. It also outperforms the state-of-the-art filter in terms of QT-interval estimation error bias and variance. Conclusion: The proposed GP filter is a versatile technique for preprocessing the ECG in clinical and research applications, is applicable to ECG of arbitrary lengths and sampling frequencies, and provides confidence intervals for its performance. △ Less

Submitted 9 January, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

arXiv:2210.03231 [pdf, other]

doi 10.3390/e24121725

Functional Connectome of the Human Brain with Total Correlation

Authors: Qiang Li, Greg Ver Steeg, Shujian Yu, Jesus Malo

Abstract: Recent studies proposed the use of Total Correlation to describe functional connectivity among brain regions as a multivariate alternative to conventional pair-wise measures such as correlation or mutual information. In this work we build on this idea to infer a large scale (whole brain) connectivity network based on Total Correlation and show the possibility of using this kind of networks as biom… ▽ More Recent studies proposed the use of Total Correlation to describe functional connectivity among brain regions as a multivariate alternative to conventional pair-wise measures such as correlation or mutual information. In this work we build on this idea to infer a large scale (whole brain) connectivity network based on Total Correlation and show the possibility of using this kind of networks as biomarkers of brain alterations. In particular, this work uses Correlation Explanation (CorEx) to estimate Total Correlation. First, we prove that CorEx estimates of total correlation and clustering results are trustable compared to ground truth values. Second, the inferred large scale connectivity network extracted from the more extensive open fMRI datasets is consistent with existing neuroscience studies but, interestingly, can estimate additional relations beyond pair-wise regions. And finally, we show how the connectivity graphs based on Total Correlation can also be an effective tool to aid in the discovery of brain diseases. △ Less

Submitted 14 November, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

Comments: 22 pages, 13 figures

Journal ref: Entropy 2022, 24(12), 1725;

arXiv:2208.05770 [pdf, other]

doi 10.1016/j.neucom.2023.127143

Functional Connectivity via Total Correlation: Analytical results in Visual Areas

Authors: Qiang Li, Greg Ver Steeg, Jesus Malo

Abstract: Recent studies invoke the superiority of the multivariate Total Correlation concept over the conventional pairwise measures of functional connectivity in biological networks. Those seminal works certainly show that empirical measures of Total Correlation lead to connectivity patterns that differ from what is obtained using the most popular measure, linear correlation, or its higher order and nonli… ▽ More Recent studies invoke the superiority of the multivariate Total Correlation concept over the conventional pairwise measures of functional connectivity in biological networks. Those seminal works certainly show that empirical measures of Total Correlation lead to connectivity patterns that differ from what is obtained using the most popular measure, linear correlation, or its higher order and nonlinear alternative Mutual Information. However, they do not provide analytical results that explain the differences beyond the obvious multivariate versus bivariate definitions. Moreover, the accuracy of the empirical estimators could not be addressed directly because no controlled scenario with known analytical result was provided either. This point is critical because empirical estimation of information theory measures is always challenging. As opposed to previous empirical approaches, in this work we present analytical results to prove the advantages of Total Correlation over Mutual Information to describe the functional connectivity. In particular, we do it in neural networks for early vision (retina-LGN-cortex) which are realistic but simple enough to get analytical results. The presented analytical setting is also useful to check empirical estimates of Total Correlation. Therefore, once certain estimate can be trusted, one can explore the behavior with natural signals where the analytical results (that assume Gaussian signals), may not be valid. In this regard, as applications (a) we explore the effect of connectivity and feedback in the analytical retina-LGN-cortex network with natural images, and (b) we assess the functional connectivity in visual areas V1-V2-V3-V4 from actual fMRI recordings. △ Less

Submitted 11 December, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

Comments: 31 pages, 14 figures, Accepted to Neurocomputing Journal

Journal ref: Neurocomputing 2023, 127143

arXiv:2206.01629 [pdf, other]

doi 10.1137/22M1499911

Kinetic chemotaxis tumbling kernel determined from macroscopic quantities

Authors: Kathrin Hellmuth, Christian Klingenberg, Qin Li, Min Tang

Abstract: Chemotaxis is the physical phenomenon that bacteria adjust their motions according to chemical stimulus. A classical model for this phenomenon is a kinetic equation that describes the velocity jump process whose tumbling/transition kernel uniquely determines the effect of chemical stimulus on bacteria. The model has been shown to be an accurate model that matches with bacteria motion qualitatively… ▽ More Chemotaxis is the physical phenomenon that bacteria adjust their motions according to chemical stimulus. A classical model for this phenomenon is a kinetic equation that describes the velocity jump process whose tumbling/transition kernel uniquely determines the effect of chemical stimulus on bacteria. The model has been shown to be an accurate model that matches with bacteria motion qualitatively. For a quantitative modeling, biophysicists and practitioners are also highly interested in determining the explicit value of the tumbling kernel. Due to the experimental limitations, measurements are typically macroscopic in nature. Do macroscopic quantities contain enough information to recover microscopic behavior? In this paper, we give a positive answer. We show that when given a special design of initial data, the population density, one specific macroscopic quantity as a function of time, contains sufficient information to recover the tumbling kernel and its associated dam** coefficient. Moreover, we can read off the chemotaxis tumbling kernel using the values of population density directly from this specific experimental design. This theoretical result using kinetic theory sheds light on how practitioners may conduct experiments in laboratories. △ Less

Submitted 4 October, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

MSC Class: 92C17; 35R30; 35Q92; 35R09; 45K05

Journal ref: SIAM Journal on Mathematical Analysis, Vol. 56, Iss. 1 (2024)

arXiv:2205.13088 [pdf, other]

Towards future directions in data-integrative supervised prediction of human aging-related genes

Authors: Qi Li, Khalique Newaz, Tijana Milenković

Abstract: Identification of human genes involved in the aging process is critical due to the incidence of many diseases with age. A state-of-the-art approach for this purpose infers a weighted dynamic aging-specific subnetwork by map** gene expression (GE) levels at different ages onto the protein-protein interaction network (PPIN). Then, it analyzes this subnetwork in a supervised manner by training a pr… ▽ More Identification of human genes involved in the aging process is critical due to the incidence of many diseases with age. A state-of-the-art approach for this purpose infers a weighted dynamic aging-specific subnetwork by map** gene expression (GE) levels at different ages onto the protein-protein interaction network (PPIN). Then, it analyzes this subnetwork in a supervised manner by training a predictive model to learn how network topologies of known aging- vs. non-aging-related genes change across ages. Finally, it uses the trained model to predict novel aging-related genes. However, the best current subnetwork resulting from this approach still yields suboptimal prediction accuracy. This could be because it was inferred using outdated GE and PPIN data. Here, we evaluate whether analyzing a weighted dynamic aging-specific subnetwork inferred from newer GE and PPIN data improves prediction accuracy upon analyzing the best current subnetwork inferred from outdated data. Unexpectedly, we find that not to be the case. To understand this, we perform aging-related pathway and Gene Ontology (GO) term enrichment analyses. We find that the suboptimal prediction accuracy, regardless of which GE or PPIN data is used, may be caused by the current knowledge about which genes are aging-related being incomplete, or by the current methods for inferring or analyzing an aging-specific subnetwork being unable to capture all of the aging-related knowledge. These findings can potentially guide future directions towards improving supervised prediction of aging-related genes via -omics data integration. △ Less

Submitted 25 May, 2022; originally announced May 2022.

arXiv:2204.06071 [pdf, other]

Noise Perturbation for Saliency Prediction with Psychophysical Synthetic Images

Authors: Qiang Li

Abstract: Convolutional neural networks (CNNs) have achieved great success in natural image saliency prediction. The primary goal of this study is to investigate the performance of saliency prediction in CNN and classic models with psychophysical synthetic images under noise perturbation. Is it still as decent as natural images in terms of performance? In the meantime, it can be used to investigate the rela… ▽ More Convolutional neural networks (CNNs) have achieved great success in natural image saliency prediction. The primary goal of this study is to investigate the performance of saliency prediction in CNN and classic models with psychophysical synthetic images under noise perturbation. Is it still as decent as natural images in terms of performance? In the meantime, it can be used to investigate the relationship between CNNs and human vision, mainly low-level vision functions. On the other hand, are CNNs exact replicas of human visual function? This study used CNNs, Fourier, and spectral models inspired by low-level vision systems to investigate saliency prediction on psychophysical synthetic images rather than natural images. According to our findings, saliency prediction models inspired by Fourier and spectral theory outperformed current pre-trained deep neural networks on psychophysical images with noise perturbation. However, psychophysical models were more unstable in noise than pre-trained deep neural networks. Meanwhile, we suggested that investigating CNNs with psychophysical methods could benefit visual neuroscience and artificial neural network studies. △ Less

Submitted 28 September, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

Comments: 7 pages, 6 figures

arXiv:2110.13787 [pdf, other]

doi 10.3390/computation9110119

Multiscale convergence of the inverse problem for chemotaxis in the Bayesian setting

Authors: Kathrin Hellmuth, Christian Klingenberg, Qin Li, Min Tang

Abstract: Chemotaxis describes the movement of an organism, such as single or multi-cellular organisms and bacteria, in response to a chemical stimulus. Two widely used models to describe the phenomenon are the celebrated Keller-Segel equation and a chemotaxis kinetic equation. These two equations describe the organism movement at the macro- and mesoscopic level respectively, and are asymptotically equivale… ▽ More Chemotaxis describes the movement of an organism, such as single or multi-cellular organisms and bacteria, in response to a chemical stimulus. Two widely used models to describe the phenomenon are the celebrated Keller-Segel equation and a chemotaxis kinetic equation. These two equations describe the organism movement at the macro- and mesoscopic level respectively, and are asymptotically equivalent in the parabolic regime. How the organism responds to a chemical stimulus is embedded in the diffusion/advection coefficients of the Keller-Segel equation or the turning kernel of the chemotaxis kinetic equation. Experiments are conducted to measure the time dynamics of the organisms' population level movement when reacting to certain stimulation. From this one infers the chemotaxis response, which constitutes an inverse problem. \\ In this paper we discuss the relation between both the macro- and mesoscopic inverse problems, each of which is associated to two different forward models. The discussion is presented in the Bayesian framework, where the posterior distribution of the turning kernel of the organism population is sought after. We prove the asymptotic equivalence of the two posterior distributions. △ Less

Submitted 6 December, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

Journal ref: Computation 9, no. 11: 119 (2021)

arXiv:2104.13957 [pdf, other]

A Bayesian Modified Ising Model for Identifying Spatially Variable Genes from Spatial Transcriptomics Data

Authors: Xi Jiang, Qiwei Li, Guanghua Xiao

Abstract: A recent technology breakthrough in spatial molecular profiling has enabled the comprehensive molecular characterizations of single cells while preserving spatial information. It provides new opportunities to delineate how cells from different origins form tissues with distinctive structures and functions. One immediate question in spatial molecular profiling data analysis is to identify genes who… ▽ More A recent technology breakthrough in spatial molecular profiling has enabled the comprehensive molecular characterizations of single cells while preserving spatial information. It provides new opportunities to delineate how cells from different origins form tissues with distinctive structures and functions. One immediate question in spatial molecular profiling data analysis is to identify genes whose expressions exhibit spatially correlated patterns, called spatially variable genes. Most current methods to identify spatially variable genes are built upon the geostatistical model with Gaussian process to capture the spatial patterns, which rely on ad hoc kernels that could limit the models' ability to identify complex spatial patterns. In order to overcome this challenge and capture more types of spatial patterns, we introduce a Bayesian approach to identify spatially variable genes via a modified Ising model. The key idea is to use the energy interaction parameter of the Ising model to characterize spatial expression patterns. We use auxiliary variable Markov chain Monte Carlo algorithms to sample from the posterior distribution with an intractable normalizing constant in the model. Simulation studies using both simulated and synthetic data showed that the energy-based modeling approach led to higher accuracy in detecting spatially variable genes than those kernel-based methods. When applied to two real spatial transcriptomics datasets, the proposed method discovered novel spatial patterns that shed light on the biological mechanisms. In summary, the proposed method presents a new perspective for analyzing spatial transcriptomics data. △ Less

Submitted 5 October, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

Comments: Version 3

arXiv:2103.15142 [pdf]

COSINE: A Web Server for Clonal and Subclonal Structure Inference and Evolution in Cancer Genomics

Authors: Xiguo Yuan, Yuan Zhao, Yang Guo, Linmei Ge, Wei Liu, Shiyu Wen, Qi Li, Zhangbo Wan, Peina Zheng, Tao Guo, Zhida Li, Martin Peifer, Yupeng Cun

Abstract: Cancers evolve from mutation of a single cell with sequential clonal and subclonal expansion of somatic mutation acquisition. Inferring clonal and subclonal structures from bulk or single cell tumor genomic sequencing data has a huge impact on cancer evolution studies. Clonal state and mutational order can provide detailed insight into tumor origin and its future development. In the past decade, a… ▽ More Cancers evolve from mutation of a single cell with sequential clonal and subclonal expansion of somatic mutation acquisition. Inferring clonal and subclonal structures from bulk or single cell tumor genomic sequencing data has a huge impact on cancer evolution studies. Clonal state and mutational order can provide detailed insight into tumor origin and its future development. In the past decade, a variety of methods have been developed for subclonal reconstruction using bulk tumor sequencing data. As these methods have been developed in different programming languages and using different input data formats, their use and comparison can be problematic. Therefore, we established a web server for clonal and subclonal structure inference and evolution of cancer genomic data (COSINE), which included 12 popular subclonal reconstruction methods. We decomposed each method via a detailed workflow of single processing steps with a user-friendly interface. To the best of our knowledge, this is the first web server providing online subclonal inference, including the most popular subclonal reconstruction methods. COSINE is freely accessible at www.clab-cosine.net or http://bio.rj.run:48996/cun-web. △ Less

Submitted 28 March, 2021; originally announced March 2021.

arXiv:2103.00481 [pdf, other]

doi 10.1167/jov.22.6.8.

Contrast Sensitivity Functions in Autoencoders

Authors: Qiang Li, Alex Gomez-Villa, Marcelo Bertalmio, Jesus Malo

Abstract: Three decades ago, Atick et al. suggested that human frequency sensitivity may emerge from the enhancement required for a more efficient analysis of retinal images. Here we reassess the relevance of low-level vision tasks in the explanation of the Contrast Sensitivity Functions (CSFs) in light of (1) the current trend of using artificial neural networks for studying vision, and (2) the current kno… ▽ More Three decades ago, Atick et al. suggested that human frequency sensitivity may emerge from the enhancement required for a more efficient analysis of retinal images. Here we reassess the relevance of low-level vision tasks in the explanation of the Contrast Sensitivity Functions (CSFs) in light of (1) the current trend of using artificial neural networks for studying vision, and (2) the current knowledge of retinal image representations. As a first contribution, we show that a very popular type of convolutional neural networks (CNNs), called autoencoders, may develop human-like CSFs in the spatio-temporal and chromatic dimensions when trained to perform some basic low-level vision tasks (like retinal noise and optical blur removal), but not others (like chromatic adaptation or pure reconstruction after simple bottlenecks). As an illustrative example, the best CNN (in the considered set of simple architectures for enhancement of the retinal signal) reproduces the CSFs with an RMSE error of 11\% of the maximum sensitivity. As a second contribution, we provide experimental evidence of the fact that, for some functional goals (at low abstraction level), deeper CNNs that are better in reaching the quantitative goal are actually worse in replicating human-like phenomena (such as the CSFs). This low-level result (for the explored networks) is not necessarily in contradiction with other works that report advantages of deeper nets in modeling higher-level vision goals. However, in line with a growing body of literature, our results suggests another word of caution about CNNs in vision science since the use of simplified units or unrealistic architectures in goal optimization may be a limitation for the modeling and understanding of human vision. △ Less

Submitted 4 March, 2022; v1 submitted 28 February, 2021; originally announced March 2021.

Comments: Accepted in the Journal of Vision

Journal ref: Journal of Vision 2022;22(6):8

arXiv:2101.00059 [pdf, other]

doi 10.1177/09622802211037076

CauchyCP: a powerful test under non-proportional hazards using Cauchy combination of change-point Cox regressions

Authors: Hong Zhang, Qing Li, Devan V. Mehrotra, Judong Shen

Abstract: Non-proportional hazards data are routinely encountered in randomized clinical trials. In such cases, classic Cox proportional hazards model can suffer from severe power loss, with difficulty in interpretation of the estimated hazard ratio since the treatment effect varies over time. We propose CauchyCP, an omnibus test of change-point Cox regression models, to overcome both challenges while detec… ▽ More Non-proportional hazards data are routinely encountered in randomized clinical trials. In such cases, classic Cox proportional hazards model can suffer from severe power loss, with difficulty in interpretation of the estimated hazard ratio since the treatment effect varies over time. We propose CauchyCP, an omnibus test of change-point Cox regression models, to overcome both challenges while detecting signals of non-proportional hazards patterns. Extensive simulation studies demonstrate that, compared to existing treatment comparison tests under non-proportional hazards, the proposed CauchyCP test 1) controls the type I error better at small $α$ levels ($< 0.01$); 2) increases the power of detecting time-varying effects; and 3) is more computationally efficient. The superior performance of CauchyCP is further illustrated using retrospective analyses of two randomized clinical trial datasets and a pharmacogenetic biomarker study dataset. The R package $\textit{CauchyCP}$ is publicly available on CRAN. △ Less

Submitted 31 December, 2020; originally announced January 2021.

Journal ref: Statistical Methods in Medical Research. 2021;30(11):2447-2458

arXiv:2009.09514 [pdf, other]

Early Indicators of COVID-19 Spread Risk Using Digital Trace Data of Population Activities

Authors: Xinyu Gao, Chao Fan, Yang Yang, Sanghyeon Lee, Qingchun Li, Mikel Maron, Ali Mostafavi

Abstract: The spread of pandemics such as COVID-19 is strongly linked to human activities. The objective of this paper is to specify and examine early indicators of disease spread risk in cities during the initial stages of outbreak based on patterns of human activities obtained from digital trace data. In this study, the Venables distance (D_v), and the activity density (D_a) are used to quantify and evalu… ▽ More The spread of pandemics such as COVID-19 is strongly linked to human activities. The objective of this paper is to specify and examine early indicators of disease spread risk in cities during the initial stages of outbreak based on patterns of human activities obtained from digital trace data. In this study, the Venables distance (D_v), and the activity density (D_a) are used to quantify and evaluate human activities for 193 US counties, whose cumulative number of confirmed cases was greater than 100 as of March 31, 2020. Venables distance provides a measure of the agglomeration of the level of human activities based on the average distance of human activities across a city or a county (less distance could lead to a greater contact risk). Activity density provides a measure of level of overall activity level in a county or a city (more activity could lead to a greater risk). Accordingly, Pearson correlation analysis is used to examine the relationship between the two human activity indicators and the basic reproduction number in the following weeks. The results show statistically significant correlations between the indicators of human activities and the basic reproduction number in all counties, as well as a significant leader-follower relationship (time lag) between them. The results also show one to two weeks' lag between the change in activity indicators and the decrease in the basic reproduction number. This result implies that the human activity indicators provide effective early indicators for the spread risk of the pandemic during the early stages of the outbreak. Hence, the results could be used by the authorities to proactively assess the risk of disease spread by monitoring the daily Venables distance and activity density in a proactive manner. △ Less

Submitted 20 September, 2020; originally announced September 2020.

Comments: 12 pages, 8 figures

arXiv:2007.05672 [pdf, other]

A critical survey on the kinetic assays of DNA polymerase fidelity from a new theoretical perspective

Authors: Qiu-Shi Li, Yao-Gen Shu, Zhong-Can Ou-Yang, Ming Li

Abstract: The high fidelity of DNA polymerase is critical for the faithful replication of genomic DNA. Several approaches were proposed to quantify the fidelity of DNA polymerase. Direct measurements of the error frequency of the replication products definitely give the true fidelity but turn out very hard to implement. Two biochemical kinetic approaches, the steady-state assay and the transient-state assay… ▽ More The high fidelity of DNA polymerase is critical for the faithful replication of genomic DNA. Several approaches were proposed to quantify the fidelity of DNA polymerase. Direct measurements of the error frequency of the replication products definitely give the true fidelity but turn out very hard to implement. Two biochemical kinetic approaches, the steady-state assay and the transient-state assay, were then suggested and widely adopted. In these assays, the error frequency is indirectly estimated by using the steady-state or the transient-state kinetic theory combined with the measured kinetic rates. However, whether these indirectly estimated fidelities are equivalent to the true fidelity has never been clarified theoretically, and in particular there are different strategies to quantify the proofreading efficiency of DNAP but often lead to inconsistent results. The reason for all these confusions is that it's mathematically challenging to formulate a rigorous and general theory of the true fidelity. Recently we have succeeded to establish such a theoretical framework. In this paper, we develop this theory to make a comprehensive examination on the theoretical foundation of the kinetic assays and the relation between fidelities obtained by different methods. We conclude that while the steady-state assay and the transient-state assay can always measure the true fidelity of exonuclease-deficient DNA polymerases, they only do so for exonuclease-efficient DNA polymerases conditionally (the proper way to use these assays to quantify the proofreading efficiency is also suggested). We thus propose a new kinetic approach, the single-molecule assay, which indirectly but precisely characterizes the true fidelity of either exonuclease-deficient or exonuclease-efficient DNA polymerases. △ Less

Submitted 10 July, 2020; originally announced July 2020.

Comments: 13 pages, 14 figures

arXiv:2007.05379 [pdf]

Population aging caused by rise in sex ratio at birth

Authors: Zhen Zhang, Qiang Li

Abstract: Despite its historical and biological stability, the sex ratio at birth (SRB) has risen in parts of the world in the last several decades. The resultant demographic consequences, mostly on sex imbalance, are well documented, typically including "missing girls/women" and "marriage squeeze." However, the SRB-induced impact on demographic dynamics, particularly its underlying mechanism, has not been… ▽ More Despite its historical and biological stability, the sex ratio at birth (SRB) has risen in parts of the world in the last several decades. The resultant demographic consequences, mostly on sex imbalance, are well documented, typically including "missing girls/women" and "marriage squeeze." However, the SRB-induced impact on demographic dynamics, particularly its underlying mechanism, has not been explored in depth. We aim to investigate the impact of the SRB rise on the size, structure, and growth of a population, particularly emphasizing on population aging. We provide a simple framework, derived from classical stable population models, to analyze how the SRB rise can reduce the population size and make the population old. We demonstrate that the cohorts born with a higher SRB are smaller in size than those with a lower SRB. As the affected cohorts are born into the population, their smaller size will reduce the total population size, thereby lifting the fraction of old people that were born with the original SRB and have the same size as before. The resultant population aging speed increases as the cohorts with the new SRB take an increasing share of the population. This study adds that, in addition to fertility and mortality, the SRB can be a driving factor of population dynamics, especially when it moves far above normal biological levels. △ Less

Submitted 10 July, 2020; originally announced July 2020.

arXiv:2006.08115 [pdf, other]

Minimax Dynamics of Optimally Balanced Spiking Networks of Excitatory and Inhibitory Neurons

Authors: Qianyi Li, Cengiz Pehlevan

Abstract: Excitation-inhibition (E-I) balance is ubiquitously observed in the cortex. Recent studies suggest an intriguing link between balance on fast timescales, tight balance, and efficient information coding with spikes. We further this connection by taking a principled approach to optimal balanced networks of excitatory (E) and inhibitory (I) neurons. By deriving E-I spiking neural networks from greedy… ▽ More Excitation-inhibition (E-I) balance is ubiquitously observed in the cortex. Recent studies suggest an intriguing link between balance on fast timescales, tight balance, and efficient information coding with spikes. We further this connection by taking a principled approach to optimal balanced networks of excitatory (E) and inhibitory (I) neurons. By deriving E-I spiking neural networks from greedy spike-based optimizations of constrained minimax objectives, we show that tight balance arises from correcting for deviations from the minimax optima. We predict specific neuron firing rates in the network by solving the minimax problem, going beyond statistical theories of balanced networks. Finally, we design minimax objectives for reconstruction of an input signal, associative memory, and storage of manifold attractors, and derive from them E-I networks that perform the computation. Overall, we present a novel normative modeling approach for spiking E-I networks, going beyond the widely-used energy minimizing networks that violate Dale's law. Our networks can be used to model cortical circuits and computations. △ Less

Submitted 30 April, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

Comments: There was a typo in Eq. 3 for the definition of firing rates, where we had e^{-(t-t')/τ_E} in the integrand, which should be e^{-t'/τ_E}, it is corrected in this version

Journal ref: NeurIPS 2020

arXiv:2006.01054 [pdf, other]

Effects of Population Co-location Reduction on Cross-county Transmission Risk of COVID-19 in the United States

Authors: Chao Fan, Sanghyeon Lee, Yang Yang, Bora Oztekin, Qingchun Li, Ali Mostafavi

Abstract: The rapid spread of COVID-19 in the United States has imposed a major threat to public health, the real economy, and human well-being. With the absence of effective vaccines, the preventive actions of social distancing and travel reduction are recognized as essential non-pharmacologic approaches to control the spread of COVID-19. Prior studies demonstrated that human movement and mobility drove th… ▽ More The rapid spread of COVID-19 in the United States has imposed a major threat to public health, the real economy, and human well-being. With the absence of effective vaccines, the preventive actions of social distancing and travel reduction are recognized as essential non-pharmacologic approaches to control the spread of COVID-19. Prior studies demonstrated that human movement and mobility drove the spatiotemporal distribution of COVID-19 in China. Little is known, however, about the patterns and effects of co-location reduction on cross-county transmission risk of COVID-19. This study utilizes Facebook co-location data for all counties in the United States from March to early May 2020. The analysis examines the synchronicity and time lag between travel reduction and pandemic growth trajectory to evaluate the efficacy of social distancing in ceasing the population co-location probabilities, and subsequently the growth in weekly new cases. The results show that the mitigation effects of co-location reduction appear in the growth of weekly new cases with one week of delay. Furthermore, significant segregation is found among different county groups which are categorized based on numbers of cases. The results suggest that within-group co-location probabilities remain stable, and social distancing policies primarily resulted in reduced cross-group co-location probabilities (due to travel reduction from counties with large number of cases to counties with low numbers of cases). These findings could have important practical implications for local governments to inform their intervention measures for monitoring and reducing the spread of COVID-19, as well as for adoption in future pandemics. Public policy, economic forecasting, and epidemic modeling need to account for population co-location patterns in evaluating transmission risk of COVID-19 across counties. △ Less

Submitted 1 June, 2020; originally announced June 2020.

Comments: 12 pages, 7 figures

arXiv:2005.05784 [pdf, other]

A Graph Gaussian Embedding Method for Predicting Alzheimer's Disease Progression with MEG Brain Networks

Authors: Mengjia Xu, David Lopez Sanz, Pilar Garces, Fernando Maestu, Quanzheng Li, Dimitrios Pantazis

Abstract: Characterizing the subtle changes of functional brain networks associated with the pathological cascade of Alzheimer's disease (AD) is important for early diagnosis and prediction of disease progression prior to clinical symptoms. We developed a new deep learning method, termed multiple graph Gaussian embedding model (MG2G), which can learn highly informative network features by map** high-dimen… ▽ More Characterizing the subtle changes of functional brain networks associated with the pathological cascade of Alzheimer's disease (AD) is important for early diagnosis and prediction of disease progression prior to clinical symptoms. We developed a new deep learning method, termed multiple graph Gaussian embedding model (MG2G), which can learn highly informative network features by map** high-dimensional resting-state brain networks into a low-dimensional latent space. These latent distribution-based embeddings enable a quantitative characterization of subtle and heterogeneous brain connectivity patterns at different regions and can be used as input to traditional classifiers for various downstream graph analytic tasks, such as AD early stage prediction, and statistical evaluation of between-group significant alterations across brain regions. We used MG2G to detect the intrinsic latent dimensionality of MEG brain networks, predict the progression of patients with mild cognitive impairment (MCI) to AD, and identify brain regions with network alterations related to MCI. △ Less

Submitted 10 November, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

arXiv:2005.03659 [pdf, other]

Improved supervised prediction of aging-related genes via weighted dynamic network analysis

Authors: Qi Li, Khalique Newaz, Tijana Milenković

Abstract: This study focuses on the task of supervised prediction of aging-related genes from -omics data. Unlike gene expression methods for this task that capture aging-specific information but ignore interactions between genes (i.e., their protein products), or protein-protein interaction (PPI) network methods for this task that account for PPIs but the PPIs are context-unspecific, we recently integrated… ▽ More This study focuses on the task of supervised prediction of aging-related genes from -omics data. Unlike gene expression methods for this task that capture aging-specific information but ignore interactions between genes (i.e., their protein products), or protein-protein interaction (PPI) network methods for this task that account for PPIs but the PPIs are context-unspecific, we recently integrated the two data types into an aging-specific PPI subnetwork, which yielded more accurate aging-related gene predictions. However, a dynamic aging-specific subnetwork did not improve prediction performance compared to a static aging-specific subnetwork, despite the aging process being dynamic. This could be because the dynamic subnetwork was inferred using a naive Induced subgraph approach. Instead, we recently inferred a dynamic aging-specific subnetwork using a methodologically more advanced notion of network propagation (NP), which improved upon Induced dynamic aging-specific subnetwork in a different task, that of unsupervised analyses of the aging process. Here, we evaluate whether our existing NP-based dynamic subnetwork will improve upon the dynamic as well as static subnetwork constructed by the Induced approach in the considered task of supervised prediction of aging-related genes. The existing NP-based subnetwork is unweighted, i.e., it gives equal importance to each of the aging-specific PPIs. Because accounting for aging-specific edge weights might be important, we additionally propose a weighted NP-based dynamic aging-specific subnetwork. We demonstrate that a predictive machine learning model trained and tested on the weighted subnetwork yields higher accuracy when predicting aging-related genes than predictive models run on the existing unweighted dynamic or static subnetworks, regardless of whether the existing subnetworks were inferred using NP or the Induced approach. △ Less

Submitted 13 April, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

arXiv:2003.07586 [pdf]

A new theory of fluid-solid coupling in a porous medium for application to the ultrasonic evaluation of tissue remodeling using bioelastomers

Authors: Chuanyang Jiang, Yanying Zhu, Kaixuan Guo, Qing Li, Zhengwei You, Jiao Yu

Abstract: Bioelastomers have demonstrated tremendous value and potential in the field of tissue repair due to increasing health demands. Improved non-invasive methods are required for monitoring tissue development assisted by bioelastomers. In this paper, we present a novel theory of fluid-solid coupling in a porous medium for application to the ultrasonic evaluation of tissue remodeling using bioelastomers… ▽ More Bioelastomers have demonstrated tremendous value and potential in the field of tissue repair due to increasing health demands. Improved non-invasive methods are required for monitoring tissue development assisted by bioelastomers. In this paper, we present a novel theory of fluid-solid coupling in a porous medium for application to the ultrasonic evaluation of tissue remodeling using bioelastomers. The common assumption of equal solid and liquid displacements used in the conventional description of a fluid-saturated porous solid cannot be applied to soft media, such as bioelastomers. We revise the geoacoustic theory of Biot to allow for relative motion between a fluid and a solid in an aggregate and derive an expression for a characteristic fluid-solid coupling parameter. Unlike the conventional method, the propagation speed of shear waves observed by ultrasound shear wave elastography is considered a known quantity in the novel theory, and the calculated value of the coupling parameter is used to evaluate the status of tissue repair. The model is validated by analyzing selected cases. The conditions under which the model can be applied are identified. However, further development of the theory is required to extract dynamic parameters that can be used to monitor the entire tissue remodeling process. In this paper, a theoretical approach is developed that can be used to analyze the mechanics of tissue repair. The theory has potential applications in the field of acellular in situ tissue engineering for non-invasive monitoring of the complex mechanical remodeling process of tissue regeneration and bioelastomer degradation. △ Less

Submitted 17 March, 2020; originally announced March 2020.

arXiv:2002.05866 [pdf, other]

doi 10.1016/j.**f.2020.02.014

Trend and forecasting of the COVID-19 outbreak in China

Authors: Qiang Li, Wei Feng

Abstract: By using the public data from Jan. 20 to Feb. 11, 2020, we perform data-driven analysis and forecasting on the COVID-19 epidemic in mainland China, especially Hubei province. Our results show that the turning points of the daily infections are predicted to be Feb. 6 and Feb. 1, 2020, for Hubei and China other than Hubei, respectively. The epidemic in China is predicted to end up after Mar. 10, 202… ▽ More By using the public data from Jan. 20 to Feb. 11, 2020, we perform data-driven analysis and forecasting on the COVID-19 epidemic in mainland China, especially Hubei province. Our results show that the turning points of the daily infections are predicted to be Feb. 6 and Feb. 1, 2020, for Hubei and China other than Hubei, respectively. The epidemic in China is predicted to end up after Mar. 10, 2020, and the number of the total infections are predicted to be 51600. The data trends reveal that quick and active strategies taken by China to reduce human exposure have already had a good impact on the control of the epidemic. △ Less

Submitted 13 February, 2020; originally announced February 2020.

Comments: 12 figures, 10 pages

Report number: Online 27-FEB-2020 MSC Class: 65C20; 68U20 ACM Class: G.3; I.6.3

Journal ref: Journal of Infection 80 (2020) 469-496

arXiv:2002.02607 [pdf, ps, other]

Self-awareness based resource allocation strategy for containment of epidemic spreading

Authors: Xiaolong Chen, Quanhui Liu, Ruijie Wang, Qing Li, Wei Wang

Abstract: Resource support between individuals is of particular importance in controlling or mitigating epidemic spreading, especially during pandemics. Whereas there remains the question of how we can protect ourselves from being infected while hel** others by donating resources in fighting against the epidemic. To answer the question, we propose a novel resource allocation model by considering the aware… ▽ More Resource support between individuals is of particular importance in controlling or mitigating epidemic spreading, especially during pandemics. Whereas there remains the question of how we can protect ourselves from being infected while hel** others by donating resources in fighting against the epidemic. To answer the question, we propose a novel resource allocation model by considering the awareness of self-protection of individuals. In the model, a tuning parameter is introduced to quantify the reaction strength of individuals when they are aware of the disease. And then, a coupled model of resource allocation and disease spreading is proposed to study the impact of self-awareness on resource allocation and, its impact on the dynamics of epidemic spreading. Through theoretical analysis and extensive Monte Carlo simulations, we find that in the stationary state, the system converges to two states: the whole healthy or the completely infected, which indicates an abrupt increase in the prevalence when there is a shortage of resources. More importantly, we find that too cautious and too selfless for the people during the outbreak of an epidemic are both not suitable for disease control. Through extensive simulations, we find the optimal point, at which there is a maximum value of the epidemic threshold, and an outbreak can be delayed to the greatest extent. At last, we study further the effects of network structure on the coupled dynamics. We find that the degree heterogeneity promotes the outbreak of disease, and the network structure does not alter the optimal phenomenon in behavior response. △ Less

Submitted 6 February, 2020; originally announced February 2020.

arXiv:2001.03057 [pdf]

DS-GCNs: Connectome Classification Using Dynamic Spectral Graph Convolution Networks with Assistant Task Training

Authors: Xiaodan Xing, Qingfeng Li, Hao Wei, Minqing Zhang, Yiqiang Zhan, Xiang Sean Zhou, Zhong Xue, Feng Shi

Abstract: Functional Connectivity (FC) matrices measure the regional interactions in the brain and have been widely used in neurological brain disease classification. However, a FC matrix is neither a natural image which contains shape and texture information, nor a vector of independent features, which renders the extracting of efficient features from matrices as a challenging problem. A brain network, als… ▽ More Functional Connectivity (FC) matrices measure the regional interactions in the brain and have been widely used in neurological brain disease classification. However, a FC matrix is neither a natural image which contains shape and texture information, nor a vector of independent features, which renders the extracting of efficient features from matrices as a challenging problem. A brain network, also named as connectome, could forma a graph structure naturally, the nodes of which are brain regions and the edges are interregional connectivity. Thus, in this study, we proposed novel graph convolutional networks (GCNs) to extract efficient disease-related features from FC matrices. Considering the time-dependent nature of brain activity, we computed dynamic FC matrices with sliding-windows and implemented a graph convolution based LSTM (long short term memory) layer to process dynamic graphs. Moreover, the demographics of patients were also used to guide the classification. However, unlike in conventional methods where personal information, i.e., gender and age were added as extra inputs, we argue that this kind of approach may not actually improve the classification performance, for such personal information given in dataset was usually balanced distributed. In this paper, we proposed to utilize the demographic information as extra outputs and to share parameters among three networks predicting subject status, gender and age, which serve as assistant tasks. We tested the performance of the proposed architecture in ADNI II dataset to classify Alzheimer's disease patients from normal controls. The classification accuracy, sensitivity and specificity reach 0.90, 0.92 and 0.89 on ADNI II dataset. △ Less

Submitted 10 December, 2019; originally announced January 2020.

Comments: Number of pages: 22 Word count in abstract: 252, word count in manuscript: 3910 This manuscript includes 3 tables and 9 figures

arXiv:1910.13632 [pdf]

doi 10.1214/19-AOAS1249

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

Authors: Gaoxiang Jia, Xinlei Wang, Qiwei Li, Wei Lu, Ximing Tang, Ignacio Wistuba, Yang Xie

Abstract: Formalin-fixed paraffin-embedded (FFPE) samples have great potential for biomarker discovery, retrospective studies and diagnosis or prognosis of diseases. Their application, however, is hindered by the unsatisfactory performance of traditional gene expression profiling techniques on damaged RNAs. NanoString nCounter platform is well suited for profiling of FFPE samples and measures gene expressio… ▽ More Formalin-fixed paraffin-embedded (FFPE) samples have great potential for biomarker discovery, retrospective studies and diagnosis or prognosis of diseases. Their application, however, is hindered by the unsatisfactory performance of traditional gene expression profiling techniques on damaged RNAs. NanoString nCounter platform is well suited for profiling of FFPE samples and measures gene expression with high sensitivity which may greatly facilitate realization of scientific and clinical values of FFPE samples. However, methodological development for normalization, a critical step when analyzing this type of data, is far behind. Existing methods designed for the platform use information from different types of internal controls separately and rely on an overly-simplified assumption that expression of housekee** genes is constant across samples for global scaling. Thus, these methods are not optimized for the nCounter system, not mentioning that they were not developed for FFPE samples. We construct an integrated system of random-coefficient hierarchical regression models to capture main patterns and characteristics observed from NanoString data of FFPE samples and develop a Bayesian approach to estimate parameters and normalize gene expression across samples. Our method, labeled RCRnorm, incorporates information from all aspects of the experimental design and simultaneously removes biases from various sources. It eliminates the unrealistic assumption on housekee** genes and offers great interpretability. Furthermore, it is applicable to freshly frozen or like samples that can be generally viewed as a reduced case of FFPE samples. Simulation and applications showed the superior performance of RCRnorm. △ Less

Submitted 28 October, 2019; originally announced October 2019.

MSC Class: 97K80

Journal ref: Ann. Appl. Stat. 13 (2019), no. 3, 1617--1647. https://projecteuclid.org/euclid.aoas/1571277766

arXiv:1909.03109 [pdf, other]

Monolingual and bilingual language networks in healthy subjects using functional MRI and graph theory

Authors: Qiongge Li, Luca Pasquini, Gino Del Ferraro, Madeleine Gene, Kyung K. Peck, Hernán A. Makse, Andrei I. Holodny

Abstract: Pre-surgical language map** with functional magnetic resonance imaging (fMRI) is routinely conducted to assist the neurosurgeon in preventing damage to brain regions responsible for language. Functional differences exist between the monolingual versus the bilingual brain, whereas clinical fMRI tasks are typically conducted in a single language. The presence of secondary language processing mecha… ▽ More Pre-surgical language map** with functional magnetic resonance imaging (fMRI) is routinely conducted to assist the neurosurgeon in preventing damage to brain regions responsible for language. Functional differences exist between the monolingual versus the bilingual brain, whereas clinical fMRI tasks are typically conducted in a single language. The presence of secondary language processing mechanisms is a potential source of error in the inferred language map. From fMRI data of healthy bilingual and monolingual subjects we obtain language maps as functional networks. Our results show a sub-network "core" architecture consisting of the Broca's, pre-supplementary motor, and premotor areas present across all subjects. Wernicke's Area was found to connect to the "core" to a different extent across groups. The $k$ core centrality measure shows "core" areas belong to the maximum core while WA and other fROIs vary across groups. The results may provide a benchmark to preserve equal treatment outcomes for bilingual patients. △ Less

Submitted 15 June, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

Comments: 17 pages, 8 figures

arXiv:1908.08135 [pdf, other]

Supervised prediction of aging-related genes from a context-specific protein interaction subnetwork

Authors: Qi Li, Tijana Milenković

Abstract: Background. Human aging is linked to many prevalent diseases. The aging process is highly influenced by genetic factors. Hence, it is important to identify human aging-related genes. We focus on supervised prediction of such genes. Gene expression-based methods for this purpose study genes in isolation from each other. While protein-protein interaction (PPI) network-based methods for this purpose… ▽ More Background. Human aging is linked to many prevalent diseases. The aging process is highly influenced by genetic factors. Hence, it is important to identify human aging-related genes. We focus on supervised prediction of such genes. Gene expression-based methods for this purpose study genes in isolation from each other. While protein-protein interaction (PPI) network-based methods for this purpose account for interactions between genes' protein products, current PPI network data are context-unspecific, spanning different biological conditions. Instead, here, we focus on an aging-specific subnetwork of the entire PPI network, obtained by integrating aging-specific gene expression data and PPI network data. The potential of such data integration has been recognized but mostly in the context of cancer. So, we are the first to propose a supervised learning framework for predicting aging-related genes from an aging-specific PPI subnetwork. Results. In a systematic and comprehensive evaluation, we find that in many of the evaluation tests: (i) using an aging-specific subnetwork indeed yields more accurate aging-related gene predictions than using the entire network, and (ii) predictive methods from our framework that have not previously been used for supervised prediction of aging-related genes outperform existing prominent methods for the same purpose. Conclusion. These results justify the need for our framework. △ Less

Submitted 25 April, 2020; v1 submitted 21 August, 2019; originally announced August 2019.

Comments: This is a Journal extension of "10.1109/BIBM47256.2019.8983063". So we use the same title as our conference paper

arXiv:1906.07546 [pdf, other]

Core language brain network for fMRI-language task used in clinical applications

Authors: Qiongge Li, Gino Del Ferraro, Luca Pasquini, Kyung K. Peck, Hernan A. Makse, Andrei I. Holodny

Abstract: Functional magnetic resonance imaging (fMRI) is widely used in clinical applications to highlight brain areas involved in specific cognitive processes. Brain impairments, such as tumors, suppress the fMRI activation of the anatomical areas they invade and, thus, brain-damaged functional networks present missing links/areas of activation. The identification of the missing circuitry components is of… ▽ More Functional magnetic resonance imaging (fMRI) is widely used in clinical applications to highlight brain areas involved in specific cognitive processes. Brain impairments, such as tumors, suppress the fMRI activation of the anatomical areas they invade and, thus, brain-damaged functional networks present missing links/areas of activation. The identification of the missing circuitry components is of crucial importance to estimate the damage extent. The study of functional networks associated to clinical tasks but performed by healthy individuals becomes, therefore, of paramount concern. These `healthy' networks can, indeed, be used as control networks for clinical studies. In this work we investigate the functional architecture of 20 healthy individuals performing a language task designed for clinical purposes. We unveil a common architecture persistent across all subjects under study, which involves Broca's area, Wernicke's area, the Premotor area, and the pre-Supplementary motor area. We study the connectivity weight of this circuitry by using the k-core centrality measure and we find that three of these areas belong to the most robust structure of the functional language network for the specific task under study. Our results provide useful insight for clinical applications on primarily important functional connections which, thus, should be preserved through brain surgery. △ Less

Submitted 12 June, 2019; originally announced June 2019.

Comments: 14 pages, 7 figures

arXiv:1710.08149 [pdf, other]

Image Segmentation and Classification for Sickle Cell Disease using Deformable U-Net

Authors: Mo Zhang, Xiang Li, Mengjia Xu, Quanzheng Li

Abstract: Reliable cell segmentation and classification from biomedical images is a crucial step for both scientific research and clinical practice. A major challenge for more robust segmentation and classification methods is the large variations in the size, shape and viewpoint of the cells, combining with the low image quality caused by noise and artifacts. To address this issue, in this work we propose a… ▽ More Reliable cell segmentation and classification from biomedical images is a crucial step for both scientific research and clinical practice. A major challenge for more robust segmentation and classification methods is the large variations in the size, shape and viewpoint of the cells, combining with the low image quality caused by noise and artifacts. To address this issue, in this work we propose a learning-based, simultaneous cell segmentation and classification method based on the deep U-Net structure with deformable convolution layers. The U-Net architecture for deep learning has been shown to offer a precise localization for image semantic segmentation. Moreover, deformable convolution layer enables the free form deformation of the feature learning process, thus makes the whole network more robust to various cell morphologies and image settings. The proposed method is tested on microscopic red blood cell images from patients with sickle cell disease. The results show that U-Net with deformable convolution achieves the highest accuracy for segmentation and classification, comparing with original U-Net structure. △ Less

Submitted 29 October, 2017; v1 submitted 23 October, 2017; originally announced October 2017.

arXiv:1610.01192 [pdf]

doi 10.1016/j.bpj.2016.04.029

Microtubule Defects Influence Kinesin-Based Transport In Vitro

Authors: Winnie H. Liang, Qiaochu Li, K. M. Rifat Faysal, Stephen J. King, Ajay Gopinathan, **g Xu

Abstract: Microtubules are protein polymers that form "molecular highways" for long-range transport within living cells. Molecular motors actively step along microtubules to shuttle cellular materials between the nucleus and the cell periphery; this transport is critical for the survival and health of all eukaryotic cells. Structural defects in microtubules exist, but whether these defects impact molecular… ▽ More Microtubules are protein polymers that form "molecular highways" for long-range transport within living cells. Molecular motors actively step along microtubules to shuttle cellular materials between the nucleus and the cell periphery; this transport is critical for the survival and health of all eukaryotic cells. Structural defects in microtubules exist, but whether these defects impact molecular motor-based transport remains unknown. Here, we report a new, to our knowledge, approach that allowed us to directly investigate the impact of such defects. Using a modified optical-trap** method, we examined the group function of a major molecular motor, conventional kinesin, when transporting cargos along individual microtubules. We found that microtubule defects influence kinesin-based transport in vitro. The effects depend on motor number: cargos driven by a few motors tended to unbind prematurely from the microtubule, whereas cargos driven by more motors tended to pause. To our knowledge, our study provides the first direct link between microtubule defects and kinesin function. The effects uncovered in our study may have physiological relevance in vivo. △ Less

Submitted 4 October, 2016; originally announced October 2016.

Journal ref: Biophysical Journal , Volume 110 , Issue 10 , 2229 - 2240 (2016)

arXiv:1610.01189 [pdf]

doi 10.1016/j.bpj.2016.05.015

Quantitative Determination of the Probability of Multiple-Motor Transport in Bead-Based Assays

Authors: Qiaochu Li, Stephen J. King, Ajay Gopinathan, **g Xu

Abstract: With their longest dimension typically being less than 100 nm, molecular motors are significantly below the optical-resolution limit. Despite substantial advances in fluorescence-based imaging methodologies, labeling with beads remains critical for optical-trap**-based investigations of molecular motors. A key experimental challenge in bead-based assays is that the number of motors on a bead is… ▽ More With their longest dimension typically being less than 100 nm, molecular motors are significantly below the optical-resolution limit. Despite substantial advances in fluorescence-based imaging methodologies, labeling with beads remains critical for optical-trap**-based investigations of molecular motors. A key experimental challenge in bead-based assays is that the number of motors on a bead is not well defined. Particularly for single-molecule investigations, the probability of single versus multiple-motor events has not been experimentally investigated. Here, we used bead travel distance as an indicator of multiple-motor transport and determined the lower-bound probability of bead transport by two or more motors. We limited the ATP concentration to increase our detection sensitivity for multiple- versus single-kinesin transport. Surprisingly, for all but the lowest motor number examined, our measurements exceeded estimations of a previous model by R2-fold. To bridge this apparent gap between theory and experiment, we derived a closed-form expression for the probability of bead transport by multiple motors, and constrained the only free parameter in this model using our experimental measurements. Our data indicate that kinesin extends to ~57 nm during bead transport, suggesting that kinesin exploits its conformational flexibility to interact with microtubules at highly curved interfaces such as those present for vesicle transport in cells. To our knowledge, our findings provide the first experimentally constrained guide for estimating the probability of multiple-motor transport in optical trap** studies. The experimental approach utilized here (limiting ATP concentration) may be generally applicable to studies in which molecular motors are labeled with cargos that are artificial or are purified from cellular extracts. △ Less

Submitted 4 October, 2016; originally announced October 2016.

Journal ref: Biophysical Journal , Volume 110 , Issue 12 , 2720 - 2728 (2016)

arXiv:1407.2110 [pdf]

Addressing the unmet need for visualizing Conditional Random Fields in Biological Data

Authors: William C. Ray, Samuel L. Wolock, Nicholas W Callahan, Min Dong, Q. Quinn Li, Chun Liang, Thomas J Magliery, Christopher W. Bartlett

Abstract: Background: The biological world is replete with phenomena that appear to be ideally modeled and analyzed by one archetypal statistical framework - the Graphical Probabilistic Model (GPM). The structure of GPMs is a uniquely good match for biological problems that range from aligning sequences to modeling the genome-to-phenome relationship. The fundamental questions that GPMs address involve makin… ▽ More Background: The biological world is replete with phenomena that appear to be ideally modeled and analyzed by one archetypal statistical framework - the Graphical Probabilistic Model (GPM). The structure of GPMs is a uniquely good match for biological problems that range from aligning sequences to modeling the genome-to-phenome relationship. The fundamental questions that GPMs address involve making decisions based on a complex web of interacting factors. Unfortunately, while GPMs ideally fit many questions in biology, they are not an easy solution to apply. Building a GPM is not a simple task for an end user. Moreover, applying GPMs is also impeded by the insidious fact that the complex web of interacting factors inherent to a problem might be easy to define and also intractable to compute upon. Discussion: We propose that the visualization sciences can contribute to many domains of the bio-sciences, by develo** tools to address archetypal representation and user interaction issues in GPMs, and in particular a variety of GPM called a Conditional Random Field(CRF). CRFs bring additional power, and additional complexity, because the CRF dependency network can be conditioned on the query data. Conclusions: In this manuscript we examine the shared features of several biological problems that are amenable to modeling with CRFs, highlight the challenges that existing visualization and visual analytics paradigms induce for these data, and document an experimental solution called StickWRLD which, while leaving room for improvement, has been successfully applied in several biological research projects. △ Less

Submitted 8 July, 2014; originally announced July 2014.

Comments: BioVis 2014 conference

arXiv:1403.5241 [pdf]

Spatiotemporal Dissociation of Brain Activity underlying Subjective Awareness, Objective Performance and Confidence

Authors: Qi Li, Zachary Hill, Biyu J. He

Abstract: Despite intense recent research, the neural correlates of conscious visual perception remain elusive. The most established paradigm for studying brain mechanisms underlying conscious perception is to keep the physical sensory inputs constant and identify brain activities that correlate with the changing content of conscious awareness. However, such a contrast based on conscious content alone would… ▽ More Despite intense recent research, the neural correlates of conscious visual perception remain elusive. The most established paradigm for studying brain mechanisms underlying conscious perception is to keep the physical sensory inputs constant and identify brain activities that correlate with the changing content of conscious awareness. However, such a contrast based on conscious content alone would not only reveal brain activities directly contributing to conscious perception, but also include brain activities that precede or follow it. To address this issue, we devised a paradigm whereby we collected, trial-by-trial, measures of objective performance, subjective awareness, and the confidence level of subjective awareness. Using magnetoencephalography recordings in healthy human volunteers, we dissociated brain activities underlying these different cognitive phenomena. Our results provide strong evidence that widely distributed slow cortical potentials (SCPs) correlate with subjective awareness, even after the effects of objective performance and confidence were both removed. The SCP correlate of conscious perception manifests strongly in its waveform, phase, and power. In contrast, objective performance and confidence were both contributed by relatively transient brain activity. These results shed new light on the brain mechanisms of conscious, unconscious, and metacognitive processing. △ Less

Submitted 20 March, 2014; originally announced March 2014.

Comments: Published version of J. Neurosci

arXiv:1310.0656 [pdf, ps, other]

Piezoelectric Drop-on-Demand Inkjet Printing of Rat Fibroblast Cells: Survivability Study and Pattern Printing

Authors: Er Qiang Li, Eng Khoon Tan, Sigurdur Tryggvi Thoroddsen

Abstract: A novel piezoelectric, drop-on-demand (DOD) inkjet system has been developed and used to print L929 rat fibroblast cells. We investigate the survivability of the cells subjected to the large stresses during the printing process. These stresses are varied by changing the diameter of the orifice (36 to 119 microns) through which the cells are dispensed, as well as changing the electrical pulse used… ▽ More A novel piezoelectric, drop-on-demand (DOD) inkjet system has been developed and used to print L929 rat fibroblast cells. We investigate the survivability of the cells subjected to the large stresses during the printing process. These stresses are varied by changing the diameter of the orifice (36 to 119 microns) through which the cells are dispensed, as well as changing the electrical pulse used to drive the piezoelectric element. It is shown that for the smallest 36 microns diameter orifice, cell survival rates fall from 95% to approximately 76% when the ejection velocity is increased from 2 to 16 m/s. This decrease in survival rates is less significant when the larger orifice diameters of 81 microns and 119 microns are used. Analysis shows that there is a clear inverse relationship between cell survival rates and the mean shear rates during drop formation. By using the same printing set-up, fibroblast cells are printed onto alginate and collagen into patterns. Printed cells are cultured over a period of days to verify their long-term viability. Fibroblasts printed onto the collagen are found to successfully adhere, spread and proliferate, subsequently forming a denser patterns after 5 days in culture. Cell agglomeration is found to affect the printing performance, especially for the printhead with the smallest orifice, leading to frequent clogging of the nozzle. We also study the number of cells in each droplet, when printed under optimal conditions. The probability density of this number follows a binomial distribution, which consistent with a uniform distribution of cells in the medium and within the printhead. △ Less

Submitted 2 October, 2013; originally announced October 2013.

arXiv:1011.2087 [pdf, ps, other]

doi 10.1214/09-AOAS316

A nested mixture model for protein identification using mass spectrometry

Authors: Qunhua Li, Michael J. MacCoss, Matthew Stephens

Abstract: Mass spectrometry provides a high-throughput way to identify proteins in biological samples. In a typical experiment, proteins in a sample are first broken into their constituent peptides. The resulting mixture of peptides is then subjected to mass spectrometry, which generates thousands of spectra, each characteristic of its generating peptide. Here we consider the problem of inferring, from thes… ▽ More Mass spectrometry provides a high-throughput way to identify proteins in biological samples. In a typical experiment, proteins in a sample are first broken into their constituent peptides. The resulting mixture of peptides is then subjected to mass spectrometry, which generates thousands of spectra, each characteristic of its generating peptide. Here we consider the problem of inferring, from these spectra, which proteins and peptides are present in the sample. We develop a statistical approach to the problem, based on a nested mixture model. In contrast to commonly used two-stage approaches, this model provides a one-stage solution that simultaneously identifies which proteins are present, and which peptides are correctly identified. In this way our model incorporates the evidence feedback between proteins and their constituent peptides. Using simulated data and a yeast data set, we compare and contrast our method with existing widely used approaches (PeptideProphet/ProteinProphet) and with a recently published new approach, HSM. For peptide identification, our single-stage approach yields consistently more accurate results. For protein identification the methods have similar accuracy in most settings, although we exhibit some scenarios in which the existing methods perform poorly. △ Less

Submitted 9 November, 2010; originally announced November 2010.

Comments: Published in at http://dx.doi.org/10.1214/09-AOAS316 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS316

Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 2, 962-987

Showing 1–45 of 45 results for author: Li, Q