Skip to main content

Showing 1–19 of 19 results for author: Hu, B

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2405.03799  [pdf, other

    cs.LG cs.AI q-bio.QM

    Synthetic Data from Diffusion Models Improve Drug Discovery Prediction

    Authors: Bing Hu, Ashish Saragadam, Anita Layton, Helen Chen

    Abstract: Artificial intelligence (AI) is increasingly used in every stage of drug development. Continuing breakthroughs in AI-based methods for drug discovery require the creation, improvement, and refinement of drug discovery data. We posit a new data challenge that slows the advancement of drug discovery AI: datasets are often collected independently from each other, often with little overlap, creating d… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2403.05314  [pdf, other

    q-bio.BM

    Advances of Deep Learning in Protein Science: A Comprehensive Survey

    Authors: Bozhen Hu, Cheng Tan, Lirong Wu, Jiangbin Zheng, Jun Xia, Zhangyang Gao, Zicheng Liu, Fandi Wu, Guijun Zhang, Stan Z. Li

    Abstract: Protein representation learning plays a crucial role in understanding the structure and function of proteins, which are essential biomolecules involved in various biological processes. In recent years, deep learning has emerged as a powerful tool for protein modeling due to its ability to learn complex patterns and representations from large-scale protein data. This comprehensive survey aims to pr… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2402.09416  [pdf, other

    q-bio.BM cs.LG

    Deep Manifold Transformation for Protein Representation Learning

    Authors: Bozhen Hu, Zelin Zang, Cheng Tan, Stan Z. Li

    Abstract: Protein representation learning is critical in various tasks in biology, such as drug design and protein structure or function prediction, which has primarily benefited from protein language models and graph neural networks. These models can capture intrinsic patterns from protein sequences and structures through masking and task-related losses. However, the learned protein representations are usu… ▽ More

    Submitted 12 January, 2024; originally announced February 2024.

    Comments: This work has been accepted by ICASSP 2024

  4. arXiv:2402.08198  [pdf, other

    q-bio.BM cs.AI cs.LG

    PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

    Authors: Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu, Stan Z. Li

    Abstract: Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery. Existing deep learning-based methods utilize only the single modality of protein sequences or structures and lack the co-modeling of the joint distribution of the two modalities, which may lead to significant performance drops in complex real-world sc… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2305.09480  [pdf, other

    q-bio.BM cs.AI cs.LG

    Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot Antibody Designer

    Authors: Cheng Tan, Zhangyang Gao, Lirong Wu, Jun Xia, Jiangbin Zheng, Xihong Yang, Yue Liu, Bozhen Hu, Stan Z. Li

    Abstract: Antibodies are crucial proteins produced by the immune system in response to foreign substances or antigens. The specificity of an antibody is determined by its complementarity-determining regions (CDRs), which are located in the variable domains of the antibody chains and form the antigen-binding site. Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadeq… ▽ More

    Submitted 10 January, 2024; v1 submitted 21 April, 2023; originally announced May 2023.

    Comments: Accepted by AAAI 2024

  6. arXiv:2303.11783  [pdf, other

    q-bio.BM cs.AI cs.LG

    Lightweight Contrastive Protein Structure-Sequence Transformation

    Authors: Jiangbin Zheng, Ge Wang, Yufei Huang, Bozhen Hu, Siyuan Li, Cheng Tan, Xinwen Fan, Stan Z. Li

    Abstract: Pretrained protein structure models without labels are crucial foundations for the majority of protein downstream applications. The conventional structure pretraining methods follow the mature natural language pretraining methods such as denoised reconstruction and masked language modeling but usually destroy the real representation of spatial structures. The other common pretraining methods might… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  7. arXiv:2301.10774  [pdf, other

    q-bio.BM cs.AI cs.LG

    RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design

    Authors: Cheng Tan, Yijie Zhang, Zhangyang Gao, Bozhen Hu, Siyuan Li, Zicheng Liu, Stan Z. Li

    Abstract: While artificial intelligence has made remarkable strides in revealing the relationship between biological macromolecules' primary sequence and tertiary structure, designing RNA sequences based on specified tertiary structures remains challenging. Though existing approaches in protein design have thoroughly explored structure-to-sequence dependencies in proteins, RNA design still confronts difficu… ▽ More

    Submitted 6 March, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: 30 pages, 28 figures, 16 tables

  8. arXiv:2211.16742  [pdf, other

    q-bio.QM cs.AI cs.LG

    Protein Language Models and Structure Prediction: Connection and Progression

    Authors: Bozhen Hu, Jun Xia, Jiangbin Zheng, Cheng Tan, Yufei Huang, Yongjie Xu, Stan Z. Li

    Abstract: The prediction of protein structures from sequences is an important task for function prediction, drug design, and related biological processes understanding. Recent advances have proved the power of language models (LMs) in processing the protein sequence databases, which inherit the advantages of attention networks and capture useful information in learning representations for proteins. The past… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  9. arXiv:2204.10673  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative De Novo Protein Design with Global Context

    Authors: Cheng Tan, Zhangyang Gao, Jun Xia, Bozhen Hu, Stan Z. Li

    Abstract: The linear sequence of amino acids determines protein structure and function. Protein design, known as the inverse of protein structure prediction, aims to obtain a novel protein sequence that will fold into the defined structure. Recent works on computational protein design have studied designing sequences for the desired backbone structure with local positional information and achieved competiti… ▽ More

    Submitted 20 February, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: ICASSP 2023

  10. arXiv:2111.01351  [pdf, other

    q-bio.NC cs.LG

    Major Depressive Disorder Recognition and Cognitive Analysis Based on Multi-layer Brain Functional Connectivity Networks

    Authors: Xiaofang Sun, Xiangwei Zheng, Yonghui Xu, Lizhen Cui, Bin Hu

    Abstract: On the increase of major depressive disorders (MDD), many researchers paid attention to their recognition and treatment. Existing MDD recognition algorithms always use a single time-frequency domain method method, but the single time-frequency domain method is too simple and is not conducive to simulating the complex link relationship between brain functions. To solve this problem, this paper prop… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Journal ref: International Workshop on AI for Cognitive and Physical Frailty Workshop in Conjunction with IJCAI 2021 (AIF-IJCAI'21)

  11. arXiv:2006.08058  [pdf

    q-bio.GN

    EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing efforts

    Authors: Chien-Chi Lo, Migun Shakya, Karen Davenport, Mark Flynn, Adán Myers y Gutiérrez, Bin Hu, Po-E Li, Elais Player Jackson, Yan Xu, Patrick S. G. Chain

    Abstract: Genomics has become an essential technology for surveilling emerging infectious disease outbreaks. A wide range of technologies and strategies for pathogen genome enrichment and sequencing are being used by laboratories worldwide, together with different, and sometimes ad hoc, analytical procedures for generating genome sequences. As a result, public repositories now contain non-standard entries o… ▽ More

    Submitted 24 June, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

  12. arXiv:2006.04566  [pdf

    q-bio.GN q-bio.QM

    A Public Website for the Automated Assessment and Validation of SARS-CoV-2 Diagnostic PCR Assays

    Authors: Po-E Li, Adán Myers y Gutiérrez, Karen Davenport, Mark Flynn, Bin Hu, Chien-Chi Lo, Elais Player Jackson, Migun Shakya, Yan Xu, Jason Gans, Patrick S. G. Chain

    Abstract: Summary: Polymerase chain reaction-based assays are the current gold standard for detecting and diagnosing SARS-CoV-2. However, as SARS-CoV-2 mutates, we need to constantly assess whether existing PCR-based assays will continue to detect all known viral strains. To enable the continuous monitoring of SARS-CoV-2 assays, we have developed a web-based assay validation algorithm that checks existing P… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

    Comments: Application Note. Main: 2 pages, 1 figure. Supplementary: 6 pages, 8 figures, 1 table. Total: 8 pages, 9 figures, 1 table. Application url: https://covid19.edgebioinformatics.org/#/assayValidation Contact: Jason Gans ([email protected]) and Patrick Chain ([email protected]) Submitted to: Bioinformatics

  13. arXiv:2002.12759  [pdf

    eess.AS cs.LG cs.SD q-bio.QM stat.ML

    A Novel Decision Tree for Depression Recognition in Speech

    Authors: Zhenyu Liu, Dongyu Wang, Lan Zhang, Bin Hu

    Abstract: Depression is a common mental disorder worldwide which causes a range of serious outcomes. The diagnosis of depression relies on patient-reported scales and psychiatrist interview which may lead to subjective bias. In recent years, more and more researchers are devoted to depression recognition in speech , which may be an effective and objective indicator. This study proposes a new speech segment… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

  14. arXiv:2002.09283  [pdf

    cs.DL cs.LG q-bio.NC

    MODMA dataset: a Multi-modal Open Dataset for Mental-disorder Analysis

    Authors: Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, **g Zhu, Xiaowei Zhang, Guo** Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, **g Yang, Lan Zhang, Xi** Hu, Yumin Li , et al. (1 additional authors not shown)

    Abstract: According to the World Health Organization, the number of mental disorder patients, especially depression patients, has grown rapidly and become a leading contributor to the global burden of disease. However, the present common practice of depression diagnosis is based on interviews and clinical scales carried out by doctors, which is not only labor-consuming but also time-consuming. One important… ▽ More

    Submitted 4 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Journal ref: Sci Data 9, 178 (2022)

  15. arXiv:1810.11594  [pdf, other

    q-bio.NC cs.CV

    Convolutional neural networks with extra-classical receptive fields

    Authors: Brian Hu, Stefan Mihalas

    Abstract: Convolutional neural networks (CNNs) have had great success in many real-world applications and have also been used to model visual processing in the brain. However, these networks are quite brittle - small changes in the input image can dramatically change a network's output prediction. In contrast to what is known from biology, these networks largely rely on feedforward connections, ignoring the… ▽ More

    Submitted 27 October, 2018; originally announced October 2018.

  16. arXiv:1210.4616  [pdf, other

    q-bio.MN physics.bio-ph q-bio.QM

    How input fluctuations reshape the dynamics of a biological switching system

    Authors: Bo Hu, David A. Kessler, Wouter-Jan Rappel, Herbert Levine

    Abstract: An important task in quantitative biology is to understand the role of stochasticity in biochemical regulation. Here, as an extension of our recent work [Phys. Rev. Lett. 107, 148101 (2011)], we study how input fluctuations affect the stochastic dynamics of a simple biological switch. In our model, the on transition rate of the switch is directly regulated by a noisy input signal, which is describ… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: 7 pages, 4 figures, submitted to Physical Review E

  17. arXiv:1006.0507  [pdf, ps, other

    q-bio.CB physics.bio-ph q-bio.QM

    Determining the accuracy of spatial gradient sensing using statistical mechanics

    Authors: Bo Hu, Wen Chen, Wouter-Jan Rappel, Herbert Levine

    Abstract: Many eukaryotic cells are able to sense chemical gradients by directly measuring spatial concentration differences. The precision of such gradient sensing is limited by fluctuations in the binding of diffusing particles to specific receptors on the cell surface. Here, we explore the physical limits of the spatial sensing mechanism by modeling the chemotactic cell as an Ising spin chain subject to… ▽ More

    Submitted 2 June, 2010; originally announced June 2010.

    Comments: 4 pages, 2 figures

  18. arXiv:0709.0443  [pdf

    q-bio.PE q-bio.CB

    Similar self-organizing scale-invariant properties characterize early cancer invasion and long range species spread

    Authors: D. E. Marco, S. A. Cannas, M. A. Montemurro, B. Hu, S. Cheng

    Abstract: Occupancy of new habitats through dispersion is a central process in nature. In particular, long range dispersal is involved in the spread of species and epidemics, although it has not been previously related with cancer invasion, a process that involves spread to new tissues. We show that the early spread of cancer cells is similar to the species individuals spread and that both processes are r… ▽ More

    Submitted 4 September, 2007; originally announced September 2007.

    Comments: 21 pages, 2 figures

    Journal ref: Journal of Theoretical Biology 256: 65-75 (2008)

  19. arXiv:cond-mat/0211459  [pdf

    cond-mat.dis-nn q-bio

    Charge Transport in DNA Segments with fractal structures

    Authors: Huijie Yang, Fangcui Zhao, Chunchun Liu, Yingli Zhao, Wenxiu Yang, Beilai Hu

    Abstract: By means of the concept of factorial moment the charge transfer rates in DNA segments with fractal structures are investigated. An analytical form for the electron transfer rate is obtained.

    Submitted 20 November, 2002; originally announced November 2002.

    Comments: 10 pages,2 figures