Skip to main content

Showing 1–11 of 11 results for author: Cao, H

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2405.09886  [pdf

    cs.LG cs.AI q-bio.BM

    MTLComb: multi-task learning combining regression and classification tasks for joint feature selection

    Authors: Han Cao, Sivanesan Rajan, Bianka Hahn, Ersoy Kocak, Daniel Durstewitz, Emanuel Schwarz, Verena Schneider-Lindner

    Abstract: Multi-task learning (MTL) is a learning paradigm that enables the simultaneous training of multiple communicating algorithms. Although MTL has been successfully applied to ether regression or classification tasks alone, incorporating mixed types of tasks into a unified MTL framework remains challenging, primarily due to variations in the magnitudes of losses associated with different tasks. This c… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 33 pages, 3 figures, 5 tables

    ACM Class: J.3; I.2.6

  2. arXiv:2403.08192  [pdf, other

    cs.CL q-bio.BM

    MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension

    Authors: Xingyu Lu, He Cao, Zi**g Liu, Shengyuan Bai, Leqing Chen, Yuan Yao, Hai-Tao Zheng, Yu Li

    Abstract: Large language models are playing an increasingly significant role in molecular research, yet existing models often generate erroneous information, posing challenges to accurate molecular comprehension. Traditional evaluation metrics for generated content fail to assess a model's accuracy in molecular understanding. To rectify the absence of factual evaluation, we present MoleculeQA, a novel quest… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 19 pages, 8 figures

  3. arXiv:2402.12993  [pdf, other

    cs.IR cs.AI cs.LG q-bio.QM

    An Autonomous Large Language Model Agent for Chemical Literature Data Mining

    Authors: Kexin Chen, Hanqun Cao, Junyou Li, Yuyang Du, Menghao Guo, Xin Zeng, Lanqing Li, Jiezhong Qiu, Pheng Ann Heng, Guangyong Chen

    Abstract: Chemical synthesis, which is crucial for advancing material synthesis and drug discovery, impacts various sectors including environmental science and healthcare. The rise of technology in chemistry has generated extensive chemical data, challenging researchers to discern patterns and refine synthesis processes. Artificial intelligence (AI) helps by analyzing data to optimize synthesis and increase… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2311.16208  [pdf, other

    q-bio.BM cs.AI cs.LG

    InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery

    Authors: He Cao, Zi**g Liu, Xingyu Lu, Yuan Yao, Yu Li

    Abstract: The rapid evolution of artificial intelligence in drug discovery encounters challenges with generalization and extensive training, yet Large Language Models (LLMs) offer promise in resha** interactions with complex molecular data. Our novel contribution, InstructMol, a multi-modal LLM, effectively aligns molecular structures with natural language via an instruction-tuning approach, utilizing a t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  5. arXiv:2309.16684  [pdf, other

    q-bio.BM cs.LG physics.chem-ph

    Leveraging Side Information for Ligand Conformation Generation using Diffusion-Based Approaches

    Authors: Jiamin Wu, He Cao, Yuan Yao

    Abstract: Ligand molecule conformation generation is a critical challenge in drug discovery. Deep learning models have been developed to tackle this problem, particularly through the use of generative models in recent years. However, these models often generate conformations that lack meaningful structure and randomness due to the absence of essential side information. Examples of such side information incl… ▽ More

    Submitted 2 August, 2023; originally announced September 2023.

  6. arXiv:2302.00652  [pdf, other

    q-bio.NC nlin.PS

    Breathing cluster in complex neuron-astrocyte networks

    Authors: Ya Wang, Liang Wang, Huawei Fan, Jun Ma, Hui Cao, Xingang Wang

    Abstract: Brain activities are featured by spatially distributed neural clusters of coherent firings and a spontaneous switching of the clusters between the synchrony and asynchrony states. Evidences from {\it in vivo} experiments suggest that astrocytes, a type of glial cell regarded previously as providing only structural and metabolic supports to neurons, participate actively in brain functions and play… ▽ More

    Submitted 26 January, 2023; originally announced February 2023.

    Comments: 14 pages, 6 figures

  7. arXiv:2212.14041  [pdf, other

    q-bio.BM cs.AI cs.LG

    Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective

    Authors: Cheng Tan, Zhangyang Gao, Hanqun Cao, Xingran Chen, Ge Wang, Lirong Wu, Jun Xia, Jiangbin Zheng, Stan Z. Li

    Abstract: The secondary structure of ribonucleic acid (RNA) is more stable and accessible in the cell than its tertiary structure, making it essential for functional prediction. Although deep learning has shown promising results in this field, current methods suffer from poor generalization and high complexity. In this work, we reformulate the RNA secondary structure prediction as a K-Rook problem, thereby… ▽ More

    Submitted 19 June, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted by ICML 2024

  8. arXiv:2210.07377  [pdf

    q-bio.BM

    Scalable lipid droplet microarray fabrication, validation, and screening

    Authors: Tracey N. Bell, Aubrey E. Kusi-Appiaha, Pengfei Lyu, L. Zhu, F. Zhu, David Van Winkle, Hongyuan Cao, M. Singh, Steven Lenhert

    Abstract: High throughput screening of small molecules and natural products is costly, requiring significant amounts of time, reagents, and operating space. Although microarrays have proven effective in the miniaturization of screening for certain biochemical assays, such as nucleic acid hybridization or antibody binding, they are not widely used for drug discovery in cell culture due to the need for cells… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 13 pages, 9 figures

  9. arXiv:1801.07130  [pdf

    q-bio.QM q-bio.BM

    Computational Protein Design with Deep Learning Neural Networks

    Authors: **gxue Wang, Huali Cao, John Z. H. Zhang, Yifei Qi

    Abstract: Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein folds has reached a steady number, suggesting more structural information is being accumulated on each… ▽ More

    Submitted 23 February, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

    Comments: 16 pages, 5 figures, 3 tables

    Journal ref: Scientific Reports 8: 6349 (2018)

  10. arXiv:1510.06115  [pdf

    q-bio.NC cond-mat.mtrl-sci cs.ET

    Proton Conducting Graphene Oxide Coupled Neuron Transistors for Brain-Inspired Cognitive Systems

    Authors: Chang** Wan, Liqiang Zhu, Yanghui Liu, ** Feng, Zhao** Liu, Hailiang Cao, Peng Xiao, Yi Shi, Qing Wan

    Abstract: Neuron is the most important building block in our brain, and information processing in individual neuron involves the transformation of input synaptic spike trains into an appropriate output spike train. Hardware implementation of neuron by individual ionic/electronic hybrid device is of great significance for enhancing our understanding of the brain and solving sensory processing and complex rec… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.

    Comments: arXiv admin note: text overlap with arXiv:1506.04658

  11. On the Origins and Control of Community Types in the Human Microbiome

    Authors: Travis E. Gibson, Amir Bashan, Hong-Tai Cao, Scott T. Weiss, Yang-Yu Liu

    Abstract: Microbiome-based stratification of healthy individuals into compositional categories, referred to as "community types", holds promise for drastically improving personalized medicine. Despite this potential, the existence of community types and the degree of their distinctness have been highly debated. Here we adopted a dynamic systems approach and found that heterogeneity in the interspecific inte… ▽ More

    Submitted 21 January, 2016; v1 submitted 16 June, 2015; originally announced June 2015.

    Comments: Main Text, Figures, Methods, Supplementary Figures, and Supplementary Text