Skip to main content

Showing 1–4 of 4 results for author: Zhan, H

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2406.00164  [pdf, other

    q-bio.GN cs.AI

    DYNA: Disease-Specific Language Model for Variant Pathogenicity

    Authors: Huixin Zhan, Zijun Zhang

    Abstract: Clinical variant classification of pathogenic versus benign genetic variants remains a challenge in clinical genetics. Recently, the proposition of genomic foundation models has improved the generic variant effect prediction (VEP) accuracy via weakly-supervised or unsupervised training. However, these VEPs are not disease-specific, limiting their adaptation at the point of care. To address this pr… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  2. arXiv:2402.08075  [pdf, other

    q-bio.GN cs.AI cs.LG

    Efficient and Scalable Fine-Tune of Language Models for Genome Understanding

    Authors: Huixin Zhan, Ying Nian Wu, Zijun Zhang

    Abstract: Although DNA foundation models have advanced the understanding of genomes, they still face significant challenges in the limited scale and diversity of genomic data. This limitation starkly contrasts with the success of natural language foundation models, which thrive on substantially larger scales. Furthermore, genome understanding involves numerous downstream genome annotation tasks with inheren… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2311.03429  [pdf, other

    q-bio.GN cs.AI cs.LG

    ProPath: Disease-Specific Protein Language Model for Variant Pathogenicity

    Authors: Huixin Zhan, Zijun Zhang

    Abstract: Clinical variant classification of pathogenic versus benign genetic variants remains a pivotal challenge in clinical genetics. Recently, the proposition of protein language models has improved the generic variant effect prediction (VEP) accuracy via weakly-supervised or unsupervised training. However, these VEPs are not disease-specific, limiting their adaptation at point-of-care. To address this… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted by MLCB 2023

  4. arXiv:1811.04987  [pdf

    stat.AP q-bio.GN

    Prediction of Alzheimer's disease-associated genes by integration of GWAS summary data and expression data

    Authors: Sicheng Hao, Rui Wang, Yu Zhang, Hui Zhan

    Abstract: Alzheimer's disease is the most common cause of dementia. It is the fifth-leading cause of death among elderly people. With high genetic heritability (79%), finding disease causal genes is a crucial step in find treatment for AD. Following the International Genomics of Alzheimer's Project (IGAP), many disease-associated genes have been identified; however, we don't have enough knowledge about how… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: 11 pages, 3 figures