Skip to main content

Showing 1–9 of 9 results for author: Sun, R

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2403.13081  [pdf, other

    stat.AP math.PR q-bio.PE

    Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors

    Authors: Kevin Leder, Ru** Sun, Zicheng Wang, Xuanming Zhang

    Abstract: In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  2. arXiv:2403.00875  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    Enhancing Protein Predictive Models via Proteins Data Augmentation: A Benchmark and New Directions

    Authors: Rui Sun, Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

    Abstract: Augmentation is an effective alternative to utilize the small amount of labeled protein data. However, most of the existing work focuses on design-ing new architectures or pre-training tasks, and relatively little work has studied data augmentation for proteins. This paper extends data augmentation techniques previously used for images and texts to proteins and then benchmarks these techniques on… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  3. arXiv:2207.06010  [pdf, other

    cs.LG q-bio.BM

    Does GNN Pretraining Help Molecular Representation?

    Authors: Ruoxi Sun, Hanjun Dai, Adams Wei Yu

    Abstract: Extracting informative representations of molecules using Graph neural networks (GNNs) is crucial in AI-driven drug discovery. Recently, the graph research community has been trying to replicate the success of self-supervised pretraining in natural language processing, with several successes claimed. However, we find the benefit brought by self-supervised pretraining on small molecular data can be… ▽ More

    Submitted 2 November, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  4. arXiv:2102.07713  [pdf, other

    q-bio.GN cs.LG

    Cancer Gene Profiling through Unsupervised Discovery

    Authors: Enzo Battistella, Maria Vakalopoulou, Roger Sun, Théo Estienne, Marvin Lerousseau, Sergey Nikolaev, Emilie Alvarez Andres, Alexandre Carré, Stéphane Niyoteka, Charlotte Robert, Nikos Paragios, Eric Deutsch

    Abstract: Precision medicine is a paradigm shift in healthcare relying heavily on genomics data. However, the complexity of biological interactions, the large number of genes as well as the lack of comparisons on the analysis of data, remain a tremendous bottleneck regarding clinical adoption. In this paper, we introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarke… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  5. arXiv:2010.15191  [pdf

    q-bio.NC

    Chronic, cortex-wide imaging of specific cell populations during behavior

    Authors: Joao Couto, Simon Musall, Xiaonan R Sun, Anup Khanal, Steven Gluf, Shreya Saxena, Ian Kinsella, Taiga Abe, John P. Cunningham, Liam Paninski, Anne K Churchland

    Abstract: Measurements of neuronal activity across brain areas are important for understanding the neural correlates of cognitive and motor processes like attention, decision-making, and action selection. However, techniques that allow cellular resolution measurements are expensive and require a high degree of technical expertise, which limits their broad use. Widefield imaging of genetically encoded indica… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 36 pages, 7 figures, 2 supplementary figures

  6. arXiv:2007.13437  [pdf, other

    physics.chem-ph cs.LG q-bio.QM

    Energy-based View of Retrosynthesis

    Authors: Ruoxi Sun, Hanjun Dai, Li Li, Steven Kearnes, Bo Dai

    Abstract: Retrosynthesis -- the process of identifying a set of reactants to synthesize a target molecule -- is of vital importance to material design and drug discovery. Existing machine learning approaches based on language models and graph neural networks have achieved encouraging results. In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) w… ▽ More

    Submitted 8 December, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

  7. arXiv:1610.03182  [pdf

    stat.CO q-bio.QM

    wtest: an R Package for Testing Main and Interaction Effect in Genotype Data with Binary Traits

    Authors: Rui Sun, Billy Chang, Benny Chung-Ying Zee, Maggie Haitian Wang

    Abstract: This R package evaluates main and pair-wise interaction effect of single nucleotide polymorphisms (SNPs) via the W-test, scalable to whole genome-wide data sets. The package provides fast and accurate p-value estimation of genetic markers, as well as diagnostic checking on the probability distributions. It allows flexible stage-wise or exhaustive association testing in a user-friendly interface. A… ▽ More

    Submitted 11 October, 2016; originally announced October 2016.

    Comments: 7 pages, 1 figure

  8. arXiv:1607.07834  [pdf

    q-bio.QM stat.ME

    A W-test collapsing method for rare variant testing with applications to exome sequencing data of hypertensive disorder

    Authors: Rui Sun, Haoyi Weng, Inchi Hu, Junfeng Guo, William K. K. Wu, Benny Chung-Ying Zee, Maggie Haitian Wang

    Abstract: Advancement in sequencing technology enables the study of association between complex disorders and rare variants with low minor allele frequencies. One of the major challenges in rare variant testing is lack of statistical power of traditional testing methods due to extremely low variances of single nucleotide polymorphisms. In this paper, we introduce a W-test collapsing method that evaluates th… ▽ More

    Submitted 26 July, 2016; originally announced July 2016.

    Comments: 18 pages, 1 figure, 4 tables. Genetic Epidemiology accepted

  9. arXiv:1606.08941  [pdf

    q-bio.GN

    Enhancing power of rare variant association test by Zoom-Focus Algorithm (ZFA) to locate optimal testing region

    Authors: Maggie Haitian Wang, Haoyi Weng, Rui Sun, Benny Chung-Ying Zee

    Abstract: Motivation: Exome or targeted sequencing data exerts analytical challenge to test single nucleotide polymorphisms (SNPs) with extremely small minor allele frequency (MAF). Various rare variant tests were proposed to increase power by aggregating SNPs within a fixed genomic region, such as a gene or pathway. However, a gene could contain from several to thousands of markers, and not all of them may… ▽ More

    Submitted 28 June, 2016; originally announced June 2016.

    Comments: Main paper: 13 pages, 2 figures, 3 tables, 3 diagrams; Submitted to Bioinformatics, and the 27th International Conference on Genome Informatics