Skip to main content

Showing 1–8 of 8 results for author: Wang, Y X R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2309.05092  [pdf, other

    stat.ME cs.LG math.ST

    Adaptive conformal classification with noisy labels

    Authors: Matteo Sesia, Y. X. Rachel Wang, Xin Tong

    Abstract: This paper develops novel conformal prediction methods for classification tasks that can automatically adapt to random label contamination in the calibration sample, leading to more informative prediction sets with stronger coverage guarantees compared to state-of-the-art approaches. This is made possible by a precise characterization of the effective coverage inflation (or deflation) suffered by… ▽ More

    Submitted 21 February, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: 28 pages (127 pages including references and appendices)

  2. arXiv:2210.02197  [pdf, other

    cs.LG stat.AP

    Hierarchical Neyman-Pearson Classification for Prioritizing Severe Disease Categories in COVID-19 Patient Data

    Authors: Lijia Wang, Y. X. Rachel Wang, **gyi Jessica Li, Xin Tong

    Abstract: COVID-19 has a spectrum of disease severity, ranging from asymptomatic to requiring hospitalization. Understanding the mechanisms driving disease severity is crucial for develo** effective treatments and reducing mortality rates. One way to gain such understanding is using a multi-class classification framework, in which patients' biological features are used to predict patients' severity classe… ▽ More

    Submitted 29 September, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

  3. arXiv:2110.08605  [pdf, other

    cs.DL stat.AP

    Statistics in everyone's backyard: an impact study via citation network analysis

    Authors: Lijia Wang, Xin Tong, Y. X. Rachel Wang

    Abstract: The increasing availability of curated citation data provides a wealth of resources for analyzing and understanding the intellectual influence of scientific publications. In the field of statistics, current studies of citation data have mostly focused on the interactions between statistical journals and papers, limiting the measure of influence to mainly within statistics itself. In this paper, we… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  4. arXiv:2010.00729  [pdf, other

    stat.ME

    Individual-centered partial information in social networks

    Authors: Xiao Han, Y. X. Rachel Wang, Qing Yang, Xin Tong

    Abstract: In statistical network analysis, we often assume either the full network is available or multiple subgraphs can be sampled to estimate various global properties of the network. However, in a real social network, people frequently make decisions based on their local view of the network alone. Here, we consider a partial information framework that characterizes the local network centered at a given… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 October, 2020; originally announced October 2020.

  5. arXiv:1910.08018  [pdf, other

    stat.ML cs.LG math.ST

    A Unified Framework for Tuning Hyperparameters in Clustering Problems

    Authors: Xinjie Fan, Yuguang Yue, Purnamrita Sarkar, Y. X. Rachel Wang

    Abstract: Selecting hyperparameters for unsupervised learning problems is challenging in general due to the lack of ground truth for validation. Despite the prevalence of this issue in statistics and machine learning, especially in clustering problems, there are not many methods for tuning these hyperparameters with theoretical guarantees. In this paper, we provide a framework with provable guarantees for s… ▽ More

    Submitted 1 February, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

  6. arXiv:1908.02910  [pdf, other

    stat.ML cs.LG

    Mini-batch Metropolis-Hastings MCMC with Reversible SGLD Proposal

    Authors: Tung-Yu Wu, Y. X. Rachel Wang, Wing H. Wong

    Abstract: Traditional MCMC algorithms are computationally intensive and do not scale well to large data. In particular, the Metropolis-Hastings (MH) algorithm requires passing over the entire dataset to evaluate the likelihood ratio in each iteration. We propose a general framework for performing MH-MCMC using mini-batches of the whole dataset and show that this gives rise to approximately a tempered statio… ▽ More

    Submitted 28 August, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

  7. arXiv:1707.09587  [pdf, other

    stat.AP q-bio.GN

    Network modelling of topological domains using Hi-C data

    Authors: Y. X. Rachel Wang, Purnamrita Sarkar, Oana Ursu, Anshul Kundaje, Peter J. Bickel

    Abstract: Chromosome conformation capture experiments such as Hi-C are used to map the three-dimensional spatial organization of genomes. One specific feature of the 3D organization is known as topologically associating domains (TADs), which are densely interacting, contiguous chromatin regions playing important roles in regulating gene expression. A few algorithms have been proposed to detect TADs. In part… ▽ More

    Submitted 17 October, 2019; v1 submitted 30 July, 2017; originally announced July 2017.

    Journal ref: Annals of Applied Statistics 2019, Vol. 13, No. 3, 1511-1536

  8. Inferring gene-gene interactions and functional modules using sparse canonical correlation analysis

    Authors: Y. X. Rachel Wang, Keni Jiang, Lewis J. Feldman, Peter J. Bickel, Haiyan Huang

    Abstract: Networks pervade many disciplines of science for analyzing complex systems with interacting components. In particular, this concept is commonly used to model interactions between genes and identify closely associated genes forming functional modules. In this paper, we focus on gene group interactions and infer these interactions using appropriate partial correlations between genes, that is, the co… ▽ More

    Submitted 1 June, 2015; v1 submitted 25 January, 2014; originally announced January 2014.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS792 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS792

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 300-323