Skip to main content

Showing 1–11 of 11 results for author: Xie, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.07449  [pdf, other

    stat.ME stat.ML

    Boosted Conformal Prediction Intervals

    Authors: Ran Xie, Rina Foygel Barber, Emmanuel J. Candès

    Abstract: This paper introduces a boosted conformal procedure designed to tailor conformalized prediction intervals toward specific desired properties, such as enhanced conditional coverage or reduced interval length. We employ machine learning techniques, notably gradient boosting, to systematically improve upon a predefined conformity score function. This process is guided by carefully constructed loss fu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 22 pages, 9 figures

  2. arXiv:2405.18979  [pdf, other

    cs.LG stat.ML

    MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts

    Authors: Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An

    Abstract: Leveraging the models' outputs, specifically the logits, is a common approach to estimating the test accuracy of a pre-trained neural network on out-of-distribution (OOD) samples without requiring access to the corresponding ground truth labels. Despite their ease of implementation and computational efficiency, current logit-based methods are vulnerable to overconfidence issues, leading to predict… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: The three first authors contributed equally

  3. arXiv:2405.00742  [pdf, other

    cs.CR cs.LG stat.ML

    Federated Graph Learning for EV Charging Demand Forecasting with Personalization Against Cyberattacks

    Authors: Yi Li, Renyou Xie, Chaojie Li, Yi Wang, Zhaoyang Dong

    Abstract: Mitigating cybersecurity risk in electric vehicle (EV) charging demand forecasting plays a crucial role in the safe operation of collective EV chargings, the stability of the power grid, and the cost-effective infrastructure expansion. However, existing methods either suffer from the data privacy issue and the susceptibility to cyberattacks or fail to consider the spatial correlation among differe… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages,4 figures

  4. arXiv:2403.04015  [pdf, other

    cs.LG cs.AI stat.ML

    Knockoff-Guided Feature Selection via A Single Pre-trained Reinforced Agent

    Authors: Xinyuan Wang, Dongjie Wang, Wangyang Ying, Rui Xie, Haifeng Chen, Yanjie Fu

    Abstract: Feature selection prepares the AI-readiness of data by eliminating redundant features. Prior research falls into two primary categories: i) Supervised Feature Selection, which identifies the optimal feature subset based on their relevance to the target variable; ii) Unsupervised Feature Selection, which reduces the feature space dimensionality by capturing the essential information within the feat… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  5. arXiv:2306.07513  [pdf, other

    stat.AP

    Smoothing spline analysis of variance models: A new tool for the analysis of accelerometer data

    Authors: Rui Xie, Lulu Chen, Joon-Hyuk Park, Jeffrey Stout, Ladda Thiamwong

    Abstract: Accelerometer data is commonplace in physical activity research, exercise science, and public health studies, where the goal is to understand and compare physical activity differences between groups and/or subject populations, and to identify patterns and trends in physical activity behavior to inform interventions for improving public health. We propose using mixed-effects smoothing spline analys… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Accepted by 2023 International Conference on Intelligent Biology and Medicine (ICIBM 2023)

  6. arXiv:2303.08242  [pdf, other

    stat.ML cs.LG stat.AP

    Optimal Sampling Designs for Multi-dimensional Streaming Time Series with Application to Power Grid Sensor Data

    Authors: Rui Xie, Shuyang Bai, ** Ma

    Abstract: The Internet of Things (IoT) system generates massive high-speed temporally correlated streaming data and is often connected with online inference tasks under computational or energy constraints. Online analysis of these streaming time series data often faces a trade-off between statistical efficiency and computational cost. One important approach to balance this trade-off is sampling, where only… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by The Annals of Applied Statistics

  7. arXiv:2112.09420  [pdf, other

    cond-mat.dis-nn cs.LG stat.ML

    A random energy approach to deep learning

    Authors: Rongrong Xie, Matteo Marsili

    Abstract: We study a generic ensemble of deep belief networks which is parametrized by the distribution of energy levels of the hidden states of each layer. We show that, within a random energy approach, statistical dependence can propagate from the visible to deep layers only if each layer is tuned close to the critical point during learning. As a consequence, efficiently trained learning machines are char… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 16 pages, 4 figures

  8. arXiv:2108.09888  [pdf, other

    stat.ME

    Model-based Sparse Coding beyond Gaussian Independent Model

    Authors: Xin Xing, Rui Xie, Wenxuan Zhong

    Abstract: Sparse coding aims to model data vectors as sparse linear combinations of basis elements, but a majority of related studies are restricted to continuous data without spatial or temporal structure. A new model-based sparse coding (MSC) method is proposed to provide an effective and flexible framework for learning features from different data types: continuous, discrete, or categorical, and modeling… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

  9. arXiv:2010.12178  [pdf, other

    stat.ME

    LowCon: A design-based subsampling approach in a misspecified linear modeL

    Authors: Cheng Meng, Rui Xie, Abhyuday Mandal, Xinlian Zhang, Wenxuan Zhong, ** Ma

    Abstract: We consider a measurement constrained supervised learning problem, that is, (1) full sample of the predictors are given; (2) the response observations are unavailable and expensive to measure. Thus, it is ideal to select a subsample of predictor observations, measure the corresponding responses, and then fit the supervised learning model on the subsample of the predictors and responses. However, m… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 37pages, 10 figures

  10. arXiv:1910.03434  [pdf, other

    cs.LG stat.ML

    ATL: Autonomous Knowledge Transfer from Many Streaming Processes

    Authors: Mahardhika Pratama, Marcus de Carvalho, Renchunzi Xie, Edwin Lughofer, Jie Lu

    Abstract: Transferring knowledge across many streaming processes remains an uncharted territory in the existing literature and features unique characteristics: no labelled instance of the target domain, covariate shift of source and target domain, different period of drifts in the source and target domains. Autonomous transfer learning (ATL) is proposed in this paper as a flexible deep learning approach for… ▽ More

    Submitted 19 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: This paper has been accepted for publication in CIKM 2019

  11. arXiv:1908.08144  [pdf, other

    stat.AP cs.CR cs.CY

    They may look and look, yet not see: BMDs cannot be tested adequately

    Authors: Philip B. Stark, Ran Xie

    Abstract: Bugs, misconfiguration, and malware can cause ballot-marking devices (BMDs) to print incorrect votes. Several approaches to testing BMDs have been proposed. In logic and accuracy testing (LAT) and parallel or live testing, auditors input known test votes into the BMD and check the printout. Passive testing monitors the rate of "spoiled" BMD printout, on the theory that if BMDs malfunction, the rat… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 August, 2019; originally announced August 2019.