Skip to main content

Showing 1–50 of 108 results for author: Xue, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13936  [pdf, other

    stat.ML cs.LG math.OC

    Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods

    Authors: Tim Tsz-Kit Lau, Weijian Li, Chenwei Xu, Han Liu, Mladen Kolar

    Abstract: Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite under… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.04619  [pdf, other

    cs.LG stat.ML

    CTSyn: A Foundational Model for Cross Tabular Data Generation

    Authors: Xiaofeng Lin, Chenheng Xu, Matthew Yang, Guang Cheng

    Abstract: Generative Foundation Models (GFMs) have produced synthetic data with remarkable quality in modalities such as images and text. However, applying GFMs to tabular data poses significant challenges due to the inherent heterogeneity of table features. Existing cross-table learning frameworks are hindered by the absence of both a generative model backbone and a decoding mechanism for heterogeneous fea… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.01335  [pdf, other

    quant-ph q-fin.ST stat.ML

    Statistics-Informed Parameterized Quantum Circuit via Maximum Entropy Principle for Data Science and Finance

    Authors: Xi-Ning Zhuang, Zhao-Yun Chen, Cheng Xue, Xiao-Fan Xu, Chao Wang, Huan-Yu Liu, Tai-** Sun, Yun-Jie Wang, Yu-Chun Wu, Guo-** Guo

    Abstract: Quantum machine learning has demonstrated significant potential in solving practical problems, particularly in statistics-focused areas such as data science and finance. However, challenges remain in preparing and learning statistical models on a quantum processor due to issues with trainability and interpretability. In this letter, we utilize the maximum entropy principle to design a statistics-i… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 19 pages, 5 figures

  4. arXiv:2405.16828  [pdf, other

    cs.LG math.ST stat.ML

    Kernel-based optimally weighted conformal prediction intervals

    Authors: Jonghyeok Lee, Chen Xu, Yao Xie

    Abstract: Conformal prediction has been a popular distribution-free framework for uncertainty quantification. In this paper, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learn… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2404.06676  [pdf

    cs.LG eess.SP stat.AP

    Topological Feature Search Method for Multichannel EEG: Application in ADHD classification

    Authors: Tianming Cai, Guoying Zhao, Junbin Zang, Chen Zong, Zhidong Zhang, Chenyang Xue

    Abstract: In recent years, the preliminary diagnosis of Attention Deficit Hyperactivity Disorder (ADHD) using electroencephalography (EEG) has garnered attention from researchers. EEG, known for its expediency and efficiency, plays a pivotal role in the diagnosis and treatment of ADHD. However, the non-stationarity of EEG signals and inter-subject variability pose challenges to the diagnostic and classifica… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  6. arXiv:2404.03830  [pdf, other

    cs.LG cs.AI stat.ML

    BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model

    Authors: Chenwei Xu, Yu-Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu

    Abstract: We introduce the \textbf{B}i-Directional \textbf{S}parse \textbf{Hop}field Network (\textbf{BiSHop}), a novel end-to-end framework for deep tabular learning. BiSHop handles the two major challenges of deep tabular learning: non-rotationally invariant data structure and feature sparsity in tabular data. Our key motivation comes from the recent established connection between associative memory and a… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 40 page; Code available at https://github.com/MAGICS-LAB/BiSHop

  7. arXiv:2403.16260  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble

    Authors: Chenhui Xu, Fuxun Yu, Zirui Xu, Nathan Inkawhich, Xiang Chen

    Abstract: Recent research underscores the pivotal role of the Out-of-Distribution (OOD) feature representation field scale in determining the efficacy of models in OOD detection. Consequently, the adoption of model ensembles has emerged as a prominent strategy to augment this feature representation field, capitalizing on anticipated model diversity. However, our introduction of novel qualitative and quant… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  8. arXiv:2403.03850  [pdf, other

    stat.ML cs.LG

    Conformal prediction for multi-dimensional time series by ellipsoidal sets

    Authors: Chen Xu, Hanyang Jiang, Yao Xie

    Abstract: Conformal prediction (CP) has been a popular method for uncertainty quantification because it is distribution-free, model-agnostic, and theoretically sound. For forecasting problems in supervised learning, most CP methods focus on building prediction intervals for univariate responses. In this work, we develop a sequential CP method called $\texttt{MultiDimSPCI}$ that builds prediction… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by the Forty-first International Conference on Machine Learning (ICML 2024)

  9. arXiv:2402.02687  [pdf, other

    cs.LG cs.AI stat.ML

    Poisson Process for Bayesian Optimization

    Authors: Xiaoxing Wang, Jiaxing Li, Chao Xue, Wei Liu, Weifeng Liu, Xiaokang Yang, Junchi Yan, Dacheng Tao

    Abstract: BayesianOptimization(BO) is a sample-efficient black-box optimizer, and extensive methods have been proposed to build the absolute function response of the black-box function through a probabilistic surrogate model, including Tree-structured Parzen Estimator (TPE), random forest (SMAC), and Gaussian process (GP). However, few methods have been explored to estimate the relative rankings of candidat… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  10. arXiv:2402.00743  [pdf, other

    cs.LG cs.CL stat.ML

    Theoretical Understanding of In-Context Learning in Shallow Transformers with Unstructured Data

    Authors: Yue Xing, Xiaofeng Lin, Chenheng Xu, Namjoon Suh, Qifan Song, Guang Cheng

    Abstract: Large language models (LLMs) are powerful models that can learn concepts at the inference stage via in-context learning (ICL). While theoretical studies, e.g., \cite{zhang2023trained}, attempt to explain the mechanism of ICL, they assume the input $x_i$ and the output $y_i$ of each demonstration example are in the same token (i.e., structured data). However, in real practice, the examples are usua… ▽ More

    Submitted 18 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  11. arXiv:2401.10269  [pdf, ps, other

    cs.IT eess.SP stat.ME

    Robust Multi-Sensor Multi-Target Tracking Using Possibility Labeled Multi-Bernoulli Filter

    Authors: Han Cai, Chenbao Xue, Jeremie Houssineau, Zhirun Xue

    Abstract: With the increasing complexity of multiple target tracking scenes, a single sensor may not be able to effectively monitor a large number of targets. Therefore, it is imperative to extend the single-sensor technique to Multi-Sensor Multi-Target Tracking (MSMTT) for enhanced functionality. Typical MSMTT methods presume complete randomness of all uncertain components, and therefore effective solution… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  12. arXiv:2311.07624  [pdf

    q-bio.PE stat.AP

    Disordered hyperuniformity signals functioning and resilience of self-organized vegetation patterns

    Authors: Wensi Hu, Quan-Xing Liu, Bo Wang, Nuo Xu, Lijuan Cui, Chi Xu

    Abstract: In harsh environments, organisms may self-organize into spatially patterned systems in various ways. So far, studies of ecosystem spatial self-organization have primarily focused on apparent orders reflected by regular patterns. However, self-organized ecosystems may also have cryptic orders that can be unveiled only through certain quantitative analyses. Here we show that disordered hyperuniformi… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 34 pages, 6 figures; Supplementary Materials, 19 pages, 10 figures, 2 tables

  13. arXiv:2310.19253  [pdf, other

    cs.LG stat.ME stat.ML

    Flow-based Distributionally Robust Optimization

    Authors: Chen Xu, Jonghyeok Lee, Xiuyuan Cheng, Yao Xie

    Abstract: We present a computationally efficient framework, called $\texttt{FlowDRO}$, for solving flow-based distributionally robust optimization (DRO) problems with Wasserstein uncertainty sets while aiming to find continuous worst-case distribution (also called the Least Favorable Distribution, LFD) and sample from it. The requirement for LFD to be continuous is so that the algorithm can be scalable to p… ▽ More

    Submitted 24 February, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: IEEE Journal on Selected Areas in Information Theory (JSAIT). Accepted. 2024

  14. arXiv:2310.18572  [pdf, ps, other

    stat.AP

    Where to serve and return in Badminton Men's Double?

    Authors: Xuelin Zhu, Yu Sun, Yumin Zeng, Cong Xu

    Abstract: This study aims to analyze the service and return landing areas in badminton men's double, based on data extracted from 20 badminton matches. We find that most services land near the center-line, while returns tend to land in the crossing areas of the serving team's court. Using generalized logit models, we are able to predict the return landing area based on features of the service and return rou… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  15. arXiv:2309.12673  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    On Sparse Modern Hopfield Model

    Authors: Jerry Yao-Chieh Hu, Donglin Yang, Dennis Wu, Chenwei Xu, Bo-Yu Chen, Han Liu

    Abstract: We introduce the sparse modern Hopfield model as a sparse extension of the modern Hopfield model. Like its dense counterpart, the sparse modern Hopfield model equips a memory-retrieval dynamics whose one-step approximation corresponds to the sparse attention mechanism. Theoretically, our key contribution is a principled derivation of a closed-form sparse Hopfield energy using the convex conjugate… ▽ More

    Submitted 29 November, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 37 pages, accepted at NeurIPS 2023. [v2] updated to match with camera-ready version. Code is available at https://github.com/MAGICS-LAB/SparseModernHopfield

  16. arXiv:2309.09924  [pdf, other

    cs.LG eess.SP stat.ML

    Learning graph geometry and topology using dynamical systems based message-passing

    Authors: Dhananjay Bhaskar, Yanlei Zhang, Charles Xu, Xingzhi Sun, Oluwadamilola Fasina, Guy Wolf, Maximilian Nickel, Michael Perlmutter, Smita Krishnaswamy

    Abstract: In this paper we introduce DYMAG: a message passing paradigm for GNNs built on the expressive power of continuous, multiscale graph-dynamics. Standard discrete-time message passing algorithms implicitly make use of simplistic graph dynamics and aggregation schemes which limit their ability to capture fundamental graph topological properties. By contrast, DYMAG makes use of complex graph dynamics b… ▽ More

    Submitted 12 June, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  17. arXiv:2308.11838  [pdf, other

    cs.LG cs.AI stat.ML

    A Benchmark Study on Calibration

    Authors: Linwei Tao, Younan Zhu, Haolan Guo, Min**g Dong, Chang Xu

    Abstract: Deep neural networks are increasingly utilized in various machine learning tasks. However, as these models grow in complexity, they often face calibration issues, despite enhanced prediction accuracy. Many studies have endeavored to improve calibration performance through the use of specific loss functions, data preprocessing and training frameworks. Yet, investigations into calibration properties… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 poster

  18. arXiv:2307.09725  [pdf

    stat.AP

    Global Inequality in Cooling from Urban Green Spaces and its Climate Change Adaptation Potential

    Authors: Yuxiang Li, Jens-Christian Svenning, Weiqi Zhou, Kai Zhu, Jesse F. Abrams, Timothy M. Lenton, Shuqing N. Teng, Robert R. Dunn, Chi Xu

    Abstract: Heat extremes are projected to severely impact humanity and with increasing geographic disparities. Global South countries are more exposed to heat extremes and have reduced adaptation capacity. One documented source of such adaptation inequality is a lack of resources to cool down indoor temperatures. Less is known about the capacity to ameliorate outdoor heat stress. Here, we assess global inequ… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 56 pages, 28 figures

  19. arXiv:2306.06252  [pdf, other

    cs.LG stat.ML

    Feature Programming for Multivariate Time Series Prediction

    Authors: Alex Reneau, Jerry Yao-Chieh Hu, Chenwei Xu, Weijian Li, Ammar Gilani, Han Liu

    Abstract: We introduce the concept of programmable feature engineering for time series modeling and propose a feature programming framework. This framework generates large amounts of predictive features for noisy multivariate time series while allowing users to incorporate their inductive bias with minimal effort. The key motivation of our framework is to view any multivariate time series as a cumulative su… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 21 pages, accepted to ICML2023. Code is available at https://github.com/SirAlex900/FeatureProgramming

  20. arXiv:2305.11857  [pdf, other

    stat.ML cs.LG stat.ME

    Computing high-dimensional optimal transport by flow neural networks

    Authors: Chen Xu, Xiuyuan Cheng, Yao Xie

    Abstract: Flow-based models are widely used in generative tasks, including normalizing flow, where a neural network transports from a data distribution $P$ to a normal distribution. This work develops a flow-based model that transports from $P$ to an arbitrary $Q$ where both distributions are only accessible via finite samples. We propose to learn the dynamic optimal transport between $P$ and $Q$ by trainin… ▽ More

    Submitted 4 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  21. arXiv:2304.13793  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Generalized generalized linear models: Convex estimation and online bounds

    Authors: Anatoli Juditsky, Arkadi Nemirovski, Yao Xie, Chen Xu

    Abstract: We introduce a new computational framework for estimating parameters in generalized generalized linear models (GGLM), a class of models that extends the popular generalized linear models (GLM) to account for dependencies among observations in spatio-temporal data. The proposed approach uses a monotone operator-based variational inequality method to overcome non-convexity in parameter estimation an… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  22. arXiv:2303.17791  [pdf

    stat.AP

    Analysis of the current status of tuberculosis transmission in China based on a heterogeneity model

    Authors: Chuanqing Xu, Kedeng Cheng, Yu Wang, Songbai Guo, Maoxing Liu, Xiao**g Wang, Zhiguo Zhang

    Abstract: Tuberculosis (TB) is an infectious disease transmitted through the respiratory system. China is one of the countries with a high burden of TB. Since 2004, an average of more than 800,000 cases of active TB have been reported each year in China. Analyzing the case data from 2004-2018, we find significant differences in TB incidence by age group. Therefore, the effect of age heterogeneous structure… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: We think this is a very interesting work that gives a good understanding of the current TB transmission in China and assesses the possibility of China achieving the 2035 TB control target and also explores possible ways for how to prevent and control the TB in China

  23. arXiv:2301.09801  [pdf, other

    cs.CR cs.CY cs.LG stat.ML

    Heterogeneous Domain Adaptation for IoT Intrusion Detection: A Geometric Graph Alignment Approach

    Authors: Jiashu Wu, Hao Dai, Yang Wang, Kejiang Ye, Chengzhong Xu

    Abstract: Data scarcity hinders the usability of data-dependent algorithms when tackling IoT intrusion detection (IID). To address this, we utilise the data rich network intrusion detection (NID) domain to facilitate more accurate intrusion detection for IID domains. In this paper, a Geometric Graph Alignment (GGA) approach is leveraged to mask the geometric heterogeneities between domains for better intrus… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Internet of Things Journal

  24. arXiv:2212.14424  [pdf, other

    stat.ML cs.LG

    Normalizing flow neural networks by JKO scheme

    Authors: Chen Xu, Xiuyuan Cheng, Yao Xie

    Abstract: Normalizing flow is a class of deep generative models for efficient sampling and likelihood estimation, which achieves attractive performance, particularly in high dimensions. The flow is often implemented using a sequence of invertible residual blocks. Existing works adopt special network architectures and regularization of flow trajectories. In this paper, we develop a neural ODE flow network ca… ▽ More

    Submitted 15 February, 2024; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023 spotlight

  25. arXiv:2212.03463  [pdf, other

    stat.ML cs.LG stat.ME

    Sequential Predictive Conformal Inference for Time Series

    Authors: Chen Xu, Yao Xie

    Abstract: We present a new distribution-free conformal prediction algorithm for sequential data (e.g., time series), called the \textit{sequential predictive conformal inference} (\texttt{SPCI}). We specifically account for the nature that time series data are non-exchangeable, and thus many existing conformal prediction algorithms are not applicable. The main idea is to adaptively re-estimate the condition… ▽ More

    Submitted 30 May, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  26. Statistics for Spatially Stratified Heterogeneous Data

    Authors: **feng Wang, Robert Haining, Tonglin Zhang, Chengdong Xu, Maogui Hu

    Abstract: Spatial statistics is dominated by spatial autocorrelation (SAC) based Kriging and BHM, and spatial local heterogeneity based hotspots and geographical regression methods, appraised as the first and second laws of Geography (Tobler 1970; Goodchild 2004), respectively. Spatial stratified heterogeneity (SSH), the phenomena of a partition that within strata is more similar than between strata, exampl… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: Annals of the American Association of Geographers 2024

  27. Geostatistics in the presence of multivariate complexities: comparison of multi-Gaussian transforms

    Authors: Sultan Abulkhair, Peter A. Dowd, Chaoshui Xu

    Abstract: One of the most challenging aspects of multivariate geostatistics is dealing with complex relationships between variables. Geostatistical co-simulation and spatial decorrelation methods, commonly used for modelling multiple variables, are ineffective in the presence of multivariate complexities. On the other hand, multi-Gaussian transforms are designed to deal with complex multivariate relationshi… ▽ More

    Submitted 21 February, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  28. arXiv:2208.02364  [pdf, other

    quant-ph q-fin.PR q-fin.ST stat.AP

    Quantum Encoding and Analysis on Continuous Time Stochastic Process with Financial Applications

    Authors: Xi-Ning Zhuang, Zhao-Yun Chen, Cheng Xue, Yu-Chun Wu, Guo-** Guo

    Abstract: The continuous time stochastic process is a mainstream mathematical instrument modeling the random world with a wide range of applications involving finance, statistics, physics, and time series analysis, while the simulation and analysis of the continuous time stochastic process is a challenging problem for classical computers. In this work, a general framework is established to prepare the path… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: 37 pages, 15 figures

    Journal ref: Quantum 7, 1127 (2023)

  29. arXiv:2207.13250  [pdf, other

    stat.AP stat.ME

    Spatio-Temporal Wildfire Prediction using Multi-Modal Data

    Authors: Chen Xu, Yao Xie, Daniel A. Zuniga Vazquez, Rui Yao, Feng Qiu

    Abstract: Due to severe societal and environmental impacts, wildfire prediction using multi-modal sensing data has become a highly sought-after data-analytical tool by various stakeholders (such as state governments and power utility companies) to achieve a more informed understanding of wildfire activities and plan preventive measures. A desirable algorithm should precisely predict fire risk and magnitude… ▽ More

    Submitted 10 October, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

  30. arXiv:2207.05195  [pdf, other

    cs.CV stat.ML

    Collaborative Uncertainty Benefits Multi-Agent Multi-Modal Trajectory Forecasting

    Authors: Bohan Tang, Yiqi Zhong, Chenxin Xu, Wei-Tao Wu, Ulrich Neumann, Yanfeng Wang, Ya Zhang, Siheng Chen

    Abstract: In multi-modal multi-agent trajectory forecasting, two major challenges have not been fully tackled: 1) how to measure the uncertainty brought by the interaction module that causes correlations among the predicted trajectories of multiple agents; 2) how to rank the multiple predictions and select the optimal predicted trajectory. In order to handle these challenges, this work first proposes a nove… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.13947

  31. arXiv:2206.07851  [pdf, other

    stat.ML cs.LG stat.ME

    Conformal prediction set for time-series

    Authors: Chen Xu, Yao Xie

    Abstract: When building either prediction intervals for regression (with real-valued response) or prediction sets for classification (with categorical responses), uncertainty quantification is essential to studying complex machine learning methods. In this paper, we develop Ensemble Regularized Adaptive Prediction Set (ERAPS) to construct prediction sets for time-series (with categorical responses), based o… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Strongly accepted by the Workshop on Distribution-Free Uncertainty Quantification at ICML 2022

  32. arXiv:2206.01163  [pdf, other

    stat.ML cs.LG

    Invertible Neural Networks for Graph Prediction

    Authors: Chen Xu, Xiuyuan Cheng, Yao Xie

    Abstract: Graph prediction problems prevail in data analysis and machine learning. The inverse prediction problem, namely to infer input data from given output labels, is of emerging interest in various applications. In this work, we develop \textit{invertible graph neural network} (iGNN), a deep generative model to tackle the inverse prediction problem on graphs by casting it as a conditional generative ta… ▽ More

    Submitted 21 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Accepted at IEEE Journal on Selected Areas in Information Theory (JSAIT)---Special Issue Deep Learning for Inverse Problems

  33. arXiv:2205.09680  [pdf, other

    math.ST cs.LG stat.ME

    Metrics of calibration for probabilistic predictions

    Authors: Imanol Arrieta-Ibarra, Paman Gujral, Jonathan Tannen, Mark Tygert, Cherie Xu

    Abstract: Predictions are often probabilities; e.g., a prediction could be for precipitation tomorrow, but with only a 30% chance. Given such probabilistic predictions together with the actual outcomes, "reliability diagrams" help detect and diagnose statistically significant discrepancies -- so-called "miscalibration" -- between the predictions and the outcomes. The canonical reliability diagrams histogram… ▽ More

    Submitted 12 June, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 50 pages, 36 figures

    Journal ref: Journal of Machine Learning Research, 23: 1-54, 2022

  34. arXiv:2202.08876  [pdf, other

    stat.ML cs.LG

    An alternative approach to train neural networks using monotone variational inequality

    Authors: Chen Xu, Xiuyuan Cheng, Yao Xie

    Abstract: We propose an alternative approach to neural network training using the monotone vector field, an idea inspired by the seminal work of Juditsky and Nemirovski [Juditsky & Nemirovsky, 2019] developed originally to solve parameter estimation problems for generalized linear models (GLM) by reducing the original non-convex problem to a convex problem of solving a monotone variational inequality (VI).… ▽ More

    Submitted 11 March, 2024; v1 submitted 17 February, 2022; originally announced February 2022.

  35. arXiv:2201.03512  [pdf, other

    stat.OT stat.AP

    SMLE: An R Package for Joint Feature Screening in Ultrahigh-dimensional GLMs

    Authors: Qianxiang Zang, Chen Xu, Kelly Burkett

    Abstract: The sparsity-restricted maximum likelihood estimator (SMLE) has received considerable attention for feature screening in ultrahigh-dimensional regression. SMLE is a computationally convenient method that naturally incorporates the joint effects among features in the screening process. We develop a publicly available R package SMLE, which provides a user-friendly environment to carry out SMLE in ge… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  36. arXiv:2105.11886  [pdf, other

    stat.AP stat.ME stat.ML

    Conformal Anomaly Detection on Spatio-Temporal Observations with Missing Data

    Authors: Chen Xu, Yao Xie

    Abstract: We develop a distribution-free, unsupervised anomaly detection method called ECAD, which wraps around any regression algorithm and sequentially detects anomalies. Rooted in conformal prediction, ECAD does not require data exchangeability but approximately controls the Type-I error when data are normal. Computationally, it involves no data-splitting and efficiently trains ensemble predictors to inc… ▽ More

    Submitted 2 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Submitted to ICML 2021 Workshop--Distribution-free Uncertainty Quantification

  37. arXiv:2103.00719  [pdf, ps, other

    cs.LG cs.AI stat.ML

    LocalDrop: A Hybrid Regularization for Deep Neural Networks

    Authors: Ziqing Lu, Chang Xu, Bo Du, Takashi Ishida, Lefei Zhang, Masashi Sugiyama

    Abstract: In neural networks, develo** regularization algorithms to settle overfitting is one of the major study areas. We propose a new approach for the regularization of neural networks by the local Rademacher complexity called LocalDrop. A new regularization function for both fully-connected networks (FCNs) and convolutional neural networks (CNNs), including drop rates and weight matrices, has been dev… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  38. arXiv:2101.11769  [pdf, other

    stat.ML cs.LG

    Learning Matching Representations for Individualized Organ Transplantation Allocation

    Authors: Can Xu, Ahmed M. Alaa, Ioana Bica, Brent D. Ershoff, Maxime Cannesson, Mihaela van der Schaar

    Abstract: Organ transplantation is often the last resort for treating end-stage illness, but the probability of a successful transplantation depends greatly on compatibility between donors and recipients. Current medical practice relies on coarse rules for donor-recipient matching, but is short of domain knowledge regarding the complex factors underlying organ compatibility. In this paper, we formulate the… ▽ More

    Submitted 1 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted to AISTATS 2021

  39. arXiv:2101.11552  [pdf, other

    cs.LG stat.ML

    Efficient Graph Deep Learning in TensorFlow with tf_geometric

    Authors: Jun Hu, Shengsheng Qian, Quan Fang, Youze Wang, Quan Zhao, Huaiwen Zhang, Changsheng Xu

    Abstract: We introduce tf_geometric, an efficient and friendly library for graph deep learning, which is compatible with both TensorFlow 1.x and 2.x. tf_geometric provides kernel libraries for building Graph Neural Networks (GNNs) as well as implementations of popular GNNs. The kernel libraries consist of infrastructures for building efficient GNNs, including graph data structures, graph map-reduce framewor… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 7 pages, 5 figures

  40. arXiv:2101.11202  [pdf, other

    astro-ph.IM stat.AP stat.ME

    Change point detection and image segmentation for time series of astrophysical images

    Authors: Cong Xu, Hans Moritz Günther, Vinay L. Kashyap, Thomas C. M. Lee, Andreas Zezas

    Abstract: Many astrophysical phenomena are time-varying, in the sense that their intensity, energy spectrum, and/or the spatial distribution of the emission suddenly change. This paper develops a method for modeling a time series of images. Under the assumption that the arrival times of the photons follow a Poisson process, the data are binned into 4D grids of voxels (time, energy band, and x-y coordinates)… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 22 pages, 10 figures

  41. arXiv:2101.11179  [pdf, other

    stat.AP stat.ML

    Solar Radiation Ram** Events Modeling Using Spatio-temporal Point Processes

    Authors: Minghe Zhang, Chen Xu, Andy Sun, Feng Qiu, Yao Xie

    Abstract: Modeling and predicting solar events, particularly the solar ram** event, is critical for improving situational awareness for solar power generation systems. It has been acknowledged that weather conditions such as temperature, humidity, and cloud density can significantly impact the emergence and position of solar ram** events. As a result, modeling these events with complex spatio-temporal c… ▽ More

    Submitted 16 June, 2022; v1 submitted 26 January, 2021; originally announced January 2021.

  42. Seasonal association between viral causes of hospitalised acute lower respiratory infections and meteorological factors in China: a retrospective study

    Authors: Bing Xu, **feng Wang, Zhongjie Li, Chengdong Xu, Yilan Liao, Maogui Hu, **g Yang, Shengjie Lai, Li** Wang, Weizhong Yang

    Abstract: Acute lower respiratory infections caused by respiratory viruses are common and persistent infectious diseases worldwide and in China, which have pronounced seasonal patterns. Meteorological factors have important roles in the seasonality of some major viruses. Our aim was to identify the dominant meteorological factors and to model their effects on common respiratory viruses in different regions… ▽ More

    Submitted 15 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 6 figures and tables

    Journal ref: The Lancet Planetary Health, 2021

  43. arXiv:2010.12367  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

    Authors: Cong Zhang, Wen Song, Zhiguang Cao, Jie Zhang, Puay Siew Tan, Chi Xu

    Abstract: Priority dispatching rule (PDR) is widely used for solving real-world Job-shop scheduling problem (JSSP). However, the design of effective PDRs is a tedious task, requiring a myriad of specialized knowledge and often delivering limited performance. In this paper, we propose to automatically learn PDRs via an end-to-end deep reinforcement learning agent. We exploit the disjunctive graph representat… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  44. arXiv:2010.09107  [pdf, other

    stat.ME stat.AP stat.ML

    Conformal prediction for time series

    Authors: Chen Xu, Yao Xie

    Abstract: We develop a general framework for constructing distribution-free prediction intervals for time series. Theoretically, we establish explicit bounds on conditional and marginal coverage gaps of estimated prediction intervals, which asymptotically converge to zero under additional assumptions. We obtain similar bounds on the size of set differences between oracle and estimated prediction intervals.… ▽ More

    Submitted 16 February, 2023; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Journal version, under review. A preliminary conference version was accepted as a long talk/oral (3% of total 5513 submissions) in the Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021 (ICML 2021). The title is "Conformal prediction interval for dynamic time-series"

  45. arXiv:2010.01381  [pdf, other

    cs.LG stat.ML

    Cubic Spline Smoothing Compensation for Irregularly Sampled Sequences

    Authors: **g Shi, **g Bi, Yingru Liu, Chenliang Xu

    Abstract: The marriage of recurrent neural networks and neural ordinary differential networks (ODE-RNN) is effective in modeling irregularly-observed sequences. While ODE produces the smooth hidden states between observation intervals, the RNN will trigger a hidden state jump when a new observation arrives, thus cause the interpolation discontinuity problem. To address this issue, we propose the cubic splin… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  46. arXiv:2010.00989  [pdf, other

    cs.LG cs.AI stat.ML

    Knowledge Graph Embeddings in Geometric Algebras

    Authors: Cheng** Xu, Mojtaba Nayyeri, Yung-Yu Chen, Jens Lehmann

    Abstract: Knowledge graph (KG) embedding aims at embedding entities and relations in a KG into a lowdimensional latent representation space. Existing KG embedding approaches model entities andrelations in a KG by utilizing real-valued , complex-valued, or hypercomplex-valued (Quaternionor Octonion) representations, all of which are subsumed into a geometric algebra. In this work,we introduce a novel geometr… ▽ More

    Submitted 22 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: This paper is accepted by COLING2020

  47. arXiv:2009.03510  [pdf, other

    cs.LG cs.CR stat.ML

    FedCM: A Real-time Contribution Measurement Method for Participants in Federated Learning

    Authors: Boyi Liu, Bingjie Yan, Yize Zhou, Zhixuan Liang, Cheng-Zhong Xu

    Abstract: Federated Learning (FL) creates an ecosystem for multiple agents to collaborate on building models with data privacy consideration. The method for contribution measurement of each agent in the FL system is critical for fair credits allocation but few are proposed. In this paper, we develop a real-time contribution measurement method FedCM that is simple but powerful. The method defines the impact… ▽ More

    Submitted 11 February, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

  48. arXiv:2007.10252  [pdf, other

    cs.LG cs.CV stat.ML

    XMixup: Efficient Transfer Learning with Auxiliary Samples by Cross-domain Mixup

    Authors: Xingjian Li, Haoyi Xiong, Haozhe An, Chengzhong Xu, De**g Dou

    Abstract: Transferring knowledge from large source datasets is an effective way to fine-tune the deep neural networks of the target task with a small sample size. A great number of algorithms have been proposed to facilitate deep transfer learning, and these techniques could be generally categorized into two groups - Regularized Learning of the target task using models that have been pre-trained from source… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  49. arXiv:2007.04921  [pdf, other

    q-bio.QM cs.LG stat.ML

    Graph Neural Network Based Coarse-Grained Map** Prediction

    Authors: Zhiheng Li, Geemi P. Wellawatte, Maghesree Chakraborty, Heta A. Gandhi, Chenliang Xu, Andrew D. White

    Abstract: The selection of coarse-grained (CG) map** operators is a critical step for CG molecular dynamics (MD) simulation. It is still an open question about what is optimal for this choice and there is a need for theory. The current state-of-the art method is map** operators manually selected by experts. In this work, we demonstrate an automated approach by viewing this problem as supervised learning… ▽ More

    Submitted 19 August, 2021; v1 submitted 24 June, 2020; originally announced July 2020.

  50. arXiv:2007.03349  [pdf, other

    cs.LG cs.CV stat.ML

    RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr

    Authors: Xingjian Li, Haoyi Xiong, Haozhe An, Chengzhong Xu, De**g Dou

    Abstract: Fine-tuning the deep convolution neural network(CNN) using a pre-trained model helps transfer knowledge learned from larger datasets to the target task. While the accuracy could be largely improved even when the training dataset is small, the transfer learning outcome is usually constrained by the pre-trained model with close CNN weights (Liu et al., 2019), as the backpropagation here brings small… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted by ICML'2020