Skip to main content

Showing 1–27 of 27 results for author: Cui, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.03198  [pdf, other

    stat.ML cs.LG math.OC

    Stability Evaluation via Distributional Perturbation Analysis

    Authors: Jose Blanchet, Peng Cui, Jia** Li, Jiashuo Liu

    Abstract: The performance of learning models often deteriorates when deployed in out-of-sample environments. To ensure reliable deployment, we propose a stability evaluation criterion based on distributional perturbations. Conceptually, our stability evaluation criterion is defined as the minimal perturbation required on our observed dataset to induce a prescribed deterioration in risk evaluation. In this p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  2. arXiv:2312.01294  [pdf, other

    cs.LG stat.ML

    Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series

    Authors: Ying Liu, Peng Cui, Wenbo Hu, Richang Hong

    Abstract: Multivariate time series are everywhere. Nevertheless, real-world time series data often exhibit numerous missing values, which is the time series imputation task. Although previous deep learning methods have been shown to be effective for time series imputation, they are shown to produce overconfident imputations, which might be a potentially overlooked threat to the reliability of the intelligen… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  3. arXiv:2308.15364  [pdf, other

    cs.LG stat.ML

    Heterogeneous Multi-Task Gaussian Cox Processes

    Authors: Feng Zhou, Quyu Kong, Zhijie Deng, Fengxiang He, Peng Cui, Jun Zhu

    Abstract: This paper presents a novel extension of multi-task Gaussian Cox processes for modeling multiple heterogeneous correlated tasks jointly, e.g., classification and regression, via multi-output Gaussian processes (MOGP). A MOGP prior over the parameters of the dedicated likelihoods for classification, regression and point process tasks can facilitate sharing of information between heterogeneous tasks… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  4. arXiv:2212.00992  [pdf, other

    cs.LG stat.ML

    Stable Learning via Sparse Variable Independence

    Authors: Han Yu, Peng Cui, Yue He, Zheyan Shen, Yong Lin, Renzhe Xu, Xingxuan Zhang

    Abstract: The problem of covariate-shift generalization has attracted intensive research attention. Previous stable learning algorithms employ sample reweighting schemes to decorrelate the covariates when there is no explicit domain information about training data. However, with finite samples, it is difficult to achieve the desirable weights that ensure perfect independence to get rid of the unstable varia… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI 2023

  5. arXiv:2111.02355  [pdf, other

    cs.LG stat.ML

    A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization

    Authors: Renzhe Xu, Xingxuan Zhang, Zheyan Shen, Tong Zhang, Peng Cui

    Abstract: Covariate-shift generalization, a typical case in out-of-distribution (OOD) generalization, requires a good performance on the unknown test distribution, which varies from the accessible training distribution in the form of covariate shift. Recently, independence-driven importance weighting algorithms in stable learning literature have shown empirical effectiveness to deal with covariate-shift gen… ▽ More

    Submitted 17 October, 2023; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: ICML 2022

  6. arXiv:2010.12408  [pdf, other

    cs.LG stat.ML

    On the Equivalence of Decoupled Graph Convolution Network and Label Propagation

    Authors: Hande Dong, Jiawei Chen, Fuli Feng, Xiangnan He, Shuxian Bi, Zhaolin Ding, Peng Cui

    Abstract: The original design of Graph Convolution Network (GCN) couples feature transformation and neighborhood aggregation for node representation learning. Recently, some work shows that coupling is inferior to decoupling, which supports deep graph propagation better and has become the latest paradigm of GCN (e.g., APPNP and SGCN). Despite effectiveness, the working mechanisms of the decoupled GCN are no… ▽ More

    Submitted 15 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted by WWW 2021

  7. arXiv:2009.02562  [pdf, other

    cs.LG stat.ML

    Permutation-equivariant and Proximity-aware Graph Neural Networks with Stochastic Message Passing

    Authors: Ziwei Zhang, Chenhao Niu, Peng Cui, Jian Pei, Bo Zhang, Wenwu Zhu

    Abstract: Graph neural networks (GNNs) are emerging machine learning models on graphs. Permutation-equivariance and proximity-awareness are two important properties highly desirable for GNNs. Both properties are needed to tackle some challenging graph problems, such as finding communities and leaders. In this paper, we first analytically show that the existing GNNs, mostly based on the message-passing mecha… ▽ More

    Submitted 22 February, 2022; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering

  8. arXiv:2009.00097  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarial Eigen Attack on Black-Box Models

    Authors: Linjun Zhou, Peng Cui, Yinan Jiang, Shiqiang Yang

    Abstract: Black-box adversarial attack has attracted a lot of research interests for its practical use in AI safety. Compared with the white-box attack, a black-box setting is more difficult for less available information related to the attacked model and the additional constraint on the query budget. A general way to improve the attack efficiency is to draw support from a pre-trained transferable white-box… ▽ More

    Submitted 27 August, 2020; originally announced September 2020.

  9. Algorithmic Decision Making with Conditional Fairness

    Authors: Renzhe Xu, Peng Cui, Kun Kuang, Bo Li, Linjun Zhou, Zheyan Shen, Wei Cui

    Abstract: Nowadays fairness issues have raised great concerns in decision-making systems. Various fairness notions have been proposed to measure the degree to which an algorithm is unfair. In practice, there frequently exist a certain set of variables we term as fair variables, which are pre-decision covariates such as users' choices. The effects of fair variables are irrelevant in assessing the fairness of… ▽ More

    Submitted 18 July, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: KDD 2020

  10. arXiv:2006.10255  [pdf, other

    cs.LG stat.ML

    Calibrated Reliable Regression using Maximum Mean Discrepancy

    Authors: Peng Cui, Wenbo Hu, Jun Zhu

    Abstract: Accurate quantification of uncertainty is crucial for real-world applications of machine learning. However, modern deep neural networks still produce unreliable predictive uncertainty, often yielding over-confident predictions. In this paper, we are concerned with getting well-calibrated predictions in regression tasks. We propose the calibrated regression method using the maximum mean discrepancy… ▽ More

    Submitted 27 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to NeurIPS'2020. Full version with appendix

  11. arXiv:2006.05076  [pdf, other

    cs.LG cs.AI stat.ML

    Stable Prediction via Leveraging Seed Variable

    Authors: Kun Kuang, Bo Li, Peng Cui, Yue Liu, Jianrong Tao, Yueting Zhuang, Fei Wu

    Abstract: In this paper, we focus on the problem of stable prediction across unknown test data, where the test distribution is agnostic and might be totally different from the training one. In such a case, previous machine learning methods might exploit subtly spurious correlations in training data induced by non-causal variables for prediction. Those spurious correlations are changeable across data, leadin… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  12. arXiv:2006.04414  [pdf, other

    cs.LG stat.ML

    Stable Adversarial Learning under Distributional Shifts

    Authors: Jiashuo Liu, Zheyan Shen, Peng Cui, Linjun Zhou, Kun Kuang, Bo Li, Yishi Lin

    Abstract: Machine learning algorithms with empirical risk minimization are vulnerable under distributional shifts due to the greedy adoption of all the correlations found in training data. Recently, there are robust learning methods aiming at this problem by minimizing the worst-case risk over an uncertainty set. However, they equally treat all covariates to form the decision sets regardless of the stabilit… ▽ More

    Submitted 10 May, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 11 pages

    Journal ref: Association for the Advancement of Artificial Intelligence, 2021

  13. arXiv:2006.04330  [pdf, other

    cs.LG stat.ML

    Eigen-GNN: A Graph Structure Preserving Plug-in for GNNs

    Authors: Ziwei Zhang, Peng Cui, Jian Pei, Xin Wang, Wenwu Zhu

    Abstract: Graph Neural Networks (GNNs) are emerging machine learning models on graphs. Although sufficiently deep GNNs are shown theoretically capable of fully preserving graph structures, most existing GNN models in practice are shallow and essentially feature-centric. We show empirically and analytically that the existing shallow GNNs cannot preserve graph structures well. To overcome this fundamental cha… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

    Comments: 11 pages plus 8 pages for appendices, 4 figures

  14. arXiv:2004.00315  [pdf, other

    cs.CV cs.LG stat.ML

    Learning to Select Base Classes for Few-shot Classification

    Authors: Linjun Zhou, Peng Cui, Xu Jia, Shiqiang Yang, Qi Tian

    Abstract: Few-shot learning has attracted intensive research attention in recent years. Many methods have been proposed to generalize a model learned from provided base classes to novel classes, but no previous work studies how to select base classes, or even whether different base classes will result in different generalization performance of the learned model. In this paper, we utilize a simple yet effect… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  15. arXiv:2003.01171  [pdf, other

    cs.SI cs.CR cs.LG stat.ML

    A Semi-supervised Graph Attentive Network for Financial Fraud Detection

    Authors: Daixin Wang, Jianbin Lin, Peng Cui, Quanhui Jia, Zhen Wang, Yanming Fang, Quan Yu, Jun Zhou, Shuang Yang, Yuan Qi

    Abstract: With the rapid growth of financial services, fraud detection has been a very important problem to guarantee a healthy environment for both users and providers. Conventional solutions for fraud detection mainly use some rule-based methods or distract some features manually to perform prediction. However, in financial services, users have rich interactions and they themselves always show multifacete… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: icdm

  16. One-Class Graph Neural Networks for Anomaly Detection in Attributed Networks

    Authors: Xuhong Wang, Baihong **, Ying Du, ** Cui, Yupu Yang

    Abstract: Nowadays, graph-structured data are increasingly used to model complex systems. Meanwhile, detecting anomalies from graph has become a vital research problem of pressing societal concerns. Anomaly detection is an unsupervised learning task of identifying rare data that differ from the majority. As one of the dominant anomaly detection algorithms, One Class Support Vector Machine has been widely us… ▽ More

    Submitted 6 June, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 16 pages, 4 figures. Neural Comput & Applic (2021)

  17. Structural Deep Clustering Network

    Authors: Deyu Bo, Xiao Wang, Chuan Shi, Meiqi Zhu, Emiao Lu, Peng Cui

    Abstract: Clustering is a fundamental task in data analysis. Recently, deep clustering, which derives inspiration primarily from deep learning approaches, achieves state-of-the-art performance and has attracted considerable attention. Current deep clustering methods usually boost the clustering results by means of the powerful representation ability of deep learning, e.g., autoencoder, suggesting that learn… ▽ More

    Submitted 12 February, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: Published at The Web Conference (WWW) 2020, full paper

  18. arXiv:2001.11713  [pdf, other

    cs.LG stat.ML

    Stable Prediction with Model Misspecification and Agnostic Distribution Shift

    Authors: Kun Kuang, Ruoxuan Xiong, Peng Cui, Susan Athey, Bo Li

    Abstract: For many machine learning algorithms, two main assumptions are required to guarantee performance. One is that the test data are drawn from the same distribution as the training data, and the other is that the model is correctly specified. In real applications, however, we often have little prior knowledge on the test data and on the underlying true model. Under model misspecification, agnostic dis… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

  19. arXiv:2001.00293  [pdf, other

    cs.LG cs.IR cs.SI stat.ML

    Deep Learning for Learning Graph Representations

    Authors: Wenwu Zhu, Xin Wang, Peng Cui

    Abstract: Mining graph data has become a popular research topic in computer science and has been widely studied in both academia and industry given the increasing amount of network data in the recent years. However, the huge amount of network data has posed great challenges for efficient analysis. This motivates the advent of graph representation which maps the graph into a low-dimension vector space, keepi… ▽ More

    Submitted 1 January, 2020; originally announced January 2020.

    Comments: 51 pages, 8 figures

  20. arXiv:1911.12580  [pdf, other

    cs.LG stat.ML

    Stable Learning via Sample Reweighting

    Authors: Zheyan Shen, Peng Cui, Tong Zhang, Kun Kuang

    Abstract: We consider the problem of learning linear prediction models with model misspecification bias. In such case, the collinearity among input variables may inflate the error of parameter estimation, resulting in instability of prediction results when training and test distributions do not match. In this paper we theoretically analyze this fundamental problem and propose a sample reweighting method tha… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted as poster paper at AAAI2020

  21. arXiv:1910.14238  [pdf, other

    cs.LG cs.IR stat.ML

    Learning Disentangled Representations for Recommendation

    Authors: Jianxin Ma, Chang Zhou, Peng Cui, Hongxia Yang, Wenwu Zhu

    Abstract: User behavior data in recommender systems are driven by the complex interactions of many latent factors behind the users' decision making processes. The factors are highly entangled, and may range from high-level ones that govern user intentions, to low-level ones that characterize a user's preference when executing an intention. Learning representations that uncover and disentangle these latent f… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: To appear in the Proceedings of the Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019)

  22. arXiv:1908.01297  [pdf, other

    cs.SI cs.CR cs.LG stat.ML

    A Restricted Black-box Adversarial Framework Towards Attacking Graph Embedding Models

    Authors: Heng Chang, Yu Rong, Tingyang Xu, Wenbing Huang, Honglei Zhang, Peng Cui, Wenwu Zhu, Junzhou Huang

    Abstract: With the great success of graph embedding model on both academic and industry area, the robustness of graph embedding against adversarial attack inevitably becomes a central problem in graph learning domain. Regardless of the fruitful progress, most of the current works perform the attack in a white-box fashion: they need to access the model predictions and labels to construct their adversarial lo… ▽ More

    Submitted 17 December, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: Accepted by the AAAI 2020

  23. adVAE: A self-adversarial variational autoencoder with Gaussian anomaly prior knowledge for anomaly detection

    Authors: Xuhong Wang, Ying Du, Shijie Lin, ** Cui, Yuntian Shen, Yupu Yang

    Abstract: Recently, deep generative models have become increasingly popular in unsupervised anomaly detection. However, deep generative models aim at recovering the data distribution rather than detecting anomalies. Besides, deep generative models have the risk of overfitting training samples, which has disastrous effects on anomaly detection performance. To solve the above two problems, we propose a Self-a… ▽ More

    Submitted 14 November, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: This paper has been accepted by Knowledge-based Systems

  24. arXiv:1812.04202  [pdf, other

    cs.LG cs.SI stat.ML

    Deep Learning on Graphs: A Survey

    Authors: Ziwei Zhang, Peng Cui, Wenwu Zhu

    Abstract: Deep learning has been shown to be successful in a number of domains, ranging from acoustics, images, to natural language processing. However, applying deep learning to the ubiquitous graph data is non-trivial because of the unique characteristics of graphs. Recently, substantial research efforts have been devoted to applying deep learning methods to graphs, resulting in beneficial advances in gra… ▽ More

    Submitted 13 March, 2020; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: Accepted by Transactions on Knowledge and Data Engineering. 24 pages, 11 figures

  25. arXiv:1806.06270  [pdf, other

    cs.LG stat.ML

    Stable Prediction across Unknown Environments

    Authors: Kun Kuang, Ruoxuan Xiong, Peng Cui, Susan Athey, Bo Li

    Abstract: In many important machine learning applications, the training distribution used to learn a probabilistic classifier differs from the testing distribution on which the classifier will be used to make predictions. Traditional methods correct the distribution shift by reweighting the training data with the ratio of the density between test and training data. In many applications training takes place… ▽ More

    Submitted 10 July, 2018; v1 submitted 16 June, 2018; originally announced June 2018.

  26. arXiv:1805.02396  [pdf, ps, other

    cs.SI cs.LG stat.ML

    Billion-scale Network Embedding with Iterative Random Projection

    Authors: Ziwei Zhang, Peng Cui, Haoyang Li, Xiao Wang, Wenwu Zhu

    Abstract: Network embedding, which learns low-dimensional vector representation for nodes in the network, has attracted considerable research attention recently. However, the existing methods are incapable of handling billion-scale networks, because they are computationally expensive and, at the same time, difficult to be accelerated by distributed computing schemes. To address these problems, we propose Ra… ▽ More

    Submitted 10 September, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

    Comments: Accepted by ICDM 2018. 10 pages, 8 figures, 2018 IEEE International Conference on Data Mining (ICDM)

  27. arXiv:1708.06656  [pdf, ps, other

    cs.CV cs.MM stat.ML

    Causally Regularized Learning with Agnostic Data Selection Bias

    Authors: Zheyan Shen, Peng Cui, Kun Kuang, Bo Li, Peixuan Chen

    Abstract: Most of previous machine learning algorithms are proposed based on the i.i.d. hypothesis. However, this ideal assumption is often violated in real applications, where selection bias may arise between training and testing process. Moreover, in many scenarios, the testing data is not even available during the training process, which makes the traditional methods like transfer learning infeasible due… ▽ More

    Submitted 19 August, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: Oral paper of 2018 ACM Multimedia Conference (MM'18)