Skip to main content

Showing 1–36 of 36 results for author: Chang, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.10101  [pdf, other

    cs.CR cs.DS cs.LG stat.ML stat.OT

    Gaussian Differential Privacy on Riemannian Manifolds

    Authors: Yangdi Jiang, Xiaotian Chang, Yi Liu, Lei Ding, Linglong Kong, Bei Jiang

    Abstract: We develop an advanced approach for extending Gaussian Differential Privacy (GDP) to general Riemannian manifolds. The concept of GDP stands out as a prominent privacy definition that strongly warrants extension to manifold settings, due to its central limit properties. By harnessing the power of the renowned Bishop-Gromov theorem in geometric analysis, we propose a Riemannian Gaussian distributio… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  2. arXiv:2310.06746  [pdf, other

    cs.LG stat.ME stat.ML

    Causal Rule Learning: Enhancing the Understanding of Heterogeneous Treatment Effect via Weighted Causal Rules

    Authors: Ying Wu, Hanzhong Liu, Kai Ren, Xiangyu Chang

    Abstract: Interpretability is a key concern in estimating heterogeneous treatment effects using machine learning methods, especially for healthcare applications where high-stake decisions are often made. Inspired by the Predictive, Descriptive, Relevant framework of interpretability, we propose causal rule learning which finds a refined set of causal rules characterizing potential subgroups to estimate and… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  3. Spectral co-Clustering in Multi-layer Directed Networks

    Authors: Wenqing Su, Xiao Guo, Xiangyu Chang, Ying Yang

    Abstract: Modern network analysis often involves multi-layer network data in which the nodes are aligned, and the edges on each layer represent one of the multiple relations among the nodes. Current literature on multi-layer network data is mostly limited to undirected relations. However, direct relations are more common and may introduce extra information. This study focuses on community detection (or clus… ▽ More

    Submitted 16 June, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Journal ref: Computational Statistics & Data Analysis (2024) 107987

  4. arXiv:2306.15709  [pdf, other

    cs.SI cs.LG stat.ME stat.ML

    Privacy-Preserving Community Detection for Locally Distributed Multiple Networks

    Authors: Xiao Guo, Xiang Li, Xiangyu Chang, Shujie Ma

    Abstract: Modern multi-layer networks are commonly stored and analyzed in a local and distributed fashion because of the privacy, ownership, and communication costs. The literature on the model-based statistical methods for community detection based on these data is still limited. This paper proposes a new method for consensus community detection and estimation in a multi-layer stochastic block model using… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  5. arXiv:2304.06900  [pdf, other

    stat.ME

    Subsampling-Based Modified Bayesian Information Criterion for Large-Scale Stochastic Block Models

    Authors: Jiayi Deng, Danyang Huang, Xiangyu Chang, Bo Zhang

    Abstract: Identifying the number of communities is a fundamental problem in community detection, which has received increasing attention recently. However, rapid advances in technology have led to the emergence of large-scale networks in various disciplines, thereby making existing methods computationally infeasible. To address this challenge, we propose a novel subsampling-based modified Bayesian informati… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  6. arXiv:2303.05223  [pdf, other

    stat.ME

    LEAP: The latent exchangeability prior for borrowing information from historical data

    Authors: Ethan M. Alt, Xiuya Chang, Xun Jiang, Qing Liu, May Mo, H. Amy Xia, Joseph G. Ibrahim

    Abstract: It is becoming increasingly popular to elicit informative priors on the basis of historical data. Popular existing priors, including the power prior, commensurate prior, and robust meta-analytic prior provide blanket discounting. Thus, if only a subset of participants in the historical data are exchangeable with the current data, these priors may not be appropriate. In order to combat this issue,… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  7. arXiv:2210.16835  [pdf, other

    stat.ML cs.LG

    Variance reduced Shapley value estimation for trustworthy data valuation

    Authors: Mengmeng Wu, Ruoxi Jia, Changle Lin, Wei Huang, Xiangyu Chang

    Abstract: Data valuation, especially quantifying data value in algorithmic prediction and decision-making, is a fundamental problem in data trading scenarios. The most widely used method is to define the data Shapley and approximate it by means of the permutation sampling algorithm. To make up for the large estimation variance of the permutation sampling that hinders the development of the data marketplace,… ▽ More

    Submitted 22 May, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

  8. arXiv:2209.13807  [pdf, other

    stat.ME stat.ML

    Asynchronous and Error-prone Longitudinal Data Analysis via Functional Calibration

    Authors: Xinyue Chang, Yehua Li, Yi Li

    Abstract: In many longitudinal settings, time-varying covariates may not be measured at the same time as responses and are often prone to measurement error. Naive last-observation-carried-forward methods incur estimation biases, and existing kernel-based methods suffer from slow convergence rates and large variations. To address these challenges, we propose a new functional calibration approach to efficient… ▽ More

    Submitted 8 March, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

  9. arXiv:2209.12401  [pdf, ps, other

    math.NA eess.SY math.OC stat.AP

    Elevator Optimization: Application of Spatial Process and Gibbs Random Field Approaches for Dumbwaiter Modeling and Multi-Dumbwaiter Systems

    Authors: Zheng Cao, Benjamin Lu Davis, Wanchaloem Wunkaew, Xinyu Chang

    Abstract: This research investigates analytical and quantitative methods for simulating elevator optimizations. To maximize overall elevator usage, we concentrate on creating a multiple-user positive-sum system that is inspired by agent-based game theory. We define and create basic "Dumbwaiter" models by attempting both the Spatial Process Approach and the Gibbs Random Field Approach. These two mathematical… ▽ More

    Submitted 23 December, 2022; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: 14 pages

    MSC Class: 93-10; 60J05; 90B36 ACM Class: G.1.6; G.3; I.6.5

  10. arXiv:2206.15379  [pdf, ps, other

    stat.ME

    On the efficacy of higher-order spectral clustering under weighted stochastic block models

    Authors: Xiao Guo, Hai Zhang, Xiangyu Chang

    Abstract: Higher-order structures of networks, namely, small subgraphs of networks (also called network motifs), are widely known to be crucial and essential to the organization of networks. There has been a few work studying the community detection problem -- a fundamental problem in network analysis, at the level of motifs. In particular, higher-order spectral clustering has been developed, where the noti… ▽ More

    Submitted 13 April, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

  11. arXiv:2205.05343  [pdf, other

    stat.ML cs.LG

    Learning Multitask Gaussian Bayesian Networks

    Authors: Shuai Liu, Yixuan Qiu, Baojuan Li, Huaning Wang, Xiangyu Chang

    Abstract: Major depressive disorder (MDD) requires study of brain functional connectivity alterations for patients, which can be uncovered by resting-state functional magnetic resonance imaging (rs-fMRI) data. We consider the problem of identifying alterations of brain functional connectivity for a single MDD patient. This is particularly difficult since the amount of data collected during an fMRI scan is t… ▽ More

    Submitted 8 June, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  12. arXiv:2109.10053  [pdf, other

    cs.LG stat.ME

    Toward a Fairness-Aware Scoring System for Algorithmic Decision-Making

    Authors: Yi Yang, Ying Wu, Mei Li, Xiangyu Chang, Yong Tan

    Abstract: Scoring systems, as a type of predictive model, have significant advantages in interpretability and transparency and facilitate quick decision-making. As such, scoring systems have been extensively used in a wide variety of industries such as healthcare and criminal justice. However, the fairness issues in these models have long been criticized, and the use of big data and machine learning algorit… ▽ More

    Submitted 22 November, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  13. arXiv:2109.01326  [pdf, other

    stat.ML cs.LG

    Statistical Estimation and Inference via Local SGD in Federated Learning

    Authors: Xiang Li, Jiadong Liang, Xiangyu Chang, Zhihua Zhang

    Abstract: Federated Learning (FL) makes a large amount of edge computing devices (e.g., mobile phones) jointly learn a global model without data sharing. In FL, data are generated in a decentralized manner with high heterogeneity. This paper studies how to perform statistical estimation and inference in the federated setting. We analyze the so-called Local SGD, a multi-round estimation procedure that uses i… ▽ More

    Submitted 17 December, 2021; v1 submitted 3 September, 2021; originally announced September 2021.

  14. arXiv:2103.00704  [pdf, other

    stat.ML cs.LG

    FedPower: Privacy-Preserving Distributed Eigenspace Estimation

    Authors: Xiao Guo, Xiang Li, Xiangyu Chang, Shusen Wang, Zhihua Zhang

    Abstract: Eigenspace estimation is fundamental in machine learning and statistics, which has found applications in PCA, dimension reduction, and clustering, among others. The modern machine learning community usually assumes that data come from and belong to different organizations. The low communication power and the possible privacy breaches of data make the computation of eigenspace challenging. To addre… ▽ More

    Submitted 27 June, 2023; v1 submitted 28 February, 2021; originally announced March 2021.

  15. arXiv:2101.09418  [pdf, other

    stat.AP

    A Geospatial Functional Model For OCO-2 Data with Application on Imputation and Land Fraction Estimation

    Authors: Xinyue Chang, Zhengyuan Zhu, Xiongtao Dai, Jonathan Hobbs

    Abstract: Data from NASA's Orbiting Carbon Observatory-2 (OCO-2) satellite is essential to many carbon management strategies. A retrieval algorithm is used to estimate CO2 concentration using the radiance data measured by OCO-2. However, due to factors such as cloud cover and cosmic rays, the spatial coverage of the retrieval algorithm is limited in some areas of critical importance for carbon cycle science… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

  16. arXiv:2012.08749  [pdf, other

    cs.LG stat.ML

    Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

    Authors: Xiangyu Chang, Yingcong Li, Samet Oymak, Christos Thrampoulidis

    Abstract: Deep networks are typically trained with many more parameters than the size of the training dataset. Recent empirical evidence indicates that the practice of overparameterization not only benefits training large models, but also assists - perhaps counterintuitively - building lightweight models. Specifically, it suggests that overparameterization benefits model pruning / sparsification. This paper… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: to appear at AAAI 2021

  17. arXiv:2009.12362  [pdf, other

    cs.LG stat.ML

    Self-Weighted Robust LDA for Multiclass Classification with Edge Classes

    Authors: Caixia Yan, Xiaojun Chang, Minnan Luo, Qinghua Zheng, Xiaoqin Zhang, Zhihui Li, Fei** Nie

    Abstract: Linear discriminant analysis (LDA) is a popular technique to learn the most discriminative features for multi-class classification. A vast majority of existing LDA algorithms are prone to be dominated by the class with very large deviation from the others, i.e., edge class, which occurs frequently in multi-class classification. First, the existence of edge classes often makes the total mean biased… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 17 pages, has been accepted by ACM TIST

  18. arXiv:2009.01514  [pdf, other

    math.NA stat.ML

    Kernel Interpolation of High Dimensional Scattered Data

    Authors: Shao-Bo Lin, Xiangyu Chang, ** Sun

    Abstract: Data sites selected from modeling high-dimensional problems often appear scattered in non-paternalistic ways. Except for sporadic clustering at some spots, they become relatively far apart as the dimension of the ambient space grows. These features defy any theoretical treatment that requires local or global quasi-uniformity of distribution of data sites. Incorporating a recently-developed applica… ▽ More

    Submitted 27 September, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: 33 pages, 5 figures

  19. arXiv:2009.00236  [pdf, other

    cs.LG stat.ML

    A Survey of Deep Active Learning

    Authors: Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Brij B. Gupta, Xiaojiang Chen, Xin Wang

    Abstract: Active learning (AL) attempts to maximize the performance gain of the model by marking the fewest samples. Deep learning (DL) is greedy for data and requires a large amount of data supply to optimize massive parameters, so that the model learns how to extract high-quality features. In recent years, due to the rapid development of internet technology, we are in an era of information torrents and we… ▽ More

    Submitted 5 December, 2021; v1 submitted 30 August, 2020; originally announced September 2020.

  20. arXiv:2008.08844  [pdf, other

    cs.LG stat.ML

    Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks

    Authors: Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang, Doina Precup

    Abstract: The core operation of current Graph Neural Networks (GNNs) is the aggregation enabled by the graph Laplacian or message passing, which filters the neighborhood node information. Though effective for various tasks, in this paper, we show that they are potentially a problematic factor underlying all GNN methods for learning on certain datasets, as they force the node representations similar, making… ▽ More

    Submitted 2 November, 2022; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: New Frontiers in Graph Learning (GLFrontiers) Workshop (Oral), NeurIPS 2022

  21. arXiv:2008.08838  [pdf, ps, other

    cs.LG stat.ML

    Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks

    Authors: Sitao Luan, Mingde Zhao, Xiao-Wen Chang, Doina Precup

    Abstract: The performance limit of Graph Convolutional Networks (GCNs) and the fact that we cannot stack more of them to increase the performance, which we usually do for other deep learning paradigms, are pervasively thought to be caused by the limitations of the GCN layers, including insufficient expressive power, etc. However, if so, for a fixed architecture, it would be unlikely to lower the training di… ▽ More

    Submitted 3 November, 2023; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted by 12th International Conference on Complex Networks and Their Applications

  22. arXiv:2006.13681  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-view Drone-based Geo-localization via Style and Spatial Alignment

    Authors: Siyi Hu, Xiaojun Chang

    Abstract: In this paper, we focus on the task of multi-view multi-source geo-localization, which serves as an important auxiliary method of GPS positioning by matching drone-view image and satellite-view image with pre-annotated GPS tag. To solve this problem, most existing methods adopt metric loss with an weighted classification block to force the generation of common feature space shared by different vie… ▽ More

    Submitted 8 July, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 9 pages 9 figures. arXiv admin note: text overlap with arXiv:2002.12186 by other authors

    ACM Class: I.4.7; I.2.10

  23. arXiv:2006.02903  [pdf, other

    cs.LG stat.ML

    A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

    Authors: Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang

    Abstract: Deep learning has made breakthroughs and substantial in many fields due to its powerful automatic representation capabilities. It has been proven that neural architecture design is crucial to the feature representation of data and the final performance. However, the design of the neural architecture heavily relies on the researchers' prior knowledge and experience. And due to the limitations of hu… ▽ More

    Submitted 2 March, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted by ACM Computing Surveys 2021

  24. arXiv:2005.11650  [pdf, other

    cs.LG stat.ML

    Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

    Authors: Zonghan Wu, Shirui Pan, Guodong Long, **g Jiang, Xiaojun Chang, Chengqi Zhang

    Abstract: Modeling multivariate time series has long been a subject that has attracted researchers from a diverse range of fields including economics, finance, and traffic. A basic assumption behind multivariate time series forecasting is that its variables depend on one another but, upon looking closely, it is fair to say that existing methods fail to fully exploit latent spatial dependencies between pairs… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

    Comments: Accepted by KDD 2020

  25. arXiv:2004.12164  [pdf, other

    stat.ML cs.LG cs.SI stat.ME

    Randomized spectral co-clustering for large-scale directed networks

    Authors: Xiao Guo, Yixuan Qiu, Hai Zhang, Xiangyu Chang

    Abstract: Directed networks are broadly used to represent asymmetric relationships among units. Co-clustering aims to cluster the senders and receivers of directed networks simultaneously. In particular, the well-known spectral clustering algorithm could be modified as the spectral co-clustering to co-cluster directed networks. However, large-scale networks pose great computational challenges to it. In this… ▽ More

    Submitted 9 April, 2022; v1 submitted 25 April, 2020; originally announced April 2020.

  26. arXiv:2004.10956  [pdf, other

    cs.CV cs.LG stat.ML

    Few-Shot Class-Incremental Learning

    Authors: Xiaoyu Tao, Xiaopeng Hong, Xinyuan Chang, Songlin Dong, Xing Wei, Yihong Gong

    Abstract: The ability to incrementally learn new classes is crucial to the development of real-world artificial intelligence systems. In this paper, we focus on a challenging but practical few-shot class-incremental learning (FSCIL) problem. FSCIL requires CNN models to incrementally learn new classes from very few labelled samples, without forgetting the previously learned ones. To address this problem, we… ▽ More

    Submitted 23 April, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Accepted by CVPR 2020 (oral)

  27. arXiv:2003.07017  [pdf, ps, other

    stat.ML cs.LG

    Uncertainty Quantification for Demand Prediction in Contextual Dynamic Pricing

    Authors: Yining Wang, Xi Chen, Xiangyu Chang, Dongdong Ge

    Abstract: Data-driven sequential decision has found a wide range of applications in modern operations management, such as dynamic pricing, inventory control, and assortment optimization. Most existing research on data-driven sequential decision focuses on designing an online policy to maximize the revenue. However, the research on uncertainty quantification on the underlying true model function (e.g., deman… ▽ More

    Submitted 31 August, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

  28. arXiv:2003.03691  [pdf, other

    stat.ML cs.LG

    Angle-Based Cost-Sensitive Multicategory Classification

    Authors: Yi Yang, Yuxuan Guo, Xiangyu Chang

    Abstract: Many real-world classification problems come with costs which can vary for different types of misclassification. It is thus important to develop cost-sensitive classifiers which minimize the total misclassification cost. Although binary cost-sensitive classifiers have been well-studied, solving multicategory classification problems is still challenging. A popular approach to address this issue is… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

  29. arXiv:2002.00839  [pdf, other

    cs.SI cs.LG stat.ME stat.ML

    Randomized Spectral Clustering in Large-Scale Stochastic Block Models

    Authors: Hai Zhang, Xiao Guo, Xiangyu Chang

    Abstract: Spectral clustering has been one of the widely used methods for community detection in networks. However, large-scale networks bring computational challenges to the eigenvalue decomposition therein. In this paper, we study the spectral clustering using randomized sketching algorithms from a statistical perspective, where we typically assume the network data are generated from a stochastic block mo… ▽ More

    Submitted 6 January, 2022; v1 submitted 19 January, 2020; originally announced February 2020.

  30. arXiv:2001.02879   

    cs.LG stat.ML

    Adaptive Stop** Rule for Kernel-based Gradient Descent Algorithms

    Authors: Xiangyu Chang, Shao-Bo Lin

    Abstract: In this paper, we propose an adaptive stop** rule for kernel-based gradient descent (KGD) algorithms. We introduce the empirical effective dimension to quantify the increments of iterations in KGD and derive an implementable early stop** strategy. We analyze the performance of the adaptive stop** rule in the framework of learning theory. Using the recently developed integral operator approac… ▽ More

    Submitted 13 June, 2023; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: There is a critical wrong in the proof

  31. arXiv:1906.09205  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction

    Authors: Fengda Zhu, Xiaojun Chang, Runhao Zeng, Mingkui Tan

    Abstract: Deep reinforcement learning has made significant progress in the field of continuous control, such as physical control and autonomous driving. However, it is challenging for a reinforcement model to learn a policy for each task sequentially due to catastrophic forgetting. Specifically, the model would forget knowledge it learned in the past when trained on a new task. We consider this challenge fr… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  32. arXiv:1906.02174  [pdf, other

    cs.LG cs.AI stat.ML

    Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks

    Authors: Sitao Luan, Mingde Zhao, Xiao-Wen Chang, Doina Precup

    Abstract: Recently, neural network based approaches have achieved significant improvement for solving large, complex, graph-structured problems. However, their bottlenecks still need to be addressed, and the advantages of multi-scale information and deep architectures have not been sufficiently exploited. In this paper, we theoretically analyze how existing Graph Convolutional Networks (GCNs) have limited e… ▽ More

    Submitted 8 September, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted and to be published by NeurIPS 2019

  33. arXiv:1702.08701  [pdf, ps, other

    cs.LG math.OC stat.ML

    Learning rates for classification with Gaussian kernels

    Authors: Shao-Bo Lin, **shan Zeng, Xiangyu Chang

    Abstract: This paper aims at refined error analysis for binary classification using support vector machine (SVM) with Gaussian kernel and convex loss. Our first result shows that for some loss functions such as the truncated quadratic loss and quadratic loss, SVM with Gaussian kernel can reach the almost optimal learning rate, provided the regression function is smooth. Our second result shows that, for a l… ▽ More

    Submitted 5 October, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: This paper has been accepted by Neural Computation

  34. arXiv:1702.01229  [pdf, other

    cs.LG stat.ML

    Simple to Complex Cross-modal Learning to Rank

    Authors: Minnan Luo, Xiaojun Chang, Zhihui Li, Liqiang Nie, Alexander G. Hauptmann, Qinghua Zheng

    Abstract: The heterogeneity-gap between different modalities brings a significant challenge to multimedia information retrieval. Some studies formalize the cross-modal retrieval tasks as a ranking problem and learn a shared multi-modal embedding space to measure the cross-modality similarity. However, previous methods often establish the shared embedding space based on linear map** functions which might n… ▽ More

    Submitted 7 July, 2017; v1 submitted 3 February, 2017; originally announced February 2017.

    Comments: 14 pages; Accepted by Computer Vision and Image Understanding

  35. Mobile Localization in Non-Line-of-Sight Using Constrained Square-Root Unscented Kalman Filter

    Authors: Siamak Yousefi, Xiao-Wen Chang, Benoit Champagne

    Abstract: Localization and tracking of a mobile node (MN) in non-line-of-sight (NLOS) scenarios, based on time of arrival (TOA) measurements, is considered in this work. To this end, we develop a constrained form of square root unscented Kalman filter (SRUKF), where the sigma points of the unscented transformation are projected onto the feasible region by solving constrained optimization problems. The feasi… ▽ More

    Submitted 1 May, 2014; originally announced May 2014.

    Comments: Under review by IEEE Trans. on Vehicular Technology

  36. arXiv:1403.7890  [pdf, other

    stat.ML cs.LG stat.ME

    Sparse K-Means with $\ell_{\infty}/\ell_0$ Penalty for High-Dimensional Data Clustering

    Authors: Xiangyu Chang, Yu Wang, Rongjian Li, Zongben Xu

    Abstract: Sparse clustering, which aims to find a proper partition of an extremely high-dimensional data set with redundant noise features, has been attracted more and more interests in recent years. The existing studies commonly solve the problem in a framework of maximizing the weighted feature contributions subject to a $\ell_2/\ell_1$ penalty. Nevertheless, this framework has two serious drawbacks: One… ▽ More

    Submitted 31 March, 2014; originally announced March 2014.

    Comments: 36 pages, 4 figures, Present the paper at ICSA 2013

    Report number: SS-2015-0261

    Journal ref: Statistica Sinica 28 (2018)1265-1284