Skip to main content

Showing 1–9 of 9 results for author: Zeng, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.16428  [pdf, ps, other

    stat.AP

    Similarity-driven and Task-driven Models for Diversity of Opinion in Crowdsourcing Markets

    Authors: Chen Jason Zhang, Yunrui Liu, Pengcheng Zeng, Ting Wu, Lei Chen, Pan Hui, Fei Hao

    Abstract: The recent boom in crowdsourcing has opened up a new avenue for utilizing human intelligence in the realm of data analysis. This innovative approach provides a powerful means for connecting online workers to tasks that cannot effectively be done solely by machines or conducted by professional experts due to cost constraints. Within the field of social science, four elements are required to constru… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 37 pages, 11 figures

  2. arXiv:2305.07481  [pdf, other

    stat.CO

    Extended ADMM for general penalized quantile regression with linear constraints in big data

    Authors: Yongxin Liu, Peng Zeng

    Abstract: Quantile regression (QR) can be used to describe the comprehensive relationship between a response and predictors. Prior domain knowledge and assumptions in application are usually formulated as constraints of parameters to improve the estimation efficiency. This paper develops methods based on multi-block ADMM to fit general penalized QR with linear constraints of regression coefficients. Differe… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  3. arXiv:2211.10541  [pdf, ps, other

    math.ST stat.CO

    Phase transition and higher order analysis of $L_q$ regularization under dependence

    Authors: Hanwen Huang, Peng Zeng, Qinglong Yang

    Abstract: We study the problem of estimating a $k$-sparse signal ${\mbox{$β$}}_0\in{\bf R}^p$ from a set of noisy observations ${\bf y}\in{\bf R}^n$ under the model ${\bf y}={\bf X}{\mbox{$β$}}+{\bf w}$, where ${\bf X}\in{\bf R}^{n\times p}$ is the measurement matrix the row of which is drawn from distribution $N(0,{\mbox{$Σ$}})$. We consider the class of $L_q$-regularized least squares (LQLS) given by the… ▽ More

    Submitted 1 December, 2022; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: 35 pages, 11 figures

  4. arXiv:2205.09523  [pdf, other

    stat.ML cs.LG

    scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data

    Authors: Pengcheng Zeng, Zhixiang Lin

    Abstract: Modern high-throughput sequencing technologies have enabled us to profile multiple molecular modalities from the same single cell, providing unprecedented opportunities to assay celluar heterogeneity from multiple biological layers. However, the datasets generated from these technologies tend to have high level of noise and are highly sparse, bringing challenges to data analysis. In this paper, we… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 11 pages; 1 figure

  5. A Model-free Variable Screening Method Based on Leverage Score

    Authors: Wenxuan Zhong, Yiwen Liu, Peng Zeng

    Abstract: With rapid advances in information technology, massive datasets are collected in all fields of science, such as biology, chemistry, and social science. Useful or meaningful information is extracted from these data often through statistical learning or model fitting. In massive datasets, both sample size and number of predictors can be large, in which case conventional methods face computational ch… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Journal of the American Statistical Association, published online: 21 Jun 2021

  6. arXiv:2011.02304  [pdf, ps, other

    stat.ME

    Joint Curve Registration and Classification with Two-level Functional Models

    Authors: Lin Tang, Pengcheng Zeng, Jian Qing Shi, Won-Seok Kim

    Abstract: Many classification techniques when the data are curves or functions have been recently proposed. However, the presence of misaligned problems in the curves can influence the performance of most of them. In this paper, we propose a model-based approach for simultaneous curve registration and classification. The method is proposed to perform curve classification based on a functional logistic regre… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: 27 pages,8 figures

  7. arXiv:2003.12970  [pdf, other

    stat.ML cs.LG

    Elastic Coupled Co-clustering for Single-Cell Genomic Data

    Authors: Pengcheng Zeng, Zhixiang Lin

    Abstract: The recent advances in single-cell technologies have enabled us to profile genomic features at unprecedented resolution and datasets from multiple domains are available, including datasets that profile different types of genomic features and datasets that profile the same type of genomic features across different species. These datasets typically have different powers in identifying the unknown ce… ▽ More

    Submitted 5 June, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    Comments: 18 pages, 3 figures, 2 tables

  8. arXiv:1711.04761  [pdf, other

    stat.ME

    Simultaneous Registration and Clustering for Multi-dimensional Functional Data

    Authors: Pengcheng Zeng, Jian Qing Shi, Won-Seok Kim

    Abstract: The clustering for functional data with misaligned problems has drawn much attention in the last decade. Most methods do the clustering after those functional data being registered and there has been little research using both functional and scalar variables. In this paper, we propose a simultaneous registration and clustering (SRC) model via two-level models, allowing the use of both types of var… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

    Comments: 36 pages, 13 figures

  9. arXiv:1408.2794  [pdf, other

    q-fin.ST stat.AP

    Sector-Based Factor Models for Asset Returns

    Authors: Angela Gu, Patrick Zeng

    Abstract: Factor analysis is a statistical technique employed to evaluate how observed variables correlate through common factors and unique variables. While it is often used to analyze price movement in the unstable stock market, it does not always yield easily interpretable results. In this study, we develop improved factor models by explicitly incorporating sector information on our studied stocks. We ad… ▽ More

    Submitted 11 August, 2014; originally announced August 2014.

    Comments: 10 pages, 6 figures