Skip to main content

Showing 1–33 of 33 results for author: Zeng, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00292  [pdf, other

    stat.OT stat.AP

    Interpret the estimand framework from a causal inference perspective

    Authors: **ghong Zeng

    Abstract: The estimand framework proposed by ICH in 2017 has brought fundamental changes in the pharmaceutical industry. It clearly describes how a treatment effect in a clinical question should be precisely defined and estimated, through attributes including treatments, endpoints and intercurrent events. However, ideas around the estimand framework are commonly in text, and different interpretations on thi… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2302.14505  [pdf

    stat.AP stat.ME

    Nonlinear regression models to forecast PM$_{2.5}$ concentration in Wuhan, China

    Authors: **ghong Zeng

    Abstract: Forecasting PM$_{2.5}$ concentration is important to solving air pollution problems in Wuhan. This paper proposes a PM$_{2.5}$ concentration forecast model based on nonlinear regression, including a single-value forecast model and an interval forecast model. The single-value forecast model can precisely forecast PM$_{2.5}$ concentration for the next day, with forecast bias about 6 $μg/m^3$ in good… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: In Chinese, supervised by Yurong Chen

  3. arXiv:2302.14469  [pdf, other

    stat.ME stat.AP

    Bayesian inference on average treatment effects in the PreventS trial data in the presence of unmeasured confounding

    Authors: **ghong Zeng

    Abstract: Using the PreventS trial data, our objective is to estimate average effects of a Health Wellness Coaching (HWC) intervention on improvement of cardiovascular health at 9 months post randomization and in three consecutive 3-month periods over 9 months post randomization. Conventional approaches, including instrumental variable models, are not applicable in the presence of multiple correlated multiv… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Supervised by Alain C. Vandal

  4. arXiv:2210.11834  [pdf, other

    cs.LG stat.ML

    Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles

    Authors: Yuxuan Han, Jialin Zeng, Yang Wang, Yang Xiang, Jiheng Zhang

    Abstract: We study the stochastic contextual bandit with knapsacks (CBwK) problem, where each action, taken upon a context, not only leads to a random reward but also costs a random resource consumption in a vector form. The challenge is to maximize the total reward without violating the budget for each resource. We study this problem under a general realizability setting where the expected reward and expec… ▽ More

    Submitted 22 February, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: AISTATS2023

  5. arXiv:2209.05742  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation Is the Fixed Point of Adversarial Game

    Authors: Ke Ma, Qianqian Xu, **shan Zeng, Guorong Li, Xiaochun Cao, Qingming Huang

    Abstract: Rank aggregation with pairwise comparisons has shown promising results in elections, sports competitions, recommendations, and information retrieval. However, little attention has been paid to the security issue of such algorithms, in contrast to numerous research work on the computational and statistical characteristics. Driven by huge profits, the potential adversary has strong motivation and in… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 33 pages, https://github.com/alphaprime/Target_Attack_Rank_Aggregation

    Journal ref: Early Access by TPAMI 2022 (https://ieeexplore.ieee.org/document/9830042)

  6. arXiv:2209.00869  [pdf, other

    stat.ME

    A Survey of Causal Inference Frameworks

    Authors: **gying Zeng, Run Wang

    Abstract: Causal inference is a science with multi-disciplinary evolution and applications. On the one hand, it measures effects of treatments in observational data based on experimental designs and rigorous statistical inference to draw causal statements. One of the most influential framework in quantifying causal effects is the potential outcomes framework. On the other hand, causal graphical models utili… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  7. arXiv:2207.12630  [pdf, other

    stat.ME

    Bayesian Causal Inference in Sequentially Randomized Experiments with Noncompliance

    Authors: **gying Zeng

    Abstract: Scientific researchers utilize randomized experiments to draw casual statements. Most early studies as well as current work on experiments with sequential intervention decisions has been focusing on estimating the causal effects among sequential treatments, ignoring the non-compliance issues that experimental units might not be compliant with the treatment assignments that they were originally all… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  8. arXiv:2207.11932  [pdf, other

    stat.ME

    Semiparametric Estimation on Multi-treatment Causal Effects via Cross-Fitting

    Authors: **gying Zeng

    Abstract: Causal inference is a critical research area with multi-disciplinary origins and applications, ranging from statistics, computer science, economics, psychology to public health. In many scientific research, randomized experiments provide a golden standard for estimation of causal effects for decades. However, in many situations, randomized experiments are not feasible in practice so that practitio… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  9. arXiv:2101.11190  [pdf, other

    stat.AP

    Boost-S: Gradient Boosted Trees for Spatial Data and Its Application to FDG-PET Imaging Data

    Authors: Reza Iranzad, Xiao Liu, W. Art Chaovalitwongse, Daniel S. Hippe, Shouyi Wang, Jie Han, Phawis Thammasorn, Chunyan Duan, **g Zeng, Stephen R. Bowen

    Abstract: Boosting Trees are one of the most successful statistical learning approaches that involve sequentially growing an ensemble of simple regression trees (i.e., "weak learners"). However, gradient boosted trees are not yet available for spatially correlated data. This paper proposes a new gradient Boosted Trees algorithm for Spatial Data (Boost-S) with covariate information. Boost-S integrates the sp… ▽ More

    Submitted 3 February, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

  10. arXiv:2009.05923  [pdf, other

    cs.LG stat.ML

    Contrastive Self-supervised Learning for Graph Classification

    Authors: Jiaqi Zeng, Pengtao Xie

    Abstract: Graph classification is a widely studied problem and has broad applications. In many real-world problems, the number of labeled graphs available for training classification models is limited, which renders these models prone to overfitting. To address this problem, we propose two approaches based on contrastive self-supervised learning (CSSL) to alleviate overfitting. In the first approach, we use… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

  11. arXiv:2009.02528  [pdf, other

    stat.AP eess.SP

    Structured Sparsity Modeling for Improved Multivariate Statistical Analysis based Fault Isolation

    Authors: Wei Chen, Jiusun Zeng, Xiaobin Xu, Shihua Luo, Chuanhou Gao

    Abstract: In order to improve the fault diagnosis capability of multivariate statistical methods, this article introduces a fault isolation framework based on structured sparsity modeling. The developed method relies on the reconstruction based contribution analysis and the process structure information can be incorporated into the reconstruction objective function in the form of structured sparsity regular… ▽ More

    Submitted 21 December, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: 36 pages, 12 figures

  12. arXiv:2009.02517  [pdf, other

    stat.CO

    Uncertainty modelling and computational aspects of data association

    Authors: Jeremie Houssineau, Jiajie Zeng, Ajay Jasra

    Abstract: A novel solution to the smoothing problem for multi-object dynamical systems is proposed and evaluated. The systems of interest contain an unknown and varying number of dynamical objects that are partially observed under noisy and corrupted observations. An alternative representation of uncertainty is considered in order to account for the lack of information about the different aspects of this ty… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

  13. arXiv:2008.03733  [pdf, other

    stat.ME

    Generalized Liquid Association Analysis for Multimodal Data Integration

    Authors: Lexin Li, **g Zeng, Xin Zhang

    Abstract: Multimodal data are now prevailing in scientific research. A central question in multimodal integrative analysis is to understand how two data modalities associate and interact with each other given another modality or demographic variables. The problem can be formulated as studying the associations among three sets of random variables, a question that has received relatively less attention in the… ▽ More

    Submitted 24 April, 2021; v1 submitted 9 August, 2020; originally announced August 2020.

  14. arXiv:2007.02010  [pdf, other

    cs.CV cs.LG math.DS stat.AP stat.ML

    DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths

    Authors: Yanwei Fu, Chen Liu, Donghao Li, Xinwei Sun, **shan Zeng, Yuan Yao

    Abstract: Over-parameterization is ubiquitous nowadays in training neural networks to benefit both optimization in seeking global optima and generalization in reducing prediction error. However, compressive networks are desired in many real world applications and direct training of small networks may be trapped in local optima. In this paper, instead of pruning or distilling over-parameterized models to com… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: conference , 23 pages https://github.com/corwinliu9669/dS2LBI. arXiv admin note: text overlap with arXiv:1905.09449

    Journal ref: ICML 2020

  15. arXiv:2005.07916  [pdf

    physics.comp-ph cs.AI cs.LG cs.NE stat.ML

    Deep-learning of Parametric Partial Differential Equations from Sparse and Noisy Data

    Authors: Hao Xu, Dongxiao Zhang, Junsheng Zeng

    Abstract: Data-driven methods have recently made great progress in the discovery of partial differential equations (PDEs) from spatial-temporal data. However, several challenges remain to be solved, including sparse noisy data, incomplete candidate library, and spatially- or temporally-varying coefficients. In this work, a new framework, which combines neural network, genetic algorithm and adaptive methods,… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: 30 pages, 6 figures, and 7 tables

    Journal ref: Phys. Fluids, 33, 037132, 10.1063/5.0042868, 2021

  16. arXiv:2004.03329  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    MedDialog: Two Large-scale Medical Dialogue Datasets

    Authors: Xuehai He, Shu Chen, Zeqian Ju, Xiangyu Dong, Hongchao Fang, Sicheng Wang, Yue Yang, Jiaqi Zeng, Ruisi Zhang, Ruoyu Zhang, Meng Zhou, Penghui Zhu, Pengtao Xie

    Abstract: Medical dialogue systems are promising in assisting in telemedicine to increase access to healthcare services, improve the quality of patient care, and reduce medical costs. To facilitate the research and development of medical dialogue systems, we build two large-scale medical dialogue datasets: MedDialog-EN and MedDialog-CN. MedDialog-EN is an English dataset containing 0.3 million conversations… ▽ More

    Submitted 7 July, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  17. arXiv:2004.00179  [pdf, other

    cs.LG math.ST stat.ML

    Fully-Corrective Gradient Boosting with Squared Hinge: Fast Learning Rates and Early Stop**

    Authors: **shan Zeng, Min Zhang, Shao-Bo Lin

    Abstract: Boosting is a well-known method for improving the accuracy of weak learners in machine learning. However, its theoretical generalization guarantee is missing in literature. In this paper, we propose an efficient boosting method with theoretical generalization guarantees for binary classification. Three key ingredients of the proposed boosting method are: a) the \textit{fully-corrective greedy} (FC… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: 14 pages

  18. arXiv:2002.12135  [pdf, other

    cs.LG eess.SP stat.ML

    Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

    Authors: Qiquan Shi, Jiaming Yin, Jiajun Cai, Andrzej Cichocki, Tatsuya Yokota, Lei Chen, Mingxuan Yuan, Jia Zeng

    Abstract: This work proposes a novel approach for multiple time series forecasting. At first, multi-way delay embedding transform (MDT) is employed to represent time series as low-rank block Hankel tensors (BHT). Then, the higher-order tensors are projected to compressed core tensors by applying Tucker decomposition. At the same time, the generalized tensor Autoregressive Integrated Moving Average (ARIMA) i… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted by AAAI 2020

  19. arXiv:1912.04521  [pdf, other

    cs.LG stat.ML

    Transfer Learning-Based Outdoor Position Recovery with Telco Data

    Authors: Yige Zhang, Aaron Yi Ding, Jorg Ott, Mingxuan Yuan, Jia Zeng, Kun Zhang, Weixiong Rao

    Abstract: Telecommunication (Telco) outdoor position recovery aims to localize outdoor mobile devices by leveraging measurement report (MR) data. Unfortunately, Telco position recovery requires sufficient amount of MR samples across different areas and suffers from high data collection cost. For an area with scarce MR samples, it is hard to achieve good accuracy. In this paper, by leveraging the recently de… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  20. arXiv:1912.00362  [pdf, other

    cs.LG math.OC stat.ML

    Fast Stochastic Ordinal Embedding with Variance Reduction and Adaptive Step Size

    Authors: Ke Ma, **shan Zeng, Qianqian Xu, Xiaochun Cao, Wei Liu, Yuan Yao

    Abstract: Learning representation from relative similarity comparisons, often called ordinal embedding, gains rising attention in recent years. Most of the existing methods are based on semi-definite programming (\textit{SDP}), which is generally time-consuming and degrades the scalability, especially confronting large-scale data. To overcome this challenge, we propose a stochastic algorithm called \textit{… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 19 pages, 5 figures, accepted by IEEE Transaction on Knowledge and Data Engineering, Conference Version: arXiv:1711.06446

  21. arXiv:1911.10558  [pdf, other

    cs.LG math.OC stat.ML

    Fast Polynomial Kernel Classification for Massive Data

    Authors: **shan Zeng, Minrun Wu, Shao-Bo Lin, Ding-Xuan Zhou

    Abstract: In the era of big data, it is desired to develop efficient machine learning algorithms to tackle massive data challenges such as storage bottleneck, algorithmic scalability, and interpretability. In this paper, we develop a novel efficient classification algorithm, called fast polynomial kernel classification (FPC), to conquer the scalability and storage challenges. Our main tools are a suitable s… ▽ More

    Submitted 11 November, 2022; v1 submitted 24 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1402.4735 by other authors

  22. arXiv:1905.09449  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Exploring Structural Sparsity of Deep Networks via Inverse Scale Spaces

    Authors: Yanwei Fu, Chen Liu, Donghao Li, Zuyuan Zhong, Xinwei Sun, **shan Zeng, Yuan Yao

    Abstract: The great success of deep neural networks is built upon their over-parameterization, which smooths the optimization landscape without degrading the generalization ability. Despite the benefits of over-parameterization, a huge amount of parameters makes deep networks cumbersome in daily life applications. Though techniques such as pruning and distillation are developed, they are expensive in fully… ▽ More

    Submitted 21 April, 2022; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: This is the journal extension version of the ICML conference paper, "DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths"

    Journal ref: International Conference on Machine Learning. PMLR, 2020, pp. 3315--3326

  23. arXiv:1902.02060  [pdf, other

    cs.LG math.OC stat.ML

    On ADMM in Deep Learning: Convergence and Saturation-Avoidance

    Authors: **shan Zeng, Shao-Bo Lin, Yuan Yao, Ding-Xuan Zhou

    Abstract: In this paper, we develop an alternating direction method of multipliers (ADMM) for deep neural networks training with sigmoid-type activation functions (called \textit{sigmoid-ADMM pair}), mainly motivated by the gradient-free nature of ADMM in avoiding the saturation of sigmoid-type activations and the advantages of deep neural networks with sigmoid-type activations (called deep sigmoid nets) ov… ▽ More

    Submitted 15 September, 2021; v1 submitted 6 February, 2019; originally announced February 2019.

    Comments: This is a revised version of our previous one entitled "A Convergence Analysis of Nonlinearly Constrained ADMM in Deep Learning, arXiv:1902.02060" with some significantly changes

    Journal ref: Journal of Machine Learning Research 22 (2021) 1-67

  24. arXiv:1811.12535  [pdf, other

    cs.LG cs.AI stat.ML

    The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning

    Authors: Jiaming Zeng, Adam Lesnikowski, Jose M. Alvarez

    Abstract: One of the main challenges of deep learning tools is their inability to capture model uncertainty. While Bayesian deep learning can be used to tackle the problem, Bayesian neural networks often require more time and computational power to train than deterministic networks. Our work explores whether fully Bayesian networks are needed to successfully capture model uncertainty. We vary the number and… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Journal ref: Third workshop on Bayesian Deep Learning (NeurIPS 2018)

  25. arXiv:1808.03425  [pdf, other

    quant-ph cs.LG stat.ML

    Learning and Inference on Generative Adversarial Quantum Circuits

    Authors: **feng Zeng, Yufeng Wu, **-Guo Liu, Lei Wang, Jiang** Hu

    Abstract: Quantum mechanics is inherently probabilistic in light of Born's rule. Using quantum circuits as probabilistic generative models for classical data exploits their superior expressibility and efficient direct sampling ability. However, training of quantum circuits can be more challenging compared to classical neural networks due to lack of efficient differentiable learning algorithm. We devise an a… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: 7 pages, 6 figures

    Journal ref: Phys. Rev. A 99, 052306 (2019)

  26. arXiv:1803.09082  [pdf, other

    stat.ML cs.LG math.OC

    A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training

    Authors: Tim Tsz-Kit Lau, **shan Zeng, Baoyuan Wu, Yuan Yao

    Abstract: Training deep neural networks (DNNs) efficiently is a challenge due to the associated highly nonconvex optimization. The backpropagation (backprop) algorithm has long been the most widely used algorithm for gradient computation of parameters of DNNs and is used along with gradient descent-type algorithms for this optimization task. Recent work have shown the efficiency of block coordinate descent… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

    Comments: The 6th International Conference on Learning Representations (ICLR 2018), Workshop Track

  27. arXiv:1803.00225  [pdf, other

    math.OC cs.LG stat.ML

    Global Convergence of Block Coordinate Descent in Deep Learning

    Authors: **shan Zeng, Tim Tsz-Kit Lau, Shaobo Lin, Yuan Yao

    Abstract: Deep learning has aroused extensive attention due to its great empirical success. The efficiency of the block coordinate descent (BCD) methods has been recently demonstrated in deep neural network (DNN) training. However, theoretical studies on their convergence properties are limited due to the highly nonconvex nature of DNN training. In this paper, we aim at providing a general methodology for p… ▽ More

    Submitted 12 May, 2019; v1 submitted 1 March, 2018; originally announced March 2018.

    Comments: 27 pages, 2 figures

    Journal ref: Proceeding of the 36th International Conference on Machine Learning (ICML), 2019

  28. arXiv:1711.06446  [pdf, other

    stat.ML cs.IR cs.LG math.OC

    Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size

    Authors: Ke Ma, **shan Zeng, Jiechao Xiong, Qianqian Xu, Xiaochun Cao, Wei Liu, Yuan Yao

    Abstract: Learning representation from relative similarity comparisons, often called ordinal embedding, gains rising attention in recent years. Most of the existing methods are batch methods designed mainly based on the convex optimization, say, the projected gradient descent method. However, they are generally time-consuming due to that the singular value decomposition (SVD) is commonly adopted during the… ▽ More

    Submitted 30 January, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: 11 pages, 3 figures, 2 tables, accepted by AAAI2018

    MSC Class: aaai.org

  29. arXiv:1702.08701  [pdf, ps, other

    cs.LG math.OC stat.ML

    Learning rates for classification with Gaussian kernels

    Authors: Shao-Bo Lin, **shan Zeng, Xiangyu Chang

    Abstract: This paper aims at refined error analysis for binary classification using support vector machine (SVM) with Gaussian kernel and convex loss. Our first result shows that for some loss functions such as the truncated quadratic loss and quadratic loss, SVM with Gaussian kernel can reach the almost optimal learning rate, provided the regression function is smooth. Our second result shows that, for a l… ▽ More

    Submitted 5 October, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: This paper has been accepted by Neural Computation

  30. arXiv:1503.07810  [pdf, other

    stat.ML stat.AP

    Interpretable Classification Models for Recidivism Prediction

    Authors: Jiaming Zeng, Berk Ustun, Cynthia Rudin

    Abstract: We investigate a long-debated question, which is how to create predictive models of recidivism that are sufficiently accurate, transparent, and interpretable to use for decision-making. This question is complicated as these models are used to support different decisions, from sentencing, to determining release on probation, to allocating preventative social services. Each use case might have an ob… ▽ More

    Submitted 7 July, 2016; v1 submitted 26 March, 2015; originally announced March 2015.

    Comments: 45 pages, 17 figures

    Journal ref: Journal of Royal Statistics - Series A (2017)

  31. arXiv:1312.5465  [pdf, ps, other

    cs.LG stat.ML

    Learning rates of $l^q$ coefficient regularization learning with Gaussian kernel

    Authors: Shaobo Lin, **shan Zeng, Jian Fang, Zongben Xu

    Abstract: Regularization is a well recognized powerful strategy to improve the performance of a learning machine and $l^q$ regularization schemes with $0<q<\infty$ are central in use. It is known that different $q$ leads to different properties of the deduced estimators, say, $l^2$ regularization leads to smooth estimators while $l^1$ regularization leads to sparse estimators. Then, how does the generalizat… ▽ More

    Submitted 24 September, 2014; v1 submitted 19 December, 2013; originally announced December 2013.

    Comments: 26 pages, 3 figures

    MSC Class: 68T05 ACM Class: F.2.1

  32. arXiv:1311.4150  [pdf, ps, other

    cs.LG cs.DC cs.IR stat.ML

    Towards Big Topic Modeling

    Authors: Jian-Feng Yan, Jia Zeng, Zhi-Qiang Liu, Yang Gao

    Abstract: To solve the big topic modeling problem, we need to reduce both time and space complexities of batch latent Dirichlet allocation (LDA) algorithms. Although parallel LDA algorithms on the multi-processor architecture have low time and space complexities, their communication costs among processors often scale linearly with the vocabulary size and the number of topics, leading to a serious scalabilit… ▽ More

    Submitted 17 November, 2013; originally announced November 2013.

    Comments: 14 pages

  33. arXiv:1307.6616   

    cs.LG stat.ML

    Does generalization performance of $l^q$ regularization learning depend on $q$? A negative example

    Authors: Shaobo Lin, Chen Xu, **gshan Zeng, Jian Fang

    Abstract: $l^q$-regularization has been demonstrated to be an attractive technique in machine learning and statistical modeling. It attempts to improve the generalization (prediction) capability of a machine (model) through appropriately shrinking its coefficients. The shape of a $l^q$ estimator differs in varying choices of the regularization order $q$. In particular, $l^1… ▽ More

    Submitted 13 June, 2023; v1 submitted 24 July, 2013; originally announced July 2013.

    Comments: There is critical wrong in the proof