Skip to main content

Showing 1–29 of 29 results for author: Zheng, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2212.12978  [pdf, other

    math.OC cs.LG stat.ML

    Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization

    Authors: Taoli Zheng, Linglingzhi Zhu, Anthony Man-Cho So, Jose Blanchet, Jia** Li

    Abstract: Nonconvex-nonconcave minimax optimization has received intense attention over the last decade due to its broad applications in machine learning. Most existing algorithms rely on one-sided information, such as the convexity (resp. concavity) of the primal (resp. dual) functions, or other specific structures, such as the Polyak-Łojasiewicz (PŁ) and Kurdyka-Łojasiewicz (KŁ) conditions. However, verif… ▽ More

    Submitted 30 October, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

  2. arXiv:2211.10314  [pdf, other

    stat.ME

    Prediction scoring of data-driven discoveries for reproducible research

    Authors: Anna L. Smith, Tian Zheng, Andrew Gelman

    Abstract: Predictive modeling uncovers knowledge and insights regarding a hypothesized data generating mechanism (DGM). Results from different studies on a complex DGM, derived from different data sets, and using complicated models and algorithms, are hard to quantitatively compare due to random noise and statistical uncertainty in model results. This has been one of the main contributors to the replication… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  3. arXiv:2209.04991  [pdf, other

    stat.ME stat.ML

    Wasserstein Distributional Learning

    Authors: Chengliang Tang, Nathan Lenssen, Ying Wei, Tian Zheng

    Abstract: Learning conditional densities and identifying factors that influence the entire distribution are vital tasks in data-driven applications. Conventional approaches work mostly with summary statistics, and are hence inadequate for a comprehensive investigation. Recently, there have been developments on functional regression methods to model density curves as functional outcomes. A major challenge fo… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  4. arXiv:2205.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Incorporating Prior Knowledge into Neural Networks through an Implicit Composite Kernel

    Authors: Ziyang Jiang, Tongshu Zheng, Yiling Liu, David Carlson

    Abstract: It is challenging to guide neural network (NN) learning with prior knowledge. In contrast, many known properties, such as spatial smoothness or seasonality, are straightforward to model by choosing an appropriate kernel in a Gaussian process (GP). Many deep learning applications could be enhanced by modeling such known properties. For example, convolutional neural networks (CNNs) are frequently us… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: 27 pages, 13 figures, 5 tables, 3 algorithms, published in Transactions on Machine Learning Research (TMLR)

    ACM Class: I.5.1

  5. arXiv:2112.03270  [pdf, other

    cs.LG stat.ME

    Toward a Taxonomy of Trust for Probabilistic Machine Learning

    Authors: Tamara Broderick, Andrew Gelman, Rachael Meager, Anna L. Smith, Tian Zheng

    Abstract: Probabilistic machine learning increasingly informs critical decisions in medicine, economics, politics, and beyond. We need evidence to support that the resulting decisions are well-founded. To aid development of trust in these decisions, we develop a taxonomy delineating where trust in an analysis can break down: (1) in the translation of real-world goals to goals on a particular set of availabl… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: 18 pages, 2 figures

  6. arXiv:2106.01485  [pdf, other

    stat.ML cs.LG stat.AP

    Weakly Supervised Learning Creates a Fusion of Modeling Cultures

    Authors: Chengliang Tang, Gan Yuan, Tian Zheng

    Abstract: The past two decades have witnessed the great success of the algorithmic modeling framework advocated by Breiman et al. (2001). Nevertheless, the excellent prediction performance of these black-box models rely heavily on the availability of strong supervision, i.e. a large set of accurate and exact ground-truth labels. In practice, strong supervision can be unavailable or expensive, which calls fo… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  7. arXiv:2105.05532  [pdf, other

    stat.ME econ.EM

    Generalized Autoregressive Moving Average Models with GARCH Errors

    Authors: Tingguo Zheng, Han Xiao, Rong Chen

    Abstract: One of the important and widely used classes of models for non-Gaussian time series is the generalized autoregressive model average models (GARMA), which specifies an ARMA structure for the conditional mean process of the underlying time series. However, in many applications one often encounters conditional heteroskedasticity. In this paper we propose a new class of models, referred to as GARMA-GA… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  8. arXiv:2012.09598  [pdf, other

    stat.AP stat.ME

    Network Hawkes Process Models for Exploring Latent Hierarchy in Social Animal Interactions

    Authors: Owen G. Ward, **g Wu, Tian Zheng, Anna L. Smith, James P. Curley

    Abstract: Group-based social dominance hierarchies are of essential interest in animal behavior research. Studies often record aggressive interactions observed over time, and models that can capture such dynamic hierarchy are therefore crucial. Traditional ranking methods summarize interactions across time, using only aggregate counts. Instead, we take advantage of the interaction timestamps, proposing a se… ▽ More

    Submitted 16 July, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: To appear in Journal of the Royal Statistical Society, Series C

  9. arXiv:2009.01742  [pdf, other

    cs.SI cs.LG stat.ML

    Online Estimation and Community Detection of Network Point Processes for Event Streams

    Authors: Guanhua Fang, Owen G. Ward, Tian Zheng

    Abstract: A common goal in network modeling is to uncover the latent community structure present among nodes. For many real-world networks, the true connections consist of events arriving as streams, which are then aggregated to form edges, ignoring the dynamic temporal component. A natural way to take account of these temporal dynamics of interactions is to use point processes as the foundation of network… ▽ More

    Submitted 26 October, 2023; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: 45 pages

  10. arXiv:2007.05385  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Next Waves in Veridical Network Embedding

    Authors: Owen G. Ward, Zhen Huang, Andrew Davison, Tian Zheng

    Abstract: Embedding nodes of a large network into a metric (e.g., Euclidean) space has become an area of active research in statistical machine learning, which has found applications in natural and social sciences. Generally, a representation of a network object is learned in a Euclidean geometry and is then used for subsequent tasks regarding the nodes and/or edges of the network, such as community detecti… ▽ More

    Submitted 12 August, 2021; v1 submitted 10 July, 2020; originally announced July 2020.

  11. arXiv:2005.07347  [pdf, other

    cs.LG cs.CR stat.ML

    Towards Assessment of Randomized Smoothing Mechanisms for Certifying Adversarial Robustness

    Authors: Tianhang Zheng, Di Wang, Baochun Li, **hui Xu

    Abstract: As a certified defensive technique, randomized smoothing has received considerable attention due to its scalability to large datasets and neural networks. However, several important questions remain unanswered, such as (i) whether the Gaussian mechanism is an appropriate option for certifying $\ell_2$-norm robustness, and (ii) whether there is an appropriate randomized (smoothing) mechanism to cer… ▽ More

    Submitted 7 June, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: Correct the some details of the theorems and proofs

  12. arXiv:2001.09359  [pdf, other

    stat.AP

    Diagnostics and Visualization of Point Process Models for Event Times on a Social Network

    Authors: **g Wu, Anna L. Smith, Tian Zheng

    Abstract: Point process models have been used to analyze interaction event times on a social network, in the hope to provides valuable insights for social science research. However, the diagnostics and visualization of the modeling results from such an analysis have received limited discussion in the literature. In this paper, we develop a systematic set of diagnostic tools and visualizations for point proc… ▽ More

    Submitted 25 January, 2020; originally announced January 2020.

  13. arXiv:1904.12052  [pdf, ps, other

    cs.LG cs.AI cs.CR stat.ML

    Data Poisoning Attack against Knowledge Graph Embedding

    Authors: Hengtong Zhang, Tianhang Zheng, **g Gao, Chenglin Miao, Lu Su, Yaliang Li, Kui Ren

    Abstract: Knowledge graph embedding (KGE) is a technique for learning continuous embeddings for entities and relations in the knowledge graph.Due to its benefit to a variety of downstream tasks such as knowledge graph completion, question answering and recommendation, KGE has gained significant attention recently. Despite its effectiveness in a benign environment, KGE' robustness to adversarial attacks is n… ▽ More

    Submitted 24 June, 2019; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: Fix typos and version conflicts

  14. arXiv:1903.03223  [pdf, other

    stat.AP

    Markov-Modulated Hawkes Processes for Sporadic and Bursty Event Occurrences

    Authors: **g Wu, Owen G. Ward, James Curley, Tian Zheng

    Abstract: Modeling event dynamics is central to many disciplines. Patterns in observed event arrival times are commonly modeled using point processes. Such event arrival data often exhibits self-exciting, heterogeneous and sporadic trends, which is challenging for conventional models. It is reasonable to assume that there exists a hidden state process that drives different event dynamics at different states… ▽ More

    Submitted 12 August, 2021; v1 submitted 7 March, 2019; originally announced March 2019.

  15. arXiv:1810.05665   

    cs.LG stat.ML

    Is PGD-Adversarial Training Necessary? Alternative Training via a Soft-Quantization Network with Noisy-Natural Samples Only

    Authors: Tianhang Zheng, Changyou Chen, Kui Ren

    Abstract: Recent work on adversarial attack and defense suggests that PGD is a universal $l_\infty$ first-order attack, and PGD adversarial training can significantly improve network robustness against a wide range of first-order $l_\infty$-bounded attacks, represented as the state-of-the-art defense method. However, an obvious weakness of PGD adversarial training is its highly-computational cost in generat… ▽ More

    Submitted 19 October, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Further improvement

  16. arXiv:1808.05537  [pdf, other

    cs.LG cs.CR stat.ML

    Distributionally Adversarial Attack

    Authors: Tianhang Zheng, Changyou Chen, Kui Ren

    Abstract: Recent work on adversarial attack has shown that Projected Gradient Descent (PGD) Adversary is a universal first-order adversary, and the classifier adversarially trained by PGD is robust against a wide range of first-order attacks. It is worth noting that the original objective of an attack/defense model relies on a data distribution $p(\mathbf{x})$, typically in the form of risk maximization/min… ▽ More

    Submitted 5 December, 2018; v1 submitted 16 August, 2018; originally announced August 2018.

    Comments: accepted to AAAI-19

  17. arXiv:1801.04587  [pdf

    stat.AP

    A Bayesian Evidence Synthesis Approach to Estimate Disease Prevalence in Hard-To-Reach Populations: Hepatitis C in New York City

    Authors: Sarah Tan, Susanna Makela, Daliah Heller, Kevin Konty, Sharon Balter, Tian Zheng, James H. Stark

    Abstract: Existing methods to estimate the prevalence of chronic hepatitis C (HCV) in New York City (NYC) are limited in scope and fail to assess hard-to-reach subpopulations with highest risk such as injecting drug users (IDUs). To address these limitations, we employ a Bayesian multi-parameter evidence synthesis model to systematically combine multiple sources of data, account for bias in certain data sou… ▽ More

    Submitted 14 January, 2018; originally announced January 2018.

  18. arXiv:1709.02899  [pdf, other

    stat.ME

    Estimating the theoretical error rate for prediction

    Authors: Herman Chernoff, Shaw-Hwa Lo, Tian Zheng, Adeline Lo

    Abstract: Prediction for very large data sets is typically carried out in two stages, variable selection and pattern recognition. Ordinarily variable selection involves seeing how well individual explanatory variables are correlated with the dependent variable. This practice neglects the possible interactions among the variables. Simulations have shown that a statistic I, that we used for variable selection… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

  19. arXiv:1604.06498  [pdf, other

    stat.ML cs.LG

    Stabilized Sparse Online Learning for Sparse Data

    Authors: Yuting Ma, Tian Zheng

    Abstract: Stochastic gradient descent (SGD) is commonly used for optimization in large-scale machine learning problems. Langford et al. (2009) introduce a sparse online learning method to induce sparsity via truncated gradient. With high-dimensional sparse data, however, the method suffers from slow convergence and high variance due to the heterogeneity in feature sparsity. To mitigate this issue, we introd… ▽ More

    Submitted 8 May, 2017; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: 45 pages, 4 figures

  20. arXiv:1604.04899  [pdf, other

    stat.ME

    Phase-Aligned Spectral Filtering for Decomposing Spatiotemporal Dynamics

    Authors: Lu Meng, Tian Zheng

    Abstract: Spatiotemporal dynamics is central to a wide range of applications from climatology, computer vision to neural sciences. From temporal observations taken on a high-dimensional vector of spatial locations, we seek to derive knowledge about such dynamics via data assimilation and modeling. It is assumed that the observed spatiotemporal data represent superimposed lower-rank smooth oscillations and m… ▽ More

    Submitted 17 April, 2016; originally announced April 2016.

    Comments: 29 pages, 10 figures

    MSC Class: 37M10 ACM Class: G.3; I.5.4

  21. arXiv:1512.03396  [pdf, other

    stat.ML cs.LG

    Boosted Sparse Non-linear Distance Metric Learning

    Authors: Yuting Ma, Tian Zheng

    Abstract: This paper proposes a boosting-based solution addressing metric learning problems for high-dimensional data. Distance measures have been used as natural measures of (dis)similarity and served as the foundation of various learning methods. The efficiency of distance-based learning methods heavily depends on the chosen distance metric. With increasing dimensionality and complexity of data, however,… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

  22. arXiv:1502.07190  [pdf, other

    stat.ML cs.LG

    Topic-adjusted visibility metric for scientific articles

    Authors: Linda S. L. Tan, Aik Hui Chan, Tian Zheng

    Abstract: Measuring the impact of scientific articles is important for evaluating the research output of individual scientists, academic institutions and journals. While citations are raw data for constructing impact measures, there exist biases and potential issues if factors affecting citation patterns are not properly accounted for. In this work, we address the problem of field variation and introduce an… ▽ More

    Submitted 16 October, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

    Journal ref: Annals of Applied Statistics, Volume 10, Number 1 (2016), 1-31

  23. arXiv:1412.2183  [pdf, other

    stat.AP stat.CO stat.ME

    Reduced-Rank Covariance Estimation in Vector Autoregressive Modeling

    Authors: Richard A. Davis, Pengfei Zang, Tian Zheng

    Abstract: We consider reduced-rank modeling of the white noise covariance matrix in a large dimensional vector autoregressive (VAR) model. We first propose the reduced-rank covariance estimator under the setting where independent observations are available. We derive the reduced-rank estimator based on a latent variable model for the vector observation and give the analytical form of its maximum likelihood… ▽ More

    Submitted 5 December, 2014; originally announced December 2014.

    Comments: 36 pages, 5 figures

  24. arXiv:1304.4851  [pdf, ps, other

    stat.ME

    Integrative Analysis of Prognosis Data on Multiple Cancer Subtypes using Penalization

    Authors: ** Liu, Jian Huang, Yawei Zhang, Qing Lan, Nathaniel Rothman, Tongzhang Zheng, Shuangge Ma

    Abstract: In cancer research, profiling studies have been extensively conducted, searching for genes/SNPs associated with prognosis. Cancer is a heterogeneous disease. Examining similarity and difference in the genetic basis of multiple subtypes of the same cancer can lead to better understanding of their connections and distinctions. Classic meta-analysis approaches analyze each subtype separately and then… ▽ More

    Submitted 17 April, 2013; originally announced April 2013.

    Comments: 23 pages (main text) 17 pages (appendix), 12 figures

  25. arXiv:1302.2142  [pdf, other

    stat.CO

    Simulation-efficient shortest probability intervals

    Authors: Ying Liu, Andrew Gelman, Tian Zheng

    Abstract: Bayesian highest posterior density (HPD) intervals can be estimated directly from simulations via empirical shortest intervals. Unfortunately, these can be noisy (that is, have a high Monte Carlo error). We derive an optimal weighting strategy using bootstrap and quadratic programming to obtain a more compu- tationally stable HPD, or in general, shortest probability interval (Spin). We prove the c… ▽ More

    Submitted 8 February, 2013; originally announced February 2013.

    Comments: 22 pages, 13 figures

  26. Latent demographic profile estimation in hard-to-reach groups

    Authors: Tyler H. McCormick, Tian Zheng

    Abstract: The sampling frame in most social science surveys excludes members of certain groups, known as hard-to-reach groups. These groups, or subpopulations, may be difficult to access (the homeless, e.g.), camouflaged by stigma (individuals with HIV/AIDS), or both (commercial sex workers). Even basic demographic information about these groups is typically unknown, especially in many develo** nations. W… ▽ More

    Submitted 11 January, 2013; originally announced January 2013.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS569 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS569

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 4, 1795-1813

  27. arXiv:1207.0520  [pdf, other

    stat.AP stat.CO

    Sparse Vector Autoregressive Modeling

    Authors: Richard A. Davis, Pengfei Zang, Tian Zheng

    Abstract: The vector autoregressive (VAR) model has been widely used for modeling temporal dependence in a multivariate time series. For large (and even moderate) dimensions, the number of AR coefficients can be prohibitively large, resulting in noisy estimates, unstable predictions and difficult-to-interpret temporal dependence. To overcome such drawbacks, we propose a 2-stage approach for fitting sparse V… ▽ More

    Submitted 2 July, 2012; originally announced July 2012.

    Comments: 39 pages, 7 figures

  28. Comment: Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies

    Authors: Tian Zheng, Shaw-Hwa Lo

    Abstract: Comment on "Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies" [arXiv:1102.2774]

    Submitted 15 February, 2011; originally announced February 2011.

    Comments: Published in at http://dx.doi.org/10.1214/08-STS244A the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS244A

    Journal ref: Statistical Science 2008, Vol. 23, No. 3, 321-324

  29. Discovering influential variables: A method of partitions

    Authors: Herman Chernoff, Shaw-Hwa Lo, Tian Zheng

    Abstract: A trend in all scientific disciplines, based on advances in technology, is the increasing availability of high dimensional data in which are buried important information. A current urgent challenge to statisticians is to develop effective methods of finding the useful information from the vast amounts of messy and noisy data available, most of which are noninformative. This paper presents a genera… ▽ More

    Submitted 28 September, 2010; originally announced September 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS265 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS265

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 4, 1335-1369