Skip to main content

Showing 1–41 of 41 results for author: Zhu, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.09194  [pdf, other

    stat.ME

    Bayesian modeling of co-occurrence microbial interaction networks

    Authors: Tejasv Bedi, Bencong Zhu, Michael L. Neugent, Kevin C. Lutz, Nicole J. De Nisco, Qiwei Li

    Abstract: The human body consists of microbiomes associated with the development and prevention of several diseases. These microbial organisms form several complex interactions that are informative to the scientific community for explaining disease progression and prevention. Contrary to the traditional view of the microbiome as a singular, assortative network, we introduce a novel statistical approach usin… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 25 pages

  2. arXiv:2403.05803  [pdf, other

    econ.EM stat.ME

    Semiparametric Inference for Regression-Discontinuity Designs

    Authors: Rong J. B. Zhu, Weiwei Jiang

    Abstract: Treatment effects in regression discontinuity designs (RDDs) are often estimated using local regression methods. However, global approximation methods are generally deemed inefficient. In this paper, we propose a semiparametric framework tailored for estimating treatment effects in RDDs. Our global approach conceptualizes the identification of treatment effects within RDDs as a partially linear mo… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  3. arXiv:2401.16335  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

    Authors: Banghua Zhu, Michael I. Jordan, Jiantao Jiao

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial phase of RLHF involves learning human values using a reward model from ranking data. It is observed that the performance of the reward model degrades after one epoch of training, and optimizing too much against the learned reward model eventually hinde… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  4. arXiv:2312.08369  [pdf, other

    stat.ML cs.AI cs.LG

    The Effective Horizon Explains Deep RL Performance in Stochastic Environments

    Authors: Cassidy Laidlaw, Banghua Zhu, Stuart Russell, Anca Dragan

    Abstract: Reinforcement learning (RL) theory has largely focused on proving minimax sample complexity bounds. These require strategic exploration algorithms that use relatively limited function classes for representing the policy or value function. Our goal is to explain why deep RL algorithms often perform well in practice, despite using random exploration and much more expressive function classes like neu… ▽ More

    Submitted 12 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Journal ref: ICLR 2024 (Spotlight)

  5. arXiv:2312.08324  [pdf, other

    stat.AP

    Bayesian Nonparametric Clustering with Feature Selection for Spatially Resolved Transcriptomics Data

    Authors: Bencong Zhu, Guanyu Hu, Yang Xie, Lin Xu, Xiaodan Fan, Qiwei Li

    Abstract: The advent of next-generation sequencing-based spatially resolved transcriptomics (SRT) techniques has reshaped genomic studies by enabling high-throughput gene expression profiling while preserving spatial and morphological context. Nevertheless, there are inherent challenges associated with these new high-dimensional spatial data, such as zero-inflation, over-dispersion, and heterogeneity. These… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  6. arXiv:2312.07930  [pdf, other

    cs.LG cs.CL cs.CR cs.IT stat.ML

    Towards Optimal Statistical Watermarking

    Authors: Baihe Huang, Hanlin Zhu, Banghua Zhu, Kannan Ramchandran, Michael I. Jordan, Jason D. Lee, Jiantao Jiao

    Abstract: We study statistical watermarking by formulating it as a hypothesis testing problem, a general framework which subsumes all previous statistical watermarking methods. Key to our formulation is a coupling of the output tokens and the rejection region, realized by pseudo-random generators in practice, that allows non-trivial trade-offs between the Type I error and Type II error. We characterize the… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  7. arXiv:2310.07838  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Towards the Fundamental Limits of Knowledge Transfer over Finite Domains

    Authors: Qingyue Zhao, Banghua Zhu

    Abstract: We characterize the statistical efficiency of knowledge transfer through $n$ samples from a teacher to a probabilistic student classifier with input space $\mathcal S$ over labels $\mathcal A$. We show that privileged information at three progressive levels accelerates the transfer. At the first level, only samples with hard labels are known, via which the maximum likelihood estimator attains the… ▽ More

    Submitted 14 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 41 pages, 2 figures; Appendix polished

  8. arXiv:2308.12016  [pdf, ps, other

    stat.ML cs.LG

    MKL-$L_{0/1}$-SVM

    Authors: Bin Zhu, Yijie Shi

    Abstract: This paper presents a Multiple Kernel Learning (abbreviated as MKL) framework for the Support Vector Machine (SVM) with the $(0, 1)$ loss function. Some KKT-like first-order optimality conditions are provided and then exploited to develop a fast ADMM algorithm to solve the nonsmooth nonconvex optimization problem. Numerical experiments on real data sets show that the performance of our MKL-… ▽ More

    Submitted 3 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 26 pages in the JMLR template, 3 figures, and 2 tables, submitted to the Journal of Machine Learning Research, with minor text overlap with arXiv: 2303.04445 (conference version). arXiv admin note: text overlap with arXiv:2303.04445

  9. arXiv:2306.02584  [pdf, other

    econ.EM stat.ME

    Synthetic Regressing Control Method

    Authors: Rong J. B. Zhu

    Abstract: Estimating weights in the synthetic control method, typically resulting in sparse weights where only a few control units have non-zero weights, involves an optimization procedure that simultaneously selects and aligns control units to closely match the treated unit. However, this simultaneous selection and alignment of control units may lead to a loss of efficiency. Another concern arising from th… ▽ More

    Submitted 23 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  10. arXiv:2306.02003  [pdf, other

    cs.LG cs.AI cs.PF eess.SY stat.ML

    On Optimal Caching and Model Multiplexing for Large Model Inference

    Authors: Banghua Zhu, Ying Sheng, Lianmin Zheng, Clark Barrett, Michael I. Jordan, Jiantao Jiao

    Abstract: Large Language Models (LLMs) and other large foundation models have achieved noteworthy success, but their size exacerbates existing resource consumption and latency challenges. In particular, the large-scale deployment of these models is hindered by the significant resource requirements during inference. In this paper, we study two approaches for mitigating these challenges: employing a cache to… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  11. arXiv:2306.00265  [pdf, other

    cs.LG cs.AI cs.CV eess.IV stat.ML

    Doubly Robust Self-Training

    Authors: Banghua Zhu, Mingyu Ding, Philip Jacobson, Ming Wu, Wei Zhan, Michael Jordan, Jiantao Jiao

    Abstract: Self-training is an important technique for solving semi-supervised learning problems. It leverages unlabeled data by generating pseudo-labels and combining them with a limited labeled dataset for training. The effectiveness of self-training heavily relies on the accuracy of these pseudo-labels. In this paper, we introduce doubly robust self-training, a novel semi-supervised algorithm that provabl… ▽ More

    Submitted 2 November, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

  12. arXiv:2303.04445  [pdf, ps, other

    stat.ML cs.LG

    An ADMM Solver for the MKL-$L_{0/1}$-SVM

    Authors: Yijie Shi, Bin Zhu

    Abstract: We formulate the Multiple Kernel Learning (abbreviated as MKL) problem for the support vector machine with the infamous $(0,1)$-loss function. Some first-order optimality conditions are given and then exploited to develop a fast ADMM solver for the nonconvex and nonsmooth optimization problem. A simple numerical experiment on synthetic planar data shows that our MKL-$L_{0/1}$-SVM framework could b… ▽ More

    Submitted 30 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 2 tables. Submitted to the 62nd IEEE Conference on Decision and Control as a Regular paper, with a shortened version (arXiv version 1) submitted to the 3rd Chinese Conference on Predictive Control and Intelligent Decision (CPCID) as an Extended Abstract

  13. arXiv:2301.11270  [pdf, other

    cs.LG cs.AI cs.HC math.ST stat.ML

    Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a theoretical framework for Reinforcement Learning with Human Feedback (RLHF). Our analysis shows that when the true reward function is linear, the widely used maximum likelihood estimator (MLE) converges under both the Bradley-Terry-Luce (BTL) model and the Plackett-Luce (PL) model. However, we show that when training a policy based on the learned reward model, MLE fails while a pessim… ▽ More

    Submitted 7 February, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  14. arXiv:2211.03710  [pdf, other

    cs.LG stat.ML

    Graph Contrastive Learning with Implicit Augmentations

    Authors: Huidong Liang, Xingjian Du, Bilei Zhu, Zejun Ma, Ke Chen, Junbin Gao

    Abstract: Existing graph contrastive learning methods rely on augmentation techniques based on random perturbations (e.g., randomly adding or drop** edges and nodes). Nevertheless, altering certain edges or nodes can unexpectedly change the graph characteristics, and choosing the optimal perturbing ratio for each dataset requires onerous manual tuning. In this paper, we introduce Implicit Graph Contrastiv… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  15. arXiv:2210.15801  [pdf, ps, other

    stat.ME

    Clustering High-dimensional Data via Feature Selection

    Authors: Tianqi Liu, Yu Lu, Biqing Zhu, Hongyu Zhao

    Abstract: High-dimensional clustering analysis is a challenging problem in statistics and machine learning, with broad applications such as the analysis of microarray data and RNA-seq data. In this paper, we propose a new clustering procedure called Spectral Clustering with Feature Selection (SC-FS), where we first obtain an initial estimate of labels via spectral clustering, then select a small fraction of… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted at Biometrics Journal (https://onlinelibrary.wiley.com/doi/epdf/10.1111/biom.13665)

  16. arXiv:2208.10059  [pdf, ps, other

    stat.ME eess.SY

    Sampling Gaussian Stationary Random Fields: A Stochastic Realization Approach

    Authors: Bin Zhu, Jiahao Liu, Zhengshou Lai, Tao Qian

    Abstract: Generating large-scale samples of stationary random fields is of great importance in the fields such as geomaterial modeling and uncertainty quantification. Traditional methodologies based on covariance matrix decomposition have the diffculty of being computationally expensive, which is even more serious when the dimension of the random field is large. This paper proposes an effcient stochastic re… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages, 9 figures

  17. arXiv:2205.11765  [pdf, ps, other

    cs.LG cs.AI cs.CR cs.DC stat.ML

    Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees

    Authors: Banghua Zhu, Lun Wang, Qi Pang, Shuai Wang, Jiantao Jiao, Dawn Song, Michael I. Jordan

    Abstract: We propose Byzantine-robust federated learning protocols with nearly optimal statistical rates. In contrast to prior work, our proposed protocols improve the dimension dependence and achieve a tight statistical rate in terms of all the parameters for strongly convex losses. We benchmark against competing protocols and show the empirical superiority of the proposed protocols. Finally, we remark tha… ▽ More

    Submitted 18 March, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  18. arXiv:2202.01269  [pdf, ps, other

    cs.LG eess.SP math.ST stat.CO stat.ML

    Robust Estimation for Nonparametric Families via Generative Adversarial Networks

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a general framework for designing Generative Adversarial Networks (GANs) to solve high dimensional robust statistics problems, which aim at estimating unknown parameter of the true distribution given adversarially corrupted samples. Prior work focus on the problem of robust mean and covariance estimation when the true distribution lies in the family of Gaussian distributions or elliptic… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  19. arXiv:2103.12021  [pdf, other

    cs.LG cs.AI math.OC math.ST stat.ML

    Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

    Authors: Paria Rashidinejad, Banghua Zhu, Cong Ma, Jiantao Jiao, Stuart Russell

    Abstract: Offline (or batch) reinforcement learning (RL) algorithms seek to learn an optimal policy from a fixed dataset without active data collection. Based on the composition of the offline dataset, two main categories of methods are used: imitation learning which is suitable for expert datasets and vanilla offline RL which often requires uniform coverage datasets. From a practical standpoint, datasets o… ▽ More

    Submitted 3 July, 2023; v1 submitted 22 March, 2021; originally announced March 2021.

    Journal ref: Published at NeurIPS 2021 and IEEE Transactions on Information Theory

  20. arXiv:2102.03240  [pdf

    physics.ao-ph econ.GN stat.OT

    De-carbonization of global energy use during the COVID-19 pandemic

    Authors: Zhu Liu, Biqing Zhu, Philippe Ciais, Steven J. Davis, Chenxi Lu, Haiwang Zhong, Piyu Ke, Yanan Cui, Zhu Deng, Duo Cui, Taochun Sun, Xinyu Dou, Jianguang Tan, Rui Guo, Bo Zheng, Katsumasa Tanaka, Wenli Zhao, Pierre Gentine

    Abstract: The COVID-19 pandemic has disrupted human activities, leading to unprecedented decreases in both global energy demand and GHG emissions. Yet a little known that there is also a low carbon shift of the global energy system in 2020. Here, using the near-real-time data on energy-related GHG emissions from 30 countries (about 70% of global power generation), we show that the pandemic caused an unprece… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  21. arXiv:2101.07781  [pdf, other

    stat.ML cs.LG math.ST

    Minimax Off-Policy Evaluation for Multi-Armed Bandits

    Authors: Cong Ma, Banghua Zhu, Jiantao Jiao, Martin J. Wainwright

    Abstract: We study the problem of off-policy evaluation in the multi-armed bandit model with bounded rewards, and develop minimax rate-optimal procedures under three settings. First, when the behavior policy is known, we show that the Switch estimator, a method that alternates between the plug-in and importance sampling estimators, is minimax rate-optimal for all sample sizes. Second, when the behavior poli… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  22. arXiv:2101.04750  [pdf, other

    cs.LG cs.AI stat.ML

    Linear Representation Meta-Reinforcement Learning for Instant Adaptation

    Authors: Matt Peng, Banghua Zhu, Jiantao Jiao

    Abstract: This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing. FLAP builds upon the idea of learning a shared linear representation of the policy so that whe… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

  23. arXiv:2010.12636  [pdf, ps, other

    cs.LG cs.CE stat.ML

    Nonseparable Symplectic Neural Networks

    Authors: Shiying Xiong, Yun** Tong, Xingzhe He, Shuqi Yang, Cheng Yang, Bo Zhu

    Abstract: Predicting the behaviors of Hamiltonian systems has been drawing increasing attention in scientific machine learning. However, the vast majority of the literature was focused on predicting separable Hamiltonian systems with their kinematic and potential energy terms being explicitly decoupled while building data-driven paradigms to predict nonseparable Hamiltonian systems that are ubiquitous in fl… ▽ More

    Submitted 19 February, 2022; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: ICLR2021

  24. arXiv:2007.08165  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Audio Tagging by Cross Filtering Noisy Labels

    Authors: Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, Yuxing Peng

    Abstract: High quality labeled datasets have allowed deep learning to achieve impressive results on many sound analysis tasks. Yet, it is labor-intensive to accurately annotate large amount of audio data, and the dataset may contain noisy labels in the practical settings. Meanwhile, the deep neural networks are susceptive to those incorrect labeled data because of their outstanding memorization ability. In… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  25. arXiv:2006.12972  [pdf, ps, other

    cs.LG physics.comp-ph stat.ML

    Sparse Symplectically Integrated Neural Networks

    Authors: Daniel M. DiPietro, Shiying Xiong, Bo Zhu

    Abstract: We introduce Sparse Symplectically Integrated Neural Networks (SSINNs), a novel model for learning Hamiltonian dynamical systems from data. SSINNs combine fourth-order symplectic integration with a learned parameterization of the Hamiltonian obtained using sparse regression through a mathematically elegant function space. This allows for interpretable models that incorporate symplectic inductive b… ▽ More

    Submitted 28 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Accepted as a conference paper to NeurIPS 2020. Main paper has 9 pages and 4 figures

  26. arXiv:2006.07900  [pdf, other

    cs.LG eess.SP stat.ML

    ResOT: Resource-Efficient Oblique Trees for Neural Signal Classification

    Authors: Bingzhao Zhu, Masoud Farivar, Mahsa Shoaran

    Abstract: Classifiers that can be implemented on chip with minimal computational and memory resources are essential for edge computing in emerging applications such as medical and IoT devices. This paper introduces a machine learning model based on oblique decision trees to enable resource-efficient classification on a neural implant. By integrating model compression with probabilistic routing and implement… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  27. arXiv:2006.05044  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Physicist: Learning Physical Dynamics from Image Sequences

    Authors: Baocheng Zhu, Shijun Wang, James Zhang

    Abstract: We present a novel architecture named Neural Physicist (NeurPhy) to learn physical dynamics directly from image sequences using deep neural networks. For any physical system, given the global system parameters, the time evolution of states is governed by the underlying physical laws. How to learn meaningful system representations in an end-to-end way and estimate accurate state transition dynamics… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: 19 pages, 20 figures

  28. arXiv:2005.14073  [pdf, other

    stat.ML cs.LG eess.SP math.ST stat.CO

    Robust estimation via generalized quasi-gradients

    Authors: Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

    Abstract: We explore why many recently proposed robust estimation problems are efficiently solvable, even though the underlying optimization problems are non-convex. We study the loss landscape of these robust estimation problems, and identify the existence of "generalized quasi-gradients". Whenever these quasi-gradients exist, a large family of low-regret algorithms are guaranteed to approximate the global… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

  29. arXiv:2005.09195  [pdf, other

    cs.LG stat.ML

    Riemannian Proximal Policy Optimization

    Authors: Shijun Wang, Baocheng Zhu, Chen Li, Mingzhe Wu, James Zhang, Wei Chu, Yuan Qi

    Abstract: In this paper, We propose a general Riemannian proximal optimization algorithm with guaranteed convergence to solve Markov decision process (MDP) problems. To model policy functions in MDP, we employ Gaussian mixture model (GMM) and formulate it as a nonconvex optimization problem in the Riemannian space of positive semidefinite matrices. For two given policy functions, we also provide its lower b… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 12 pages, 1 figures

  30. A Riemannian Primal-dual Algorithm Based on Proximal Operator and its Application in Metric Learning

    Authors: Shijun Wang, Baocheng Zhu, Lintao Ma, Yuan Qi

    Abstract: In this paper, we consider optimizing a smooth, convex, lower semicontinuous function in Riemannian space with constraints. To solve the problem, we first convert it to a dual problem and then propose a general primal-dual algorithm to optimize the primal and dual variables iteratively. In each optimization iteration, we employ a proximal operator to search optimal solution in the primal space. We… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 8 pages, 2 figures, published as a conference paper in 2019 International Joint Conference on Neural Networks (IJCNN)

  31. arXiv:2005.06546  [pdf

    cs.LG stat.ML

    Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

    Authors: Forrest Sheng Bao, Youbiao He, Jie Liu, Yuanfang Chen, Qian Li, Christina R. Zhang, Lei Han, Baoli Zhu, Yaorong Ge, Shi Chen, Ming Xu, Liu Ouyang

    Abstract: The COVID-19 is swee** the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wid… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    ACM Class: I.5.4

  32. arXiv:2001.07805  [pdf, other

    math.ST cs.LG eess.SP stat.ML

    When does the Tukey median work?

    Authors: Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

    Abstract: We analyze the performance of the Tukey median estimator under total variation (TV) distance corruptions. Previous results show that under Huber's additive corruption model, the breakdown point is 1/3 for high-dimensional halfspace-symmetric distributions. We show that under TV corruptions, the breakdown point reduces to 1/4 for the same set of distributions. We also show that a certain projection… ▽ More

    Submitted 31 March, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

  33. arXiv:1909.08755  [pdf, ps, other

    math.ST cs.LG stat.ML

    Generalized Resilience and Robust Statistics

    Authors: Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

    Abstract: Robust statistics traditionally focuses on outliers, or perturbations in total variation distance. However, a dataset could be corrupted in many other ways, such as systematic measurement errors and missing covariates. We generalize the robust statistics approach to consider perturbations under any Wasserstein distance, and show that robust estimation is possible whenever a distribution's populati… ▽ More

    Submitted 13 December, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

  34. arXiv:1903.00906  [pdf, other

    cs.LG stat.ML

    Understanding Feature Selection and Feature Memorization in Recurrent Neural Networks

    Authors: Bokang Zhu, Richong Zhang, Dingkun Long, Yongyi Mao

    Abstract: In this paper, we propose a test, called Flagged-1-Bit (F1B) test, to study the intrinsic capability of recurrent neural networks in sequence learning. Four different recurrent network models are studied both analytically and experimentally using this test. Our results suggest that in general there exists a conflict between feature selection and feature memorization in sequence learning. Such a co… ▽ More

    Submitted 3 March, 2019; originally announced March 2019.

  35. arXiv:1901.09465  [pdf, other

    cs.LG cs.AI stat.ML

    Deconstructing Generative Adversarial Networks

    Authors: Banghua Zhu, Jiantao Jiao, David Tse

    Abstract: We deconstruct the performance of GANs into three components: 1. Formulation: we propose a perturbation view of the population target of GANs. Building on this interpretation, we show that GANs can be viewed as a generalization of the robust statistics framework, and propose a novel GAN architecture, termed as Cascade GANs, to provably recover meaningful low-dimensional generator approximations… ▽ More

    Submitted 19 May, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

  36. arXiv:1609.09272  [pdf, ps, other

    stat.ME math.ST

    A New Algorithm for Circulant Rational Covariance Extension and Applications to Finite-interval Smoothing

    Authors: Giorgio Picci, Bin Zhu

    Abstract: The partial stochastic realization of periodic processes from finite covariance data has recently been solved by Lindquist and Picci based on convex optimization of a generalized entropy functional. The meaning and the role of this criterion have an unclear origin. In this paper we propose a solution based on a nonlinear generalization of the classical Yule-Walker type equations and on a new itera… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

    Comments: Submitted

  37. arXiv:1212.0181  [pdf, ps, other

    stat.AP

    Stochastic Volatility Regression for Functional Data Dynamics

    Authors: Bin Zhu, David B. Dunson

    Abstract: Although there are many methods for functional data analysis (FDA), little emphasis is put on characterizing variability among volatilities of individual functions. In particular, certain individuals exhibit erratic swings in their trajectory while other individuals have more stable trajectories. There is evidence of such volatility heterogeneity in blood pressure trajectories during pregnancy, fo… ▽ More

    Submitted 1 December, 2012; originally announced December 2012.

  38. arXiv:1201.5169  [pdf, ps, other

    stat.AP

    Signal extraction and breakpoint identification for array CGH data using robust state space model

    Authors: Bin Zhu, Jeremy M. G. Taylor, Peter X. -K. Song

    Abstract: Array comparative genomic hybridization(CGH) is a high resolution technique to assess DNA copy number variation. Identifying breakpoints where copy number changes will enhance the understanding of the pathogenesis of human diseases, such as cancers. However, the biological variation and experimental errors contained in array CGH data may lead to false positive identification of breakpoints. We pro… ▽ More

    Submitted 24 January, 2012; originally announced January 2012.

  39. arXiv:1201.4403  [pdf, ps, other

    stat.ME

    Locally Adaptive Bayes Nonparametric Regression via Nested Gaussian Processes

    Authors: Bin Zhu, David B. Dunson

    Abstract: We propose a nested Gaussian process (nGP) as a locally adaptive prior for Bayesian nonparametric regression. Specified through a set of stochastic differential equations (SDEs), the nGP imposes a Gaussian process prior for the function's $m$th-order derivative. The nesting comes in through including a local instantaneous mean function, which is drawn from another Gaussian process inducing adaptiv… ▽ More

    Submitted 20 January, 2012; originally announced January 2012.

  40. arXiv:1111.5563  [pdf, ps, other

    stat.AP

    Adverse Subpopulation Regression for Multivariate Outcomes with High-Dimensional Predictors

    Authors: Bin Zhu, David B. Dunson, Allison E. Ashley-Koch

    Abstract: Biomedical studies have a common interest in assessing relationships between multiple related health outcomes and high-dimensional predictors. For example, in reproductive epidemiology, one may collect pregnancy outcomes such as length of gestation and birth weight and predictors such as single nucleotide polymorphisms in multiple candidate genes and environmental exposures. In such settings, ther… ▽ More

    Submitted 23 November, 2011; originally announced November 2011.

  41. arXiv:1111.5551  [pdf, ps, other

    stat.AP

    Generalized Admixture Map** for Complex Traits

    Authors: Bin Zhu, Allison E. Ashley-Koch, David B. Dunson

    Abstract: Admixture map** is a popular tool to identify regions of the genome associated with traits in a recently admixed population. Existing methods have been developed primarily for identification of a single locus influencing a dichotomous trait within a case-control study design. We propose a generalized admixture map** (GLEAM) approach, a flexible and powerful regression method for both quantitat… ▽ More

    Submitted 23 November, 2011; originally announced November 2011.