Skip to main content

Showing 1–50 of 147 results for author: Zhao, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.02424  [pdf, ps, other

    cs.LG math.ST stat.ME

    Contextual Dynamic Pricing: Algorithms, Optimality, and Local Differential Privacy Constraints

    Authors: Zifeng Zhao, Feiyu Jiang, Yi Yu

    Abstract: We study the contextual dynamic pricing problem where a firm sells products to $T$ sequentially arriving consumers that behave according to an unknown demand model. The firm aims to maximize its revenue, i.e. minimize its regret over a clairvoyant that knows the model in advance. The demand model is a generalized linear model (GLM), allowing for a stochastic feature vector in $\mathbb R^d$ that en… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2405.13794  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Conditioning diffusion models by explicit forward-backward bridging

    Authors: Adrien Corenflos, Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: Given an unconditional diffusion model $π(x, y)$, using it to perform conditional simulation $π(x \mid y)$ is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express conditional simulation as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to im… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 24 pages, 12 figures

  3. arXiv:2405.09485  [pdf, other

    stat.ME

    Predicting Future Change-points in Time Series

    Authors: Chak Fung Choi, Chunxue Li, Chun Yip Yau, Zifeng Zhao

    Abstract: Change-point detection and estimation procedures have been widely developed in the literature. However, commonly used approaches in change-point analysis have mainly been focusing on detecting change-points within an entire time series (off-line methods), or quickest detection of change-points in sequentially observed data (on-line methods). Both classes of methods are concerned with change-points… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 37 pages, 4 figures

    MSC Class: 62M10

  4. arXiv:2404.07451  [pdf, other

    stat.CO

    SNSeg: An R Package for Time Series Segmentation via Self-Normalization

    Authors: Shubo Sun, Zifeng Zhao, Feiyu Jiang, Xiaofeng Shao

    Abstract: Time series segmentation aims to identify potential change-points in a sequence of temporally dependent data, so that the original sequence can be partitioned into several homogeneous subsequences. It is useful for modeling and predicting non-stationary time series and is widely applied in natural and social sciences. Existing segmentation methods primarily focus on only one type of parameter chan… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  5. arXiv:2404.02594  [pdf, other

    stat.ME stat.AP

    Comparison of the LASSO and Integrative LASSO with Penalty Factors (IPF-LASSO) methods for multi-omics data: Variable selection with Type I error control

    Authors: Charlotte Castel, Zhi Zhao, Magne Thoresen

    Abstract: Variable selection in relation to regression modeling has constituted a methodological problem for more than 60 years. Especially in the context of high-dimensional regression, develo** stable and reliable methods, algorithms, and computational tools for variable selection has become an important research topic. Omics data is one source of such high-dimensional data, characterized by diverse gen… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 9 pages, 4 figures

  6. arXiv:2402.12710  [pdf, other

    stat.ME cs.LG stat.ML

    Integrating Active Learning in Causal Inference with Interference: A Novel Approach in Online Experiments

    Authors: Hongtao Zhu, Sizhe Zhang, Yang Su, Zhenyu Zhao, Nan Chen

    Abstract: In the domain of causal inference research, the prevalent potential outcomes framework, notably the Rubin Causal Model (RCM), often overlooks individual interference and assumes independent treatment effects. This assumption, however, is frequently misaligned with the intricate realities of real-world scenarios, where interference is not merely a possibility but a common occurrence. Our research e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: conference paper

  7. arXiv:2402.05336  [pdf, other

    stat.AP cs.SI

    Treatment Effect Estimation Amidst Dynamic Network Interference in Online Gaming Experiments

    Authors: Yu Zhu, Zehang Richard Li, Yang Su, Zhenyu Zhao

    Abstract: The evolving landscape of online multiplayer gaming presents unique challenges in assessing the causal impacts of game features. Traditional A/B testing methodologies fall short due to complex player interactions, leading to violations of fundamental assumptions like the Stable Unit Treatment Value Assumption (SUTVA). Unlike traditional social networks with stable and long-term connections, networ… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2402.00440  [pdf, other

    stat.AP

    Optimal investment, consumption and life insurance decisions for households with consumption habits under the health shock risk

    Authors: Zhen Zhao, Wei Liu, Xiaoyi Tang

    Abstract: This paper investigates the optimal investment, consumption, and life insurance strategies for households under the impact of health shock risk. Considering the uncertainty of the future health status of family members, a non-homogeneous Markov process is used to model the health status of the breadwinner. Drawing upon the theory of habit formation, we investigate the influence of different consum… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  9. arXiv:2401.17008  [pdf, other

    stat.ME

    A Unified Three-State Model Framework for Analysis of Treatment Crossover in Survival Trials

    Authors: Zile Zhao, Ye Li, Xiaodong Luo, Ray Bai

    Abstract: We present a unified three-state model (TSM) framework for evaluating treatment effects in clinical trials in the presence of treatment crossover. Researchers have proposed diverse methodologies to estimate the treatment effect that would have hypothetically been observed if treatment crossover had not occurred. However, there is little work on understanding the connections between these different… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 27 pages, 5 figures, 2 tables

  10. arXiv:2311.15322  [pdf, other

    stat.ME

    False Discovery Rate Control For Structured Multiple Testing: Asymmetric Rules And Conformal Q-values

    Authors: Zinan Zhao, Wenguang Sun

    Abstract: The effective utilization of structural information in data while ensuring statistical validity poses a significant challenge in false discovery rate (FDR) analyses. Conformal inference provides rigorous theory for grounding complex machine learning methods without relying on strong assumptions or highly idealized models. However, existing conformal methods have limitations in handling structured… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  11. arXiv:2310.19608  [pdf, other

    cs.LG stat.ML

    On Feynman--Kac training of partial Bayesian neural networks

    Authors: Zheng Zhao, Sebastian Mair, Thomas B. Schön, Jens Sjölund

    Abstract: Recently, partial Bayesian neural networks (pBNNs), which only consider a subset of the parameters to be stochastic, were shown to perform competitively with full Bayesian neural networks. However, pBNNs are often multi-modal in the latent variable space and thus challenging to approximate with parametric models. To address this problem, we propose an efficient sampling-based training strategy, wh… ▽ More

    Submitted 27 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In AISTATS 2024

  12. arXiv:2310.00646  [pdf, other

    cs.LG cs.AI stat.ML

    WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data

    Authors: **gtan Wang, Xinyang Lu, Zitong Zhao, Zhongxiang Dai, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The impressive performances of large language models (LLMs) and their immense potential for commercialization have given rise to serious concerns over the intellectual property (IP) of their training data. In particular, the synthetic texts generated by LLMs may infringe the IP of the data being used to train the LLMs. To this end, it is imperative to be able to (a) identify the data provider who… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  13. arXiv:2303.17290  [pdf, other

    math.OC stat.AP stat.CO

    Gaussian-Based Parametric Bijections For Automatic Projection Filters

    Authors: Muhammad F. Emzir, Zheng Zhao, Lahouari Cheded, Simo Särkkä

    Abstract: The automatic projection filter is a recently developed numerical method for projection filtering that leverages sparse-grid integration and automatic differentiation. However, its accuracy is highly sensitive to the accuracy of the cumulant-generating function computed via the sparse-grid integration, which in turn is also sensitive to the choice of the bijection from the canonical hypercube to t… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

  14. arXiv:2303.13895  [pdf, other

    stat.ME math.PR stat.CO

    Stochastic filtering with moment representation

    Authors: Zheng Zhao, Juha Sarmavuori

    Abstract: Stochastic filtering refers to estimating the probability distribution of the latent stochastic process conditioned on the observed measurements in time. In this paper, we introduce a new class of convergent filters that represent the filtering distributions by their moments. The key enablement is a quadrature method that uses orthonormal polynomials spanned by the moments. We prove that this mome… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Code: https://github.com/zgbkdlm/mfs

  15. arXiv:2303.07570  [pdf, ps, other

    stat.ME math.ST

    High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection

    Authors: Zifeng Zhao, Feiyu Jiang, Yi Yu, Xi Chen

    Abstract: We consider a high-dimensional dynamic pricing problem under non-stationarity, where a firm sells products to $T$ sequentially arriving consumers that behave according to an unknown demand model with potential changes at unknown times. The demand model is assumed to be a high-dimensional generalized linear model (GLM), allowing for a feature vector in $\mathbb R^d$ that encodes products and consum… ▽ More

    Submitted 20 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

  16. Tutorial on survival modeling with applications to omics data

    Authors: Zhi Zhao, John Zobolas, Manuela Zucknick, Tero Aittokallio

    Abstract: Motivation: Identification of genomic, molecular and clinical markers prognostic of patient survival is important for develo** personalized disease prevention, diagnostic and treatment approaches. Modern omics technologies have made it possible to investigate the prognostic impact of markers at multiple molecular levels, including genomics, epigenomics, transcriptomics, proteomics and metabolomi… ▽ More

    Submitted 4 March, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Journal ref: Bioinformatics, 2024

  17. arXiv:2211.11255  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection

    Authors: Lu** Liu, Yi Ren, Xize Cheng, Rongjie Huang, Chongxuan Li, Zhou Zhao

    Abstract: Out-of-distribution (OOD) detection is a crucial task for ensuring the reliability and safety of deep learning. Currently, discriminator models outperform other methods in this regard. However, the feature extraction process used by discriminator models suffers from the loss of critical information, leaving room for bad cases and malicious attacks. In this paper, we introduce a new perceptron bias… ▽ More

    Submitted 3 June, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  18. arXiv:2210.14080  [pdf, other

    cs.LG cs.AI cs.SI stat.ME

    Learning Individual Treatment Effects under Heterogeneous Interference in Networks

    Authors: Ziyu Zhao, Yuqi Bai, Kun Kuang, Ruoxuan Xiong, Fei Wu

    Abstract: Estimates of individual treatment effects from networked observational data are attracting increasing attention these days. One major challenge in network scenarios is the violation of the stable unit treatment value assumption (SUTVA), which assumes that the treatment assignment of a unit does not influence others' outcomes. In network data, due to interference, the outcome of a unit is influence… ▽ More

    Submitted 25 January, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

  19. arXiv:2210.05873  [pdf, other

    stat.ME math.ST

    On the testing of multiple hypothesis in sliced inverse regression

    Authors: Zhigen Zhao, Xin Xing

    Abstract: We consider the multiple testing of the general regression framework aiming at studying the relationship between a univariate response and a p-dimensional predictor. To test the hypothesis of the effect of each predictor, we construct an Angular Balanced Statistic (ABS) based on the estimator of the sliced inverse regression without assuming a model of the conditional distribution of the response.… ▽ More

    Submitted 16 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  20. Consistent Covariance estimation for stratum imbalances under minimization method for covariate-adaptive randomization

    Authors: Zixuan Zhao, Yanglei Song, Wenyu Jiang, Dongsheng Tu

    Abstract: Pocock and Simon's minimization method is a popular approach for covariate-adaptive randomization in clinical trials. Valid statistical inference with data collected under the minimization method requires the knowledge of the limiting covariance matrix of within-stratum imbalances, whose existence is only recently established. In this work, we propose a bootstrap-based estimator for this limit and… ▽ More

    Submitted 26 December, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 29 pages, peer reviewed version, will appear in Scandinavian Journal of Statistics

  21. arXiv:2207.12453  [pdf, other

    math.ST stat.ME

    Change point inference in high-dimensional regression models under temporal dependence

    Authors: Haotian Xu, Daren Wang, Zifeng Zhao, Yi Yu

    Abstract: This paper concerns about the limiting distributions of change point estimators, in a high-dimensional linear regression time series context, where a regression object $(y_t, X_t) \in \mathbb{R} \times \mathbb{R}^p$ is observed at every time point $t \in \{1, \ldots, n\}$. At unknown time points, called change points, the regression coefficients change, with the jump sizes measured in $\ell_2$-nor… ▽ More

    Submitted 1 October, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

  22. arXiv:2207.05642  [pdf, other

    astro-ph.IM stat.AP stat.ML

    Scalable Bayesian Inference for Detection and Deblending in Astronomical Images

    Authors: Derek Hansen, Ismael Mendoza, Run**g Liu, Ziteng Pang, Zhe Zhao, Camille Avestruz, Jeffrey Regier

    Abstract: We present a new probabilistic method for detecting, deblending, and cataloging astronomical sources called the Bayesian Light Source Separator (BLISS). BLISS is based on deep generative models, which embed neural networks within a Bayesian model. For posterior inference, BLISS uses a new form of variational inference known as Forward Amortized Variational Inference. The BLISS inference routine is… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to the ICML 2022 Workshop on Machine Learning for Astrophysics. 5 pages, 2 figures

  23. arXiv:2207.02985  [pdf, other

    math.OC cs.CV eess.IV q-bio.BM stat.AP

    Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

    Authors: Shuai Huang, Mona Zehni, Ivan Dokmanić, Zhizhen Zhao

    Abstract: Unknown-view tomography (UVT) reconstructs a 3D density map from its 2D projections at unknown, random orientations. A line of work starting with Kam (1980) employs the method of moments (MoM) with rotation-invariant Fourier features to solve UVT in the frequency domain, assuming that the orientations are uniformly distributed. This line of work includes the recent orthogonal matrix retrieval (OMR… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Keywords: unknown view tomography, single-particle cryo-electron microscopy, method of moments, autocorrelation, spherical harmonics

    MSC Class: 92C55; 68U10; 33C55; 78M05

  24. arXiv:2206.12276  [pdf, other

    cs.SI cs.LG stat.ML

    Multi-Frequency Joint Community Detection and Phase Synchronization

    Authors: Lingda Wang, Zhizhen Zhao

    Abstract: This paper studies the joint community detection and phase synchronization problem on the \textit{stochastic block model with relative phase}, where each node is associated with an unknown phase angle. This problem, with a variety of real-world applications, aims to recover the cluster structure and associated phase angles simultaneously. We show this problem exhibits a \textit{``multi-frequency''… ▽ More

    Submitted 8 December, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Fixed a minor error and several typos. Accepted by IEEE Transactions on Signal and Information Processing over Networks

  25. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  26. arXiv:2205.09252  [pdf, other

    stat.ME

    Change-point Detection for Sparse and Dense Functional Data in General Dimensions

    Authors: Carlos Misael Madrid Padilla, Daren Wang, Zifeng Zhao, Yi Yu

    Abstract: We study the problem of change-point detection and localisation for functional data sequentially observed on a general d-dimensional space, where we allow the functional curves to be either sparsely or densely sampled. Data of this form naturally arise in a wide range of applications such as biology, neuroscience, climatology, and finance. To achieve such a task, we propose a kernel-based algorith… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  27. arXiv:2205.06306  [pdf, other

    stat.ML eess.SP stat.AP

    Probabilistic Estimation of Instantaneous Frequencies of Chirp Signals

    Authors: Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochas… ▽ More

    Submitted 13 February, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing

  28. arXiv:2203.11469  [pdf, other

    stat.ME

    A new class of composite GBII regression models with varying threshold for modelling heavy-tailed data

    Authors: Zhengxiao Li, Fei Wang, Zhengtang Zhao

    Abstract: The four-parameter generalized beta distribution of the second kind (GBII) has been proposed for modelling insurance losses with heavy-tailed features. The aim of this paper is to present a parametric composite GBII regression modelling by splicing two GBII distributions using mode matching method. It is designed for simultaneous modeling of small and large claims and capturing the policyholder he… ▽ More

    Submitted 26 January, 2024; v1 submitted 22 March, 2022; originally announced March 2022.

  29. arXiv:2203.06552  [pdf, other

    cs.CY stat.CO

    Mathematically Quantifying Non-responsiveness of the 2021 Georgia Congressional Districting Plan

    Authors: Zhanzhan Zhao, Cyrus Hettle, Swati Gupta, Jonathan Mattingly, Dana Randall, Gregory Herschlag

    Abstract: To audit political district maps for partisan gerrymandering, one may determine a baseline for the expected distribution of partisan outcomes by sampling an ensemble of maps. One approach to sampling is to use redistricting policy as a guide to precisely codify preferences between maps. Such preferences give rise to a probability distribution on the space of redistricting plans, and Metropolis-Has… ▽ More

    Submitted 9 October, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

    Comments: 29 pages, 20 figures, oral presentation at ACM conference on Equity and Access in Algorithms, Mechanisms, and Optimization, 2022

  30. arXiv:2202.09778  [pdf, other

    cs.CV cs.LG math.NA stat.ML

    Pseudo Numerical Methods for Diffusion Models on Manifolds

    Authors: Lu** Liu, Yi Ren, Zhijie Lin, Zhou Zhao

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) can generate high-quality samples such as image and audio samples. However, DDPMs require hundreds to thousands of iterations to produce final samples. Several prior works have successfully accelerated DDPMs through adjusting the variance schedule (e.g., Improved Denoising Diffusion Probabilistic Models) or the denoising equation (e.g., Denoising Di… ▽ More

    Submitted 31 October, 2022; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: ICLR 2022

  31. arXiv:2201.10617  [pdf, other

    cs.HC stat.AP

    Inform Product Change through Experimentation with Data-Driven Behavioral Segmentation

    Authors: Zhenyu Zhao, Yan He, Miao Chen

    Abstract: Online controlled experimentation is widely adopted for evaluating new features in the rapid development cycle for web products and mobile applications. Measurement of the overall experiment sample is a common practice to quantify the overall treatment effect. In order to understand why the treatment effect occurs in a certain way, segmentation becomes a valuable approach to a finer analysis of ex… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 2017

  32. arXiv:2112.13199  [pdf, other

    stat.ML cs.LG cs.SI

    A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

    Authors: Yifeng Fan, Yuehaw Khoo, Zhizhen Zhao

    Abstract: Community detection and orthogonal group synchronization are both fundamental problems with a variety of important applications in science and engineering. In this work, we consider the joint problem of community detection and orthogonal group synchronization which aims to recover the communities and perform synchronization simultaneously. To this end, we propose a simple algorithm that consists o… ▽ More

    Submitted 15 September, 2022; v1 submitted 25 December, 2021; originally announced December 2021.

  33. arXiv:2112.10594  [pdf, other

    math.OC math.DS stat.ME

    Multidimensional Projection Filters via Automatic Differentiation and Sparse-Grid Integration

    Authors: Muhammad Fuady Emzir, Zheng Zhao, Simo Särkkä

    Abstract: The projection filter is a technique for approximating the solutions of optimal filtering problems. In projection filters, the Kushner--Stratonovich stochastic partial differential equation that governs the propagation of the optimal filtering density is projected to a manifold of parametric densities, resulting in a finite-dimensional stochastic differential equation. Despite the fact that projec… ▽ More

    Submitted 14 September, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

  34. arXiv:2112.05331  [pdf, ps, other

    stat.ME math.ST

    Segmenting Time Series via Self-Normalization

    Authors: Zifeng Zhao, Feiyu Jiang, Xiaofeng Shao

    Abstract: We propose a novel and unified framework for change-point estimation in multivariate time series. The proposed method is fully nonparametric, enjoys effortless tuning and is robust to temporal dependence. One salient and distinct feature of the proposed method is its versatility, where it allows change-point detection for a broad class of parameters (such as mean, variance, correlation and quantil… ▽ More

    Submitted 8 September, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

  35. arXiv:2111.12604  [pdf, other

    stat.ME eess.SP stat.ML

    State-space deep Gaussian processes with applications

    Authors: Zheng Zhao

    Abstract: This thesis is mainly concerned with state-space approaches for solving deep (temporal) Gaussian process (DGP) regression problems. More specifically, we represent DGPs as hierarchically composed systems of stochastic differential equations (SDEs), and we consequently solve the DGP regression problem by using state-space filtering and smoothing methods. The resulting state-space DGP (SS-DGP) model… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: See reproducible codes in https://github.com/zgbkdlm/dissertation. Permanent link http://urn.fi/URN:ISBN:978-952-64-0603-9

    Journal ref: Doctoral dissertation, Aalto University, 2021

  36. Non-linear Gaussian smoothing with Taylor moment expansion

    Authors: Zheng Zhao, Simo Särkkä

    Abstract: This letter is concerned with solving continuous-discrete Gaussian smoothing problems by using the Taylor moment expansion (TME) scheme. In the proposed smoothing method, we apply the TME method to approximate the transition density of the stochastic differential equation in the dynamic model. Furthermore, we derive a theoretical error bound (in the mean square sense) of the TME smoothing estimate… ▽ More

    Submitted 4 November, 2021; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: 5 pages, 1 figure

    Journal ref: IEEE Signal Processing Letters, 2021

  37. Identifying Hidden Visits from Sparse Call Detail Record Data

    Authors: Zhan Zhao, Haris N. Koutsopoulos, **hua Zhao

    Abstract: Despite a large body of literature on trip inference using call detail record (CDR) data, a fundamental understanding of their limitations is lacking. In particular, because of the sparse nature of CDR data, users may travel to a location without being revealed in the data, which we refer to as a "hidden visit". The existence of hidden visits hinders our ability to extract reliable information abo… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  38. arXiv:2106.03760  [pdf, other

    cs.LG math.OC stat.ML

    DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning

    Authors: Hussein Hazimeh, Zhe Zhao, Aakanksha Chowdhery, Maheswaran Sathiamoorthy, Yihua Chen, Rahul Mazumder, Lichan Hong, Ed H. Chi

    Abstract: The Mixture-of-Experts (MoE) architecture is showing promising results in improving parameter sharing in multi-task learning (MTL) and in scaling high-capacity neural networks. State-of-the-art MoE models use a trainable sparse gate to select a subset of the experts for each input example. While conceptually appealing, existing sparse gates, such as Top-k, are not smooth. The lack of smoothness ca… ▽ More

    Submitted 31 December, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Appeared in NeurIPS 2021

  39. arXiv:2105.09695  [pdf, other

    stat.ME stat.CO stat.ML

    Hierarchical Non-Stationary Temporal Gaussian Processes With $L^1$-Regularization

    Authors: Zheng Zhao, Rui Gao, Simo Särkkä

    Abstract: This paper is concerned with regularized extensions of hierarchical non-stationary temporal Gaussian processes (NSGPs) in which the parameters (e.g., length-scale) are modeled as GPs. In particular, we consider two commonly used NSGP constructions which are based on explicitly constructed non-stationary covariance functions and stochastic differential equations, respectively. We extend these NSGPs… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 20 pages. Submitted to Statistics and Computing

  40. arXiv:2105.06031  [pdf, other

    stat.ML cs.LG cs.SI

    Joint Community Detection and Rotational Synchronization via Semidefinite Programming

    Authors: Yifeng Fan, Yuehaw Khoo, Zhizhen Zhao

    Abstract: In the presence of heterogeneous data, where randomly rotated objects fall into multiple underlying categories, it is challenging to simultaneously classify them into clusters and synchronize them based on pairwise relations. This gives rise to the joint problem of community detection and synchronization. We propose a series of semidefinite relaxations, and prove their exact recovery when extendin… ▽ More

    Submitted 14 September, 2023; v1 submitted 12 May, 2021; originally announced May 2021.

  41. arXiv:2104.14008  [pdf, other

    stat.ME stat.CO

    BayesSUR: An R package for high-dimensional multivariate Bayesian variable and covariance selection in linear regression

    Authors: Zhi Zhao, Marco Banterle, Leonardo Bottolo, Sylvia Richardson, Alex Lewin, Manuela Zucknick

    Abstract: In molecular biology, advances in high-throughput technologies have made it possible to study complex multivariate phenotypes and their simultaneous associations with high-dimensional genomic and other omics data, a problem that can be studied with high-dimensional multi-response regression, where the response variables are potentially highly correlated. To this purpose, we recently introduced sev… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Journal ref: Journal of Statistical Software. 100 (2021) 1-32

  42. arXiv:2103.00674  [pdf, other

    stat.ME cs.LG math.ST stat.AP stat.ML

    BEAUTY Powered BEAST

    Authors: Kai Zhang, Zhigen Zhao, Wen Zhou

    Abstract: We study distribution-free goodness-of-fit tests with the proposed Binary Expansion Approximation of UniformiTY (BEAUTY) approach. This method generalizes the renowned Euler's formula, and approximates the characteristic function of any copula through a linear combination of expectations of binary interactions from marginal binary expansions. This novel theory enables a unification of many importa… ▽ More

    Submitted 16 October, 2023; v1 submitted 28 February, 2021; originally announced March 2021.

  43. arXiv:2102.09964  [pdf, other

    cs.LG stat.CO stat.ME

    Temporal Gaussian Process Regression in Logarithmic Time

    Authors: Adrien Corenflos, Zheng Zhao, Simo Särkkä

    Abstract: The aim of this article is to present a novel parallelization method for temporal Gaussian process (GP) regression problems. The method allows for solving GP regression problems in logarithmic O(log N) time, where N is the number of time steps. Our approach uses the state-space representation of GPs which in its original form allows for linear O(N) time GP regression by leveraging the Kalman filte… ▽ More

    Submitted 17 May, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  44. Multivariate Bayesian structured variable selection for pharmacogenomic studies

    Authors: Zhi Zhao, Marco Banterle, Alex Lewin, Manuela Zucknick

    Abstract: Precision cancer medicine aims to determine the optimal treatment for each patient. In-vitro cancer drug sensitivity screens combined with multi-omics characterization of the cancer cells have become an important tool to achieve this aim. Analyzing such pharmacogenomic studies requires flexible and efficient joint statistical models for associating drug sensitivity with high-dimensional multi-omic… ▽ More

    Submitted 13 February, 2023; v1 submitted 14 January, 2021; originally announced January 2021.

    Journal ref: Journal of the Royal Statistical Society, Series C. 2024, 73, 420-443

  45. arXiv:2101.03996  [pdf, other

    cs.LG cs.AI stat.ML

    Individual Mobility Prediction: An Interpretable Activity-based Hidden Markov Approach

    Authors: Baichuan Mo, Zhan Zhao, Haris N. Koutsopoulos, **hua Zhao

    Abstract: Individual mobility is driven by demand for activities with diverse spatiotemporal patterns, but existing methods for mobility prediction often overlook the underlying activity patterns. To address this issue, this study develops an activity-based modeling framework for individual mobility prediction. Specifically, an input-output hidden Markov model (IOHMM) framework is proposed to simultaneously… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

  46. arXiv:2012.00460  [pdf, ps, other

    math.ST stat.ME

    Functional Linear Regression with Mixed Predictors

    Authors: Daren Wang, Zifeng Zhao, Yi Yu, Rebecca Willett

    Abstract: We study a functional linear regression model that deals with functional responses and allows for both functional covariates and high-dimensional vector covariates. The proposed model is flexible and nests several functional regression models in the literature as special cases. Based on the theory of reproducing kernel Hilbert spaces (RKHS), we propose a penalized least squares estimator that can… ▽ More

    Submitted 23 August, 2022; v1 submitted 1 December, 2020; originally announced December 2020.

  47. arXiv:2011.13993  [pdf, ps, other

    stat.ME

    Functional Autoregressive Processes in Reproducing Kernel Hilbert Spaces

    Authors: Daren Wang, Zifeng Zhao, Rebecca Willett, Chun Yip Yau

    Abstract: We study the estimation and prediction of functional autoregressive~(FAR) processes, a statistical tool for modeling functional time series data. Due to the infinite-dimensional nature of FAR processes, the existing literature addresses its inference via dimension reduction and theoretical results therein require the (unrealistic) assumption of fully observed functional time series. We propose an… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  48. arXiv:2009.10064  [pdf, other

    quant-ph cs.CR cs.LG stat.ML

    Optimal Provable Robustness of Quantum Classification via Quantum Hypothesis Testing

    Authors: Maurice Weber, Nana Liu, Bo Li, Ce Zhang, Zhikuan Zhao

    Abstract: Quantum machine learning models have the potential to offer speedups and better predictive accuracy compared to their classical counterparts. However, these quantum algorithms, like their classical counterparts, have been shown to also be vulnerable to input perturbations, in particular for classification problems. These can arise either from noisy implementations or, as a worst-case type of noise… ▽ More

    Submitted 26 May, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: 28 pages, 5 figures

    Journal ref: npj Quantum Information 7, 76 (2021)

  49. arXiv:2009.08868  [pdf

    q-bio.BM cs.LG stat.ML

    Review of Machine-Learning Methods for RNA Secondary Structure Prediction

    Authors: Qi Zhao, Zheng Zhao, Xiaoya Fan, Zhengwei Yuan, Qian Mao, Yudong Yao

    Abstract: Secondary structure plays an important role in determining the function of non-coding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagn… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 25 pages, 5 figures, 1 table

    MSC Class: I.2.0 General

  50. arXiv:2008.05808  [pdf, other

    cs.LG stat.ML

    Small Towers Make Big Differences

    Authors: Yuyan Wang, Zhe Zhao, Bo Dai, Christopher Fifty, Dong Lin, Lichan Hong, Ed H. Chi

    Abstract: Multi-task learning aims at solving multiple machine learning tasks at the same time. A good solution to a multi-task learning problem should be generalizable in addition to being Pareto optimal. In this paper, we provide some insights on understanding the trade-off between Pareto efficiency and generalization as a result of parameterization in multi-task deep learning models. As a multi-objective… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.